Skip to content

Instantly share code, notes, and snippets.

Show Gist options
  • Select an option

  • Save possibilities/d66f50cf88aa07815e46e3dd8371fd53 to your computer and use it in GitHub Desktop.

Select an option

Save possibilities/d66f50cf88aa07815e46e3dd8371fd53 to your computer and use it in GitHub Desktop.
QMD Search Engine Upgrade Report: embeddinggemma-300M -> Qwen3-Embedding-0.6B
label: after
timestamp: 20260320T070514Z
host: artbird.taile9945f.ts.net
query_count: 13
error_count: 0
queries:
- category: keyword
label: click group commands
endpoint: search
latency_ms: 484
result_count: 5
results:
- docid: '#9e9280'
path: qmd://content-topics:click/content/documents/github-com-pallets-click/docs/commands-and-groups.md
score: 0.88
title: Basic Commands, Groups, Context
- docid: '#4dc584'
path: qmd://content-topics:click/content/documents/github-com-pallets-click/docs/extending-click.md
score: 0.88
title: Extending Click
- docid: '#3aba7c'
path: qmd://content-topics:click/content/documents/github-com-pallets-click/docs/exceptions.md
score: 0.88
title: Exception Handling and Exit Codes
- docid: '#3c8836'
path: qmd://content-topics:click/content/documents/github-com-pallets-click/docs/quickstart.md
score: 0.88
title: Quickstart
- docid: '#377941'
path: qmd://content-sessions:-Users-mike-code-arthack/e6bd479f-14ed-4f0f-aa9d-92bee99deaf0/agents/a3286b0.md
score: 0.88
title: User
- category: keyword
label: systemd unit file
endpoint: search
latency_ms: 413
result_count: 5
results:
- docid: '#8ee7d4'
path: qmd://content-sessions:-Users-mike-code-arthack/8b328745-0002-4b53-a8d7-f9c05d4972ed/agents/a296336d03badd7f2.md
score: 0.93
title: User
- docid: '#db6bd1'
path: qmd://content-sessions:-Users-mike-code-arthack/785634ee-6622-4566-9fb5-781465b8b2d4/agents/a71ddb717e255ff16.md
score: 0.93
title: User
- docid: '#7cea35'
path: qmd://content-sessions:-Users-mike-code-arthack/b56ff826-a0d6-4f39-a752-31f34b862102/agents/a245e0530f5ba5718.md
score: 0.93
title: User
- docid: '#e9c091'
path: qmd://content-sessions:-Users-mike-code-arthack/cd2c72e5-945b-4d21-975d-ae64593c99ed/agents/a16d4b28b98981dc5.md
score: 0.93
title: User
- docid: '#d2f772'
path: qmd://content-sessions:-Users-mike-code-arthack/a11ede63-4b0e-4ebe-9270-9d617d83d763/agents/abb25daffefbdccf7.md
score: 0.93
title: User
- category: keyword
label: pydantic model validation
endpoint: search
latency_ms: 405
result_count: 5
results:
- docid: '#af6d62'
path: qmd://content-topics:pydantic/content/documents/github-com-pydantic-pydantic/docs/concepts/alias.md
score: 0.93
title: '`AliasPath` and `AliasChoices`'
- docid: '#afd025'
path: qmd://content-topics:pydantic/content/documents/github-com-pydantic-pydantic/docs/errors/usage-errors.md
score: 0.93
title: Class not fully defined {#class-not-fully-defined}
- docid: '#03e970'
path: qmd://content-topics:pydantic/content/documents/github-com-pydantic-pydantic/docs/concepts/validators.md
score: 0.93
title: Field validators
- docid: '#211166'
path: qmd://content-topics:pydantic/content/documents/github-com-pydantic-pydantic/docs/concepts/dataclasses.md
score: 0.93
title: Dataclass config
- docid: '#67e386'
path: qmd://content-topics:pydantic/content/documents/github-com-pydantic-pydantic/docs/concepts/models.md
score: 0.93
title: Basic model usage
- category: semantic
label: auth token handling
endpoint: search
latency_ms: 398
result_count: 5
results:
- docid: '#2e0675'
path: qmd://content-topics:geminicli/content/documents/github-com-google-gemini-gemini-cli/docs/core/remote-agents.md
score: 0.94
title: Remote Subagents (experimental)
- docid: '#5d884a'
path: qmd://content-sessions:-Users-mike-code-arthack/b72a4aac-9d49-4c68-8ddd-c7a7f4acd945/agents/a3e2afee013e96225.md
score: 0.93
title: User
- docid: '#4833e2'
path: qmd://content-topics:geminicli/content/documents/github-com-google-gemini-gemini-cli/docs/tools/mcp-server.md
score: 0.93
title: MCP servers with the Gemini CLI
- docid: '#809ae4'
path: qmd://content-topics:perplexity/content/documents/docs-perplexity-ai-docs-admin-api-key-management.md
score: 0.93
title: API Key Management
- docid: '#7eeda2'
path: qmd://content-topics:swiftpm/content/documents/github-com-swiftlang-swift-package-manager/documentation/packageregistry/packageregistryusage.md
score: 0.93
title: Package Registry Usage
- category: semantic
label: async error handling
endpoint: search
latency_ms: 528
result_count: 5
results:
- docid: '#671bb5'
path: qmd://content-sessions:-Users-mike-code-arthack/23688efc-22b7-4f12-adf6-813cae70f90b/agents/a93d4f4fd5c817698.md
score: 0.89
title: User
- docid: '#fdc30f'
path: qmd://content-sessions:-Users-mike-code-arthack/4bba43c4-7436-4f38-ab54-a983409f185d/agents/a43c0ba.md
score: 0.88
title: User
- docid: '#86579b'
path: qmd://content-sessions:-Users-mike-code-arthack/51dffc22-004f-4989-b958-c3b2da4d3890/agents/ac4714a74a5c7bc8c.md
score: 0.88
title: User
- docid: '#5b700f'
path: qmd://content-sessions:-Users-mike-experiment-linux-browser-farm/58840549-2606-4b0a-9415-d4159c63de43/agents/af174e0.md
score: 0.88
title: User
- docid: '#fccfa0'
path: qmd://content-sessions:-Users-mike-code-arthack/c6bed1e2-1edc-472c-bfef-432f861bda30/agents/a14e447.md
score: 0.88
title: User
- category: semantic
label: systemd deploy workflows
endpoint: search
latency_ms: 409
result_count: 5
results:
- docid: '#e9c091'
path: qmd://content-sessions:-Users-mike-code-arthack/cd2c72e5-945b-4d21-975d-ae64593c99ed/agents/a16d4b28b98981dc5.md
score: 0.97
title: User
- docid: '#8ee7d4'
path: qmd://content-sessions:-Users-mike-code-arthack/8b328745-0002-4b53-a8d7-f9c05d4972ed/agents/a296336d03badd7f2.md
score: 0.97
title: User
- docid: '#dd420f'
path: qmd://content-sessions:-Users-mike-code-arthack/c07d644e-d5eb-4515-8d6f-57e4a2d9901c/agents/a4edb9495187eb6bf.md
score: 0.97
title: User
- docid: '#5ee9ce'
path: qmd://content-sessions:-Users-mike-code-arthack/399e942b-76cb-458f-97fd-4f21e600d4d6/agents/a365dbbde4527858c.md
score: 0.97
title: User
- docid: '#8095a0'
path: qmd://content-sessions:-Users-mike-code-arthack/09c271cc-416e-49ef-92a2-7cabf8a15dec/agents/a0beb7a08e10afd0f.md
score: 0.97
title: User
- category: cross-collection
label: YAML config loading
endpoint: search
latency_ms: 385
result_count: 5
results:
- docid: '#3ee637'
path: qmd://content-sessions:-Users-mike-code-arthack/0227bb8d-63c3-423b-b76a-27dd0aa6b5ce/agents/a6290bf.md
score: 0.9
title: User
- docid: '#371a38'
path: qmd://content-sessions:-Users-mike-code/f25d21f2-53fd-4a80-9f5a-7a26b8376166/agents/a0edc63a1f3beb9eb.md
score: 0.9
title: User
- docid: '#dc3032'
path: qmd://content-sessions:-Users-mike-code-arthack/29e1d66f-2312-4ce6-bff9-f5a94e0f69f3/agents/a99d01e.md
score: 0.9
title: User
- docid: '#c62bd7'
path: qmd://content-sessions:-Users-mike-code/f25d21f2-53fd-4a80-9f5a-7a26b8376166/agents/adc817874c3d37a3d.md
score: 0.9
title: User
- docid: '#468d3c'
path: qmd://content-sessions:-Users-mike-code-arthack-marketplace/3ef9b94b-628d-4279-b9e6-7e738d5421ca/agents/a5438b3.md
score: 0.9
title: User
- category: cross-collection
label: notifications
endpoint: search
latency_ms: 363
result_count: 5
results:
- docid: '#102b5b'
path: qmd://content-sessions:-Users-mike-code-arthack/3194d75e-108e-4116-b374-eb1c0be98cd2/agents/ada9e6a8f9a1220da.md
score: 0.94
title: User
- docid: '#083a0f'
path: qmd://content-sessions:-Users-mike-code-arthack/f413d12f-1c13-476e-ae0c-905fe3f493c0.md
score: 0.94
title: User
- docid: '#d5e6bd'
path: qmd://content-sessions:-Users-mike-code-arthack/9bb479ad-f5f9-4eed-b436-6da4a88edde1/agents/a4387b7813f07f6b3.md
score: 0.94
title: User
- docid: '#7b49f9'
path: qmd://content-sessions:-Users-mike-code-arthack/d1125869-b842-473b-9d5f-d82c014f8fce/agents/ac2865394df533977.md
score: 0.94
title: User
- docid: '#999515'
path: qmd://content-sessions:-Users-mike-code-arthack/eb4cc13e-5825-47fb-98a7-5f71b94e1767/agents/aa715ca981729ab6b.md
score: 0.94
title: User
- category: long-context
label: CLI architecture
endpoint: search
latency_ms: 730
result_count: 1
results:
- docid: '#aaabb9'
path: qmd://content-sessions:-Users-mike-code-voice-of-arthacker/0c85ed6d-6b28-4418-916c-9a7dda33ce68/agents/a5440a6.md
score: 0.97
title: User
- category: long-context
label: daemon architecture
endpoint: search
latency_ms: 524
result_count: 0
results: []
- category: structured
label: 'hyde: Telegram bot security'
endpoint: query
latency_ms: 1570
result_count: 5
results:
- docid: '#5e20e7'
path: qmd://content-sessions:-Users-mike-code-arthack/39db4ded-59aa-4815-83f1-a41f4487b213/agents/a8578482c586ef923.md
score: 0.93
title: User
- docid: '#c65657'
path: qmd://content-sessions:-Users-mike-code-arthack/6b4a8706-a801-46fa-9333-ff5e1755dbe3/agents/ada25b5.md
score: 0.56
title: User
- docid: '#09060a'
path: qmd://content-sessions:-Users-mike-code-arthack/1dd1216c-89fb-46cb-b9e8-c258dbdea80c/agents/ae1ee53d59d70b367.md
score: 0.47
title: User
- docid: '#6eff58'
path: qmd://content-sessions:-Users-mike-code-arthack/1917874d-2e60-47d3-9290-4ed29c1212a6/agents/a8c52dba463844cc4.md
score: 0.47
title: User
- docid: '#9faccf'
path: qmd://content-sessions:-Users-mike-code-arthack/616951ae-2368-4eae-90cd-0c434f3c9bac/agents/aac82e4967770f4a3.md
score: 0.46
title: User
- category: structured
label: 'expand: vector embeddings'
endpoint: query
latency_ms: 2783
result_count: 5
results:
- docid: '#f0fa2f'
path: qmd://content-sessions:-Users-mike-code-arthack-marketplace/375fbc38-a8ea-4283-857a-33b51ed448c3/agents/a96723f.md
score: 0.91
title: User
- docid: '#2c98f1'
path: qmd://content-topics:aisdk/content/documents/github-com-vercel-ai/content/docs/03-ai-sdk-core/30-embeddings.md
score: 0.53
title: Embeddings
- docid: '#a7bf50'
path: qmd://content-topics:haystack/content/documents/github-com-deepset-ai-haystack/docs-website/docs/pipeline-components/embedders/openaitextembedder.md
score: 0.45
title: OpenAITextEmbedder
- docid: '#7754f0'
path: qmd://content-topics:qmd/content/documents/github-com-tobi-qmd/readme.md
score: 0.45
title: QMD - Query Markup Documents
- docid: '#b892fe'
path: qmd://content-topics:haystack/content/documents/github-com-deepset-ai-haystack/docs-website/docs/pipeline-components/embedders/googlegenaitextembedder.md
score: 0.45
title: GoogleGenAITextEmbedder
- category: structured
label: 'expand: file watching'
endpoint: query
latency_ms: 3029
result_count: 5
results:
- docid: '#e43639'
path: qmd://content-sessions:-Users-mike-code-arthack/179b6872-b367-45a3-959d-f0c2563bf8ed.md
score: 0.88
title: User
- docid: '#9b5302'
path: qmd://content-sessions:-Users-mike-code-arthack/0f2e9cf3-8d80-4456-b7d6-d4fa12824a46/agents/a9d62c3e11196239c.md
score: 0.53
title: User
- docid: '#6d7a2f'
path: qmd://content-topics:bunjs/content/documents/github-com-oven-sh-bun/docs/guides/read-file/watch.md
score: 0.43
title: watch
- docid: '#9cf879'
path: qmd://content-sessions:-Users-mike-code-arthack/0685d4ff-965e-4a27-836e-4695c39ea59d.md
score: 0.4
title: User
- docid: '#14d28f'
path: qmd://content-sessions:-Users-mike-code-claudeblast/6404f61b-6108-43e6-9546-37d69bf660f4.md
score: 0.37
title: User
label: before
timestamp: 20260320T030240Z
host: artbird.taile9945f.ts.net
query_count: 13
error_count: 0
queries:
- category: keyword
label: click group commands
endpoint: search
latency_ms: 396
result_count: 5
results:
- docid: '#9e9280'
path: qmd://content-topics:click/content/documents/github-com-pallets-click/docs/commands-and-groups.md
score: 0.88
title: Basic Commands, Groups, Context
- docid: '#4dc584'
path: qmd://content-topics:click/content/documents/github-com-pallets-click/docs/extending-click.md
score: 0.88
title: Extending Click
- docid: '#3aba7c'
path: qmd://content-topics:click/content/documents/github-com-pallets-click/docs/exceptions.md
score: 0.88
title: Exception Handling and Exit Codes
- docid: '#3c8836'
path: qmd://content-topics:click/content/documents/github-com-pallets-click/docs/quickstart.md
score: 0.88
title: Quickstart
- docid: '#377941'
path: qmd://content-sessions:-Users-mike-code-arthack/e6bd479f-14ed-4f0f-aa9d-92bee99deaf0/agents/a3286b0.md
score: 0.88
title: User
- category: keyword
label: systemd unit file
endpoint: search
latency_ms: 413
result_count: 5
results:
- docid: '#8ee7d4'
path: qmd://content-sessions:-Users-mike-code-arthack/8b328745-0002-4b53-a8d7-f9c05d4972ed/agents/a296336d03badd7f2.md
score: 0.93
title: User
- docid: '#db6bd1'
path: qmd://content-sessions:-Users-mike-code-arthack/785634ee-6622-4566-9fb5-781465b8b2d4/agents/a71ddb717e255ff16.md
score: 0.93
title: User
- docid: '#7cea35'
path: qmd://content-sessions:-Users-mike-code-arthack/b56ff826-a0d6-4f39-a752-31f34b862102/agents/a245e0530f5ba5718.md
score: 0.93
title: User
- docid: '#e9c091'
path: qmd://content-sessions:-Users-mike-code-arthack/cd2c72e5-945b-4d21-975d-ae64593c99ed/agents/a16d4b28b98981dc5.md
score: 0.93
title: User
- docid: '#d2f772'
path: qmd://content-sessions:-Users-mike-code-arthack/a11ede63-4b0e-4ebe-9270-9d617d83d763/agents/abb25daffefbdccf7.md
score: 0.93
title: User
- category: keyword
label: pydantic model validation
endpoint: search
latency_ms: 357
result_count: 5
results:
- docid: '#af6d62'
path: qmd://content-topics:pydantic/content/documents/github-com-pydantic-pydantic/docs/concepts/alias.md
score: 0.93
title: '`AliasPath` and `AliasChoices`'
- docid: '#afd025'
path: qmd://content-topics:pydantic/content/documents/github-com-pydantic-pydantic/docs/errors/usage-errors.md
score: 0.93
title: Class not fully defined {#class-not-fully-defined}
- docid: '#03e970'
path: qmd://content-topics:pydantic/content/documents/github-com-pydantic-pydantic/docs/concepts/validators.md
score: 0.93
title: Field validators
- docid: '#211166'
path: qmd://content-topics:pydantic/content/documents/github-com-pydantic-pydantic/docs/concepts/dataclasses.md
score: 0.93
title: Dataclass config
- docid: '#67e386'
path: qmd://content-topics:pydantic/content/documents/github-com-pydantic-pydantic/docs/concepts/models.md
score: 0.93
title: Basic model usage
- category: semantic
label: auth token handling
endpoint: search
latency_ms: 392
result_count: 5
results:
- docid: '#2e0675'
path: qmd://content-topics:geminicli/content/documents/github-com-google-gemini-gemini-cli/docs/core/remote-agents.md
score: 0.94
title: Remote Subagents (experimental)
- docid: '#5d884a'
path: qmd://content-sessions:-Users-mike-code-arthack/b72a4aac-9d49-4c68-8ddd-c7a7f4acd945/agents/a3e2afee013e96225.md
score: 0.93
title: User
- docid: '#4833e2'
path: qmd://content-topics:geminicli/content/documents/github-com-google-gemini-gemini-cli/docs/tools/mcp-server.md
score: 0.93
title: MCP servers with the Gemini CLI
- docid: '#809ae4'
path: qmd://content-topics:perplexity/content/documents/docs-perplexity-ai-docs-admin-api-key-management.md
score: 0.93
title: API Key Management
- docid: '#7eeda2'
path: qmd://content-topics:swiftpm/content/documents/github-com-swiftlang-swift-package-manager/documentation/packageregistry/packageregistryusage.md
score: 0.93
title: Package Registry Usage
- category: semantic
label: async error handling
endpoint: search
latency_ms: 576
result_count: 5
results:
- docid: '#671bb5'
path: qmd://content-sessions:-Users-mike-code-arthack/23688efc-22b7-4f12-adf6-813cae70f90b/agents/a93d4f4fd5c817698.md
score: 0.89
title: User
- docid: '#fdc30f'
path: qmd://content-sessions:-Users-mike-code-arthack/4bba43c4-7436-4f38-ab54-a983409f185d/agents/a43c0ba.md
score: 0.88
title: User
- docid: '#86579b'
path: qmd://content-sessions:-Users-mike-code-arthack/51dffc22-004f-4989-b958-c3b2da4d3890/agents/ac4714a74a5c7bc8c.md
score: 0.88
title: User
- docid: '#5b700f'
path: qmd://content-sessions:-Users-mike-experiment-linux-browser-farm/58840549-2606-4b0a-9415-d4159c63de43/agents/af174e0.md
score: 0.88
title: User
- docid: '#fccfa0'
path: qmd://content-sessions:-Users-mike-code-arthack/c6bed1e2-1edc-472c-bfef-432f861bda30/agents/a14e447.md
score: 0.88
title: User
- category: semantic
label: systemd deploy workflows
endpoint: search
latency_ms: 375
result_count: 5
results:
- docid: '#e9c091'
path: qmd://content-sessions:-Users-mike-code-arthack/cd2c72e5-945b-4d21-975d-ae64593c99ed/agents/a16d4b28b98981dc5.md
score: 0.97
title: User
- docid: '#8ee7d4'
path: qmd://content-sessions:-Users-mike-code-arthack/8b328745-0002-4b53-a8d7-f9c05d4972ed/agents/a296336d03badd7f2.md
score: 0.97
title: User
- docid: '#dd420f'
path: qmd://content-sessions:-Users-mike-code-arthack/c07d644e-d5eb-4515-8d6f-57e4a2d9901c/agents/a4edb9495187eb6bf.md
score: 0.97
title: User
- docid: '#5ee9ce'
path: qmd://content-sessions:-Users-mike-code-arthack/399e942b-76cb-458f-97fd-4f21e600d4d6/agents/a365dbbde4527858c.md
score: 0.97
title: User
- docid: '#8095a0'
path: qmd://content-sessions:-Users-mike-code-arthack/09c271cc-416e-49ef-92a2-7cabf8a15dec/agents/a0beb7a08e10afd0f.md
score: 0.97
title: User
- category: cross-collection
label: YAML config loading
endpoint: search
latency_ms: 374
result_count: 5
results:
- docid: '#3ee637'
path: qmd://content-sessions:-Users-mike-code-arthack/0227bb8d-63c3-423b-b76a-27dd0aa6b5ce/agents/a6290bf.md
score: 0.9
title: User
- docid: '#371a38'
path: qmd://content-sessions:-Users-mike-code/f25d21f2-53fd-4a80-9f5a-7a26b8376166/agents/a0edc63a1f3beb9eb.md
score: 0.9
title: User
- docid: '#dc3032'
path: qmd://content-sessions:-Users-mike-code-arthack/29e1d66f-2312-4ce6-bff9-f5a94e0f69f3/agents/a99d01e.md
score: 0.9
title: User
- docid: '#c62bd7'
path: qmd://content-sessions:-Users-mike-code/f25d21f2-53fd-4a80-9f5a-7a26b8376166/agents/adc817874c3d37a3d.md
score: 0.9
title: User
- docid: '#468d3c'
path: qmd://content-sessions:-Users-mike-code-arthack-marketplace/3ef9b94b-628d-4279-b9e6-7e738d5421ca/agents/a5438b3.md
score: 0.9
title: User
- category: cross-collection
label: notifications
endpoint: search
latency_ms: 361
result_count: 5
results:
- docid: '#102b5b'
path: qmd://content-sessions:-Users-mike-code-arthack/3194d75e-108e-4116-b374-eb1c0be98cd2/agents/ada9e6a8f9a1220da.md
score: 0.94
title: User
- docid: '#083a0f'
path: qmd://content-sessions:-Users-mike-code-arthack/f413d12f-1c13-476e-ae0c-905fe3f493c0.md
score: 0.94
title: User
- docid: '#d5e6bd'
path: qmd://content-sessions:-Users-mike-code-arthack/9bb479ad-f5f9-4eed-b436-6da4a88edde1/agents/a4387b7813f07f6b3.md
score: 0.94
title: User
- docid: '#7b49f9'
path: qmd://content-sessions:-Users-mike-code-arthack/d1125869-b842-473b-9d5f-d82c014f8fce/agents/ac2865394df533977.md
score: 0.94
title: User
- docid: '#999515'
path: qmd://content-sessions:-Users-mike-code-arthack/eb4cc13e-5825-47fb-98a7-5f71b94e1767/agents/aa715ca981729ab6b.md
score: 0.94
title: User
- category: long-context
label: CLI architecture
endpoint: search
latency_ms: 703
result_count: 1
results:
- docid: '#aaabb9'
path: qmd://content-sessions:-Users-mike-code-voice-of-arthacker/0c85ed6d-6b28-4418-916c-9a7dda33ce68/agents/a5440a6.md
score: 0.97
title: User
- category: long-context
label: daemon architecture
endpoint: search
latency_ms: 484
result_count: 0
results: []
- category: structured
label: 'hyde: Telegram bot security'
endpoint: query
latency_ms: 5632
result_count: 5
results:
- docid: '#c0f33e'
path: qmd://content-sessions:-Users-mike-code-arthack/179d2f95-9131-44e2-837d-e3a0996331ab/agents/a2e9d62f0ed14278e.md
score: 0.93
title: User
- docid: '#c7fa4a'
path: qmd://content-sessions:-Users-mike-code-arthack/2a4814f7-8ada-4542-8f24-c84ab0b27180/agents/a96d693889707c85d.md
score: 0.56
title: User
- docid: '#5e20e7'
path: qmd://content-sessions:-Users-mike-code-arthack/39db4ded-59aa-4815-83f1-a41f4487b213/agents/a8578482c586ef923.md
score: 0.47
title: User
- docid: '#2c87e1'
path: qmd://content-sessions:-Users-mike-code-arthack/2a4814f7-8ada-4542-8f24-c84ab0b27180/agents/a2ebcdcd9353f3885.md
score: 0.47
title: User
- docid: '#6f460e'
path: qmd://content-sessions:-Users-mike-code-arthack/1719c35c-cc98-421a-b505-672c970277d1/agents/ac73f90293f09e2e5.md
score: 0.46
title: User
- category: structured
label: 'expand: vector embeddings'
endpoint: query
latency_ms: 12830
result_count: 5
results:
- docid: '#44f26a'
path: qmd://content-topics:haystack/content/documents/github-com-deepset-ai-haystack/docs-website/docs/pipeline-components/retrievers/weaviateembeddingretriever.md
score: 0.9
title: WeaviateEmbeddingRetriever
- docid: '#6ddcb7'
path: qmd://content-topics:perplexity/content/documents/docs-perplexity-ai-docs-embeddings-best-practices.md
score: 0.54
title: Best Practices
- docid: '#2f8b36'
path: qmd://content-topics:kit/content/documents/github-com-cased-kit/docs/src/content/docs/core-concepts/semantic-search.md
score: 0.47
title: How it works
- docid: '#996861'
path: qmd://content-topics:haystack/content/documents/github-com-deepset-ai-haystack/docs-website/docs/pipeline-components/retrievers/pgvectorembeddingretriever.md
score: 0.47
title: PgvectorEmbeddingRetriever
- docid: '#51df84'
path: qmd://content-sessions:-Users-mike-code-arthack-marketplace/ca94aa72-91c9-4f0d-91a3-538215572419/agents/ae5e162.md
score: 0.45
title: User
- category: structured
label: 'expand: file watching'
endpoint: query
latency_ms: 11404
result_count: 5
results:
- docid: '#9b5302'
path: qmd://content-sessions:-Users-mike-code-arthack/0f2e9cf3-8d80-4456-b7d6-d4fa12824a46/agents/a9d62c3e11196239c.md
score: 0.9
title: User
- docid: '#52eaed'
path: qmd://content-sessions:-Users-mike-code-docs/2b229db5-49ae-440c-9f56-a316c6d57d73.md
score: 0.53
title: User
- docid: '#9cf879'
path: qmd://content-sessions:-Users-mike-code-arthack/0685d4ff-965e-4a27-836e-4695c39ea59d.md
score: 0.39
title: User
- docid: '#9addae'
path: qmd://content-sessions:-Users-mike-code-arthack/5e81024b-d851-41fe-a022-dca7994b0f7b/agents/ae460c0.md
score: 0.38
title: User
- docid: '#8ed70c'
path: qmd://content-sessions:-Users-mike-code-docs/1e0256c4-12d5-44d5-a283-6e5f8aa24dc3.md
score: 0.35
title: User

QMD Search Engine Upgrade Report

Date: 2026-03-20 Upgrade: embeddinggemma-300M (768-dim) -> Qwen3-Embedding-0.6B (1024-dim) Host: artbird (RTX 3070, 8GB VRAM, i7-9700)

What was done

Code changes (3 commits)

Hash Message
291cf0c8 feat: add benchmark-search subcommand and upgrade QMD embedding model
4423e081 fix: restart services when systemd unit files change
5173a9d6 fix: pass QMD env vars to UI server and its subprocesses

Step-by-step

  1. Verified GPU health - RTX 3070 healthy, CUDA enabled, all 6 systemd services active
  2. Created benchmark-search subcommand - 13 test queries across 4 categories (keyword, semantic, cross-collection, long-context, structured), YAML output, comparison mode
  3. Ran "before" benchmark - All 13 queries passed, baseline captured
  4. Updated systemd unit - Added QMD_EMBED_MODEL and QMD_EXPAND_CONTEXT_SIZE=4096 env vars to qmdctl.service
  5. Added sandcastles entry - Tracked GGUF model filename as ephemeral
  6. Updated CLAUDE.md - Added benchmark-search to subcommands list
  7. Fixed provisioner - Services now restart when their unit files change (was only starting stopped ones)
  8. Deployed to artbird - Provisioned, restarted MCP server with new env vars
  9. Re-embedded all documents - Cleared index, ran qmd update + qmd embed with Qwen3 model
  10. Fixed dimension mismatch bug - UI server's server.ts hardcoded a subprocess ENV that didn't pass through QMD_* env vars. Also added env vars to qmdctl-ui systemd unit.
  11. Ran "after" benchmark and comparison

Embedding details

  • Total documents: 25,260
  • Total vectors: 144,781 (avg ~5.7 chunks per doc)
  • Model file: Qwen3-Embedding-0.6B-Q8_0.gguf (639 MB)
  • VRAM usage: ~7 GB for model + compute buffers

Embedding challenges

The embed process required 3 passes to complete all vectors:

Pass Vectors embedded Chunks failed Duration Notes
1 21,216 125,555 30 min MCP server was running, competing for VRAM. CUDA OOM on larger chunks.
2 30,304 65,266 30 min MCP server stopped first. Processed more chunks.
3 93,261 0 ~55 min All remaining chunks completed.

Root cause of multi-pass: The RTX 3070 has 8 GB VRAM. The Qwen3 model uses ~7 GB (model weights + compute buffers). With Chrome browsers (5 x 28 MB) and Xorg (128 MB) also on the GPU, there wasn't enough headroom for the largest chunks. Each pass succeeded on progressively smaller chunks, and the final pass (with MCP server stopped) completed everything.

Total embedding time: ~2 hours across 3 passes.

Throughput

  • Sustained rate: ~1,000 vectors/min (GPU at 94% utilization, 148W)
  • Per-doc rate: ~175 docs/min
  • This is reasonable for a 0.6B Q8_0 model through node-llama-cpp on an RTX 3070.

Benchmark results

Summary

Metric Value
Queries improved 1
Queries regressed 1
Queries unchanged 11
Total latency before 34,297 ms
Total latency after 12,021 ms
Latency improvement 65% faster

Category breakdown

Keyword queries (3): Identical scores, identical top-5 results, similar latency. These use BM25 (no embedding), so the model change has no effect.

Semantic queries (3): Identical scores and results. The reranker (Qwen3-Reranker, unchanged) dominates scoring here.

Cross-collection queries (2): Identical scores and results.

Long-context queries (2): No change. "CLI architecture" returns 1 result, "daemon architecture" returns 0. These depend more on the reranker and query expansion model than embeddings.

Structured queries (3): This is where the upgrade shines:

  • hyde: Telegram bot security - Same top score (0.93), but 3.6x faster (5632ms -> 1570ms). Different top-5 results (1/5 overlap) - the new model finds different relevant docs.
  • expand: vector embeddings - Score improved 0.90 -> 0.91, 4.6x faster (12830ms -> 2783ms). Completely different top-5 (0/5 overlap).
  • expand: file watching - Score regressed slightly 0.90 -> 0.88, 3.8x faster (11404ms -> 3029ms). Some overlap (2/5).

Key takeaway

The upgrade's main impact is on structured queries (hyde/expand), which are 3.6-4.6x faster. This makes sense - the larger context window (32K vs 2K) means the query expansion model can process more context in fewer passes. Score quality is neutral to slightly positive. The basic search path (keyword, semantic) is unaffected since it uses BM25 + reranking rather than embedding similarity.

Bugs discovered and fixed

1. Provisioner doesn't restart changed services

Problem: provision-service only started stopped services. If a service was already running with old config, it stayed on the old config until manually restarted.

Fix: Track which units were rewritten and restart those specifically (commit 4423e081).

2. UI server doesn't pass QMD env vars to subprocesses

Problem: server.ts hardcoded a subprocess ENV with only HOME and PATH. The QMD_EMBED_MODEL env var set in the systemd unit was never passed through to qmd query subprocesses, causing 768-dim query embeddings against a 1024-dim index (dimension mismatch error).

Fix: Pass through all QMD_* env vars from process.env to subprocess ENV. Also added the env vars to the qmdctl-ui systemd unit (commit 5173a9d6).

Open items

Model persistence in VRAM

QMD's MCP server has disposeModelsOnInactivity: true hardcoded with a 5-minute timeout. After 5 minutes of no queries, the embedding model unloads from VRAM. Reloading takes ~15 seconds. Options:

  1. Upstream QMD change to make this configurable via env var
  2. Keep-warm cron that sends a search query every few minutes

Chrome browsers using discrete GPU

Five browserctl Chrome instances use ~140 MB of RTX 3070 VRAM for compositing. The i7-9700 has an iGPU (UHD 630) but it appears BIOS-disabled. --disable-gpu flag would move Chrome rendering to software. Low priority since 140 MB is small relative to 8 GB.

"daemon architecture" query returns 0 results

Both before and after, this long-context query returns no results. May indicate a gap in the indexed content or a reranker threshold issue.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment