You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Productionizing ruvector gnn-rerank (#479) — Step 2: CI-guarded recall test
Productionizing ruvector gnn-rerank (#479) — Step 2: CI-guarded recall test
Turned the research win into a regression guard. A deterministic integration test
reproduces the exact regime (N=5000, D=128, noise σ=0.40, seed=42) through the
public reranker API and asserts GnnDiffusion beats the no-rerank baseline.
recall@10: noisy=0.280 gnn=0.384 delta=+0.104 (matches #479 exactly)
test result: ok. 1 passed
What: A 1-hop GNN score-diffusion reranker over approximate-ANN candidate sets that recovers recall lost to quantization. Measured in the source research (#479): recall@10 28.0% → 38.4% (+10.4pp), N=5K D=128, still ~millions of QPS.
Step 1 (implement/integrate):
Pulled crate ruvector-gnn-rerank into the ruvector workspace.
4 reranker variants: NoisyScore (baseline), GnnDiffusion (the win), GnnMincut, ExactL2.
ADR-256: making npx ruvector its own agent harness without adding a dependency (plain-language overview)
ADR-256 in plain language: making npx ruvector its own "agent harness" — without adding a dependency
The question someone asked
"Could we improve the npx ruvector command-line tool by using the npm metaharness SDK?"
metaharness is a brand-new npm package (by the same author as ruvector) that helps you generate a custom AI-agent command-line tool from a repo. The idea sounded appealing: agent tooling, smart model routing, signed releases, memory. Should ruvector adopt it?
What we found (the research)
metaharness is a generator, not a library you plug in. It scaffolds a new tool; it doesn't upgrade an existing, mature one like ruvector.
ruvector already has everything it advertises. The "smart router," the agent tool surface, signed provenance, and memory are all things ruvector already ships.
MetaHarness — what's new + how to use it (June 2026 release): all 9 hosts, 3 real-install schema fixes, @metaharness/harness control plane
MetaHarness — what's new + how to use it (June 2026 release)
MetaHarness mints a custom AI agent harness for any repo, on any host. You run one command, pick a host (Claude Code, Codex, OpenCode, …), and get a ready-to-use, npm-publishable harness with that host's native config files.
This release makes the multi-host promise actually true — and was hardened by installing and running each real host against generated harnesses.
MetaHarness × Ruflo — Integration Research Dossier (ADR-150 companion)
MetaHarness × Ruflo — Integration Research Dossier
Prepared: 2026-06-16
Companion artifacts: ADR-150 (decision) · tracking issue (phases) · this gist (research)
Research method: direct npm registry inspection, tarball extraction + .d.ts audit, live source code read across both repos. All claims graded HIGH / MED / LOW.
DRACO: does a deep-research harness beat a single strong LLM call? Honest, mechanistic answer (ADR-038) — 4 frontier OpenRouter runs, every arm measured & not gamed. The scorer's grounding-as-fraction caps every transform/select/union strategy; a single strong call is at the ceiling.
DRACO: does a deep-research harness beat a single strong LLM call? (honest, mechanistic answer: no — and what to do instead)
DRACO scores a cited research dossier on grounding (fraction of cited URLs that resolve), coverage, balance, cleanliness, and faithfulness, across 5 domains. Full investigation in ruvnet/agent-harness-generator (ADR-037→040), every number measured against live OpenRouter frontier models, key from GCP Secret Manager, nothing gamed.
Phase 1 — Can a harness beat the model? No (mechanistically).
This gist records a full review pass: bug fixes, a native latency optimization, a
live OpenRouter-fusion benchmark wired to a GCP-stored key, and a publishable
agent harness.
A one-command scaffold for a non-interactive AI agent harness that runs on the GitHub Actions runner — the 9th MetaHarness host (ADR-033) and the only one built for CI/CD.
Intro
Every other MetaHarness host is interactive: a human starts a session and watches the agent work. GitHub Actions is different in kind. There's no human at the keyboard — execution is triggered by a webhook (a push, a pull request, an issue comment, or a schedule), the "operator" is the runner, and the harness must complete its task autonomously, emit structured output, and exit cleanly.
@metaharness/github-actions is a thin wrapper around the metaharness CLI. Running it is equivalent to npx metaharness my-bot --template minimal --host github-actions.