Skip to content

Instantly share code, notes, and snippets.

View ruvnet's full-sized avatar
💭
hacking the multiverse.

rUv ruvnet

💭
hacking the multiverse.
View GitHub Profile
@ruvnet
ruvnet / gist479-3.md
Created June 17, 2026 22:14
Productionizing ruvector gnn-rerank (#479) — Step 3: honest benchmark

Productionizing ruvector gnn-rerank (#479) — Step 3: honest benchmark

Measured the rerank hot path under cargo test --release:

variant latency (µs/q) throughput
NoisyScore (no rerank, baseline) ~0.15 ~7.0M QPS
GnnDiffusion (+10.4pp recall) ~400 ~2.5K QPS

The honest correction: the "~millions of QPS" associated with #479 is the

@ruvnet
ruvnet / gist479-2.md
Created June 17, 2026 22:10
Productionizing ruvector gnn-rerank (#479) — Step 2: CI-guarded recall test

Productionizing ruvector gnn-rerank (#479) — Step 2: CI-guarded recall test

Turned the research win into a regression guard. A deterministic integration test reproduces the exact regime (N=5000, D=128, noise σ=0.40, seed=42) through the public reranker API and asserts GnnDiffusion beats the no-rerank baseline.

recall@10:  noisy=0.280   gnn=0.384   delta=+0.104   (matches #479 exactly)
test result: ok. 1 passed
@ruvnet
ruvnet / gist479-1.md
Created June 17, 2026 22:05
Productionizing ruvector gnn-rerank (#479) — Step 1: baseline

Productionizing ruvector gnn-rerank (#479) — Step 1: baseline

What: A 1-hop GNN score-diffusion reranker over approximate-ANN candidate sets that recovers recall lost to quantization. Measured in the source research (#479): recall@10 28.0% → 38.4% (+10.4pp), N=5K D=128, still ~millions of QPS.

Step 1 (implement/integrate):

  • Pulled crate ruvector-gnn-rerank into the ruvector workspace.
  • 4 reranker variants: NoisyScore (baseline), GnnDiffusion (the win), GnnMincut, ExactL2.
  • 14/14 unit tests pass.
  • Branch: feat/productionize-gnn-rerank.
@ruvnet
ruvnet / adr-256-gist.md
Created June 17, 2026 03:15
ADR-256: making npx ruvector its own agent harness without adding a dependency (plain-language overview)

ADR-256 in plain language: making npx ruvector its own "agent harness" — without adding a dependency

The question someone asked

"Could we improve the npx ruvector command-line tool by using the npm metaharness SDK?"

metaharness is a brand-new npm package (by the same author as ruvector) that helps you generate a custom AI-agent command-line tool from a repo. The idea sounded appealing: agent tooling, smart model routing, signed releases, memory. Should ruvector adopt it?

What we found (the research)

  1. metaharness is a generator, not a library you plug in. It scaffolds a new tool; it doesn't upgrade an existing, mature one like ruvector.
  2. ruvector already has everything it advertises. The "smart router," the agent tool surface, signed provenance, and memory are all things ruvector already ships.
@ruvnet
ruvnet / oia-implementation-gist.md
Created June 16, 2026 19:43
Ruflo × MetaHarness Integration: OIA-Layered Walkthrough (ADR-150 implementation across 28 /loop iterations)

Ruflo × MetaHarness Integration: An OIA-Layered Walkthrough

Plain-language guide to what we built across 28 iterations of /loop work.

Companion to ADR-150 · Tracking issue #2399 · Upstream bug #9


What is OIA?

@ruvnet
ruvnet / metaharness-release-guide.md
Created June 16, 2026 19:28
MetaHarness — what's new + how to use it (June 2026 release): all 9 hosts, 3 real-install schema fixes, @metaharness/harness control plane

MetaHarness — what's new + how to use it (June 2026 release)

MetaHarness mints a custom AI agent harness for any repo, on any host. You run one command, pick a host (Claude Code, Codex, OpenCode, …), and get a ready-to-use, npm-publishable harness with that host's native config files.

This release makes the multi-host promise actually true — and was hardened by installing and running each real host against generated harnesses.


Install / use

@ruvnet
ruvnet / metaharness-dossier.md
Created June 16, 2026 16:40
MetaHarness × Ruflo — Integration Research Dossier (ADR-150 companion)

MetaHarness × Ruflo — Integration Research Dossier

Prepared: 2026-06-16 Companion artifacts: ADR-150 (decision) · tracking issue (phases) · this gist (research) Research method: direct npm registry inspection, tarball extraction + .d.ts audit, live source code read across both repos. All claims graded HIGH / MED / LOW.


1. What @metaharness Is

@ruvnet
ruvnet / draco-gist.md
Last active June 15, 2026 13:41
DRACO: does a deep-research harness beat a single strong LLM call? Honest, mechanistic answer (ADR-038) — 4 frontier OpenRouter runs, every arm measured & not gamed. The scorer's grounding-as-fraction caps every transform/select/union strategy; a single strong call is at the ceiling.

DRACO: does a deep-research harness beat a single strong LLM call? (honest, mechanistic answer: no — and what to do instead)

DRACO scores a cited research dossier on grounding (fraction of cited URLs that resolve), coverage, balance, cleanliness, and faithfulness, across 5 domains. Full investigation in ruvnet/agent-harness-generator (ADR-037→040), every number measured against live OpenRouter frontier models, key from GCP Secret Manager, nothing gamed.

Phase 1 — Can a harness beat the model? No (mechanistically).

Frontier tier (opus-4 / gpt-5 / gemini-2.5-pro), n=20:

arm Δ vs vanilla verdict
@ruvnet
ruvnet / GIST.md
Created June 15, 2026 02:23
Symbolic Scribe PR #2 — review, fixes, native optimization (5.4x compress), live openrouter/fusion benchmark, agent harness

Symbolic Scribe — PR #2 review, fixes, benchmark & optimization

PR: ruvnet/symbolic-scribe#2Rust→WASM PromptOps compiler core + Optimizer UI

This gist records a full review pass: bug fixes, a native latency optimization, a live OpenRouter-fusion benchmark wired to a GCP-stored key, and a publishable agent harness.


@ruvnet
ruvnet / gist-gha.md
Created June 14, 2026 21:12
MetaHarness × GitHub Actions: a one-command non-interactive CI/CD agent harness — .github/workflows + composite action, default-deny via permissions (ADR-033). npx @metaharness/github-actions@latest my-bot

MetaHarness × GitHub Actions

A one-command scaffold for a non-interactive AI agent harness that runs on the GitHub Actions runner — the 9th MetaHarness host (ADR-033) and the only one built for CI/CD.

Intro

Every other MetaHarness host is interactive: a human starts a session and watches the agent work. GitHub Actions is different in kind. There's no human at the keyboard — execution is triggered by a webhook (a push, a pull request, an issue comment, or a schedule), the "operator" is the runner, and the harness must complete its task autonomously, emit structured output, and exit cleanly.

@metaharness/github-actions is a thin wrapper around the metaharness CLI. Running it is equivalent to npx metaharness my-bot --template minimal --host github-actions.