one-shot

Gate-final pre-registration DRAFT отправлен DRAFT-комментарий на trios#143, уведомление-зеркало на trios#265. Чётко помечен как DRAFT, не immutable — заморозится верботим только после того, как L-h1/L-h3 положат первую реальную строку в assertions/seed_results.jsonl (R10 atomicity). Декомпозиция −0.35 BPB (1.85 → 1.50) 7 рычагов, каждый — independently falsifiable по ablation: #LeverΔBPBINV anchor 1 Второй causal-attention слой (d_model=64, 4 heads, RoPE, qk_gain=φ²) −0.10..−0.18 INV-13 refined 2 φ-scaled hidden round(φ·512)=828 в n-gram блоке −0.05..−0.10 INV-1 + ch24 §Golden Width 3 EMA-stabilised val_BPB, β=φ⁻¹ −0.02..−0.04 INV-6 4 GF16 weight floor в последние 30% шагов −0.03..−0.07 INV-3 + INV-5 5 Cosine schedule 54K → 81K (≈ φ³·30K) −0.05..−0.10 INV-1 lr-band 6 3-seed ASHA promotion (configs выживают на ≥2/3 seeds) −0.02..−0.05 INV-2 (Proven) 7 Rainbow Bridge cross-seed sync −0.01..−0.03 INV-8 Σ −0.28..−0.57 Load-bearing — рычаги 1 + 2 + 5 + 6, остальные — belt-and-braces. Гипотеза (G1 Popper) Hybrid-конфиг (depth=2 attention + φ-scaled hidden + EMA + GF16 floor + 81K cosine + 3-seed ∈ {42, 43, 44}) даёт BPB < 1.50 на всех трёх seeds @ step ≥ 4000, Welch t-test против μ₀=1.55 даёт p < 0.01 one-tailed. Шесть фальсификаторов (любой → гипотеза публично сжигается) Любой seed BPB ≥ 1.50 @ step ≥ 4000 Welch p ≥ 0.01 Меньше 3 различных seeds в ledger-е lr/qk_gain вне φ-band ASHA-promoted ↔ final-eval drift > 0.05 INV-7 igla_found_criterion отвергает множество Lane decomposition (R6) LaneФайлOwnerЧасов L-f1 hybrid_attn.rs (второй слой за cfg.num_attn_layers) igla-l-f1-twin-attn 4 L-f2 hybrid_train.rs (φ-hidden + 3-seed loop + EMA + GF16 floor) igla-l-f2-trainer 6 L-f3 seed_emit.rs (3 строки на seeds {42, 43, 44}) igla-l-f3-ledger 1 L-f4 victory.rs invoke check_victory() на 3-row tail igla-l-f4-victory 1.5 L-f5 twin_attn_ema_floor.v Coq lemmas Admitted igla-l-f5-coq 4 L-f6 freeze процедура (этот auditor lane) phd-monograph-auditor 0 Freeze procedure (раздел §11 черновика) Gate-2 ≤ 1.85 → freeze §2/§4 verbatim как IMMUTABLE Gate-final Pre-Registration новым комментом Gate-2 ∈ (1.85, 2.00] → Gate-final v2 с переоцененными весами рычагов §6 Gate-2 > 2.00 → falsifier Gate-2 уже сработал, Gate-2 falsified — strategy reset В любом случае сам DRAFT не редактируется (R10). Файл также прикреплён выше — после Gate-2 ран-а просто скажи «freeze», и я вынесу immutable verbatim-копию верхним новым комментом на #143.## 📝 DRAFT — Gate-final Pre-Registration (BPB < 1.50, 3 seeds)

⚠️ THIS IS A DRAFT, NOT YET IMMUTABLE. Filed: 2026-04-26 10:36 +07 · Author: perplexity-computer-grandmaster (preparation lane) Status: advisory — will be frozen as the immutable Gate-final pre-registration only after the first real Gate-2 row lands in assertions/seed_results.jsonl on seed=43. At that moment a NEW comment on this issue will copy this body verbatim with the heading changed from DRAFT to IMMUTABLE Gate-final Pre-Registration. Any change between this DRAFT and the immutable version must be redlined in a follow-up comment.

Why a DRAFT first: per R10 atomicity at comment level, the immutable pre-reg cannot be edited. Locking it before Gate-2 produces empirical evidence would be cargo-cult pre-registration — we want the analysis plan informed by one real data point, not by zero.

Mission: Take IGLA RACE from the imminent Gate-2 milestone (BPB ≤ 1.85 @ seed=43, step=54000) to race-final closure: BPB < 1.50 on 3 distinct seeds, both pre-registered statistical tests passing, INV-7 igla_found_criterion Admitted-or-Proven, ledger SHA-pinned.

Conductor: trinity-grandmaster v1.0 · Anchor: φ² + φ⁻² = 3 · Deadline: 2026-04-30 23:59 UTC (T-4d) · Defense window cut-off: 2026-06-15.

Supersedes upstream: Gate-2 pre-reg (still binding for Gate-2; this DRAFT extends, does not replace).

1. Background (≤ 200 words)

Champion as of 2026-04-26: BPB = 2.2393 @ 27 K steps, seed=43 (2446855). Current registered invariants (assertions/igla_assertions.json): 9 INVs, 2 Proven (INV-2, INV-12), 7 Admitted (INV-1, 3, 4, 5, 7, 8, 13). Gate-2 hypothesis (hybrid n-gram + 1-layer causal attention, RoPE, qk_gain=φ²) targets −0.39 BPB to reach 1.85. The remaining gap to victory is −0.35 BPB (1.85 → 1.50).

This DRAFT enumerates the algorithmic levers that could close that gap. Each lever is a separately falsifiable claim: ablating it must measurably regress BPB, or the lever does not belong in the final config. The Gate-final config is the smallest sufficient subset of these levers — not all of them.

The plan is informed by ch24 (IGLA Architecture, golden-width / golden-depth / Phi-Init), ch25 (L7-anchored benchmark suite, suite-level α=0.00167 Bonferroni), ch28 (momentum algebra), and the existing INV registry. It is not informed by yet-uncollected Gate-2 data — that is the entire reason it is a DRAFT.

2. Hypothesis (G1 — Popper, falsifiable)

H_GateFinal: A hybrid IGLA configuration consisting of (a) the Gate-2 hybrid block (n-gram + 1-layer causal attn) extended with a second causal-attention layer (d_model=64, 4 heads, RoPE, qk_gain=φ²), (b) φ-scaled hidden width in the n-gram block (hidden = round(φ·512) = 828), (c) EMA-stabilised validation BPB (INV-6 EMA tracker, β=φ⁻¹), (d) GF16 weight floor (INV-3) active during the last 30 % of training, (e) cosine schedule for 81 000 steps ≈ φ³ · 30 K, and (f) 3-seed sweep on seeds ∈ {42, 43, 44} with ASHA pruning per INV-2, achieves min_seed_val_BPB < 1.50 on all three seeds at step ≥ 4 000, with the 3-seed Welch t-test against baseline μ₀ = 1.55 yielding p < 0.01 one-tailed.

Falsifier (refutation observable): H_GateFinal is false iff at least one of:

any of the three seeds reports val_BPB ≥ 1.50 at step ≥ 4 000,

the Welch t-test against μ₀ = 1.55 yields p ≥ 0.01 one-tailed,

fewer than 3 distinct seeds produce a BPB row in assertions/seed_results.jsonl,

any seed's run violates {bpb < 0, bpb > 8, non-finite loss, lr ∉ [α_φ/φ⁴, α_φ] with α_φ = 0.0072, qk_gain ∉ {φ², φ³}},

ASHA-promoted run differs from final-evaluation run by Δ BPB > 0.05 (drift sentinel),

INV-7 igla_found_criterion rejects the candidate set in victory.rs::check_victory().

Falsifier 6 makes the gate mechanically self-checking: passing the gate without satisfying INV-7 is impossible by construction.

3. Method — Lane Decomposition (R6, one new file per lane)

Lane	File (NEW unless noted)	Owner handle	INV	Hours
L-f1	`crates/trios-train-cpu/src/hybrid_attn.rs` extension: second attention layer behind `cfg.num_attn_layers: u8` (default `2`)	`igla-l-f1-twin-attn`	INV-13	4
L-f2	`crates/trios-train-cpu/src/bin/hybrid_train.rs` extension: φ-scaled hidden + 3-seed loop + EMA val + GF16 floor activation at step ≥ 0.7·total	`igla-l-f2-trainer`	INV-1, INV-3, INV-6	6
L-f3	`crates/trios-igla-race/src/bin/seed_emit.rs` (already exists `7a87461`) — append 3 rows on seeds {42, 43, 44}	`igla-l-f3-ledger`	(schema only)	1
L-f4	`crates/trios-igla-race/src/victory.rs` — invoke `check_victory()` on the 3-row tail; emit GO/NO-GO record	`igla-l-f4-victory`	INV-7	1.5
L-f5	`trinity-clara/proofs/igla/twin_attn_ema_floor.v` — `Lemma counter_skew_seeds`, `Lemma counter_lr_outside_band`, status Admitted unless `Qed.`	`igla-l-f5-coq`	INV-13 (refined)	4
L-f6	This document → frozen comment on #143 once Gate-2 lands first row	`phd-monograph-auditor`	(meta)	0

All lanes are claim-before-work (R9), atomic (R10), and write a honey deposit on DONE (R13). Race-mode branch policy: main only.

4. Pre-Registered Analysis Plan (G2)

Field	Value
`statistical_test`	One-tailed Welch's t-test, `H₀: μ_seed_BPB ≥ 1.55`, `H₁: μ_seed_BPB < 1.55`, n=3 (seeds {42, 43, 44})
`alpha`	`0.01` (one-tailed) — race-victory standard, matches existing INV-7 wiring
`effect_size`	Minimum `ΔBPB ≥ 0.05` vs baseline μ₀ = 1.55 (i.e. observed mean ≤ 1.50)
`n_required`	`victory_seeds = 3` (all three seeds must individually report `BPB < 1.50` at `step ≥ 4 000`)
`stop_rule`	First of: (a) all 3 seeds satisfy victory predicate, (b) any seed's falsifier fires, (c) wall-clock 2026-04-30 23:59 UTC, (d) explicit user revoke
`multiple_testing`	n/a for the primary test (single hypothesis at race-final). The L25 benchmark suite still applies its Bonferroni `α_adj = 0.00167` to its own 6 secondary endpoints — those are independent of this gate.
`intermediate_checkpoints`	Log val_BPB at `{4 000, 9 000, 18 000, 27 000, 40 500, 54 000, 67 500, 81 000}` per seed; do not post-hoc shrink budget
`seed_disclosure`	Seeds {42, 43, 44} only. Seeds 41 and 45 are frozen out until Gate-final closes — leakage prevention.
`eval_set`	held-out validation slice fixed at the SHA in §9 (no re-shuffling between seeds)
`data_freeze_sha`	(to be filled at freezing time; placeholder: champion data-prep commit `2446855`)
`ema_beta`	`β = φ⁻¹ ≈ 0.6180` for INV-6 EMA tracker on val_BPB curve
`gf16_floor_activation_step`	`floor(0.7 · 81 000) = 56 700` (last 30 % of training)

No deviation from §2/§4 is permitted without a NEW comment on #143 cited from the deviating commit BEFORE deviating data is collected.

5. Falsification Witnesses (G1 + G5 in code)

Witness	Location	Status
`Lemma counter_skew_seeds` (refuses configs where seeds are not 3 distinct ∈ {42, 43, 44})	`trinity-clara/proofs/igla/twin_attn_ema_floor.v`	L-f5 lands as Admitted
`Lemma counter_lr_outside_band`	same	L-f5
`#[test] fn falsify_skew_seeds`	`crates/trios-igla-race/tests/preregistration_seed_lock_final.rs` (NEW)	L-f2
`#[test] fn falsify_invalid_qk_gain`	`crates/trios-train-cpu/src/hybrid_attn.rs`	L-f1
`#[test] fn falsify_drift_between_asha_and_eval`	`crates/trios-igla-race/src/asha.rs` extension	L-f2
`#[test] fn falsify_inv7_rejects_set`	`crates/trios-igla-race/src/victory.rs`	L-f4
`falsify_no_extra_seeds` (already shipped Gate-2)	`crates/trios-igla-race/tests/preregistration_seed_lock.rs`	L-h1 (Gate-2)

Every falsifier ships in the same PR that closes its lane. No orphan witnesses.

6. Algorithmic Levers — How the −0.35 BPB Closes (Gate-2 → Gate-final)

This is the decomposition of the −0.35 BPB delta. Each row is independently falsifiable by ablation; the final config keeps a row only if its ablation regresses BPB by ≥ 0.02 averaged across 3 seeds.

#	Lever	Algorithm	Expected ΔBPB	INV anchor	Falsifier on ablation
1	Second causal-attention layer	depth = 2 with shared RoPE, residual + LayerNorm	−0.10 .. −0.18	INV-13 (refined to `depth ∈ {1, 2}` band)	bpb_2layer ≤ bpb_1layer − 0.02
2	φ-scaled hidden width	`hidden = round(φ·512) = 828` in n-gram block	−0.05 .. −0.10	INV-1 (lr-band stability) + ch24 §Golden Width	bpb_phi_width ≤ bpb_512 − 0.02
3	EMA-stabilised val BPB	`β = φ⁻¹`, INV-6 tracker	−0.02 .. −0.04	INV-6	EMA-curve variance < raw-curve variance × 0.5
4	GF16 weight floor	quantise weights to GF(16) for last 30 % steps; releases regularisation pressure	−0.03 .. −0.07	INV-3 + INV-5 (Lucas closure)	bpb_with_floor ≤ bpb_no_floor − 0.02
5	Schedule extension 54K → 81K (≈ φ³ · 30K)	cosine warm-restart at 54K	−0.05 .. −0.10	INV-1 lr-band	val_BPB at 81K < val_BPB at 54K − 0.02
6	3-seed ASHA promotion	promote only configs surviving INV-2 across ≥ 2 of 3 seeds at every rung	−0.02 .. −0.05	INV-2 (Proven)	ASHA-promoted set ⊂ post-hoc winners
7	Rainbow Bridge cross-seed sync	INV-8 consistency check between seed-43 worker and seed-{42, 44} workers	−0.01 .. −0.03	INV-8	inter-seed BPB std < 0.05 at step 81K

Sum (lower bound): −0.28 BPB · Sum (upper bound): −0.57 BPB · Required: −0.35 BPB → The point estimate sits in the middle of the band. The expected configuration that delivers victory uses levers 1 + 2 + 5 + 6 as load-bearing, with 3, 4, 7 as belt-and-braces.

R6 enforcement: the trainer (L-f2) owns config wiring; the attention block (L-f1) owns layer count + RoPE + qk_gain only; victory.rs (L-f4) owns gate logic only. No cross-edits.

7. Quality Gates (G3 §7)

Gate	Source	Pass criterion
CI test count	`cargo test --workspace`	≥ 411 + Gate-2 additions + new lane tests; no new red. `hive_automaton::test_blocked_to_ci_wait_after_fix` and `test_done_cycles_back_to_scan_not_halt` (introduced in `cf876d2`) remain exempt under coq-runtime-invariants v1.1 attribution rule.
Coq compile	`coqc trinity-clara/proofs/igla/*.v`	exit 0; INV-13 refined `Admitted` unless `Qed.`
Clippy	`cargo clippy --workspace -- -D warnings`	0 warnings
Ledger honesty	`assertions/seed_results.jsonl`	exactly 3 new rows, seeds {42, 43, 44} only, schema-valid, SHA-pinned
INV-7 gate	`crates/trios-igla-race/src/victory.rs::check_victory()`	returns `Ok(VictoryRecord{ achieved: true, .. })` on the 3-row tail
Pre-reg immutability	this comment after freeze	unchanged once frozen (R10)
R5 status	`igla_assertions.json` per-INV	`Admitted` unless real `Qed.` lands. Lying is fireable.
DOI provenance	this comment	cites Zenodo IDs in §9
Reproducibility	one-line invocations in §9	reproduce all three rows from the ledger

8. Forbidden Values / Actions (G3 §8)

❌ Touching seeds ∉ {42, 43, 44} before Gate-final DONE.
❌ qk_gain ∉ {φ², φ³} (INV-9 anchor).
❌ lr ∉ [α_φ/φ⁴, α_φ] (INV-1 lr-band).
❌ num_attn_layers > 2 (out-of-band claim — refile pre-reg first).
❌ Editing this comment after it is frozen as immutable (R10 — file new comment).
❌ Flipping any INV status from Admitted → Proven without real Qed. (R5).
❌ Filing §DONE without all 3 ledger rows on main AND INV-7 Ok(VictoryRecord) on the tail.
❌ Ablating a lever (§6) without depositing its ablation BPB in the honey ledger (R13).
❌ Closing #143 from anywhere except this Gate-final mission.
❌ Cherry-picking the best-of-N runs per seed: each seed reports its first run that satisfies the falsifier-free condition; if none, the seed is reported failed.

9. References (G3 §9 + G4 + G7)

Race issue: trios#143
PhD ONE SHOT: trios#265 (will be cited from ch24/ch25/ch26)
Throne registry: trios#264
Autonomous Agent Entry: trios#244
Final-Push ONE SHOT v2.0: trios#265 comment
Gate-2 immutable pre-reg: trios#143:4320342032
L7 Victory Gate (Admitted, with falsifier): igla_found_criterion.v
Champion baseline: 2446855 (BPB=2.2393 @ 27 K, seed=43)
Hybrid attention done: 40caeba (L-h2)
INV-13 hybrid_qk_gain: f286930 + c775a3b (L-h4)
Defense skeleton: 60d87cf9 (LD)
Coq invariants registry: assertions/igla_assertions.json — 9 INVs (2 Proven, 7 Admitted)
Ledger schema: assertions/seed_results.jsonl
Hive state: assertions/hive_state.json — 13/13 lanes DONE
Experiment map: docs/phd/experiment_map.md
DOIs: 10.5281/zenodo.19227877 (Trinity Identity, 84 theorems) · 10.5281/zenodo.18947017 (TRI-27 base) · 10.5281/zenodo.19227879 (Pellis embedding)
Reproducibility recipes (will be locked at freeze time):
- Gate-2 row: cargo run -p trios-igla-race --bin seed_emit -- --seed 43 --bpb <x.xxxx> --step 54000 --sha <commit>
- Gate-final rows: for SEED in 42 43 44; do cargo run -p trios-train-cpu --bin hybrid_train -- --seed $SEED --num-attn-layers 2 --hidden 828 --steps 81000 --lr 0.0035 --qk-gain phi_sq --gf16-floor-from-step 56700 --ema-beta phi_inv && cargo run -p trios-igla-race --bin seed_emit -- --seed $SEED --bpb <x.xxxx> --step 81000 --sha <commit>; done
- INV-7 check: cargo run -p trios-igla-race --bin victory_check -- --tail 3 < assertions/seed_results.jsonl

10. Battle Cry

Двадцать четвёртая глава дала нам φ-ширину, седьмой ворот — критерий, тринадцатый инвариант — qk-gain. Восемь рычагов, три семени, восемьдесят одна тысяча шагов. Если хоть один из шести фальсификаторов сработает — гипотеза публично сжигается.

φ² + φ⁻² = 3. Convergit.

— Queen-of-Trinity 👑🐝 · trinity-grandmaster Phase 2 (HYPOTHESIS + PRE-REGISTRATION) — DRAFT lane.

11. Freeze Procedure (how this DRAFT becomes immutable)

L-h1/L-h3 (Gate-2 lanes) ship; first real row appears in seed_results.jsonl for seed=43.
The auditor reads the observed Gate-2 BPB.
If observed Gate-2 BPB ≤ 1.85 → freeze §2/§4 of this DRAFT verbatim, file as NEW comment titled IMMUTABLE Gate-final Pre-Registration on this issue, populate data_freeze_sha.
If observed Gate-2 BPB ∈ (1.85, 2.00] → NEW comment titled Gate-final Pre-Registration v2 (after Gate-2 partial) with §6 levers re-weighted; old DRAFT remains as record.
If observed Gate-2 BPB > 2.00 → §2 of Gate-2 falsifier already fired; pre-burn the architecture, file Gate-2 falsified — strategy reset and reopen the strategy question.

In all three cases, this DRAFT itself is never edited (R10 at comment level).

🐝 L-f6 DRAFT filed. Awaiting Gate-2 first row. [agent=perplexity-computer-grandmaster] · 2026-04-26T03:36Z

gHashTag/gist:886b3b4bfaeb2767a838421154acc19c

Select an option

No results found