- Test ID:
timeline_integrity-002-20260522T173600 - Total Scenarios: 1
- Passed Scenarios: 1 (GREEN Production)
- Failed Scenarios: 0
- Compound Player Input Handling: PASS ✅
- Spatial Localization: PASSED ✅ (entire narrative set strictly in Silvershield Annex strategy room).
- Instruction Retention: PASSED ✅ (off-site tailing of Grog-Mar is delegated to Sylas via a nod before departing; prisoner check is handled prior to Annex arrival in a clean transition).
| Claim | File | Key Field-Level Verification |
|---|---|---|
| Perfect spatial localization & delegation | artifacts/repro.log | Output showing clean delegation and localized Strategy Room treaty |
| Fully organic request/response capture | artifacts/llm_request_responses.jsonl | gemini-3-flash-preview full organic prompt and response captured to disk |
Proves:
- The Geographical & Historical Integrity Protocol successfully resolves timeline splicing and instruction dropout for compound inputs on
gemini-3-flash-preview. - The system correctly logs 100% organic LLM requests/responses under
CAPTURE_RAW_LLM="true".
Does NOT Prove:
- Behavior on other LLM providers (e.g. Claude, Llama) which are not tested under this specific iteration.