Skip to content

Instantly share code, notes, and snippets.

@grahama1970
grahama1970 / codex_review_bundle.md
Created September 28, 2025 16:50
Codex Review Bundle (Single) 2025-09-28

Codex Review Bundle (Single File)

  • Repo root: /home/graham/workspace/experiments/codex
  • Date: 2025-09-28

Executive Verdict

Readiness: ✅ Top risks are low and localized to config and CLI glue.

Findings by Focus Area

@grahama1970
grahama1970 / REVIEW_BUNDLE_PROMPT.md
Created September 28, 2025 16:35
LiteLLM Smokes Review Bundle Prompt

External Review Prompt (Canonical v3)

Goal: Deliver a blunt, evidence‑backed production‑readiness assessment and a minimal patch set (unified diffs) with tests and doc updates. No broad refactors. Ship safety first.

Reviewer persona & tone

  • Principal SRE/DevOps + AppSec mindset; fluent with Linux userland (systemd, Docker/OCI, rsync/tar), CI, and Python/Node toolchains.
  • Be terse, specific, and fail‑closed. Unverified claims must be called out and marked 🔴 or 🟡.

Project context (declare at top of your report)

  • Project:
@grahama1970
grahama1970 / codex_review_bundle.md
Created September 28, 2025 16:11
Codex Review Bundle (Single) 2025-09-28

Codex Review Bundle (Single File)

  • Repo root: /home/graham/workspace/experiments/codex
  • Date: 2025-09-28

Executive Verdict

Readiness: ✅ Top risks are low and localized to config mapping and CLI glue.

Findings by Focus Area

@grahama1970
grahama1970 / REVIEW_BUNDLE_PROMPT.md
Created September 28, 2025 16:09
Codex Review Bundle 2025-09-28

Codex Review Bundle Prompt (Project‑Tailored)

Goal

  • Produce a concise, evidence‑backed technical review of Codex (Rust workspace: CLI/TUI/core), with minimal patches (unified diffs), tests, and docs. Return a single public Gist URL that contains the full review bundle.

Reviewer profile & tone

  • Senior Rust engineer with AppSec + CLI/TUI experience (cargo, clippy, ratatui, insta snapshots, MCP). Terse, specific, fail‑closed. Mark unverified claims 🔴/🟡.

Project anchors

  • Repo root:
@grahama1970
grahama1970 / review_bundle
Created September 28, 2025 15:11
Codex Review Bundle 2025-09-28
This file has been truncated, but you can view the full file.
# Project Bundle
- Generated: 2025-09-27T17:43:50Z
- Root: /home/graham/workspace/experiments/codex
- Git: 5c67dc3+dirty
- Files: 205
- Bundle Part: 1
- Context Tokens Limit: 400000
---
@grahama1970
grahama1970 / reviewbundle2.ahrcBc.md
Created September 27, 2025 21:43
All-smokes split still timing out: focused diffs (TTL 15m)

All‑Smokes Gate Still Timing Out/Fails in Split — Targeted Debug + Patch Requests Created: 2025-09-27 TTL: Private, delete within 15 minutes after review

Summary

  • We split the composite all_smokes gate into all_smokes_core + all_smokes_nd and added per‑check timeouts, xdist and PYTEST_ADDOPTS pass‑through.
  • The orchestrator no longer dies universally, but we still see:
    1. Harness timeouts under certain runs (improved but still possible on slow hosts).
    2. True FAILs in all_smokes_nd due to env/base mismatches (see below) — these are not red in isolation.
@grahama1970
grahama1970 / reviewbundle.X3ZU3p.md
Created September 27, 2025 20:54
LiteLLM all-smokes readiness: ask + diffs (TTL 15m)

Mini‑Agent + Router Readiness — All‑Smokes Orchestration Timeout (Request for Focused Help) Created: 2025-09-27 TTL: Please treat as private and ephemeral; delete within 15 minutes after review.

  1. Context and Goal
  • Project: LiteLLM fork with env‑gated codex‑agent provider and Mini‑Agent + Readiness system.
  • Deploy gate we want: make project-ready-all must pass, i.e., EVERY smoke (smoke, smoke_optional, ndsmoke, ndsmoke_e2e) green.
  • Current state: All failing clusters are fixed; individual smokes pass in isolation. The composite readiness check all_smokes times out in the harness despite extending per‑check timeout.
  1. What’s Working (evidence-based)
@grahama1970
grahama1970 / CODEWORLD_REVIEW_QUESTIONS.md
Created September 27, 2025 14:03
CodeWorld — External Review Questions (context + code anchors)

CodeWorld — External Review Questions, Context, and Code Anchors

Context: CodeWorld is a prompt‑driven, multi‑variant orchestrator for agentic code generation. It emits per‑instance prompts, autostarts a tiny FastAPI ingest backend, runs agents (or a local fallback), and aggregates a reproducible scorecard. Observability flows to ArangoDB with a thin proto dashboard. Memory hooks integrate a Graph Memory service for recall and timeline context.

Inspiration: CWM: An Open‑Weights LLM for Research on Code Generation with World Models (Meta AI, Sept 24, 2025). Local copy: docs/papers/CWM_ An Open-Weights LLM for Research on Code Generation with World Models _ Research - AI at Meta.md. Our aim is to explore world‑model style signals for agentic coding by capturing observation→action episodes during runs and enabling recall‑driven guidance.

Objective: Harden the orchestrator for research‑grade iteration while keeping it thin and deterministic by default. We want principled process lifecycle, secure defaults,

@grahama1970
grahama1970 / REVIEW_BUNDLE_PROMPT.md
Created September 26, 2025 21:49
Extractor External Review — 2025-09-26

External Review Prompt (Extractor — Canonical v1)

Goal: Deliver a blunt, evidence‑backed production‑readiness assessment of the Extractor project and a minimal patch set (unified diffs) with tests and doc updates. Keep changes surgical. Ship safety first.

Reviewer persona & tone

  • Principal SRE/DevEx + AppSec; fluent with Python/uv, Typer/FastAPI, Vite/React, ArangoDB, CI.
  • Be terse, specific, and fail‑closed. Unverified claims must be marked 🔴 (blocking) or 🟡 (needs proof). No hand‑waving.

Project context (declare at top of your report)

  • Project: Extractor — Self‑Correcting Agentic Document Processing System (multi‑stage pipeline + tabbed UX)
@grahama1970
grahama1970 / config.toml
Created September 9, 2025 13:05
codex config.toml
# ======================================
# CORE
# ======================================
model = "gpt-5"
model_reasoning_effort = "high"
# Disable all sandboxing (no filesystem/network restrictions)
sandbox_mode = "danger-full-access"
# Never prompt for approvals (Codex will run commands directly)