|
<!doctype html> |
|
<html lang="zh-Hant"> |
|
<head> |
|
<meta charset="utf-8" /> |
|
<meta name="viewport" content="width=device-width, initial-scale=1" /> |
|
<title>Codex Worker 自修復採用作戰台</title> |
|
<style> |
|
:root{--ink:#17202a;--muted:#627181;--line:#dbe3ea;--bg:#f5f7f9;--panel:#fff;--blue:#285f8f;--green:#20765a;--amber:#b67a18;--red:#ad3f36;--soft:#edf4f8} |
|
*{box-sizing:border-box} body{margin:0;background:var(--bg);color:var(--ink);font-family:Inter,ui-sans-serif,system-ui,-apple-system,BlinkMacSystemFont,"Segoe UI",sans-serif;line-height:1.45} |
|
header{padding:30px 36px 22px;background:#fff;border-bottom:1px solid var(--line)} h1,h2,h3{margin:0;letter-spacing:0} h1{font-size:28px;max-width:1120px} h2{font-size:17px;margin-bottom:12px} h3{font-size:14px;margin-bottom:8px} |
|
p{margin:0;color:var(--muted)} .eyebrow{font-size:12px;font-weight:800;text-transform:uppercase;color:var(--blue);margin-bottom:8px} |
|
.wrap{padding:24px 36px 42px}.grid{display:grid;grid-template-columns:repeat(12,1fr);gap:16px;max-width:1280px;margin:0 auto} |
|
.panel{background:var(--panel);border:1px solid var(--line);border-radius:8px;padding:18px;box-shadow:0 1px 2px rgba(0,0,0,.03)} |
|
.span-12{grid-column:span 12}.span-8{grid-column:span 8}.span-6{grid-column:span 6}.span-4{grid-column:span 4}.span-3{grid-column:span 3} |
|
.kpis{display:grid;grid-template-columns:repeat(4,1fr);gap:12px}.kpi{border:1px solid var(--line);border-radius:8px;padding:14px;min-height:92px;background:#fbfcfd}.kpi strong{display:block;font-size:28px}.kpi span{font-size:13px;color:var(--muted)} |
|
table{width:100%;border-collapse:collapse;font-size:13px} th,td{text-align:left;padding:10px 8px;border-bottom:1px solid var(--line);vertical-align:top} th{font-size:12px;text-transform:uppercase;color:var(--muted);background:#fbfcfd} |
|
.status{display:inline-flex;border-radius:999px;padding:4px 9px;font-size:12px;font-weight:800;white-space:nowrap}.green{background:#e1f0e8;color:var(--green)}.blue{background:#e4eef8;color:var(--blue)}.amber{background:#f7ecd9;color:#81530f}.red{background:#f9e6e3;color:var(--red)} |
|
.timeline{display:grid;grid-template-columns:repeat(4,1fr);gap:12px}.stage{border-left:4px solid var(--blue);background:#fbfcfd;padding:12px;border-radius:0 8px 8px 0;min-height:150px}.stage:nth-child(2){border-color:var(--amber)}.stage:nth-child(3){border-color:var(--green)}.stage:nth-child(4){border-color:var(--red)} |
|
.flow{display:grid;grid-template-columns:repeat(5,1fr);gap:10px}.step{border:1px solid var(--line);border-radius:8px;padding:12px;background:#fbfcfd;min-height:116px}.step b{display:block;margin-bottom:6px}.small{font-size:12px;color:var(--muted)} |
|
ul{margin:8px 0 0 18px;padding:0;color:var(--muted)} li{margin:4px 0}.script{background:var(--soft);border-radius:8px;padding:12px;color:var(--ink);font-size:13px;white-space:pre-wrap} |
|
.code{background:#16212b;color:#eaf1f7;border-radius:8px;padding:12px;font:12px ui-monospace,SFMono-Regular,Menlo,monospace;white-space:pre-wrap;overflow:auto} |
|
@media(max-width:900px){header,.wrap{padding-left:18px;padding-right:18px}.span-8,.span-6,.span-4,.span-3{grid-column:span 12}.kpis,.timeline,.flow{grid-template-columns:1fr}} |
|
</style> |
|
</head> |
|
<body> |
|
<header> |
|
<div class="eyebrow">PLS production artifact · codex-session-stability · 2026-05-24</div> |
|
<h1>Codex Worker 自修復採用作戰台</h1> |
|
<p>把「worker 穩定與自修復」從上層要求,升級成一線 worker 可自下而上回報、修復、驗收與累積學習記憶的 production loop。</p> |
|
</header> |
|
<main class="wrap"> |
|
<section class="grid"> |
|
<div class="panel span-12"><div class="kpis"> |
|
<div class="kpi"><strong>2</strong><span>最新共同訊號都指向生產關係調整</span></div> |
|
<div class="kpi"><strong>3</strong><span>關聯專案需合併成 reliability/adoption lane</span></div> |
|
<div class="kpi"><strong>D7</strong><span>完成 worker 回報與自修復 acceptance</span></div> |
|
<div class="kpi"><strong>D30</strong><span>進 PLS ops console 與 worker learning memory</span></div> |
|
</div></div> |
|
|
|
<div class="panel span-8"> |
|
<h2>自修復 Adoption Gates</h2> |
|
<table> |
|
<thead><tr><th>Gate</th><th>Trigger</th><th>Worker Action</th><th>Human Policy</th><th>Pass Evidence</th></tr></thead> |
|
<tbody> |
|
<tr><td><span class="status red">stuck_claim</span></td><td>claim 無 job 或 context fetch failed</td><td>重試一次、touch、記錄 blocker phase</td><td>不得硬 complete</td><td>progress / verification 記錄 command output</td></tr> |
|
<tr><td><span class="status amber">repair_suggested</span></td><td>同類錯誤重複 2 次</td><td>產生 backlog/capability repair proposal</td><td>Louis 或 PLS owner 決定是否納入 worker policy</td><td>decision-record 有採納/不採納欄</td></tr> |
|
<tr><td><span class="status blue">artifact_guard</span></td><td>complete 前</td><td>驗證 primary URL HTTP 200、upload-files 成功</td><td>無可開啟主成果不得完成</td><td>curl/gh/upload-files output</td></tr> |
|
<tr><td><span class="status green">worker_learning</span></td><td>每次修復後</td><td>更新 learning_memory 與 next_round_priority</td><td>一線 worker 可回報 policy friction</td><td>D7 產出 3 筆採用/阻塞訊號</td></tr> |
|
</tbody> |
|
</table> |
|
</div> |
|
|
|
<div class="panel span-4"> |
|
<h2>LINE Ask</h2> |
|
<div class="script">這輪 Codex worker 穩定性不是只補 watchdog,而是把一線 worker 的自修復回報納入作業。 |
|
請確認三件事: |
|
1. context/complete fetch failed 時,是否允許 worker 自動 touch + retry 一次? |
|
2. 同類錯誤連續 2 次時,是否要自動產 repair proposal? |
|
3. 哪些 self-heal 動作需要 Louis/PLS owner 批准? |
|
|
|
未回覆前,worker 只做 diagnosis / retry once / proposal,不做 release/reclaim。</div> |
|
</div> |
|
|
|
<div class="panel span-12"> |
|
<h2>D1 / D7 / D14 / D30</h2> |
|
<div class="timeline"> |
|
<div class="stage"><h3>D1 · Console + Policy</h3><ul><li>發布自修復採用作戰台。</li><li>定義 retry once、artifact guard、repair proposal policy。</li><li>上傳 production pack。</li></ul></div> |
|
<div class="stage"><h3>D7 · Worker Adoption</h3><ul><li>收 3 筆一線 worker friction/repair 訊號。</li><li>驗證 stuck_claim 與 artifact_guard。</li><li>輸出 first reliability scorecard。</li></ul></div> |
|
<div class="stage"><h3>D14 · Workflow App</h3><ul><li>接 PLS job/lease/progress/artifact tables。</li><li>自動產 repair proposal。</li><li>建立 policy approval matrix。</li></ul></div> |
|
<div class="stage"><h3>D30 · Operating Model</h3><ul><li>worker self-repair 進 PLS ops console。</li><li>每週 review learning memory。</li><li>用 adoption evidence 改 worker prompts/tools。</li></ul></div> |
|
</div> |
|
</div> |
|
|
|
<div class="panel span-12"> |
|
<h2>Purpose-to-Purpose E2E</h2> |
|
<div class="flow"> |
|
<div class="step"><b>原始目的</b><span class="small">Codex Session / worker 要穩定推進專案與強化 huber 分身。</span></div> |
|
<div class="step"><b>主成果</b><span class="small">自修復 adoption console、資料模型、acceptance tests、policy LINE ask。</span></div> |
|
<div class="step"><b>人採用</b><span class="small">PLS owner 決定 self-heal policy;worker 回報 friction。</span></div> |
|
<div class="step"><b>系統改善</b><span class="small">fetch failed、schema error、artifact missing 變成可驗收事件。</span></div> |
|
<div class="step"><b>價值路徑</b><span class="small">降低人工盯盤、減少錯誤完成、讓 AI 推進產能累積成制度。</span></div> |
|
</div> |
|
</div> |
|
|
|
<div class="panel span-6"> |
|
<h2>市場成熟做法</h2> |
|
<p>Google SRE monitoring 強調用 actionable symptom 管理 production reliability;McKinsey gen AI transformation 則指出價值來自 operating model、workflow redesign、talent/adoption reinforcement。這輪把兩者合併:不只監控 worker,也讓一線 worker 能提出修復訊號,形成生產關係調整。</p> |
|
<ul> |
|
<li>https://sre.google/resources/book-update/monitoring-distributed-systems/</li> |
|
<li>https://www.mckinsey.com/capabilities/people-and-organizational-performance/our-insights/gen-ais-next-inflection-point-from-employee-experimentation-to-organizational-transformation</li> |
|
</ul> |
|
</div> |
|
|
|
<div class="panel span-6"> |
|
<h2>Data / API / Permission</h2> |
|
<div class="code">tables: |
|
worker_repair_events |
|
worker_policy_decisions |
|
artifact_gate_checks |
|
learning_memory_updates |
|
apis: |
|
POST /worker-repair-events |
|
POST /artifact-gate-checks |
|
PATCH /worker-policy/:id |
|
roles: |
|
worker: diagnose/propose |
|
PLS owner: approve policy |
|
Louis: approve risky self-heal</div> |
|
</div> |
|
|
|
<div class="panel span-4"> |
|
<h2>Production Acceptance</h2> |
|
<p><b>Owner:</b> PLS platform / Louis。</p> |
|
<p><b>Due:</b> D7 worker adoption proof。</p> |
|
<p><b>Pass:</b> 3 筆 repair/friction event、1 次 artifact guard pass、1 份 policy decision。</p> |
|
</div> |
|
<div class="panel span-4"> |
|
<h2>Solution Selection</h2> |
|
<p>選 `watchdog + governance + project`:比單純 watchdog 更能回應本輪「生產關係調整」訊號;不直接 full agent,因 self-heal 權限仍需政策批准。</p> |
|
</div> |
|
<div class="panel span-4"> |
|
<h2>Next Upgrade</h2> |
|
<p>D7 後把真實 worker repair events 接到 PLS ops console,D14 建 policy approval matrix,D30 用 learning memory 自動改善 worker prompt/tooling。</p> |
|
</div> |
|
</section> |
|
</main> |
|
</body> |
|
</html> |