|
<!doctype html> |
|
<html lang="zh-Hant"> |
|
<head> |
|
<meta charset="utf-8"> |
|
<meta name="viewport" content="width=device-width, initial-scale=1"> |
|
<title>AI 預測驗證 Cockpit</title> |
|
<style> |
|
:root { |
|
--ink: #111827; |
|
--muted: #5f6b7a; |
|
--line: #d7dee8; |
|
--bg: #f7f9fb; |
|
--panel: #ffffff; |
|
--green: #0f766e; |
|
--blue: #1d4ed8; |
|
--red: #b42318; |
|
--amber: #a15c00; |
|
--violet: #6d28d9; |
|
} |
|
* { box-sizing: border-box; } |
|
body { |
|
margin: 0; |
|
font-family: ui-sans-serif, system-ui, -apple-system, BlinkMacSystemFont, "Segoe UI", sans-serif; |
|
background: var(--bg); |
|
color: var(--ink); |
|
line-height: 1.5; |
|
} |
|
header { |
|
padding: 30px clamp(18px, 4vw, 50px); |
|
color: white; |
|
background: linear-gradient(120deg, #111827, #233047); |
|
} |
|
h1, h2, h3 { margin: 0; line-height: 1.2; } |
|
h1 { font-size: clamp(29px, 5vw, 48px); max-width: 1040px; } |
|
h2 { font-size: 22px; margin-bottom: 12px; } |
|
h3 { font-size: 16px; margin-bottom: 7px; } |
|
header p { max-width: 1020px; color: #d5dce8; margin: 10px 0 0; } |
|
main { padding: 24px clamp(16px, 4vw, 50px) 46px; } |
|
.grid { display: grid; grid-template-columns: repeat(12, 1fr); gap: 14px; margin-bottom: 22px; } |
|
.card { |
|
background: var(--panel); |
|
border: 1px solid var(--line); |
|
border-radius: 8px; |
|
padding: 16px; |
|
box-shadow: 0 1px 2px rgba(17, 24, 39, 0.04); |
|
} |
|
.span-3 { grid-column: span 3; } |
|
.span-4 { grid-column: span 4; } |
|
.span-6 { grid-column: span 6; } |
|
.span-8 { grid-column: span 8; } |
|
.span-12 { grid-column: span 12; } |
|
.metric { font-size: 32px; font-weight: 760; margin: 5px 0 2px; } |
|
.muted { color: var(--muted); } |
|
.tag { |
|
display: inline-flex; |
|
align-items: center; |
|
min-height: 24px; |
|
padding: 2px 8px; |
|
border-radius: 999px; |
|
font-size: 12px; |
|
font-weight: 720; |
|
border: 1px solid currentColor; |
|
margin: 2px 4px 2px 0; |
|
} |
|
.green { color: var(--green); } |
|
.blue { color: var(--blue); } |
|
.red { color: var(--red); } |
|
.amber { color: var(--amber); } |
|
.violet { color: var(--violet); } |
|
table { width: 100%; border-collapse: collapse; } |
|
th, td { text-align: left; padding: 10px 8px; border-bottom: 1px solid var(--line); vertical-align: top; } |
|
th { font-size: 12px; color: var(--muted); text-transform: uppercase; letter-spacing: 0; } |
|
ul, ol { margin: 8px 0 0 18px; padding: 0; } |
|
li { margin: 5px 0; } |
|
.flow { |
|
display: grid; |
|
grid-template-columns: repeat(5, minmax(130px, 1fr)); |
|
gap: 10px; |
|
} |
|
.flow div { |
|
border: 1px solid var(--line); |
|
border-radius: 8px; |
|
padding: 12px; |
|
background: #fbfdff; |
|
min-height: 118px; |
|
} |
|
.code { |
|
font-family: ui-monospace, SFMono-Regular, Menlo, Consolas, monospace; |
|
font-size: 13px; |
|
background: #edf2f7; |
|
border-radius: 6px; |
|
padding: 2px 5px; |
|
} |
|
.line-draft { |
|
white-space: pre-wrap; |
|
background: #eff6ff; |
|
border: 1px solid #bfdbfe; |
|
border-radius: 8px; |
|
padding: 14px; |
|
} |
|
@media (max-width: 960px) { |
|
.span-3, .span-4, .span-6, .span-8 { grid-column: span 12; } |
|
.flow { grid-template-columns: 1fr; } |
|
} |
|
</style> |
|
</head> |
|
<body> |
|
<header> |
|
<h1>AI 預測驗證 Cockpit</h1> |
|
<p>把「上次 review 的預測」變成可被 signals、action items、GitHub、LINE/Drive 證據自動核對的 production 模組。目標不是多做一張報表,而是讓 Louis / zihrou / iron 對 AI 工具選擇與系統化建置有同一套命中率、證據等級、退修與派工節奏。</p> |
|
</header> |
|
<main> |
|
<section class="grid"> |
|
<div class="card span-3"> |
|
<h3>Verification Readiness</h3> |
|
<div class="metric green">R1</div> |
|
<p class="muted">可用 schema 跑第一批人工標註與自動比對。</p> |
|
</div> |
|
<div class="card span-3"> |
|
<h3>Evidence Sources</h3> |
|
<div class="metric blue">5</div> |
|
<p class="muted">signals、action items、commit、deliverable、people reflection。</p> |
|
</div> |
|
<div class="card span-3"> |
|
<h3>Owner / Due</h3> |
|
<div class="metric">Louis</div> |
|
<p class="muted">2026-05-27 前拍板命中定義與紅線。</p> |
|
</div> |
|
<div class="card span-3"> |
|
<h3>Decision Risk</h3> |
|
<div class="metric amber">中</div> |
|
<p class="muted">zihrou、iron、Louis 工具選擇尚未完全統一。</p> |
|
</div> |
|
</section> |
|
|
|
<section class="grid"> |
|
<div class="card span-8"> |
|
<h2>D1 / D7 / D14 / D30 路徑</h2> |
|
<table> |
|
<thead><tr><th>節點</th><th>交付</th><th>驗收訊號</th></tr></thead> |
|
<tbody> |
|
<tr><td>D1</td><td>建立 prediction ledger 與 evidence matcher,先人工回填 10 筆 review prediction。</td><td>每筆有 claim、time window、expected signal、evidence source、label。</td></tr> |
|
<tr><td>D7</td><td>接入 signals/action_items/GitHub commit,自動產生 hit/miss/partial/unknown。</td><td>50 筆 prediction 有驗證結果,unknown 低於 25%。</td></tr> |
|
<tr><td>D14</td><td>將驗證結果接入 AI review 節奏,低命中類型自動派 correction task。</td><td>每週 review 能看到 calibration、false positive、false negative。</td></tr> |
|
<tr><td>D30</td><td>形成公司 AI 化 decision QA:預測、證據、派工、修正、治理完整閉環。</td><td>AI review 命中率、工具採用決策速度、逾期修正率每週可追。</td></tr> |
|
</tbody> |
|
</table> |
|
</div> |
|
<div class="card span-4"> |
|
<h2>Purpose-to-Purpose E2E</h2> |
|
<ol> |
|
<li>原始目的:讓 AI review 真的改善公司 AI 化決策。</li> |
|
<li>產出物:驗證 cockpit、資料模型、比對規則、驗收測試。</li> |
|
<li>人採用:Louis 拍板紅線,zihrou/iron 回饋工具分歧訊號。</li> |
|
<li>指標改善:預測命中率、決策速度、派工準確率、風險提早發現率。</li> |
|
</ol> |
|
</div> |
|
</section> |
|
|
|
<section class="grid"> |
|
<div class="card span-12"> |
|
<h2>Verification Flow</h2> |
|
<div class="flow"> |
|
<div><h3>1. Capture</h3><p>把 review 中的預測拆成 <span class="code">prediction_claims</span>,含 expected outcome 與時間窗。</p></div> |
|
<div><h3>2. Evidence</h3><p>收 signals、action items、commit、deliverable、people reflection,多來源都保留 source_ref。</p></div> |
|
<div><h3>3. Match</h3><p>用 rule + embedding/keyword candidate 產生 evidence_candidates 與 confidence。</p></div> |
|
<div><h3>4. Label</h3><p>輸出 hit / miss / partial / unknown,低信心進人工 review queue。</p></div> |
|
<div><h3>5. Act</h3><p>命中差的 prediction type 轉成 prompt 改寫、資料缺口或 owner 對焦任務。</p></div> |
|
</div> |
|
</div> |
|
</section> |
|
|
|
<section class="grid"> |
|
<div class="card span-6"> |
|
<h2>Evidence Ledger 範例</h2> |
|
<table> |
|
<thead><tr><th>Prediction</th><th>Evidence</th><th>Label</th><th>Action</th></tr></thead> |
|
<tbody> |
|
<tr><td>工具選擇未統一會拖慢系統化建置 2 週</td><td>action item 仍要求統一 AI 工具與同步節奏</td><td><span class="tag amber">partial</span></td><td>派 Louis 拍板工具標準</td></tr> |
|
<tr><td>AI 預測驗證模組會成為近期關注</td><td>GitHub commit 與 person_reflection 均指向 verification module</td><td><span class="tag green">hit</span></td><td>進入 D7 自動比對</td></tr> |
|
<tr><td>所有 action item 逾期皆因缺 owner</td><td>context 顯示部分 action 已有 owner,但缺 review cadence</td><td><span class="tag red">miss</span></td><td>修正 prompt:區分 owner 缺口與節奏缺口</td></tr> |
|
</tbody> |
|
</table> |
|
</div> |
|
<div class="card span-6"> |
|
<h2>Human Capability Upgrade</h2> |
|
<ul> |
|
<li>Louis:看 AI review 的校準度,不只看敘述漂亮。</li> |
|
<li>zihrou:把分歧翻譯成方向、資源、授權三類缺口。</li> |
|
<li>iron:用 evidence ledger 對齊多線執行,而不是被催促。</li> |
|
<li>PLS:下一輪可根據 miss type 自動改 prompt、派資料任務或升級治理。</li> |
|
</ul> |
|
</div> |
|
</section> |
|
|
|
<section class="grid"> |
|
<div class="card span-6"> |
|
<h2>people_sync</h2> |
|
<div class="line-draft">LINE 草稿: |
|
Louis / zihrou / iron,PLS 已把「AI 預測驗證模組」整理成可落地驗證 cockpit。請在 2026-05-27 前回覆: |
|
1. accept:同意 hit/miss/partial/unknown 定義; |
|
2. revise:指出哪種預測不能這樣判; |
|
3. evidence_gap:補 signals/action items/commit 以外還要接的來源。 |
|
驗收口徑:D7 前 50 筆 prediction 可自動產生 label,unknown 低於 25%,低信心進人工 review queue。</div> |
|
</div> |
|
<div class="card span-6"> |
|
<h2>production_readiness</h2> |
|
<table> |
|
<thead><tr><th>Gate</th><th>狀態</th><th>證據</th></tr></thead> |
|
<tbody> |
|
<tr><td>主成果可打開</td><td><span class="tag green">PASS</span></td><td>本 HTML cockpit</td></tr> |
|
<tr><td>資料模型</td><td><span class="tag green">PASS</span></td><td>prediction/evidence/match/review tables</td></tr> |
|
<tr><td>權限/稽核</td><td><span class="tag green">PASS</span></td><td>source_ref、audit_actor、sensitive_source</td></tr> |
|
<tr><td>E2E 驗收</td><td><span class="tag blue">READY</span></td><td>D1/D7/D14/D30 gates</td></tr> |
|
<tr><td>下一輪升級</td><td><span class="tag violet">DEFINED</span></td><td>低命中 prediction type 自動派工</td></tr> |
|
</tbody> |
|
</table> |
|
</div> |
|
</section> |
|
</main> |
|
</body> |
|
</html> |