| Metric | Cloud (GPT-5.4) | GB10 (31B Dense) | Mac (26B MoE) |
|---|---|---|---|
| Wall-clock time | 1m 05s | 6m 59s | 4m 42s |
| Tokens used | 21,268 | 185,091 | 29,501 |
| Tests passed | 5/5 (first try) | 5/5 (first try) | 4/4 (fifth try) |
| Code quality | 5/5 | 4/5 | 3/5 |
| Tool calls | about five (clean) | three (clean) | about ten (messy) |
Created
April 12, 2026 20:15
-
-
Save danielvaughan/51ce90e33f125087bda8deea0cb15fa8 to your computer and use it in GitHub Desktop.
Gemma 4 benchmark: code generation task
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment