danielvaughan/benchmark-table.md

Created April 12, 2026 20:15

Star (0) You must be signed in to star a gist
Fork (0) You must be signed in to fork a gist

Select an option

Learn more about clone URLs
Clone this repository at <script src="https://gist.github.com/danielvaughan/51ce90e33f125087bda8deea0cb15fa8.js"></script>
Save danielvaughan/51ce90e33f125087bda8deea0cb15fa8 to your computer and use it in GitHub Desktop.

Download ZIP

Gemma 4 benchmark: code generation task

Raw

benchmark-table.md

Metric	Cloud (GPT-5.4)	GB10 (31B Dense)	Mac (26B MoE)
Wall-clock time	1m 05s	6m 59s	4m 42s
Tokens used	21,268	185,091	29,501
Tests passed	5/5 (first try)	5/5 (first try)	4/4 (fifth try)
Code quality	5/5	4/5	3/5
Tool calls	about five (clean)	three (clean)	about ten (messy)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment