Skip to content

Instantly share code, notes, and snippets.

@danielvaughan
Created April 12, 2026 20:15
Show Gist options
  • Select an option

  • Save danielvaughan/5d4206f8f351b147cabf65ee7a7f46fa to your computer and use it in GitHub Desktop.

Select an option

Save danielvaughan/5d4206f8f351b147cabf65ee7a7f46fa to your computer and use it in GitHub Desktop.
Gemma 4 benchmark: raw speed (llama-bench)
Metric Mac (26B MoE Q4_K_M) GB10 (31B Dense Q4_K_M) GB10 (31B Dense Q8_0)
pp512 (tok/s) 590 674 499
pp8192 (tok/s) 531 548 426
tg128 (tok/s) 51.73 10.18 6.74
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment