ck3d/benchmark.md

Last active January 29, 2025 08:59

Star (3) You must be signed in to star a gist
Fork (0) You must be signed in to fork a gist

Learn more about clone URLs
Clone this repository at <script src="https://gist.github.com/ck3d/a7b1f2aac9875fb1e227272165555b09.js"></script>
Save ck3d/a7b1f2aac9875fb1e227272165555b09 to your computer and use it in GitHub Desktop.

Download ZIP

llama-bench

Raw

benchmark.md

llama.cpp version: https://github.com/ggerganov/llama.cpp/commit/925e5584a058afb612f9c20bc472c130f5d0f891

LLM: https://huggingface.co/TheBloke/Llama-2-7B-Chat-GGUF/blob/main/llama-2-7b-chat.Q4_K_M.gguf

llama-bench -m ../models/llama-2-7b-chat.Q4_K_M.gguf

Intel i7-6700K

model	size	params	backend	threads	test	t/s
llama 7B Q4_K - Medium	3.80 GiB	6.74 B	BLAS	4	pp 512	7.58 ± 0.08
llama 7B Q4_K - Medium	3.80 GiB	6.74 B	BLAS	4	tg 128	6.27 ± 0.01

AMD Ryzen 7 7735HS

model	size	params	backend	threads	test	t/s
llama 7B Q4_K - Medium	3.80 GiB	6.74 B	BLAS	8	pp 512	27.12 ± 0.39
llama 7B Q4_K - Medium	3.80 GiB	6.74 B	BLAS	8	tg 128	11.31 ± 0.01

Apple M1Pro

model	size	params	backend	ngl	test	t/s
llama 7B Q4_K - Medium	3.80 GiB	6.74 B	Metal	99	pp 512	229.66 ± 7.05
llama 7B Q4_K - Medium	3.80 GiB	6.74 B	Metal	99	tg 128	28.99 ± 0.19

See Also

https://www.hardware-corner.net/guides/computer-to-run-llama-ai-model/

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment