Skip to content

Instantly share code, notes, and snippets.

@gslin
Created March 10, 2025 21:20
Show Gist options
  • Save gslin/e5c198d70d2da4b1baae58ec0dfddfda to your computer and use it in GitHub Desktop.
Save gslin/e5c198d70d2da4b1baae58ec0dfddfda to your computer and use it in GitHub Desktop.
build/bin/llama-server -hf ggml-org/Qwen2.5-Coder-7B-Q8_0-GGUF -ngl 99 -fa -ub 1024 -b 1024 -dt 0.1 --ctx-size 0 --cache-reuse 256
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment