Skip to content

Instantly share code, notes, and snippets.

@Jeffwan
Last active August 27, 2024 13:39
Show Gist options
  • Save Jeffwan/44cf970bd035c2443a9eb2e88f0fb96a to your computer and use it in GitHub Desktop.
Save Jeffwan/44cf970bd035c2443a9eb2e88f0fb96a to your computer and use it in GitHub Desktop.
single node benchmark
```
python3 -m vllm.entrypoints.openai.api_server --host "0.0.0.0" --port "8000" --model deepseek-ai/deepseek-coder-6.7b-instruct --max-model-len 32768
```
```
git clone https://github.com/vllm-project/vllm.git
```
```
python3 benchmark_serving.py --backend openai --base-url http://127.0.0.1:8000 --dataset-name=random --model deepseek-ai/deepseek-coder-6.7b-instruct --seed 12345
```
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment