Last active
August 27, 2024 13:39
-
-
Save Jeffwan/44cf970bd035c2443a9eb2e88f0fb96a to your computer and use it in GitHub Desktop.
single node benchmark
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
``` | |
python3 -m vllm.entrypoints.openai.api_server --host "0.0.0.0" --port "8000" --model deepseek-ai/deepseek-coder-6.7b-instruct --max-model-len 32768 | |
``` | |
``` | |
git clone https://github.com/vllm-project/vllm.git | |
``` | |
``` | |
python3 benchmark_serving.py --backend openai --base-url http://127.0.0.1:8000 --dataset-name=random --model deepseek-ai/deepseek-coder-6.7b-instruct --seed 12345 | |
``` |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment