Skip to content

Instantly share code, notes, and snippets.

@JenZhao
JenZhao / benchmark_serving.sh
Last active March 8, 2025 23:31
benchmarking scripts for vllm
#!/bin/bash
# first start it in another terminal
# vllm serve Qwen/Qwen2-VL-7B-Instruct --swap-space 16 --disable-log-requests
# Define model variables
MODEL_NAME="Qwen/Qwen2-VL-7B-Instruct"
NUM_PROMPTS=10
# Define the output markdown file
OUTPUT_FILE="benchmark_output.md"