Skip to content

Instantly share code, notes, and snippets.

@AmosLewis
Created September 20, 2025 04:59
Show Gist options
  • Save AmosLewis/7690c3020b5dbc2e26f23ceda4056c24 to your computer and use it in GitHub Desktop.
Save AmosLewis/7690c3020b5dbc2e26f23ceda4056c24 to your computer and use it in GitHub Desktop.
# The format of this config file is 'key = value'.
# The key has the format 'model.scenario.key'. Value is mostly int64_t.
# Model maybe '*' as wildcard. In that case the value applies to all models.
# All times are in milli seconds
#
*.Offline.min_duration = 6000
*.Offline.min_query_count = 4
*.Offline.max_query_count = 4
*.Server.target_qps = 0.5
*.Server.min_duration = 120000
*.Server.min_query_count = 10
llama3_1-405b.Server.sample_concatenate_permutation = 1
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment