Skip to content

Instantly share code, notes, and snippets.

View rizar's full-sized avatar

Dzmitry Bahdanau rizar

  • ServiceNow Research
View GitHub Profile
@rizar
rizar / log.txt
Created September 26, 2025 16:08
vllm main B200 oom
INFO 09-26 08:47:55 [__init__.py:216] Automatically detected platform cuda.
INFO 09-26 08:47:55 [__init__.py:216] Automatically detected platform cuda.
WARNING 09-26 08:47:57 [__init__.py:1748] argument 'eplb_log_balancedness' is deprecated
WARNING 09-26 08:47:57 [__init__.py:1748] argument 'eplb_window_size' is deprecated
WARNING 09-26 08:47:57 [__init__.py:1748] argument 'eplb_step_interval' is deprecated
WARNING 09-26 08:47:57 [__init__.py:1748] argument 'num_redundant_experts' is deprecated
WARNING 09-26 08:47:57 [__init__.py:1748] argument 'num_redundant_experts' is deprecated
WARNING 09-26 08:47:57 [__init__.py:1748] argument 'eplb_window_size' is deprecated
WARNING 09-26 08:47:57 [__init__.py:1748] argument 'eplb_log_balancedness' is deprecated
WARNING 09-26 08:47:57 [__init__.py:1748] argument 'eplb_step_interval' is deprecated
@rizar
rizar / bench.py
Created September 26, 2025 01:06
bench
#!/usr/bin/env python3
"""
vLLM Load Test Script
Makes 512 concurrent requests to vLLM server at localhost:8000
"""
import sys
import asyncio
import os
from pprint import pprint
@rizar
rizar / residual_mlp.py
Created January 2, 2022 21:16
Bespoke MLP Implementation for Measuring Throughput
import os
import time
import torch
from torch.nn import Linear, ReLU
from torch.profiler import profile, ProfilerActivity
from codeparrot.build_table import build_table
device = torch.device('cuda')
@rizar
rizar / megatron.csv
Last active January 2, 2022 21:07
MegaTron throughput
system GPU count Training time Model size Tokens teraWFLOP/s
MegaTron + DeepSpeed 2240 60.1 5.3E+11 3932160 92.9
HyperCLOVA 1024 1157760 8.2E+10 1.5E+11 62.2
MegaTron LM GPT-3 Example 1024 32 1.75E+11 3145728 100.8
@rizar
rizar / throughput.csv
Last active January 9, 2022 18:54
Throughput table
model precision throughput in teraWFLOPs
matmul 8192x1600x6400 bfloat16 237
linear MLP B=8192; 1600 -> 6400 -> 1600 -> 1 bfloat16 230
linear MLP B=8192; 1600 -> 6400 -> 1600 -> 1 mixed 207.6
residual ReLU MLP B=8192; 1600 -> 6400 -> 1600 -> 1 mixed 185.7
HuggingFace GPT2 B=32 L=256; d=1600 mixed 68
@rizar
rizar / frisbee.py
Created July 1, 2019 18:00
Sub selection script (run with the seed as the only argument)
import numpy
import sys
players = [
'Alex Lacoste',
'Karl Janelle',
'JF Marcil',
'Pat Steeves',
'Dzmitry Bahdanau',
'Philippe Guay',
@rizar
rizar / error.txt
Created November 19, 2018 17:41
error again
> [email protected] dev-build /home/dzmitry/Dist/arl-omnishop/static
> webpack --progress -d --config webpack.config.js
clean-webpack-plugin: /home/dzmitry/Dist/arl-omnishop/static/dist has been removed.
10% building modules 4/11 modules 7 active .../Dist/arl-omnishop/static/sass/site.cssℹ 「atl」: Using [email protected] from typescript
ℹ 「atl」: Using tsconfig.json from /home/dzmitry/Dist/arl-omnishop/static/tsconfig.json
94% after sealℹ 「atl」: Checking started in a separate process...
✖ 「atl」: Checking finished with 47 errors
Hash: 248bef2f47830ed686dd
Version: webpack 4.20.2
@rizar
rizar / dev_build.txt
Created November 19, 2018 14:48
errors
╰─$ npm run dev-build 2 ↵
> [email protected] dev-build /home/dzmitry/Dist/arl-omnishop/static
> webpack --progress -d --config webpack.config.js
clean-webpack-plugin: /home/dzmitry/Dist/arl-omnishop/static/dist has been removed.
10% building modules 4/11 modules 7 active .../Dist/arl-omnishop/static/sass/site.cssℹ 「atl」: Using [email protected] from typescript
ℹ 「atl」: Using tsconfig.json from /home/dzmitry/Dist/arl-omnishop/static/tsconfig.json
94% after sealℹ 「atl」: Checking started in a separate process...
✖ 「atl」: Checking finished with 26 errors
@rizar
rizar / dev_build.txt
Created November 19, 2018 14:48
errors
╰─$ npm run dev-build 2 ↵
> [email protected] dev-build /home/dzmitry/Dist/arl-omnishop/static
> webpack --progress -d --config webpack.config.js
clean-webpack-plugin: /home/dzmitry/Dist/arl-omnishop/static/dist has been removed.
10% building modules 4/11 modules 7 active .../Dist/arl-omnishop/static/sass/site.cssℹ 「atl」: Using [email protected] from typescript
ℹ 「atl」: Using tsconfig.json from /home/dzmitry/Dist/arl-omnishop/static/tsconfig.json
94% after sealℹ 「atl」: Checking started in a separate process...
✖ 「atl」: Checking finished with 26 errors
@rizar
rizar / beautiful_plots.py
Created September 26, 2018 20:31
Beautifult plots
def compare_rl_and_gail(metric, window, ylabel=None, xlim=None, fs=15, ncol=2):
df1 = df_rl[df_rl['config.rp_weight'] == 1.0]
df2 = df_rl[df_rl['config.rp_weight'] == 0.0]
df3 = df_gail[df_gail['config.rmt.top_k_percent'] == 0.]
df4 = df_gail[df_gail['config.rmt.top_k_percent'] == 0.25]
df5 = df_gail[df_gail['config.rmt.top_k_percent'] == 0.5]
df6 = df_gail[df_gail['config.rmt.top_k_percent'] == 0.75]
for i, df in enumerate([df1, df2, df3, df4, df5, df6]):
value = 100 * pandas.rolling_mean(average_value(df, metric), window)
print(value.iloc[-1])