sshleifer/memory_stats.csv

model	batch_size	sequence_length	MB
t5-large	1	128	6558
t5-large	1	512	9568
t5-large	1	1024	23124
facebook/bart-large	1	128	3758
facebook/bart-large	1	512	4670
facebook/bart-large	1	1024	8888
t5-base	1	128	2242
t5-base	1	512	3776
t5-base	1	1024	9056

sshleifer · 2020-06-12T03:42:17Z

T5 Lines with top memory consumption:
=> 376: mem 1956:         weights = F.dropout(weights, p=self.dropout, training=self.training)  # (bs, n_heads, qlen, klen)
=> 359: mem 1728:         scores = torch.einsum("bnqd,bnkd->bnqk", q, k)  # (bs, n_heads, qlen, klen)
=> 375: mem 1728:         weights = F.softmax(scores.float(), dim=-1).type_as(scores)  # (bs, n_heads, qlen, klen)
=> 382: mem 600:         context = torch.matmul(weights, v)  # (bs, n_heads, qlen, dim_per_head)
=> 188: mem 396:         layer_output = hidden_states + self.dropout(y)
=> 172: mem 288:         h = F.relu(h)

sshleifer · 2020-06-12T03:44:59Z

python run_benchmark.py --models t5-large facebook/bart-large t5-base \
   --save_to_csv --inference_time_csv_file plots/time.csv  --train_memory_csv_file plots/train_memory.csv \
   --inference_memory_csv_file plots/memory.csv --training  --batch_size 1 --sequence_lengths 128 512 1024 --trace_memory_line_by_line   --save_to_csv | tee -a trace.txt

sshleifer/memory_stats.csv

sshleifer commented Jun 12, 2020 •

edited

Loading

sshleifer commented Jun 12, 2020

sshleifer/memory_stats.csv

sshleifer commented Jun 12, 2020 • edited Loading

sshleifer commented Jun 12, 2020

sshleifer commented Jun 12, 2020 •

edited

Loading