Last active
September 23, 2022 00:49
-
-
Save davidberard98/fc12ac6b42b4e2ab7ad5a1711797966d to your computer and use it in GitHub Desktop.
hf_T5, 2 nodes, dynamo+inductor, verbose=True, log_level=DEBUG; functorch..debug_graphs is FALSE.
This file has been truncated, but you can view the full file.
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
submitit INFO (2022-09-22 18:42:53,293) - Starting with JobEnvironment(job_id=67434, hostname=a100-st-p4d24xlarge-3, local_rank=0(8), node=0(2), global_rank=0(16)) | |
submitit INFO (2022-09-22 18:42:53,294) - Loading pickle: /fsx/users/dberard/scratch-local/bench-fast/benchmark/logs/67434_submitted.pkl | |
Process group: 16 tasks, rank: 0 | |
MY HOSTNAME: a100-st-p4d24xlarge-3 | |
FI_PROVIDER : efa | |
LD_LIBRARY_PATH : /fsx/users/dberard/scratch-local/bench-fast/aws-ofi-nccl/lib:/opt/amazon/efa/lib:/fsx/users/dberard/scratch-local/bench-fast/aws-ofi-nccl/lib:/opt/amazon/efa/lib:/path/to/aws-ofi-nccl:/opt/amazon/efa/lib:/path/to/aws-ofi-nccl:/opt/amazon/efa/lib:/usr/local/cuda-11.6/lib:/usr/local/cuda-11.6/lib64:/opt/amazon/efa/lib:/opt/amazon/openmpi/lib:/usr/local/cuda/efa/lib:/usr/local/cuda/lib:/usr/local/cuda:/usr/local/cuda/lib64:/usr/local/cuda/extras/CUPTI/lib64:/usr/local/cuda/targets/x86_64-linux/lib:/usr/local/lib:/usr/lib: | |
NCCL_DEBUG : TRACE | |
FI_EFA_USE_DEVICE_RDMA : 1 | |
a100-st-p4d24xlarge-3:69371:69371 [0] NCCL INFO NCCL_SOCKET_IFNAME set by environment to ens | |
a100-st-p4d24xlarge-3:69371:69371 [0] NCCL INFO NCCL_SOCKET_IFNAME set to ens | |
a100-st-p4d24xlarge-3:69371:69371 [0] NCCL INFO Bootstrap : Using ens32:10.200.90.130<0> | |
a100-st-p4d24xlarge-3:69371:69371 [0] NCCL INFO NET/Plugin: Failed to find ncclCollNetPlugin_v6 symbol. | |
a100-st-p4d24xlarge-3:69371:69371 [0] NCCL INFO NET/Plugin: Failed to find ncclCollNetPlugin symbol (v4 or v5). | |
a100-st-p4d24xlarge-3:69371:69371 [0] NCCL INFO cudaDriverVersion 11060 | |
NCCL version 2.13.4+cuda11.6 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO NET/OFI Using aws-ofi-nccl 1.4.0aws | |
a100-st-p4d24xlarge-3:69371:69878 [0] 28.771366 get_platform_type:1118 NCCL TRACE NET/OFI Read 12 bytes. EC2 platform type is p4d.24xlarge | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO NET/OFI Running on p4d.24xlarge platform, Setting NCCL_TOPO_FILE environment variable to /fsx/users/dberard/scratch-local/bench-fast/aws-ofi-nccl/share/aws-ofi-nccl/xml/p4d-24xl-topo.xml | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO NET/OFI Setting FI_EFA_FORK_SAFE environment variable to 1 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO NET/OFI Selected Provider is efa | |
a100-st-p4d24xlarge-3:69371:69878 [0] 40.029316 ofi_init:1342 NCCL TRACE NET/OFI Provider efa requires registration of local memory buffers | |
a100-st-p4d24xlarge-3:69371:69878 [0] 40.040884 ofi_init:1349 NCCL TRACE NET/OFI Provider efa requires registration of device buffers | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO Using network AWS Libfabric | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO NCCL_TOPO_FILE set by environment to /fsx/users/dberard/scratch-local/bench-fast/aws-ofi-nccl/share/aws-ofi-nccl/xml/p4d-24xl-topo.xml | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO NET/AWS Libfabric : GPU Direct RDMA Enabled for HCA 0 'rdmap16s27' | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO NET/AWS Libfabric : GPU Direct RDMA Enabled for HCA 1 'rdmap32s27' | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO NET/AWS Libfabric : GPU Direct RDMA Enabled for HCA 2 'rdmap144s27' | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO NET/AWS Libfabric : GPU Direct RDMA Enabled for HCA 3 'rdmap160s27' | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101d0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201c0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201d0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901c0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901d0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01c0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01d0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101d0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201c0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201d0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901c0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901d0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01c0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01d0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101d0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201c0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201d0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901c0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901d0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01c0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01d0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101d0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201c0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201d0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901c0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901d0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01c0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01d0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101d0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201c0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201d0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901c0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901d0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01c0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01d0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101d0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201c0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201d0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901c0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901d0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01c0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01d0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101d0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201c0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201d0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901c0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901d0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01c0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01d0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101d0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201c0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201d0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901c0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901d0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01c0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01d0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO Setting affinity for GPU 0 to 1f0000,0000001f | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO Channel 00/08 : 0 7 6 5 4 3 2 1 8 15 14 13 12 11 10 9 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO Channel 01/08 : 0 3 10 15 14 13 12 9 8 11 2 7 6 5 4 1 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO Channel 02/08 : 0 7 6 5 12 11 10 9 8 15 14 13 4 3 2 1 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO Channel 03/08 : 0 5 4 7 14 11 10 9 8 13 12 15 6 3 2 1 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO Channel 04/08 : 0 7 6 5 4 3 2 1 8 15 14 13 12 11 10 9 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO Channel 05/08 : 0 3 10 15 14 13 12 9 8 11 2 7 6 5 4 1 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO Channel 06/08 : 0 7 6 5 12 11 10 9 8 15 14 13 4 3 2 1 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO Channel 07/08 : 0 5 4 7 14 11 10 9 8 13 12 15 6 3 2 1 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO Trees [0] 1/8/-1->0->-1 [1] 1/-1/-1->0->7 [2] 1/-1/-1->0->7 [3] 1/-1/-1->0->7 [4] 1/-1/-1->0->8 [5] 1/-1/-1->0->7 [6] 1/-1/-1->0->7 [7] 1/-1/-1->0->7 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO Channel 01 : 0[101c0] -> 3[201d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3e00002d20 | |
a100-st-p4d24xlarge-3:69371:69926 [0] NCCL INFO New proxy send connection 0 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO Channel 05 : 0[101c0] -> 3[201d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3e00002d58 | |
a100-st-p4d24xlarge-3:69371:69926 [0] NCCL INFO New proxy send connection 1 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO Channel 03 : 0[101c0] -> 5[901d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3e00002d90 | |
a100-st-p4d24xlarge-3:69371:69926 [0] NCCL INFO New proxy send connection 2 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO Channel 07 : 0[101c0] -> 5[901d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-3:69371:69926 [0] NCCL INFO New proxy send connection 3 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3e00002dc8 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-3:69371:69926 [0] NCCL INFO New proxy recv connection 4 from local rank 0, transport 2 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3e00002e00 | |
a100-st-p4d24xlarge-3:69371:69926 [0] 2957.355429 create_nccl_ofi_comp_for_dev:882 NCCL TRACE NET/OFI OFI component #0 is created | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO Channel 00/0 : 9[101d0] -> 0[101c0] [receive] via NET/AWS Libfabric/0/GDRDMA | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-3:69371:69926 [0] NCCL INFO New proxy recv connection 5 from local rank 0, transport 2 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3e00002e38 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO Channel 04/0 : 9[101d0] -> 0[101c0] [receive] via NET/AWS Libfabric/0/GDRDMA | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO Channel 00 : 0[101c0] -> 7[a01d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3e00002e70 | |
a100-st-p4d24xlarge-3:69371:69926 [0] NCCL INFO New proxy send connection 6 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO Channel 02 : 0[101c0] -> 7[a01d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-3:69371:69926 [0] NCCL INFO New proxy send connection 7 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3e00002ea8 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO Channel 04 : 0[101c0] -> 7[a01d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-3:69371:69926 [0] NCCL INFO New proxy send connection 8 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3e00002ee0 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO Channel 06 : 0[101c0] -> 7[a01d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-3:69371:69926 [0] NCCL INFO New proxy send connection 9 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3e00002f18 | |
a100-st-p4d24xlarge-3:69371:69926 [0] 3150.481758 alloc_and_reg_flush_buff:2275 NCCL TRACE NET/OFI Registering buffer for flush operations | |
a100-st-p4d24xlarge-3:69371:69926 [0] 3154.596521 alloc_and_reg_flush_buff:2275 NCCL TRACE NET/OFI Registering buffer for flush operations | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3e00002f50 | |
a100-st-p4d24xlarge-3:69371:69926 [0] NCCL INFO New proxy recv connection 10 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3e00002f88 | |
a100-st-p4d24xlarge-3:69371:69926 [0] NCCL INFO New proxy recv connection 11 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3e00002fc0 | |
a100-st-p4d24xlarge-3:69371:69926 [0] NCCL INFO New proxy recv connection 12 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3e00002ff8 | |
a100-st-p4d24xlarge-3:69371:69926 [0] NCCL INFO New proxy recv connection 13 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3e00003030 | |
a100-st-p4d24xlarge-3:69371:69926 [0] NCCL INFO New proxy recv connection 14 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3e00003068 | |
a100-st-p4d24xlarge-3:69371:69926 [0] NCCL INFO New proxy recv connection 15 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO Connected all rings | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO Channel 00 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3e000030a0 | |
a100-st-p4d24xlarge-3:69371:69926 [0] NCCL INFO New proxy send connection 16 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO Channel 01 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3e000030d8 | |
a100-st-p4d24xlarge-3:69371:69926 [0] NCCL INFO New proxy send connection 17 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO Channel 02 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3e00003110 | |
a100-st-p4d24xlarge-3:69371:69926 [0] NCCL INFO New proxy send connection 18 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO Channel 03 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3e00003148 | |
a100-st-p4d24xlarge-3:69371:69926 [0] NCCL INFO New proxy send connection 19 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO Channel 04 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3e00003180 | |
a100-st-p4d24xlarge-3:69371:69926 [0] NCCL INFO New proxy send connection 20 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO Channel 05 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3e000031b8 | |
a100-st-p4d24xlarge-3:69371:69926 [0] NCCL INFO New proxy send connection 21 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO Channel 06 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3e000031f0 | |
a100-st-p4d24xlarge-3:69371:69926 [0] NCCL INFO New proxy send connection 22 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO Channel 07 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3e00003228 | |
a100-st-p4d24xlarge-3:69371:69926 [0] NCCL INFO New proxy send connection 23 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO Channel 01 : 0[101c0] -> 7[a01d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3e00003260 | |
a100-st-p4d24xlarge-3:69371:69926 [0] NCCL INFO New proxy send connection 24 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO Channel 03 : 0[101c0] -> 7[a01d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3e00003298 | |
a100-st-p4d24xlarge-3:69371:69926 [0] NCCL INFO New proxy send connection 25 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO Channel 05 : 0[101c0] -> 7[a01d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3e000032d0 | |
a100-st-p4d24xlarge-3:69371:69926 [0] NCCL INFO New proxy send connection 26 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO Channel 07 : 0[101c0] -> 7[a01d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3e00003308 | |
a100-st-p4d24xlarge-3:69371:69926 [0] NCCL INFO New proxy send connection 27 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-3:69371:69926 [0] NCCL INFO New proxy recv connection 28 from local rank 0, transport 2 | |
a100-st-p4d24xlarge-3:69371:69926 [0] NCCL INFO New proxy send connection 29 from local rank 5, transport 2 | |
a100-st-p4d24xlarge-3:69371:69926 [0] NCCL INFO New proxy send connection 30 from local rank 2, transport 2 | |
a100-st-p4d24xlarge-3:69371:69926 [0] NCCL INFO New proxy send connection 31 from local rank 3, transport 2 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3e00003340 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO Channel 00/0 : 8[101c0] -> 0[101c0] [receive] via NET/AWS Libfabric/0/GDRDMA | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-3:69371:69926 [0] NCCL INFO New proxy recv connection 32 from local rank 0, transport 2 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3e00003420 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO Channel 04/0 : 8[101c0] -> 0[101c0] [receive] via NET/AWS Libfabric/0/GDRDMA | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 0 (distance 3 <= 4), read 1 | |
a100-st-p4d24xlarge-3:69371:69926 [0] NCCL INFO New proxy send connection 33 from local rank 0, transport 2 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3e00003458 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO Channel 00/0 : 0[101c0] -> 8[101c0] [send] via NET/AWS Libfabric/0/GDRDMA | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 0 (distance 3 <= 4), read 1 | |
a100-st-p4d24xlarge-3:69371:69926 [0] NCCL INFO New proxy send connection 34 from local rank 0, transport 2 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3e00003490 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO Channel 04/0 : 0[101c0] -> 8[101c0] [send] via NET/AWS Libfabric/0/GDRDMA | |
a100-st-p4d24xlarge-3:69371:69926 [0] 3340.151999 alloc_and_reg_flush_buff:2275 NCCL TRACE NET/OFI Registering buffer for flush operations | |
a100-st-p4d24xlarge-3:69371:69926 [0] 3341.826148 alloc_and_reg_flush_buff:2275 NCCL TRACE NET/OFI Registering buffer for flush operations | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3e000034c8 | |
a100-st-p4d24xlarge-3:69371:69926 [0] NCCL INFO New proxy recv connection 35 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3e00003500 | |
a100-st-p4d24xlarge-3:69371:69926 [0] NCCL INFO New proxy recv connection 36 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3e00003538 | |
a100-st-p4d24xlarge-3:69371:69926 [0] NCCL INFO New proxy recv connection 37 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3e00003570 | |
a100-st-p4d24xlarge-3:69371:69926 [0] NCCL INFO New proxy recv connection 38 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3e000035a8 | |
a100-st-p4d24xlarge-3:69371:69926 [0] NCCL INFO New proxy recv connection 39 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3e000035e0 | |
a100-st-p4d24xlarge-3:69371:69926 [0] NCCL INFO New proxy recv connection 40 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3e00003618 | |
a100-st-p4d24xlarge-3:69371:69926 [0] NCCL INFO New proxy recv connection 41 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3e00003650 | |
a100-st-p4d24xlarge-3:69371:69926 [0] NCCL INFO New proxy recv connection 42 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO Connected all trees | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO NCCL_ALGO set by environment to ring | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO threadThresholds 8/8/64 | 128/8/64 | 8/8/512 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO 8 coll channels, 8 p2p channels, 2 p2p channels per peer | |
a100-st-p4d24xlarge-3:69371:69926 [0] NCCL INFO New proxy send connection 43 from local rank 0, transport 2 | |
a100-st-p4d24xlarge-3:69371:69926 [0] NCCL INFO New proxy send connection 44 from local rank 1, transport 2 | |
a100-st-p4d24xlarge-3:69371:69926 [0] NCCL INFO New proxy send connection 45 from local rank 7, transport 2 | |
a100-st-p4d24xlarge-3:69371:69926 [0] NCCL INFO New proxy send connection 46 from local rank 6, transport 2 | |
a100-st-p4d24xlarge-3:69371:69926 [0] NCCL INFO New proxy send connection 47 from local rank 4, transport 2 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3e00003688 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 1 (distance 3 <= 4), read 1 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 1 (distance 3 <= 4), read 1 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 2 (distance 3 <= 4), read 1 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 2 (distance 3 <= 4), read 1 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 3 (distance 3 <= 4), read 1 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 3 (distance 3 <= 4), read 1 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 1 (distance 3 <= 4), read 1 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 1 (distance 3 <= 4), read 1 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 2 (distance 3 <= 4), read 1 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 2 (distance 3 <= 4), read 1 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 3 (distance 3 <= 4), read 1 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 3 (distance 3 <= 4), read 1 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO Connection to proxy localRank 2 -> connection 0x7fec90003688 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO Connection to proxy localRank 4 -> connection 0x7fbee4003768 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO Connection to proxy localRank 6 -> connection 0x7f1294003688 | |
a100-st-p4d24xlarge-3:69371:69878 [0] NCCL INFO comm 0x7f3e04002c30 rank 0 nranks 16 cudaDev 0 busId 101c0 - Init COMPLETE | |
torchdynamo.eval_frame: [DEBUG] skipping __init__ /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/contextlib.py | |
torchdynamo.eval_frame: [DEBUG] skipping __enter__ /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/contextlib.py | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR model [UserDefinedObjectVariable(Model)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_TUPLE 0 [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [NNModuleVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR example_inputs [NNModuleVariable(), TupleVariable(), UserDefinedObjectVariable(Model)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION_EX 1 [NNModuleVariable(), TupleVariable(), ConstDictVariable()] | |
torchdynamo.symbolic_convert: [WARNING] Graph break: data dependent operator: aten._local_scalar_dense.default from user code at File "/fsx/users/dberard/scratch-local/bench-fast/benchmark/torchbenchmark/util/framework/huggingface/model_factory.py", line 123, in train | |
outputs = self.model(**self.example_inputs) | |
torchdynamo.convert_frame: [INFO] ORIGINAL BYTECODE train /fsx/users/dberard/scratch-local/bench-fast/benchmark/torchbenchmark/util/framework/huggingface/model_factory.py line 122 | |
123 0 LOAD_FAST 0 (self) | |
2 LOAD_ATTR 0 (model) | |
4 BUILD_TUPLE 0 | |
6 LOAD_FAST 0 (self) | |
8 LOAD_ATTR 1 (example_inputs) | |
10 CALL_FUNCTION_EX 1 | |
12 STORE_FAST 1 (outputs) | |
124 14 LOAD_FAST 1 (outputs) | |
16 LOAD_ATTR 2 (loss) | |
18 STORE_FAST 2 (loss) | |
125 20 LOAD_FAST 2 (loss) | |
22 LOAD_METHOD 3 (backward) | |
24 CALL_METHOD 0 | |
26 POP_TOP | |
126 28 LOAD_FAST 0 (self) | |
30 LOAD_ATTR 4 (optimizer) | |
32 LOAD_METHOD 5 (step) | |
34 CALL_METHOD 0 | |
36 POP_TOP | |
38 LOAD_CONST 0 (None) | |
40 RETURN_VALUE | |
torchdynamo.convert_frame: [INFO] MODIFIED BYTECODE train /fsx/users/dberard/scratch-local/bench-fast/benchmark/torchbenchmark/util/framework/huggingface/model_factory.py line 122 | |
122 0 LOAD_FAST 0 (self) | |
2 LOAD_ATTR 0 (model) | |
4 LOAD_CONST 1 (()) | |
6 LOAD_FAST 0 (self) | |
8 LOAD_ATTR 1 (example_inputs) | |
10 CALL_FUNCTION_EX 1 | |
12 LOAD_GLOBAL 6 (__resume_at_12_0) | |
14 ROT_TWO | |
16 LOAD_FAST 0 (self) | |
18 CALL_FUNCTION 2 | |
20 RETURN_VALUE | |
torchdynamo.convert_frame: [INFO] GUARDS: | |
- | |
local 'self' TYPE_MATCH" | |
{ | |
'guard_types': ['TYPE_MATCH'], | |
'code': ['___check_type_id(self, 94256889894848)'], | |
'obj_weakref': <weakref at 0x7f3f1bbcf590; to 'Model' at 0x7f3f1620f8e0> | |
'guarded_class': <weakref at 0x7f40e12ea360; to 'PostInitProcessor' at 0x55b9e47097c0 (Model)> | |
} | |
- | |
local 'self.model' NN_MODULE" | |
{ | |
'guard_types': ['ID_MATCH'], | |
'code': ['___check_obj_id(self.model, 139916925672032)'], | |
'obj_weakref': <weakref at 0x7f3ef6b17c70; to 'DistributedDataParallel' at 0x7f40f2a71a60> | |
'guarded_class': <weakref at 0x7f3f1bda6220; to 'ABCMeta' at 0x55b9e32746f0 (DistributedDataParallel)> | |
} | |
- | |
local 'self.example_inputs' DICT_KEYS" | |
{ | |
'guard_types': ['DICT_KEYS'], | |
'code': ['___check_type_id(self.example_inputs, 94256774751424)', "set(self.example_inputs.keys()) == {'input_ids', 'labels'}"], | |
'obj_weakref': None | |
'guarded_class': <weakref at 0x7f40fc1a00e0; to 'type' at 0x55b9dd93a4c0 (dict)> | |
} | |
- | |
local "self.example_inputs['labels']" TENSOR_MATCH" | |
{ | |
'guard_types': ['TENSOR_MATCH'], | |
'code': None, | |
'obj_weakref': <weakref at 0x7f40f01de450; to 'Tensor' at 0x7f3f1a5de720> | |
'guarded_class': <weakref at 0x7f3f1c308590; to 'torch._C._TensorMeta' at 0x55b9e2db1e00 (Tensor)> | |
} | |
- | |
local "self.example_inputs['input_ids']" TENSOR_MATCH" | |
{ | |
'guard_types': ['TENSOR_MATCH'], | |
'code': None, | |
'obj_weakref': <weakref at 0x7f40f01c9ef0; to 'Tensor' at 0x7f3f1ad67950> | |
'guarded_class': <weakref at 0x7f3f1c308590; to 'torch._C._TensorMeta' at 0x55b9e2db1e00 (Tensor)> | |
} | |
torchdynamo.eval_frame: [DEBUG] skipping _call_impl /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/nn/modules/module.py | |
torchdynamo.eval_frame: [DEBUG] skipping forward /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/nn/parallel/distributed.py | |
torchdynamo.eval_frame: [DEBUG] skipping __setattr__ /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/nn/modules/module.py | |
torchdynamo.eval_frame: [DEBUG] skipping __instancecheck__ /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/nn/parameter.py | |
torchdynamo.eval_frame: [DEBUG] skipping notify_join_context /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/distributed/algorithms/join.py | |
torchdynamo.eval_frame: [DEBUG] skipping __getattr__ /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/nn/modules/module.py | |
torchdynamo.eval_frame: [DEBUG] skipping _check_sync_bufs_pre_fwd /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/nn/parallel/distributed.py | |
torchdynamo.eval_frame: [DEBUG] skipping will_sync_module_buffers /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/nn/parallel/distributed.py | |
torchdynamo.eval_frame: [DEBUG] skipping _run_ddp_forward /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/nn/parallel/distributed.py | |
torchdynamo.eval_frame: [DEBUG] skipping _to_kwargs /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/distributed/utils.py | |
torchdynamo.eval_frame: [DEBUG] skipping _recursive_to /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/distributed/utils.py | |
torchdynamo.eval_frame: [DEBUG] skipping to_map /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/distributed/utils.py | |
torchdynamo.eval_frame: [DEBUG] skipping _is_namedtuple /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/nn/parallel/scatter_gather.py | |
torchdynamo.eval_frame: [DEBUG] skipping <listcomp> /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/distributed/utils.py | |
torchdynamo.eval_frame: [DEBUG] skipping <listcomp> /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/distributed/utils.py | |
torchdynamo.eval_frame: [DEBUG] skipping helper /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/contextlib.py | |
torchdynamo.eval_frame: [DEBUG] skipping __init__ /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/contextlib.py | |
torchdynamo.eval_frame: [DEBUG] skipping __enter__ /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/contextlib.py | |
torchdynamo.eval_frame: [DEBUG] skipping _inside_ddp_forward /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/nn/parallel/distributed.py | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST use_cache [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP is not [ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 12 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR config [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR use_cache [HFPretrainedConfigVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST use_cache [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST return_dict [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP is not [ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 32 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR config [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR use_return_dict [HFPretrainedConfigVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST return_dict [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST head_mask [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP is not [ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 88 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST encoder_outputs [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP is [ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 122 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR encoder [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST input_ids [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST attention_mask [NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST inputs_embeds [NNModuleVariable(), TensorVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST head_mask [NNModuleVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST output_attentions [NNModuleVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST output_hidden_states [NNModuleVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST return_dict [NNModuleVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST ('input_ids', 'attention_mask', 'inputs_embeds', 'head_mask', 'output_attentions', 'output_hidden_states', 'return_dict') [NNModuleVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION_KW 7 [NNModuleVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(bool), ConstantVariable(tuple)] | |
torchdynamo.symbolic_convert: [DEBUG] INLINING <code object forward at 0x7f3f100ba030, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 892> | |
908 0 LOAD_FAST 0 (self) | |
2 LOAD_ATTR 0 (model_parallel) | |
4 POP_JUMP_IF_FALSE 36 | |
909 6 LOAD_GLOBAL 1 (torch) | |
8 LOAD_ATTR 2 (cuda) | |
10 LOAD_METHOD 3 (set_device) | |
12 LOAD_FAST 0 (self) | |
14 LOAD_ATTR 4 (first_device) | |
16 CALL_METHOD 1 | |
18 POP_TOP | |
910 20 LOAD_FAST 0 (self) | |
22 LOAD_ATTR 5 (embed_tokens) | |
24 LOAD_METHOD 6 (to) | |
26 LOAD_FAST 0 (self) | |
28 LOAD_ATTR 4 (first_device) | |
30 CALL_METHOD 1 | |
32 LOAD_FAST 0 (self) | |
34 STORE_ATTR 5 (embed_tokens) | |
911 >> 36 LOAD_DEREF 1 (use_cache) | |
38 LOAD_CONST 0 (None) | |
40 COMPARE_OP 9 (is not) | |
42 POP_JUMP_IF_FALSE 48 | |
44 LOAD_DEREF 1 (use_cache) | |
46 JUMP_FORWARD 6 (to 54) | |
>> 48 LOAD_FAST 0 (self) | |
50 LOAD_ATTR 7 (config) | |
52 LOAD_ATTR 8 (use_cache) | |
>> 54 STORE_DEREF 1 (use_cache) | |
912 56 LOAD_DEREF 0 (output_attentions) | |
58 LOAD_CONST 0 (None) | |
60 COMPARE_OP 9 (is not) | |
62 POP_JUMP_IF_FALSE 68 | |
64 LOAD_DEREF 0 (output_attentions) | |
66 JUMP_FORWARD 6 (to 74) | |
>> 68 LOAD_FAST 0 (self) | |
70 LOAD_ATTR 7 (config) | |
72 LOAD_ATTR 9 (output_attentions) | |
>> 74 STORE_DEREF 0 (output_attentions) | |
914 76 LOAD_FAST 11 (output_hidden_states) | |
78 LOAD_CONST 0 (None) | |
80 COMPARE_OP 9 (is not) | |
82 POP_JUMP_IF_FALSE 88 | |
84 LOAD_FAST 11 (output_hidden_states) | |
86 JUMP_FORWARD 6 (to 94) | |
>> 88 LOAD_FAST 0 (self) | |
90 LOAD_ATTR 7 (config) | |
92 LOAD_ATTR 10 (output_hidden_states) | |
913 >> 94 STORE_FAST 11 (output_hidden_states) | |
916 96 LOAD_FAST 12 (return_dict) | |
98 LOAD_CONST 0 (None) | |
100 COMPARE_OP 9 (is not) | |
102 POP_JUMP_IF_FALSE 108 | |
104 LOAD_FAST 12 (return_dict) | |
106 JUMP_FORWARD 6 (to 114) | |
>> 108 LOAD_FAST 0 (self) | |
110 LOAD_ATTR 7 (config) | |
112 LOAD_ATTR 11 (use_return_dict) | |
>> 114 STORE_FAST 12 (return_dict) | |
918 116 LOAD_FAST 1 (input_ids) | |
118 LOAD_CONST 0 (None) | |
120 COMPARE_OP 9 (is not) | |
122 POP_JUMP_IF_FALSE 170 | |
124 LOAD_FAST 5 (inputs_embeds) | |
126 LOAD_CONST 0 (None) | |
128 COMPARE_OP 9 (is not) | |
130 POP_JUMP_IF_FALSE 170 | |
919 132 LOAD_FAST 0 (self) | |
134 LOAD_ATTR 12 (is_decoder) | |
136 POP_JUMP_IF_FALSE 142 | |
138 LOAD_CONST 1 ('decoder_') | |
140 JUMP_FORWARD 2 (to 144) | |
>> 142 LOAD_CONST 2 ('') | |
>> 144 STORE_FAST 13 (err_msg_prefix) | |
920 146 LOAD_GLOBAL 13 (ValueError) | |
921 148 LOAD_CONST 3 ('You cannot specify both ') | |
150 LOAD_FAST 13 (err_msg_prefix) | |
152 FORMAT_VALUE 0 | |
154 LOAD_CONST 4 ('input_ids and ') | |
156 LOAD_FAST 13 (err_msg_prefix) | |
158 FORMAT_VALUE 0 | |
160 LOAD_CONST 5 ('inputs_embeds at the same time') | |
162 BUILD_STRING 5 | |
920 164 CALL_FUNCTION 1 | |
166 RAISE_VARARGS 1 | |
168 JUMP_FORWARD 96 (to 266) | |
923 >> 170 LOAD_FAST 1 (input_ids) | |
172 LOAD_CONST 0 (None) | |
174 COMPARE_OP 9 (is not) | |
176 POP_JUMP_IF_FALSE 204 | |
924 178 LOAD_FAST 1 (input_ids) | |
180 LOAD_METHOD 14 (size) | |
182 CALL_METHOD 0 | |
184 STORE_FAST 14 (input_shape) | |
925 186 LOAD_FAST 1 (input_ids) | |
188 LOAD_METHOD 15 (view) | |
190 LOAD_CONST 6 (-1) | |
192 LOAD_FAST 14 (input_shape) | |
194 LOAD_CONST 6 (-1) | |
196 BINARY_SUBSCR | |
198 CALL_METHOD 2 | |
200 STORE_FAST 1 (input_ids) | |
202 JUMP_FORWARD 62 (to 266) | |
926 >> 204 LOAD_FAST 5 (inputs_embeds) | |
206 LOAD_CONST 0 (None) | |
208 COMPARE_OP 9 (is not) | |
210 POP_JUMP_IF_FALSE 230 | |
927 212 LOAD_FAST 5 (inputs_embeds) | |
214 LOAD_METHOD 14 (size) | |
216 CALL_METHOD 0 | |
218 LOAD_CONST 0 (None) | |
220 LOAD_CONST 6 (-1) | |
222 BUILD_SLICE 2 | |
224 BINARY_SUBSCR | |
226 STORE_FAST 14 (input_shape) | |
228 JUMP_FORWARD 36 (to 266) | |
929 >> 230 LOAD_FAST 0 (self) | |
232 LOAD_ATTR 12 (is_decoder) | |
234 POP_JUMP_IF_FALSE 240 | |
236 LOAD_CONST 1 ('decoder_') | |
238 JUMP_FORWARD 2 (to 242) | |
>> 240 LOAD_CONST 2 ('') | |
>> 242 STORE_FAST 13 (err_msg_prefix) | |
930 244 LOAD_GLOBAL 13 (ValueError) | |
246 LOAD_CONST 7 ('You have to specify either ') | |
248 LOAD_FAST 13 (err_msg_prefix) | |
250 FORMAT_VALUE 0 | |
252 LOAD_CONST 8 ('input_ids or ') | |
254 LOAD_FAST 13 (err_msg_prefix) | |
256 FORMAT_VALUE 0 | |
258 LOAD_CONST 9 ('inputs_embeds') | |
260 BUILD_STRING 5 | |
262 CALL_FUNCTION 1 | |
264 RAISE_VARARGS 1 | |
932 >> 266 LOAD_FAST 5 (inputs_embeds) | |
268 LOAD_CONST 0 (None) | |
270 COMPARE_OP 8 (is) | |
272 EXTENDED_ARG 1 | |
274 POP_JUMP_IF_FALSE 306 | |
933 276 LOAD_FAST 0 (self) | |
278 LOAD_ATTR 5 (embed_tokens) | |
280 LOAD_CONST 0 (None) | |
282 COMPARE_OP 9 (is not) | |
284 EXTENDED_ARG 1 | |
286 POP_JUMP_IF_TRUE 296 | |
288 LOAD_GLOBAL 16 (AssertionError) | |
290 LOAD_CONST 10 ('You have to initialize the model with valid token embeddings') | |
292 CALL_FUNCTION 1 | |
294 RAISE_VARARGS 1 | |
934 >> 296 LOAD_FAST 0 (self) | |
298 LOAD_METHOD 5 (embed_tokens) | |
300 LOAD_FAST 1 (input_ids) | |
302 CALL_METHOD 1 | |
304 STORE_FAST 5 (inputs_embeds) | |
936 >> 306 LOAD_FAST 14 (input_shape) | |
308 UNPACK_SEQUENCE 2 | |
310 STORE_FAST 15 (batch_size) | |
312 STORE_FAST 16 (seq_length) | |
939 314 LOAD_FAST 8 (past_key_values) | |
316 LOAD_CONST 0 (None) | |
318 COMPARE_OP 9 (is not) | |
320 EXTENDED_ARG 1 | |
322 POP_JUMP_IF_FALSE 346 | |
324 LOAD_FAST 8 (past_key_values) | |
326 LOAD_CONST 11 (0) | |
328 BINARY_SUBSCR | |
330 LOAD_CONST 11 (0) | |
332 BINARY_SUBSCR | |
334 LOAD_ATTR 17 (shape) | |
336 LOAD_CONST 12 (2) | |
338 BINARY_SUBSCR | |
340 LOAD_FAST 16 (seq_length) | |
342 BINARY_ADD | |
344 JUMP_FORWARD 2 (to 348) | |
>> 346 LOAD_FAST 16 (seq_length) | |
>> 348 STORE_FAST 17 (mask_seq_length) | |
941 350 LOAD_DEREF 1 (use_cache) | |
352 LOAD_CONST 13 (True) | |
354 COMPARE_OP 8 (is) | |
356 EXTENDED_ARG 1 | |
358 POP_JUMP_IF_FALSE 384 | |
942 360 LOAD_FAST 0 (self) | |
362 LOAD_ATTR 12 (is_decoder) | |
364 EXTENDED_ARG 1 | |
366 POP_JUMP_IF_TRUE 384 | |
368 LOAD_GLOBAL 16 (AssertionError) | |
370 LOAD_CONST 14 ('`use_cache` can only be set to `True` if ') | |
372 LOAD_FAST 0 (self) | |
374 FORMAT_VALUE 0 | |
376 LOAD_CONST 15 (' is used as a decoder') | |
378 BUILD_STRING 3 | |
380 CALL_FUNCTION 1 | |
382 RAISE_VARARGS 1 | |
944 >> 384 LOAD_FAST 2 (attention_mask) | |
386 LOAD_CONST 0 (None) | |
388 COMPARE_OP 8 (is) | |
390 EXTENDED_ARG 1 | |
392 POP_JUMP_IF_FALSE 414 | |
945 394 LOAD_GLOBAL 1 (torch) | |
396 LOAD_METHOD 18 (ones) | |
398 LOAD_FAST 15 (batch_size) | |
400 LOAD_FAST 17 (mask_seq_length) | |
402 CALL_METHOD 2 | |
404 LOAD_METHOD 6 (to) | |
406 LOAD_FAST 5 (inputs_embeds) | |
408 LOAD_ATTR 19 (device) | |
410 CALL_METHOD 1 | |
412 STORE_FAST 2 (attention_mask) | |
946 >> 414 LOAD_FAST 0 (self) | |
416 LOAD_ATTR 12 (is_decoder) | |
418 EXTENDED_ARG 1 | |
420 POP_JUMP_IF_FALSE 474 | |
422 LOAD_FAST 4 (encoder_attention_mask) | |
424 LOAD_CONST 0 (None) | |
426 COMPARE_OP 8 (is) | |
428 EXTENDED_ARG 1 | |
430 POP_JUMP_IF_FALSE 474 | |
432 LOAD_FAST 3 (encoder_hidden_states) | |
434 LOAD_CONST 0 (None) | |
436 COMPARE_OP 9 (is not) | |
438 EXTENDED_ARG 1 | |
440 POP_JUMP_IF_FALSE 474 | |
947 442 LOAD_FAST 3 (encoder_hidden_states) | |
444 LOAD_ATTR 17 (shape) | |
446 LOAD_CONST 16 (1) | |
448 BINARY_SUBSCR | |
450 STORE_FAST 18 (encoder_seq_length) | |
948 452 LOAD_GLOBAL 1 (torch) | |
454 LOAD_ATTR 18 (ones) | |
949 456 LOAD_FAST 15 (batch_size) | |
458 LOAD_FAST 18 (encoder_seq_length) | |
460 LOAD_FAST 5 (inputs_embeds) | |
462 LOAD_ATTR 19 (device) | |
464 LOAD_GLOBAL 1 (torch) | |
466 LOAD_ATTR 20 (long) | |
948 468 LOAD_CONST 17 (('device', 'dtype')) | |
470 CALL_FUNCTION_KW 4 | |
472 STORE_FAST 4 (encoder_attention_mask) | |
953 >> 474 LOAD_FAST 8 (past_key_values) | |
476 LOAD_CONST 0 (None) | |
478 COMPARE_OP 8 (is) | |
480 EXTENDED_ARG 1 | |
482 POP_JUMP_IF_FALSE 500 | |
954 484 LOAD_CONST 0 (None) | |
486 BUILD_LIST 1 | |
488 LOAD_GLOBAL 21 (len) | |
490 LOAD_FAST 0 (self) | |
492 LOAD_ATTR 22 (block) | |
494 CALL_FUNCTION 1 | |
496 BINARY_MULTIPLY | |
498 STORE_FAST 8 (past_key_values) | |
958 >> 500 LOAD_FAST 0 (self) | |
502 LOAD_METHOD 23 (get_extended_attention_mask) | |
504 LOAD_FAST 2 (attention_mask) | |
506 LOAD_FAST 14 (input_shape) | |
508 CALL_METHOD 2 | |
510 STORE_FAST 19 (extended_attention_mask) | |
962 512 LOAD_FAST 0 (self) | |
514 LOAD_ATTR 12 (is_decoder) | |
516 EXTENDED_ARG 2 | |
518 POP_JUMP_IF_FALSE 590 | |
520 LOAD_FAST 3 (encoder_hidden_states) | |
522 LOAD_CONST 0 (None) | |
524 COMPARE_OP 9 (is not) | |
526 EXTENDED_ARG 2 | |
528 POP_JUMP_IF_FALSE 590 | |
963 530 LOAD_FAST 3 (encoder_hidden_states) | |
532 LOAD_METHOD 14 (size) | |
534 CALL_METHOD 0 | |
536 UNPACK_SEQUENCE 3 | |
538 STORE_FAST 20 (encoder_batch_size) | |
540 STORE_FAST 21 (encoder_sequence_length) | |
542 STORE_FAST 22 (_) | |
964 544 LOAD_FAST 20 (encoder_batch_size) | |
546 LOAD_FAST 21 (encoder_sequence_length) | |
548 BUILD_TUPLE 2 | |
550 STORE_FAST 23 (encoder_hidden_shape) | |
965 552 LOAD_FAST 4 (encoder_attention_mask) | |
554 LOAD_CONST 0 (None) | |
556 COMPARE_OP 8 (is) | |
558 EXTENDED_ARG 2 | |
560 POP_JUMP_IF_FALSE 578 | |
966 562 LOAD_GLOBAL 1 (torch) | |
564 LOAD_ATTR 18 (ones) | |
566 LOAD_FAST 23 (encoder_hidden_shape) | |
568 LOAD_FAST 5 (inputs_embeds) | |
570 LOAD_ATTR 19 (device) | |
572 LOAD_CONST 18 (('device',)) | |
574 CALL_FUNCTION_KW 2 | |
576 STORE_FAST 4 (encoder_attention_mask) | |
967 >> 578 LOAD_FAST 0 (self) | |
580 LOAD_METHOD 24 (invert_attention_mask) | |
582 LOAD_FAST 4 (encoder_attention_mask) | |
584 CALL_METHOD 1 | |
586 STORE_FAST 24 (encoder_extended_attention_mask) | |
588 JUMP_FORWARD 4 (to 594) | |
969 >> 590 LOAD_CONST 0 (None) | |
592 STORE_FAST 24 (encoder_extended_attention_mask) | |
972 >> 594 LOAD_FAST 0 (self) | |
596 LOAD_METHOD 25 (get_head_mask) | |
598 LOAD_FAST 6 (head_mask) | |
600 LOAD_FAST 0 (self) | |
602 LOAD_ATTR 7 (config) | |
604 LOAD_ATTR 26 (num_layers) | |
606 CALL_METHOD 2 | |
608 STORE_FAST 6 (head_mask) | |
973 610 LOAD_FAST 0 (self) | |
612 LOAD_METHOD 25 (get_head_mask) | |
614 LOAD_FAST 7 (cross_attn_head_mask) | |
616 LOAD_FAST 0 (self) | |
618 LOAD_ATTR 7 (config) | |
620 LOAD_ATTR 26 (num_layers) | |
622 CALL_METHOD 2 | |
624 STORE_FAST 7 (cross_attn_head_mask) | |
974 626 LOAD_DEREF 1 (use_cache) | |
628 EXTENDED_ARG 2 | |
630 POP_JUMP_IF_FALSE 636 | |
632 LOAD_CONST 19 (()) | |
634 JUMP_FORWARD 2 (to 638) | |
>> 636 LOAD_CONST 0 (None) | |
>> 638 STORE_FAST 25 (present_key_value_states) | |
975 640 LOAD_FAST 11 (output_hidden_states) | |
642 EXTENDED_ARG 2 | |
644 POP_JUMP_IF_FALSE 650 | |
646 LOAD_CONST 19 (()) | |
648 JUMP_FORWARD 2 (to 652) | |
>> 650 LOAD_CONST 0 (None) | |
>> 652 STORE_FAST 26 (all_hidden_states) | |
976 654 LOAD_DEREF 0 (output_attentions) | |
656 EXTENDED_ARG 2 | |
658 POP_JUMP_IF_FALSE 664 | |
660 LOAD_CONST 19 (()) | |
662 JUMP_FORWARD 2 (to 666) | |
>> 664 LOAD_CONST 0 (None) | |
>> 666 STORE_FAST 27 (all_attentions) | |
977 668 LOAD_DEREF 0 (output_attentions) | |
670 EXTENDED_ARG 2 | |
672 POP_JUMP_IF_FALSE 686 | |
674 LOAD_FAST 0 (self) | |
676 LOAD_ATTR 12 (is_decoder) | |
678 EXTENDED_ARG 2 | |
680 POP_JUMP_IF_FALSE 686 | |
682 LOAD_CONST 19 (()) | |
684 JUMP_FORWARD 2 (to 688) | |
>> 686 LOAD_CONST 0 (None) | |
>> 688 STORE_FAST 28 (all_cross_attentions) | |
978 690 LOAD_CONST 0 (None) | |
692 STORE_FAST 29 (position_bias) | |
979 694 LOAD_CONST 0 (None) | |
696 STORE_FAST 30 (encoder_decoder_position_bias) | |
981 698 LOAD_FAST 0 (self) | |
700 LOAD_METHOD 27 (dropout) | |
702 LOAD_FAST 5 (inputs_embeds) | |
704 CALL_METHOD 1 | |
706 STORE_FAST 31 (hidden_states) | |
983 708 LOAD_GLOBAL 28 (enumerate) | |
710 LOAD_GLOBAL 29 (zip) | |
712 LOAD_FAST 0 (self) | |
714 LOAD_ATTR 22 (block) | |
716 LOAD_FAST 8 (past_key_values) | |
718 CALL_FUNCTION 2 | |
720 CALL_FUNCTION 1 | |
722 GET_ITER | |
>> 724 EXTENDED_ARG 2 | |
726 FOR_ITER 576 (to 1304) | |
728 UNPACK_SEQUENCE 2 | |
730 STORE_FAST 32 (i) | |
732 UNPACK_SEQUENCE 2 | |
734 STORE_FAST 33 (layer_module) | |
736 STORE_FAST 34 (past_key_value) | |
984 738 LOAD_FAST 6 (head_mask) | |
740 LOAD_FAST 32 (i) | |
742 BINARY_SUBSCR | |
744 STORE_FAST 35 (layer_head_mask) | |
985 746 LOAD_FAST 7 (cross_attn_head_mask) | |
748 LOAD_FAST 32 (i) | |
750 BINARY_SUBSCR | |
752 STORE_FAST 36 (cross_attn_layer_head_mask) | |
987 754 LOAD_FAST 0 (self) | |
756 LOAD_ATTR 0 (model_parallel) | |
758 EXTENDED_ARG 3 | |
760 POP_JUMP_IF_FALSE 930 | |
988 762 LOAD_GLOBAL 1 (torch) | |
764 LOAD_ATTR 2 (cuda) | |
766 LOAD_METHOD 3 (set_device) | |
768 LOAD_FAST 31 (hidden_states) | |
770 LOAD_ATTR 19 (device) | |
772 CALL_METHOD 1 | |
774 POP_TOP | |
990 776 LOAD_FAST 2 (attention_mask) | |
778 LOAD_CONST 0 (None) | |
780 COMPARE_OP 9 (is not) | |
782 EXTENDED_ARG 3 | |
784 POP_JUMP_IF_FALSE 798 | |
991 786 LOAD_FAST 2 (attention_mask) | |
788 LOAD_METHOD 6 (to) | |
790 LOAD_FAST 31 (hidden_states) | |
792 LOAD_ATTR 19 (device) | |
794 CALL_METHOD 1 | |
796 STORE_FAST 2 (attention_mask) | |
992 >> 798 LOAD_FAST 29 (position_bias) | |
800 LOAD_CONST 0 (None) | |
802 COMPARE_OP 9 (is not) | |
804 EXTENDED_ARG 3 | |
806 POP_JUMP_IF_FALSE 820 | |
993 808 LOAD_FAST 29 (position_bias) | |
810 LOAD_METHOD 6 (to) | |
812 LOAD_FAST 31 (hidden_states) | |
814 LOAD_ATTR 19 (device) | |
816 CALL_METHOD 1 | |
818 STORE_FAST 29 (position_bias) | |
994 >> 820 LOAD_FAST 3 (encoder_hidden_states) | |
822 LOAD_CONST 0 (None) | |
824 COMPARE_OP 9 (is not) | |
826 EXTENDED_ARG 3 | |
828 POP_JUMP_IF_FALSE 842 | |
995 830 LOAD_FAST 3 (encoder_hidden_states) | |
832 LOAD_METHOD 6 (to) | |
834 LOAD_FAST 31 (hidden_states) | |
836 LOAD_ATTR 19 (device) | |
838 CALL_METHOD 1 | |
840 STORE_FAST 3 (encoder_hidden_states) | |
996 >> 842 LOAD_FAST 24 (encoder_extended_attention_mask) | |
844 LOAD_CONST 0 (None) | |
846 COMPARE_OP 9 (is not) | |
848 EXTENDED_ARG 3 | |
850 POP_JUMP_IF_FALSE 864 | |
997 852 LOAD_FAST 24 (encoder_extended_attention_mask) | |
854 LOAD_METHOD 6 (to) | |
856 LOAD_FAST 31 (hidden_states) | |
858 LOAD_ATTR 19 (device) | |
860 CALL_METHOD 1 | |
862 STORE_FAST 24 (encoder_extended_attention_mask) | |
998 >> 864 LOAD_FAST 30 (encoder_decoder_position_bias) | |
866 LOAD_CONST 0 (None) | |
868 COMPARE_OP 9 (is not) | |
870 EXTENDED_ARG 3 | |
872 POP_JUMP_IF_FALSE 886 | |
999 874 LOAD_FAST 30 (encoder_decoder_position_bias) | |
876 LOAD_METHOD 6 (to) | |
878 LOAD_FAST 31 (hidden_states) | |
880 LOAD_ATTR 19 (device) | |
882 CALL_METHOD 1 | |
884 STORE_FAST 30 (encoder_decoder_position_bias) | |
1000 >> 886 LOAD_FAST 35 (layer_head_mask) | |
888 LOAD_CONST 0 (None) | |
890 COMPARE_OP 9 (is not) | |
892 EXTENDED_ARG 3 | |
894 POP_JUMP_IF_FALSE 908 | |
1001 896 LOAD_FAST 35 (layer_head_mask) | |
898 LOAD_METHOD 6 (to) | |
900 LOAD_FAST 31 (hidden_states) | |
902 LOAD_ATTR 19 (device) | |
904 CALL_METHOD 1 | |
906 STORE_FAST 35 (layer_head_mask) | |
1002 >> 908 LOAD_FAST 36 (cross_attn_layer_head_mask) | |
910 LOAD_CONST 0 (None) | |
912 COMPARE_OP 9 (is not) | |
914 EXTENDED_ARG 3 | |
916 POP_JUMP_IF_FALSE 930 | |
1003 918 LOAD_FAST 36 (cross_attn_layer_head_mask) | |
920 LOAD_METHOD 6 (to) | |
922 LOAD_FAST 31 (hidden_states) | |
924 LOAD_ATTR 19 (device) | |
926 CALL_METHOD 1 | |
928 STORE_FAST 36 (cross_attn_layer_head_mask) | |
1004 >> 930 LOAD_FAST 11 (output_hidden_states) | |
932 EXTENDED_ARG 3 | |
934 POP_JUMP_IF_FALSE 946 | |
1005 936 LOAD_FAST 26 (all_hidden_states) | |
938 LOAD_FAST 31 (hidden_states) | |
940 BUILD_TUPLE 1 | |
942 BINARY_ADD | |
944 STORE_FAST 26 (all_hidden_states) | |
1007 >> 946 LOAD_FAST 0 (self) | |
948 LOAD_ATTR 30 (gradient_checkpointing) | |
950 EXTENDED_ARG 4 | |
952 POP_JUMP_IF_FALSE 1028 | |
954 LOAD_FAST 0 (self) | |
956 LOAD_ATTR 31 (training) | |
958 EXTENDED_ARG 4 | |
960 POP_JUMP_IF_FALSE 1028 | |
1008 962 LOAD_DEREF 1 (use_cache) | |
964 EXTENDED_ARG 3 | |
966 POP_JUMP_IF_FALSE 982 | |
1009 968 LOAD_GLOBAL 32 (logger) | |
970 LOAD_METHOD 33 (warning) | |
1010 972 LOAD_CONST 20 ('`use_cache=True` is incompatible with gradient checkpointing. Setting `use_cache=False`...') | |
1009 974 CALL_METHOD 1 | |
976 POP_TOP | |
1012 978 LOAD_CONST 21 (False) | |
980 STORE_DEREF 1 (use_cache) | |
1014 >> 982 LOAD_CLOSURE 0 (output_attentions) | |
984 LOAD_CLOSURE 1 (use_cache) | |
986 BUILD_TUPLE 2 | |
988 LOAD_CONST 22 (<code object create_custom_forward at 0x7f3f100b4ea0, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 1014>) | |
990 LOAD_CONST 23 ('T5Stack.forward.<locals>.create_custom_forward') | |
992 MAKE_FUNCTION 8 (closure) | |
994 STORE_FAST 37 (create_custom_forward) | |
1020 996 LOAD_GLOBAL 34 (checkpoint) | |
1021 998 LOAD_FAST 37 (create_custom_forward) | |
1000 LOAD_FAST 33 (layer_module) | |
1002 CALL_FUNCTION 1 | |
1022 1004 LOAD_FAST 31 (hidden_states) | |
1023 1006 LOAD_FAST 19 (extended_attention_mask) | |
1024 1008 LOAD_FAST 29 (position_bias) | |
1025 1010 LOAD_FAST 3 (encoder_hidden_states) | |
1026 1012 LOAD_FAST 24 (encoder_extended_attention_mask) | |
1027 1014 LOAD_FAST 30 (encoder_decoder_position_bias) | |
1028 1016 LOAD_FAST 35 (layer_head_mask) | |
1029 1018 LOAD_FAST 36 (cross_attn_layer_head_mask) | |
1030 1020 LOAD_CONST 0 (None) | |
1020 1022 CALL_FUNCTION 10 | |
1024 STORE_FAST 38 (layer_outputs) | |
1026 JUMP_FORWARD 30 (to 1058) | |
1033 >> 1028 LOAD_FAST 33 (layer_module) | |
1034 1030 LOAD_FAST 31 (hidden_states) | |
1035 1032 LOAD_FAST 19 (extended_attention_mask) | |
1036 1034 LOAD_FAST 29 (position_bias) | |
1037 1036 LOAD_FAST 3 (encoder_hidden_states) | |
1038 1038 LOAD_FAST 24 (encoder_extended_attention_mask) | |
1039 1040 LOAD_FAST 30 (encoder_decoder_position_bias) | |
1040 1042 LOAD_FAST 35 (layer_head_mask) | |
1041 1044 LOAD_FAST 36 (cross_attn_layer_head_mask) | |
1042 1046 LOAD_FAST 34 (past_key_value) | |
1043 1048 LOAD_DEREF 1 (use_cache) | |
1044 1050 LOAD_DEREF 0 (output_attentions) | |
1033 1052 LOAD_CONST 24 (('attention_mask', 'position_bias', 'encoder_hidden_states', 'encoder_attention_mask', 'encoder_decoder_position_bias', 'layer_head_mask', 'cross_attn_layer_head_mask', 'past_key_value', 'use_cache', 'output_attentions')) | |
1054 CALL_FUNCTION_KW 11 | |
1056 STORE_FAST 38 (layer_outputs) | |
1049 >> 1058 LOAD_DEREF 1 (use_cache) | |
1060 LOAD_CONST 21 (False) | |
1062 COMPARE_OP 8 (is) | |
1064 EXTENDED_ARG 4 | |
1066 POP_JUMP_IF_FALSE 1096 | |
1050 1068 LOAD_FAST 38 (layer_outputs) | |
1070 LOAD_CONST 0 (None) | |
1072 LOAD_CONST 16 (1) | |
1074 BUILD_SLICE 2 | |
1076 BINARY_SUBSCR | |
1078 LOAD_CONST 25 ((None,)) | |
1080 BINARY_ADD | |
1082 LOAD_FAST 38 (layer_outputs) | |
1084 LOAD_CONST 16 (1) | |
1086 LOAD_CONST 0 (None) | |
1088 BUILD_SLICE 2 | |
1090 BINARY_SUBSCR | |
1092 BINARY_ADD | |
1094 STORE_FAST 38 (layer_outputs) | |
1052 >> 1096 LOAD_FAST 38 (layer_outputs) | |
1098 LOAD_CONST 0 (None) | |
1100 LOAD_CONST 12 (2) | |
1102 BUILD_SLICE 2 | |
1104 BINARY_SUBSCR | |
1106 UNPACK_SEQUENCE 2 | |
1108 STORE_FAST 31 (hidden_states) | |
1110 STORE_FAST 39 (present_key_value_state) | |
1057 1112 LOAD_FAST 38 (layer_outputs) | |
1114 LOAD_CONST 12 (2) | |
1116 BINARY_SUBSCR | |
1118 STORE_FAST 29 (position_bias) | |
1058 1120 LOAD_FAST 0 (self) | |
1122 LOAD_ATTR 12 (is_decoder) | |
1124 EXTENDED_ARG 4 | |
1126 POP_JUMP_IF_FALSE 1156 | |
1128 LOAD_FAST 3 (encoder_hidden_states) | |
1130 LOAD_CONST 0 (None) | |
1132 COMPARE_OP 9 (is not) | |
1134 EXTENDED_ARG 4 | |
1136 POP_JUMP_IF_FALSE 1156 | |
1059 1138 LOAD_FAST 38 (layer_outputs) | |
1140 LOAD_DEREF 0 (output_attentions) | |
1142 EXTENDED_ARG 4 | |
1144 POP_JUMP_IF_FALSE 1150 | |
1146 LOAD_CONST 26 (4) | |
1148 JUMP_FORWARD 2 (to 1152) | |
>> 1150 LOAD_CONST 27 (3) | |
>> 1152 BINARY_SUBSCR | |
1154 STORE_FAST 30 (encoder_decoder_position_bias) | |
1061 >> 1156 LOAD_DEREF 1 (use_cache) | |
1158 EXTENDED_ARG 4 | |
1160 POP_JUMP_IF_FALSE 1172 | |
1062 1162 LOAD_FAST 25 (present_key_value_states) | |
1164 LOAD_FAST 39 (present_key_value_state) | |
1166 BUILD_TUPLE 1 | |
1168 BINARY_ADD | |
1170 STORE_FAST 25 (present_key_value_states) | |
1064 >> 1172 LOAD_DEREF 0 (output_attentions) | |
1174 EXTENDED_ARG 4 | |
1176 POP_JUMP_IF_FALSE 1214 | |
1065 1178 LOAD_FAST 27 (all_attentions) | |
1180 LOAD_FAST 38 (layer_outputs) | |
1182 LOAD_CONST 27 (3) | |
1184 BINARY_SUBSCR | |
1186 BUILD_TUPLE 1 | |
1188 BINARY_ADD | |
1190 STORE_FAST 27 (all_attentions) | |
1066 1192 LOAD_FAST 0 (self) | |
1194 LOAD_ATTR 12 (is_decoder) | |
1196 EXTENDED_ARG 4 | |
1198 POP_JUMP_IF_FALSE 1214 | |
1067 1200 LOAD_FAST 28 (all_cross_attentions) | |
1202 LOAD_FAST 38 (layer_outputs) | |
1204 LOAD_CONST 28 (5) | |
1206 BINARY_SUBSCR | |
1208 BUILD_TUPLE 1 | |
1210 BINARY_ADD | |
1212 STORE_FAST 28 (all_cross_attentions) | |
1070 >> 1214 LOAD_FAST 0 (self) | |
1216 LOAD_ATTR 0 (model_parallel) | |
1218 EXTENDED_ARG 2 | |
1220 POP_JUMP_IF_FALSE 724 | |
1071 1222 LOAD_FAST 0 (self) | |
1224 LOAD_ATTR 35 (device_map) | |
1226 LOAD_METHOD 36 (items) | |
1228 CALL_METHOD 0 | |
1230 GET_ITER | |
>> 1232 FOR_ITER 66 (to 1300) | |
1234 UNPACK_SEQUENCE 2 | |
1236 STORE_FAST 40 (k) | |
1238 STORE_FAST 41 (v) | |
1072 1240 LOAD_FAST 32 (i) | |
1242 LOAD_FAST 41 (v) | |
1244 LOAD_CONST 6 (-1) | |
1246 BINARY_SUBSCR | |
1248 COMPARE_OP 2 (==) | |
1250 EXTENDED_ARG 4 | |
1252 POP_JUMP_IF_FALSE 1232 | |
1254 LOAD_CONST 29 ('cuda:') | |
1256 LOAD_GLOBAL 37 (str) | |
1258 LOAD_FAST 40 (k) | |
1260 CALL_FUNCTION 1 | |
1262 BINARY_ADD | |
1264 LOAD_FAST 0 (self) | |
1266 LOAD_ATTR 38 (last_device) | |
1268 COMPARE_OP 3 (!=) | |
1270 EXTENDED_ARG 4 | |
1272 POP_JUMP_IF_FALSE 1232 | |
1073 1274 LOAD_FAST 31 (hidden_states) | |
1276 LOAD_METHOD 6 (to) | |
1278 LOAD_CONST 29 ('cuda:') | |
1280 LOAD_GLOBAL 37 (str) | |
1282 LOAD_FAST 40 (k) | |
1284 LOAD_CONST 16 (1) | |
1286 BINARY_ADD | |
1288 CALL_FUNCTION 1 | |
1290 BINARY_ADD | |
1292 CALL_METHOD 1 | |
1294 STORE_FAST 31 (hidden_states) | |
1296 EXTENDED_ARG 4 | |
1298 JUMP_ABSOLUTE 1232 | |
>> 1300 EXTENDED_ARG 2 | |
1302 JUMP_ABSOLUTE 724 | |
1075 >> 1304 LOAD_FAST 0 (self) | |
1306 LOAD_METHOD 39 (final_layer_norm) | |
1308 LOAD_FAST 31 (hidden_states) | |
1310 CALL_METHOD 1 | |
1312 STORE_FAST 31 (hidden_states) | |
1076 1314 LOAD_FAST 0 (self) | |
1316 LOAD_METHOD 27 (dropout) | |
1318 LOAD_FAST 31 (hidden_states) | |
1320 CALL_METHOD 1 | |
1322 STORE_FAST 31 (hidden_states) | |
1079 1324 LOAD_FAST 11 (output_hidden_states) | |
1326 EXTENDED_ARG 5 | |
1328 POP_JUMP_IF_FALSE 1340 | |
1080 1330 LOAD_FAST 26 (all_hidden_states) | |
1332 LOAD_FAST 31 (hidden_states) | |
1334 BUILD_TUPLE 1 | |
1336 BINARY_ADD | |
1338 STORE_FAST 26 (all_hidden_states) | |
1082 >> 1340 LOAD_FAST 12 (return_dict) | |
1342 EXTENDED_ARG 5 | |
1344 POP_JUMP_IF_TRUE 1374 | |
1083 1346 LOAD_GLOBAL 40 (tuple) | |
1348 LOAD_CONST 30 (<code object <genexpr> at 0x7f3f100b4f50, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 1083>) | |
1350 LOAD_CONST 31 ('T5Stack.forward.<locals>.<genexpr>') | |
1352 MAKE_FUNCTION 0 | |
1086 1354 LOAD_FAST 31 (hidden_states) | |
1087 1356 LOAD_FAST 25 (present_key_value_states) | |
1088 1358 LOAD_FAST 26 (all_hidden_states) | |
1089 1360 LOAD_FAST 27 (all_attentions) | |
1090 1362 LOAD_FAST 28 (all_cross_attentions) | |
1085 1364 BUILD_TUPLE 5 | |
1083 1366 GET_ITER | |
1368 CALL_FUNCTION 1 | |
1370 CALL_FUNCTION 1 | |
1372 RETURN_VALUE | |
1094 >> 1374 LOAD_GLOBAL 41 (BaseModelOutputWithPastAndCrossAttentions) | |
1095 1376 LOAD_FAST 31 (hidden_states) | |
1096 1378 LOAD_FAST 25 (present_key_value_states) | |
1097 1380 LOAD_FAST 26 (all_hidden_states) | |
1098 1382 LOAD_FAST 27 (all_attentions) | |
1099 1384 LOAD_FAST 28 (all_cross_attentions) | |
1094 1386 LOAD_CONST 32 (('last_hidden_state', 'past_key_values', 'hidden_states', 'attentions', 'cross_attentions')) | |
1388 CALL_FUNCTION_KW 5 | |
1390 RETURN_VALUE | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR model_parallel [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 36 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF use_cache [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP is not [ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 48 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR config [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR use_cache [HFPretrainedConfigVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_DEREF use_cache [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF output_attentions [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP is not [ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 68 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR config [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR output_attentions [HFPretrainedConfigVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_DEREF output_attentions [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST output_hidden_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP is not [ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 88 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR config [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR output_hidden_states [HFPretrainedConfigVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST output_hidden_states [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST return_dict [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP is not [ConstantVariable(bool), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 108 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST return_dict [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE JUMP_FORWARD 114 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST return_dict [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST input_ids [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP is not [TensorVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 170 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST inputs_embeds [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP is not [ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 170 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST input_ids [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP is not [TensorVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 204 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST input_ids [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR size [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 0 [GetAttrVariable(TensorVariable(), size)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST input_shape [SizeVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST input_ids [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR view [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST -1 [GetAttrVariable(TensorVariable(), view)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST input_shape [GetAttrVariable(TensorVariable(), view), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST -1 [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), SizeVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_SUBSCR None [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), SizeVariable(), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 2 [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST input_ids [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE JUMP_FORWARD 266 [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST inputs_embeds [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP is [ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 306 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR embed_tokens [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP is not [NNModuleVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_TRUE 296 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR embed_tokens [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST input_ids [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST inputs_embeds [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST input_shape [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE UNPACK_SEQUENCE 2 [SizeVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST batch_size [ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST seq_length [ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST past_key_values [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP is not [ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 346 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST seq_length [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST mask_seq_length [ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF use_cache [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST True [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP is [ConstantVariable(bool), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 384 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST attention_mask [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP is [ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 414 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL torch [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR ones [TorchVariable(<module 'torch' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/__init__.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST batch_size [TorchVariable(<built-in method ones of type object at 0x7f40b0102e20>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST mask_seq_length [TorchVariable(<built-in method ones of type object at 0x7f40b0102e20>), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 2 [TorchVariable(<built-in method ones of type object at 0x7f40b0102e20>), ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR to [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST inputs_embeds [GetAttrVariable(TensorVariable(), to)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR device [GetAttrVariable(TensorVariable(), to), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [GetAttrVariable(TensorVariable(), to), TorchVariable(cuda:0)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST attention_mask [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR is_decoder [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 474 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST past_key_values [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP is [ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 500 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_LIST 1 [ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL len [ListVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [ListVariable(), BuiltinVariable(len)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR block [ListVariable(), BuiltinVariable(len), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [ListVariable(), BuiltinVariable(len), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_MULTIPLY None [ListVariable(), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST past_key_values [ListVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR get_extended_attention_mask [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST attention_mask [UserMethodVariable(<function ModuleUtilsMixin.get_extended_attention_mask at 0x7f3ef770e280>, NNModuleVariable())] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST input_shape [UserMethodVariable(<function ModuleUtilsMixin.get_extended_attention_mask at 0x7f3ef770e280>, NNModuleVariable()), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 2 [UserMethodVariable(<function ModuleUtilsMixin.get_extended_attention_mask at 0x7f3ef770e280>, NNModuleVariable()), TensorVariable(), SizeVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] INLINING <code object get_extended_attention_mask at 0x7f3f10087be0, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/modeling_utils.py", line 794> | |
809 0 LOAD_FAST 1 (attention_mask) | |
2 LOAD_METHOD 0 (dim) | |
4 CALL_METHOD 0 | |
6 LOAD_CONST 1 (2) | |
8 COMPARE_OP 2 (==) | |
10 POP_JUMP_IF_FALSE 20 | |
12 LOAD_FAST 0 (self) | |
14 LOAD_ATTR 1 (config) | |
16 LOAD_ATTR 2 (is_decoder) | |
18 POP_JUMP_IF_TRUE 40 | |
811 >> 20 LOAD_FAST 3 (device) | |
22 LOAD_CONST 2 (None) | |
24 COMPARE_OP 9 (is not) | |
26 POP_JUMP_IF_FALSE 40 | |
812 28 LOAD_GLOBAL 3 (warnings) | |
30 LOAD_METHOD 4 (warn) | |
813 32 LOAD_CONST 3 ('The `device` argument is deprecated and will be removed in v5 of Transformers.') | |
34 LOAD_GLOBAL 5 (FutureWarning) | |
812 36 CALL_METHOD 2 | |
38 POP_TOP | |
817 >> 40 LOAD_FAST 1 (attention_mask) | |
42 LOAD_METHOD 0 (dim) | |
44 CALL_METHOD 0 | |
46 LOAD_CONST 4 (3) | |
48 COMPARE_OP 2 (==) | |
50 POP_JUMP_IF_FALSE 82 | |
818 52 LOAD_FAST 1 (attention_mask) | |
54 LOAD_CONST 2 (None) | |
56 LOAD_CONST 2 (None) | |
58 BUILD_SLICE 2 | |
60 LOAD_CONST 2 (None) | |
62 LOAD_CONST 2 (None) | |
64 LOAD_CONST 2 (None) | |
66 BUILD_SLICE 2 | |
68 LOAD_CONST 2 (None) | |
70 LOAD_CONST 2 (None) | |
72 BUILD_SLICE 2 | |
74 BUILD_TUPLE 4 | |
76 BINARY_SUBSCR | |
78 STORE_FAST 4 (extended_attention_mask) | |
80 JUMP_FORWARD 86 (to 168) | |
819 >> 82 LOAD_FAST 1 (attention_mask) | |
84 LOAD_METHOD 0 (dim) | |
86 CALL_METHOD 0 | |
88 LOAD_CONST 1 (2) | |
90 COMPARE_OP 2 (==) | |
92 POP_JUMP_IF_FALSE 144 | |
823 94 LOAD_FAST 0 (self) | |
96 LOAD_ATTR 1 (config) | |
98 LOAD_ATTR 2 (is_decoder) | |
100 POP_JUMP_IF_FALSE 118 | |
824 102 LOAD_GLOBAL 6 (ModuleUtilsMixin) | |
104 LOAD_METHOD 7 (create_extended_attention_mask_for_decoder) | |
825 106 LOAD_FAST 2 (input_shape) | |
108 LOAD_FAST 1 (attention_mask) | |
110 LOAD_FAST 3 (device) | |
824 112 CALL_METHOD 3 | |
114 STORE_FAST 4 (extended_attention_mask) | |
116 JUMP_ABSOLUTE 168 | |
828 >> 118 LOAD_FAST 1 (attention_mask) | |
120 LOAD_CONST 2 (None) | |
122 LOAD_CONST 2 (None) | |
124 BUILD_SLICE 2 | |
126 LOAD_CONST 2 (None) | |
128 LOAD_CONST 2 (None) | |
130 LOAD_CONST 2 (None) | |
132 LOAD_CONST 2 (None) | |
134 BUILD_SLICE 2 | |
136 BUILD_TUPLE 4 | |
138 BINARY_SUBSCR | |
140 STORE_FAST 4 (extended_attention_mask) | |
142 JUMP_FORWARD 24 (to 168) | |
830 >> 144 LOAD_GLOBAL 8 (ValueError) | |
831 146 LOAD_CONST 5 ('Wrong shape for input_ids (shape ') | |
148 LOAD_FAST 2 (input_shape) | |
150 FORMAT_VALUE 0 | |
152 LOAD_CONST 6 (') or attention_mask (shape ') | |
154 LOAD_FAST 1 (attention_mask) | |
156 LOAD_ATTR 9 (shape) | |
158 FORMAT_VALUE 0 | |
160 LOAD_CONST 7 (')') | |
162 BUILD_STRING 5 | |
830 164 CALL_FUNCTION 1 | |
166 RAISE_VARARGS 1 | |
839 >> 168 LOAD_FAST 4 (extended_attention_mask) | |
170 LOAD_ATTR 10 (to) | |
172 LOAD_FAST 0 (self) | |
174 LOAD_ATTR 11 (dtype) | |
176 LOAD_CONST 8 (('dtype',)) | |
178 CALL_FUNCTION_KW 1 | |
180 STORE_FAST 4 (extended_attention_mask) | |
840 182 LOAD_CONST 9 (1.0) | |
184 LOAD_FAST 4 (extended_attention_mask) | |
186 BINARY_SUBTRACT | |
188 LOAD_CONST 10 (-10000.0) | |
190 BINARY_MULTIPLY | |
192 STORE_FAST 4 (extended_attention_mask) | |
841 194 LOAD_FAST 4 (extended_attention_mask) | |
196 RETURN_VALUE | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST attention_mask [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR dim [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 0 [GetAttrVariable(TensorVariable(), dim)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 2 [ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP == [ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 20 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR config [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR is_decoder [HFPretrainedConfigVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_TRUE 40 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST device [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP is not [ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 40 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST attention_mask [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR dim [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 0 [GetAttrVariable(TensorVariable(), dim)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 3 [ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP == [ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 82 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST attention_mask [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR dim [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 0 [GetAttrVariable(TensorVariable(), dim)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 2 [ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP == [ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 144 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR config [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR is_decoder [HFPretrainedConfigVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 118 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST attention_mask [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [TensorVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_SLICE 2 [TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [TensorVariable(), SliceVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [TensorVariable(), SliceVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [TensorVariable(), SliceVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [TensorVariable(), SliceVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_SLICE 2 [TensorVariable(), SliceVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_TUPLE 4 [TensorVariable(), SliceVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType), SliceVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_SUBSCR None [TensorVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST extended_attention_mask [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE JUMP_FORWARD 168 [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST extended_attention_mask [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR to [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [GetAttrVariable(TensorVariable(), to)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR dtype [GetAttrVariable(TensorVariable(), to), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] INLINING <code object dtype at 0x7f3f100879d0, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/modeling_utils.py", line 727> | |
732 0 LOAD_GLOBAL 0 (get_parameter_dtype) | |
2 LOAD_FAST 0 (self) | |
4 CALL_FUNCTION 1 | |
6 RETURN_VALUE | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL get_parameter_dtype [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [UserFunctionVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [UserFunctionVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] INLINING <code object get_parameter_dtype at 0x7f3f1005d190, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/modeling_utils.py", line 153> | |
157 0 LOAD_CONST 1 (None) | |
2 STORE_FAST 1 (last_dtype) | |
158 4 LOAD_FAST 0 (parameter) | |
6 LOAD_METHOD 0 (parameters) | |
8 CALL_METHOD 0 | |
10 GET_ITER | |
>> 12 FOR_ITER 28 (to 42) | |
14 STORE_FAST 2 (t) | |
159 16 LOAD_FAST 2 (t) | |
18 LOAD_ATTR 1 (dtype) | |
20 STORE_FAST 1 (last_dtype) | |
160 22 LOAD_FAST 2 (t) | |
24 LOAD_METHOD 2 (is_floating_point) | |
26 CALL_METHOD 0 | |
28 POP_JUMP_IF_FALSE 12 | |
161 30 LOAD_FAST 2 (t) | |
32 LOAD_ATTR 1 (dtype) | |
34 ROT_TWO | |
36 POP_TOP | |
38 RETURN_VALUE | |
40 JUMP_ABSOLUTE 12 | |
163 >> 42 LOAD_FAST 1 (last_dtype) | |
44 LOAD_CONST 1 (None) | |
46 COMPARE_OP 9 (is not) | |
48 POP_JUMP_IF_FALSE 54 | |
165 50 LOAD_FAST 1 (last_dtype) | |
52 RETURN_VALUE | |
169 >> 54 LOAD_GLOBAL 3 (nn) | |
56 LOAD_ATTR 4 (Module) | |
58 LOAD_GLOBAL 5 (List) | |
60 LOAD_GLOBAL 6 (Tuple) | |
62 LOAD_GLOBAL 7 (str) | |
64 LOAD_GLOBAL 8 (Tensor) | |
66 BUILD_TUPLE 2 | |
68 BINARY_SUBSCR | |
70 BINARY_SUBSCR | |
72 LOAD_CONST 2 (('module', 'return')) | |
74 BUILD_CONST_KEY_MAP 2 | |
76 LOAD_CONST 3 (<code object find_tensor_attributes at 0x7f3f1005d030, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/modeling_utils.py", line 169>) | |
78 LOAD_CONST 4 ('get_parameter_dtype.<locals>.find_tensor_attributes') | |
80 MAKE_FUNCTION 4 (annotations) | |
82 STORE_FAST 3 (find_tensor_attributes) | |
173 84 LOAD_FAST 0 (parameter) | |
86 LOAD_ATTR 9 (_named_members) | |
88 LOAD_FAST 3 (find_tensor_attributes) | |
90 LOAD_CONST 5 (('get_members_fn',)) | |
92 CALL_FUNCTION_KW 1 | |
94 STORE_FAST 4 (gen) | |
174 96 LOAD_CONST 1 (None) | |
98 STORE_FAST 5 (last_tuple) | |
175 100 LOAD_FAST 4 (gen) | |
102 GET_ITER | |
>> 104 FOR_ITER 34 (to 140) | |
106 STORE_FAST 6 (tuple) | |
176 108 LOAD_FAST 6 (tuple) | |
110 STORE_FAST 5 (last_tuple) | |
177 112 LOAD_FAST 6 (tuple) | |
114 LOAD_CONST 6 (1) | |
116 BINARY_SUBSCR | |
118 LOAD_METHOD 2 (is_floating_point) | |
120 CALL_METHOD 0 | |
122 POP_JUMP_IF_FALSE 104 | |
178 124 LOAD_FAST 6 (tuple) | |
126 LOAD_CONST 6 (1) | |
128 BINARY_SUBSCR | |
130 LOAD_ATTR 1 (dtype) | |
132 ROT_TWO | |
134 POP_TOP | |
136 RETURN_VALUE | |
138 JUMP_ABSOLUTE 104 | |
181 >> 140 LOAD_FAST 5 (last_tuple) | |
142 LOAD_CONST 6 (1) | |
144 BINARY_SUBSCR | |
146 LOAD_ATTR 1 (dtype) | |
148 RETURN_VALUE | |
150 LOAD_CONST 1 (None) | |
152 RETURN_VALUE | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST last_dtype [ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST parameter [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR parameters [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 0 [UserMethodVariable(<function Module.parameters at 0x7f3f1c0fb0d0>, NNModuleVariable())] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE GET_ITER None [ListIteratorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE FOR_ITER 42 [ListIteratorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST t [ListIteratorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST t [ListIteratorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR dtype [ListIteratorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST last_dtype [ListIteratorVariable(), TorchVariable(torch.float32)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST t [ListIteratorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR is_floating_point [ListIteratorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 0 [ListIteratorVariable(), GetAttrVariable(TensorVariable(), is_floating_point)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 12 [ListIteratorVariable(), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST t [ListIteratorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR dtype [ListIteratorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE ROT_TWO None [ListIteratorVariable(), TorchVariable(torch.float32)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_TOP None [TorchVariable(torch.float32), ListIteratorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE RETURN_VALUE None [TorchVariable(torch.float32)] | |
torchdynamo.symbolic_convert: [DEBUG] DONE INLINING <code object get_parameter_dtype at 0x7f3f1005d190, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/modeling_utils.py", line 153> | |
torchdynamo.symbolic_convert: [DEBUG] TRACE RETURN_VALUE None [TorchVariable(torch.float32)] | |
torchdynamo.symbolic_convert: [DEBUG] DONE INLINING <code object dtype at 0x7f3f100879d0, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/modeling_utils.py", line 727> | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST ('dtype',) [GetAttrVariable(TensorVariable(), to), TorchVariable(torch.float32)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION_KW 1 [GetAttrVariable(TensorVariable(), to), TorchVariable(torch.float32), ConstantVariable(tuple)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST extended_attention_mask [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 1.0 [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST extended_attention_mask [ConstantVariable(float)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_SUBTRACT None [ConstantVariable(float), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST -10000.0 [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_MULTIPLY None [TensorVariable(), ConstantVariable(float)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST extended_attention_mask [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST extended_attention_mask [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE RETURN_VALUE None [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] DONE INLINING <code object get_extended_attention_mask at 0x7f3f10087be0, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/modeling_utils.py", line 794> | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST extended_attention_mask [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR is_decoder [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 590 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST encoder_extended_attention_mask [ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR get_head_mask [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST head_mask [UserMethodVariable(<function ModuleUtilsMixin.get_head_mask at 0x7f3ef770e310>, NNModuleVariable())] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [UserMethodVariable(<function ModuleUtilsMixin.get_head_mask at 0x7f3ef770e310>, NNModuleVariable()), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR config [UserMethodVariable(<function ModuleUtilsMixin.get_head_mask at 0x7f3ef770e310>, NNModuleVariable()), ConstantVariable(NoneType), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR num_layers [UserMethodVariable(<function ModuleUtilsMixin.get_head_mask at 0x7f3ef770e310>, NNModuleVariable()), ConstantVariable(NoneType), HFPretrainedConfigVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 2 [UserMethodVariable(<function ModuleUtilsMixin.get_head_mask at 0x7f3ef770e310>, NNModuleVariable()), ConstantVariable(NoneType), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] INLINING <code object get_head_mask at 0x7f3f10087c90, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/modeling_utils.py", line 843> | |
861 0 LOAD_FAST 1 (head_mask) | |
2 LOAD_CONST 1 (None) | |
4 COMPARE_OP 9 (is not) | |
6 POP_JUMP_IF_FALSE 40 | |
862 8 LOAD_FAST 0 (self) | |
10 LOAD_METHOD 0 (_convert_head_mask_to_5d) | |
12 LOAD_FAST 1 (head_mask) | |
14 LOAD_FAST 2 (num_hidden_layers) | |
16 CALL_METHOD 2 | |
18 STORE_FAST 1 (head_mask) | |
863 20 LOAD_FAST 3 (is_attention_chunked) | |
22 LOAD_CONST 2 (True) | |
24 COMPARE_OP 8 (is) | |
26 POP_JUMP_IF_FALSE 50 | |
864 28 LOAD_FAST 1 (head_mask) | |
30 LOAD_METHOD 1 (unsqueeze) | |
32 LOAD_CONST 3 (-1) | |
34 CALL_METHOD 1 | |
36 STORE_FAST 1 (head_mask) | |
38 JUMP_FORWARD 10 (to 50) | |
866 >> 40 LOAD_CONST 1 (None) | |
42 BUILD_LIST 1 | |
44 LOAD_FAST 2 (num_hidden_layers) | |
46 BINARY_MULTIPLY | |
48 STORE_FAST 1 (head_mask) | |
868 >> 50 LOAD_FAST 1 (head_mask) | |
52 RETURN_VALUE | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST head_mask [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP is not [ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 40 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_LIST 1 [ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST num_hidden_layers [ListVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_MULTIPLY None [ListVariable(), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST head_mask [ListVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST head_mask [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE RETURN_VALUE None [ListVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] DONE INLINING <code object get_head_mask at 0x7f3f10087c90, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/modeling_utils.py", line 843> | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST head_mask [ListVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR get_head_mask [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST cross_attn_head_mask [UserMethodVariable(<function ModuleUtilsMixin.get_head_mask at 0x7f3ef770e310>, NNModuleVariable())] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [UserMethodVariable(<function ModuleUtilsMixin.get_head_mask at 0x7f3ef770e310>, NNModuleVariable()), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR config [UserMethodVariable(<function ModuleUtilsMixin.get_head_mask at 0x7f3ef770e310>, NNModuleVariable()), ConstantVariable(NoneType), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR num_layers [UserMethodVariable(<function ModuleUtilsMixin.get_head_mask at 0x7f3ef770e310>, NNModuleVariable()), ConstantVariable(NoneType), HFPretrainedConfigVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 2 [UserMethodVariable(<function ModuleUtilsMixin.get_head_mask at 0x7f3ef770e310>, NNModuleVariable()), ConstantVariable(NoneType), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] INLINING <code object get_head_mask at 0x7f3f10087c90, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/modeling_utils.py", line 843> | |
861 0 LOAD_FAST 1 (head_mask) | |
2 LOAD_CONST 1 (None) | |
4 COMPARE_OP 9 (is not) | |
6 POP_JUMP_IF_FALSE 40 | |
862 8 LOAD_FAST 0 (self) | |
10 LOAD_METHOD 0 (_convert_head_mask_to_5d) | |
12 LOAD_FAST 1 (head_mask) | |
14 LOAD_FAST 2 (num_hidden_layers) | |
16 CALL_METHOD 2 | |
18 STORE_FAST 1 (head_mask) | |
863 20 LOAD_FAST 3 (is_attention_chunked) | |
22 LOAD_CONST 2 (True) | |
24 COMPARE_OP 8 (is) | |
26 POP_JUMP_IF_FALSE 50 | |
864 28 LOAD_FAST 1 (head_mask) | |
30 LOAD_METHOD 1 (unsqueeze) | |
32 LOAD_CONST 3 (-1) | |
34 CALL_METHOD 1 | |
36 STORE_FAST 1 (head_mask) | |
38 JUMP_FORWARD 10 (to 50) | |
866 >> 40 LOAD_CONST 1 (None) | |
42 BUILD_LIST 1 | |
44 LOAD_FAST 2 (num_hidden_layers) | |
46 BINARY_MULTIPLY | |
48 STORE_FAST 1 (head_mask) | |
868 >> 50 LOAD_FAST 1 (head_mask) | |
52 RETURN_VALUE | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST head_mask [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP is not [ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 40 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_LIST 1 [ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST num_hidden_layers [ListVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_MULTIPLY None [ListVariable(), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST head_mask [ListVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST head_mask [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE RETURN_VALUE None [ListVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] DONE INLINING <code object get_head_mask at 0x7f3f10087c90, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/modeling_utils.py", line 843> | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST cross_attn_head_mask [ListVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF use_cache [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 636 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST present_key_value_states [ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST output_hidden_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 650 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST all_hidden_states [ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF output_attentions [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 664 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST all_attentions [ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF output_attentions [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 686 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST all_cross_attentions [ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST position_bias [ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST encoder_decoder_position_bias [ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR dropout [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST inputs_embeds [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST hidden_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL enumerate [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL zip [BuiltinVariable(enumerate)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [BuiltinVariable(enumerate), BuiltinVariable(zip)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR block [BuiltinVariable(enumerate), BuiltinVariable(zip), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST past_key_values [BuiltinVariable(enumerate), BuiltinVariable(zip), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 2 [BuiltinVariable(enumerate), BuiltinVariable(zip), NNModuleVariable(), ListVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [BuiltinVariable(enumerate), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE GET_ITER None [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE FOR_ITER 1304 [ListIteratorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE UNPACK_SEQUENCE 2 [ListIteratorVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST i [ListIteratorVariable(), TupleVariable(), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE UNPACK_SEQUENCE 2 [ListIteratorVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST layer_module [ListIteratorVariable(), ConstantVariable(NoneType), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST past_key_value [ListIteratorVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST head_mask [ListIteratorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST i [ListIteratorVariable(), ListVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_SUBSCR None [ListIteratorVariable(), ListVariable(), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST layer_head_mask [ListIteratorVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST cross_attn_head_mask [ListIteratorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST i [ListIteratorVariable(), ListVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_SUBSCR None [ListIteratorVariable(), ListVariable(), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST cross_attn_layer_head_mask [ListIteratorVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [ListIteratorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR model_parallel [ListIteratorVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 930 [ListIteratorVariable(), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST output_hidden_states [ListIteratorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 946 [ListIteratorVariable(), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [ListIteratorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR gradient_checkpointing [ListIteratorVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 1028 [ListIteratorVariable(), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST layer_module [ListIteratorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [ListIteratorVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST extended_attention_mask [ListIteratorVariable(), NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST position_bias [ListIteratorVariable(), NNModuleVariable(), TensorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST encoder_hidden_states [ListIteratorVariable(), NNModuleVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST encoder_extended_attention_mask [ListIteratorVariable(), NNModuleVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST encoder_decoder_position_bias [ListIteratorVariable(), NNModuleVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST layer_head_mask [ListIteratorVariable(), NNModuleVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST cross_attn_layer_head_mask [ListIteratorVariable(), NNModuleVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST past_key_value [ListIteratorVariable(), NNModuleVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF use_cache [ListIteratorVariable(), NNModuleVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF output_attentions [ListIteratorVariable(), NNModuleVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST ('attention_mask', 'position_bias', 'encoder_hidden_states', 'encoder_attention_mask', 'encoder_decoder_position_bias', 'layer_head_mask', 'cross_attn_layer_head_mask', 'past_key_value', 'use_cache', 'output_attentions') [ListIteratorVariable(), NNModuleVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(bool), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION_KW 11 [ListIteratorVariable(), NNModuleVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(bool), ConstantVariable(bool), ConstantVariable(tuple)] | |
torchdynamo.symbolic_convert: [DEBUG] INLINING <code object forward at 0x7f3f100b4450, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 631> | |
647 0 LOAD_FAST 9 (past_key_value) | |
2 LOAD_CONST 0 (None) | |
4 COMPARE_OP 9 (is not) | |
6 POP_JUMP_IF_FALSE 122 | |
648 8 LOAD_FAST 0 (self) | |
10 LOAD_ATTR 0 (is_decoder) | |
12 POP_JUMP_IF_TRUE 24 | |
649 14 LOAD_GLOBAL 1 (logger) | |
16 LOAD_METHOD 2 (warning) | |
18 LOAD_CONST 1 ('`past_key_values` is passed to the encoder. Please make sure this is intended.') | |
20 CALL_METHOD 1 | |
22 POP_TOP | |
650 >> 24 LOAD_FAST 4 (encoder_hidden_states) | |
26 LOAD_CONST 0 (None) | |
28 COMPARE_OP 8 (is) | |
30 POP_JUMP_IF_FALSE 36 | |
32 LOAD_CONST 2 (2) | |
34 JUMP_FORWARD 2 (to 38) | |
>> 36 LOAD_CONST 3 (4) | |
>> 38 STORE_FAST 13 (expected_num_past_key_values) | |
652 40 LOAD_GLOBAL 3 (len) | |
42 LOAD_FAST 9 (past_key_value) | |
44 CALL_FUNCTION 1 | |
46 LOAD_FAST 13 (expected_num_past_key_values) | |
48 COMPARE_OP 3 (!=) | |
50 POP_JUMP_IF_FALSE 96 | |
653 52 LOAD_GLOBAL 4 (ValueError) | |
654 54 LOAD_CONST 4 ('There should be ') | |
56 LOAD_FAST 13 (expected_num_past_key_values) | |
58 FORMAT_VALUE 0 | |
60 LOAD_CONST 5 (' past states. ') | |
62 LOAD_FAST 13 (expected_num_past_key_values) | |
64 LOAD_CONST 3 (4) | |
66 COMPARE_OP 2 (==) | |
68 POP_JUMP_IF_FALSE 74 | |
70 LOAD_CONST 6 ('2 (past / key) for cross attention. ') | |
72 JUMP_FORWARD 2 (to 76) | |
>> 74 LOAD_CONST 7 ('') | |
>> 76 FORMAT_VALUE 0 | |
78 LOAD_CONST 8 ('Got ') | |
80 LOAD_GLOBAL 3 (len) | |
82 LOAD_FAST 9 (past_key_value) | |
84 CALL_FUNCTION 1 | |
86 FORMAT_VALUE 0 | |
88 LOAD_CONST 9 (' past key / value states') | |
90 BUILD_STRING 7 | |
653 92 CALL_FUNCTION 1 | |
94 RAISE_VARARGS 1 | |
659 >> 96 LOAD_FAST 9 (past_key_value) | |
98 LOAD_CONST 0 (None) | |
100 LOAD_CONST 2 (2) | |
102 BUILD_SLICE 2 | |
104 BINARY_SUBSCR | |
106 STORE_FAST 14 (self_attn_past_key_value) | |
660 108 LOAD_FAST 9 (past_key_value) | |
110 LOAD_CONST 2 (2) | |
112 LOAD_CONST 0 (None) | |
114 BUILD_SLICE 2 | |
116 BINARY_SUBSCR | |
118 STORE_FAST 15 (cross_attn_past_key_value) | |
120 JUMP_FORWARD 8 (to 130) | |
662 >> 122 LOAD_CONST 10 ((None, None)) | |
124 UNPACK_SEQUENCE 2 | |
126 STORE_FAST 14 (self_attn_past_key_value) | |
128 STORE_FAST 15 (cross_attn_past_key_value) | |
664 >> 130 LOAD_FAST 0 (self) | |
132 LOAD_ATTR 5 (layer) | |
134 LOAD_CONST 11 (0) | |
136 BINARY_SUBSCR | |
665 138 LOAD_FAST 1 (hidden_states) | |
666 140 LOAD_FAST 2 (attention_mask) | |
667 142 LOAD_FAST 3 (position_bias) | |
668 144 LOAD_FAST 7 (layer_head_mask) | |
669 146 LOAD_FAST 14 (self_attn_past_key_value) | |
670 148 LOAD_FAST 10 (use_cache) | |
671 150 LOAD_FAST 11 (output_attentions) | |
664 152 LOAD_CONST 12 (('attention_mask', 'position_bias', 'layer_head_mask', 'past_key_value', 'use_cache', 'output_attentions')) | |
154 CALL_FUNCTION_KW 7 | |
156 STORE_FAST 16 (self_attention_outputs) | |
673 158 LOAD_FAST 16 (self_attention_outputs) | |
160 LOAD_CONST 0 (None) | |
162 LOAD_CONST 2 (2) | |
164 BUILD_SLICE 2 | |
166 BINARY_SUBSCR | |
168 UNPACK_SEQUENCE 2 | |
170 STORE_FAST 1 (hidden_states) | |
172 STORE_FAST 17 (present_key_value_state) | |
674 174 LOAD_FAST 16 (self_attention_outputs) | |
176 LOAD_CONST 2 (2) | |
178 LOAD_CONST 0 (None) | |
180 BUILD_SLICE 2 | |
182 BINARY_SUBSCR | |
184 STORE_FAST 18 (attention_outputs) | |
677 186 LOAD_FAST 1 (hidden_states) | |
188 LOAD_ATTR 6 (dtype) | |
190 LOAD_GLOBAL 7 (torch) | |
192 LOAD_ATTR 8 (float16) | |
194 COMPARE_OP 2 (==) | |
196 POP_JUMP_IF_FALSE 248 | |
198 LOAD_GLOBAL 7 (torch) | |
200 LOAD_METHOD 9 (isinf) | |
202 LOAD_FAST 1 (hidden_states) | |
204 CALL_METHOD 1 | |
206 LOAD_METHOD 10 (any) | |
208 CALL_METHOD 0 | |
210 POP_JUMP_IF_FALSE 248 | |
678 212 LOAD_GLOBAL 7 (torch) | |
214 LOAD_METHOD 11 (finfo) | |
216 LOAD_FAST 1 (hidden_states) | |
218 LOAD_ATTR 6 (dtype) | |
220 CALL_METHOD 1 | |
222 LOAD_ATTR 12 (max) | |
224 LOAD_CONST 13 (1000) | |
226 BINARY_SUBTRACT | |
228 STORE_FAST 19 (clamp_value) | |
679 230 LOAD_GLOBAL 7 (torch) | |
232 LOAD_ATTR 13 (clamp) | |
234 LOAD_FAST 1 (hidden_states) | |
236 LOAD_FAST 19 (clamp_value) | |
238 UNARY_NEGATIVE | |
240 LOAD_FAST 19 (clamp_value) | |
242 LOAD_CONST 14 (('min', 'max')) | |
244 CALL_FUNCTION_KW 3 | |
246 STORE_FAST 1 (hidden_states) | |
681 >> 248 LOAD_FAST 0 (self) | |
250 LOAD_ATTR 0 (is_decoder) | |
252 EXTENDED_ARG 1 | |
254 JUMP_IF_FALSE_OR_POP 262 | |
256 LOAD_FAST 4 (encoder_hidden_states) | |
258 LOAD_CONST 0 (None) | |
260 COMPARE_OP 9 (is not) | |
>> 262 STORE_FAST 20 (do_cross_attention) | |
682 264 LOAD_FAST 20 (do_cross_attention) | |
266 EXTENDED_ARG 1 | |
268 POP_JUMP_IF_FALSE 444 | |
685 270 LOAD_FAST 17 (present_key_value_state) | |
272 LOAD_CONST 0 (None) | |
274 COMPARE_OP 9 (is not) | |
276 EXTENDED_ARG 1 | |
278 POP_JUMP_IF_FALSE 296 | |
686 280 LOAD_FAST 17 (present_key_value_state) | |
282 LOAD_CONST 11 (0) | |
284 BINARY_SUBSCR | |
286 LOAD_ATTR 14 (shape) | |
288 LOAD_CONST 2 (2) | |
290 BINARY_SUBSCR | |
292 STORE_FAST 21 (query_length) | |
294 JUMP_FORWARD 4 (to 300) | |
688 >> 296 LOAD_CONST 0 (None) | |
298 STORE_FAST 21 (query_length) | |
690 >> 300 LOAD_FAST 0 (self) | |
302 LOAD_ATTR 5 (layer) | |
304 LOAD_CONST 15 (1) | |
306 BINARY_SUBSCR | |
691 308 LOAD_FAST 1 (hidden_states) | |
692 310 LOAD_FAST 4 (encoder_hidden_states) | |
693 312 LOAD_FAST 5 (encoder_attention_mask) | |
694 314 LOAD_FAST 6 (encoder_decoder_position_bias) | |
695 316 LOAD_FAST 8 (cross_attn_layer_head_mask) | |
696 318 LOAD_FAST 15 (cross_attn_past_key_value) | |
697 320 LOAD_FAST 21 (query_length) | |
698 322 LOAD_FAST 10 (use_cache) | |
699 324 LOAD_FAST 11 (output_attentions) | |
690 326 LOAD_CONST 16 (('key_value_states', 'attention_mask', 'position_bias', 'layer_head_mask', 'past_key_value', 'query_length', 'use_cache', 'output_attentions')) | |
328 CALL_FUNCTION_KW 9 | |
330 STORE_FAST 22 (cross_attention_outputs) | |
701 332 LOAD_FAST 22 (cross_attention_outputs) | |
334 LOAD_CONST 11 (0) | |
336 BINARY_SUBSCR | |
338 STORE_FAST 1 (hidden_states) | |
704 340 LOAD_FAST 1 (hidden_states) | |
342 LOAD_ATTR 6 (dtype) | |
344 LOAD_GLOBAL 7 (torch) | |
346 LOAD_ATTR 8 (float16) | |
348 COMPARE_OP 2 (==) | |
350 EXTENDED_ARG 1 | |
352 POP_JUMP_IF_FALSE 406 | |
354 LOAD_GLOBAL 7 (torch) | |
356 LOAD_METHOD 9 (isinf) | |
358 LOAD_FAST 1 (hidden_states) | |
360 CALL_METHOD 1 | |
362 LOAD_METHOD 10 (any) | |
364 CALL_METHOD 0 | |
366 EXTENDED_ARG 1 | |
368 POP_JUMP_IF_FALSE 406 | |
705 370 LOAD_GLOBAL 7 (torch) | |
372 LOAD_METHOD 11 (finfo) | |
374 LOAD_FAST 1 (hidden_states) | |
376 LOAD_ATTR 6 (dtype) | |
378 CALL_METHOD 1 | |
380 LOAD_ATTR 12 (max) | |
382 LOAD_CONST 13 (1000) | |
384 BINARY_SUBTRACT | |
386 STORE_FAST 19 (clamp_value) | |
706 388 LOAD_GLOBAL 7 (torch) | |
390 LOAD_ATTR 13 (clamp) | |
392 LOAD_FAST 1 (hidden_states) | |
394 LOAD_FAST 19 (clamp_value) | |
396 UNARY_NEGATIVE | |
398 LOAD_FAST 19 (clamp_value) | |
400 LOAD_CONST 14 (('min', 'max')) | |
402 CALL_FUNCTION_KW 3 | |
404 STORE_FAST 1 (hidden_states) | |
709 >> 406 LOAD_FAST 17 (present_key_value_state) | |
408 LOAD_CONST 0 (None) | |
410 COMPARE_OP 9 (is not) | |
412 EXTENDED_ARG 1 | |
414 POP_JUMP_IF_FALSE 428 | |
710 416 LOAD_FAST 17 (present_key_value_state) | |
418 LOAD_FAST 22 (cross_attention_outputs) | |
420 LOAD_CONST 15 (1) | |
422 BINARY_SUBSCR | |
424 BINARY_ADD | |
426 STORE_FAST 17 (present_key_value_state) | |
713 >> 428 LOAD_FAST 18 (attention_outputs) | |
430 LOAD_FAST 22 (cross_attention_outputs) | |
432 LOAD_CONST 2 (2) | |
434 LOAD_CONST 0 (None) | |
436 BUILD_SLICE 2 | |
438 BINARY_SUBSCR | |
440 BINARY_ADD | |
442 STORE_FAST 18 (attention_outputs) | |
716 >> 444 LOAD_FAST 0 (self) | |
446 LOAD_ATTR 5 (layer) | |
448 LOAD_CONST 17 (-1) | |
450 BINARY_SUBSCR | |
452 LOAD_FAST 1 (hidden_states) | |
454 CALL_FUNCTION 1 | |
456 STORE_FAST 1 (hidden_states) | |
719 458 LOAD_FAST 1 (hidden_states) | |
460 LOAD_ATTR 6 (dtype) | |
462 LOAD_GLOBAL 7 (torch) | |
464 LOAD_ATTR 8 (float16) | |
466 COMPARE_OP 2 (==) | |
468 EXTENDED_ARG 2 | |
470 POP_JUMP_IF_FALSE 524 | |
472 LOAD_GLOBAL 7 (torch) | |
474 LOAD_METHOD 9 (isinf) | |
476 LOAD_FAST 1 (hidden_states) | |
478 CALL_METHOD 1 | |
480 LOAD_METHOD 10 (any) | |
482 CALL_METHOD 0 | |
484 EXTENDED_ARG 2 | |
486 POP_JUMP_IF_FALSE 524 | |
720 488 LOAD_GLOBAL 7 (torch) | |
490 LOAD_METHOD 11 (finfo) | |
492 LOAD_FAST 1 (hidden_states) | |
494 LOAD_ATTR 6 (dtype) | |
496 CALL_METHOD 1 | |
498 LOAD_ATTR 12 (max) | |
500 LOAD_CONST 13 (1000) | |
502 BINARY_SUBTRACT | |
504 STORE_FAST 19 (clamp_value) | |
721 506 LOAD_GLOBAL 7 (torch) | |
508 LOAD_ATTR 13 (clamp) | |
510 LOAD_FAST 1 (hidden_states) | |
512 LOAD_FAST 19 (clamp_value) | |
514 UNARY_NEGATIVE | |
516 LOAD_FAST 19 (clamp_value) | |
518 LOAD_CONST 14 (('min', 'max')) | |
520 CALL_FUNCTION_KW 3 | |
522 STORE_FAST 1 (hidden_states) | |
723 >> 524 LOAD_FAST 1 (hidden_states) | |
526 BUILD_TUPLE 1 | |
528 STORE_FAST 23 (outputs) | |
725 530 LOAD_FAST 10 (use_cache) | |
532 EXTENDED_ARG 2 | |
534 POP_JUMP_IF_FALSE 552 | |
726 536 LOAD_FAST 23 (outputs) | |
538 LOAD_FAST 17 (present_key_value_state) | |
540 BUILD_TUPLE 1 | |
542 BINARY_ADD | |
544 LOAD_FAST 18 (attention_outputs) | |
546 BINARY_ADD | |
548 STORE_FAST 23 (outputs) | |
550 JUMP_FORWARD 8 (to 560) | |
728 >> 552 LOAD_FAST 23 (outputs) | |
554 LOAD_FAST 18 (attention_outputs) | |
556 BINARY_ADD | |
558 STORE_FAST 23 (outputs) | |
730 >> 560 LOAD_FAST 23 (outputs) | |
562 RETURN_VALUE | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST past_key_value [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP is not [ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 122 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST (None, None) [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE UNPACK_SEQUENCE 2 [ConstantVariable(tuple)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST self_attn_past_key_value [ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST cross_attn_past_key_value [ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR layer [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 0 [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_SUBSCR None [NNModuleVariable(), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST attention_mask [NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST position_bias [NNModuleVariable(), TensorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST layer_head_mask [NNModuleVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self_attn_past_key_value [NNModuleVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST use_cache [NNModuleVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST output_attentions [NNModuleVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST ('attention_mask', 'position_bias', 'layer_head_mask', 'past_key_value', 'use_cache', 'output_attentions') [NNModuleVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(bool), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION_KW 7 [NNModuleVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(bool), ConstantVariable(bool), ConstantVariable(tuple)] | |
torchdynamo.symbolic_convert: [DEBUG] INLINING <code object forward at 0x7f3f100b4030, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 559> | |
569 0 LOAD_FAST 0 (self) | |
2 LOAD_METHOD 0 (layer_norm) | |
4 LOAD_FAST 1 (hidden_states) | |
6 CALL_METHOD 1 | |
8 STORE_FAST 8 (normed_hidden_states) | |
570 10 LOAD_FAST 0 (self) | |
12 LOAD_ATTR 1 (SelfAttention) | |
571 14 LOAD_FAST 8 (normed_hidden_states) | |
572 16 LOAD_FAST 2 (attention_mask) | |
573 18 LOAD_FAST 3 (position_bias) | |
574 20 LOAD_FAST 4 (layer_head_mask) | |
575 22 LOAD_FAST 5 (past_key_value) | |
576 24 LOAD_FAST 6 (use_cache) | |
577 26 LOAD_FAST 7 (output_attentions) | |
570 28 LOAD_CONST 1 (('mask', 'position_bias', 'layer_head_mask', 'past_key_value', 'use_cache', 'output_attentions')) | |
30 CALL_FUNCTION_KW 7 | |
32 STORE_FAST 9 (attention_output) | |
579 34 LOAD_FAST 1 (hidden_states) | |
36 LOAD_FAST 0 (self) | |
38 LOAD_METHOD 2 (dropout) | |
40 LOAD_FAST 9 (attention_output) | |
42 LOAD_CONST 2 (0) | |
44 BINARY_SUBSCR | |
46 CALL_METHOD 1 | |
48 BINARY_ADD | |
50 STORE_FAST 1 (hidden_states) | |
580 52 LOAD_FAST 1 (hidden_states) | |
54 BUILD_TUPLE 1 | |
56 LOAD_FAST 9 (attention_output) | |
58 LOAD_CONST 3 (1) | |
60 LOAD_CONST 0 (None) | |
62 BUILD_SLICE 2 | |
64 BINARY_SUBSCR | |
66 BINARY_ADD | |
68 STORE_FAST 10 (outputs) | |
581 70 LOAD_FAST 10 (outputs) | |
72 RETURN_VALUE | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR layer_norm [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] INLINING <code object forward at 0x7f3f100ae190, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 248> | |
255 0 LOAD_FAST 1 (hidden_states) | |
2 LOAD_METHOD 0 (to) | |
4 LOAD_GLOBAL 1 (torch) | |
6 LOAD_ATTR 2 (float32) | |
8 CALL_METHOD 1 | |
10 LOAD_METHOD 3 (pow) | |
12 LOAD_CONST 1 (2) | |
14 CALL_METHOD 1 | |
16 LOAD_ATTR 4 (mean) | |
18 LOAD_CONST 2 (-1) | |
20 LOAD_CONST 3 (True) | |
22 LOAD_CONST 4 (('keepdim',)) | |
24 CALL_FUNCTION_KW 2 | |
26 STORE_FAST 2 (variance) | |
256 28 LOAD_FAST 1 (hidden_states) | |
30 LOAD_GLOBAL 1 (torch) | |
32 LOAD_METHOD 5 (rsqrt) | |
34 LOAD_FAST 2 (variance) | |
36 LOAD_FAST 0 (self) | |
38 LOAD_ATTR 6 (variance_epsilon) | |
40 BINARY_ADD | |
42 CALL_METHOD 1 | |
44 BINARY_MULTIPLY | |
46 STORE_FAST 1 (hidden_states) | |
259 48 LOAD_FAST 0 (self) | |
50 LOAD_ATTR 7 (weight) | |
52 LOAD_ATTR 8 (dtype) | |
54 LOAD_GLOBAL 1 (torch) | |
56 LOAD_ATTR 9 (float16) | |
58 LOAD_GLOBAL 1 (torch) | |
60 LOAD_ATTR 10 (bfloat16) | |
62 BUILD_TUPLE 2 | |
64 COMPARE_OP 6 (in) | |
66 POP_JUMP_IF_FALSE 82 | |
260 68 LOAD_FAST 1 (hidden_states) | |
70 LOAD_METHOD 0 (to) | |
72 LOAD_FAST 0 (self) | |
74 LOAD_ATTR 7 (weight) | |
76 LOAD_ATTR 8 (dtype) | |
78 CALL_METHOD 1 | |
80 STORE_FAST 1 (hidden_states) | |
262 >> 82 LOAD_FAST 0 (self) | |
84 LOAD_ATTR 7 (weight) | |
86 LOAD_FAST 1 (hidden_states) | |
88 BINARY_MULTIPLY | |
90 RETURN_VALUE | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR to [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL torch [GetAttrVariable(TensorVariable(), to)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR float32 [GetAttrVariable(TensorVariable(), to), TorchVariable(<module 'torch' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/__init__.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [GetAttrVariable(TensorVariable(), to), ConstantVariable(dtype)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR pow [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 2 [GetAttrVariable(TensorVariable(), pow)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [GetAttrVariable(TensorVariable(), pow), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR mean [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST -1 [GetAttrVariable(TensorVariable(), mean)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST True [GetAttrVariable(TensorVariable(), mean), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST ('keepdim',) [GetAttrVariable(TensorVariable(), mean), ConstantVariable(int), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION_KW 2 [GetAttrVariable(TensorVariable(), mean), ConstantVariable(int), ConstantVariable(bool), ConstantVariable(tuple)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST variance [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL torch [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR rsqrt [TensorVariable(), TorchVariable(<module 'torch' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/__init__.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST variance [TensorVariable(), TorchVariable(<built-in method rsqrt of type object at 0x7f40b0102e20>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [TensorVariable(), TorchVariable(<built-in method rsqrt of type object at 0x7f40b0102e20>), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR variance_epsilon [TensorVariable(), TorchVariable(<built-in method rsqrt of type object at 0x7f40b0102e20>), TensorVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_ADD None [TensorVariable(), TorchVariable(<built-in method rsqrt of type object at 0x7f40b0102e20>), TensorVariable(), ConstantVariable(float)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [TensorVariable(), TorchVariable(<built-in method rsqrt of type object at 0x7f40b0102e20>), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_MULTIPLY None [TensorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST hidden_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR weight [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR dtype [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL torch [TorchVariable(torch.float32)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR float16 [TorchVariable(torch.float32), TorchVariable(<module 'torch' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/__init__.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL torch [TorchVariable(torch.float32), ConstantVariable(dtype)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR bfloat16 [TorchVariable(torch.float32), ConstantVariable(dtype), TorchVariable(<module 'torch' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/__init__.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_TUPLE 2 [TorchVariable(torch.float32), ConstantVariable(dtype), ConstantVariable(dtype)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP in [TorchVariable(torch.float32), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 82 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR weight [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_MULTIPLY None [TensorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE RETURN_VALUE None [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] DONE INLINING <code object forward at 0x7f3f100ae190, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 248> | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST normed_hidden_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR SelfAttention [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST normed_hidden_states [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST attention_mask [NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST position_bias [NNModuleVariable(), TensorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST layer_head_mask [NNModuleVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST past_key_value [NNModuleVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST use_cache [NNModuleVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST output_attentions [NNModuleVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST ('mask', 'position_bias', 'layer_head_mask', 'past_key_value', 'use_cache', 'output_attentions') [NNModuleVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(bool), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION_KW 7 [NNModuleVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(bool), ConstantVariable(bool), ConstantVariable(tuple)] | |
torchdynamo.symbolic_convert: [DEBUG] INLINING <code object forward at 0x7f3f100aedf0, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 436> | |
454 0 LOAD_FAST 1 (hidden_states) | |
2 LOAD_ATTR 0 (shape) | |
4 LOAD_CONST 1 (None) | |
6 LOAD_CONST 2 (2) | |
8 BUILD_SLICE 2 | |
10 BINARY_SUBSCR | |
12 UNPACK_SEQUENCE 2 | |
14 STORE_DEREF 0 (batch_size) | |
16 STORE_FAST 10 (seq_length) | |
456 18 LOAD_FAST 10 (seq_length) | |
20 STORE_FAST 11 (real_seq_length) | |
458 22 LOAD_FAST 5 (past_key_value) | |
24 LOAD_CONST 1 (None) | |
26 COMPARE_OP 9 (is not) | |
28 POP_JUMP_IF_FALSE 92 | |
460 30 LOAD_GLOBAL 1 (len) | |
32 LOAD_FAST 5 (past_key_value) | |
34 CALL_FUNCTION 1 | |
36 LOAD_CONST 2 (2) | |
38 COMPARE_OP 2 (==) | |
459 40 POP_JUMP_IF_TRUE 62 | |
42 LOAD_GLOBAL 2 (AssertionError) | |
461 44 LOAD_CONST 3 ('past_key_value should have 2 past states: keys and values. Got ') | |
46 LOAD_GLOBAL 1 (len) | |
48 LOAD_FAST 5 (past_key_value) | |
50 CALL_FUNCTION 1 | |
52 FORMAT_VALUE 0 | |
54 LOAD_CONST 4 (' past states') | |
56 BUILD_STRING 3 | |
459 58 CALL_FUNCTION 1 | |
60 RAISE_VARARGS 1 | |
462 >> 62 LOAD_FAST 11 (real_seq_length) | |
64 LOAD_FAST 7 (query_length) | |
66 LOAD_CONST 1 (None) | |
68 COMPARE_OP 8 (is) | |
70 POP_JUMP_IF_FALSE 86 | |
72 LOAD_FAST 5 (past_key_value) | |
74 LOAD_CONST 5 (0) | |
76 BINARY_SUBSCR | |
78 LOAD_ATTR 0 (shape) | |
80 LOAD_CONST 2 (2) | |
82 BINARY_SUBSCR | |
84 JUMP_FORWARD 2 (to 88) | |
>> 86 LOAD_FAST 7 (query_length) | |
>> 88 INPLACE_ADD | |
90 STORE_FAST 11 (real_seq_length) | |
464 >> 92 LOAD_FAST 3 (key_value_states) | |
94 LOAD_CONST 1 (None) | |
96 COMPARE_OP 8 (is) | |
98 POP_JUMP_IF_FALSE 104 | |
100 LOAD_FAST 11 (real_seq_length) | |
102 JUMP_FORWARD 8 (to 112) | |
>> 104 LOAD_FAST 3 (key_value_states) | |
106 LOAD_ATTR 0 (shape) | |
108 LOAD_CONST 6 (1) | |
110 BINARY_SUBSCR | |
>> 112 STORE_FAST 12 (key_length) | |
466 114 LOAD_CLOSURE 0 (batch_size) | |
116 LOAD_CLOSURE 1 (self) | |
118 BUILD_TUPLE 2 | |
120 LOAD_CONST 7 (<code object shape at 0x7f3f100aebe0, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 466>) | |
122 LOAD_CONST 8 ('T5Attention.forward.<locals>.shape') | |
124 MAKE_FUNCTION 8 (closure) | |
126 STORE_DEREF 2 (shape) | |
470 128 LOAD_CLOSURE 0 (batch_size) | |
130 LOAD_CLOSURE 1 (self) | |
132 BUILD_TUPLE 2 | |
134 LOAD_CONST 9 (<code object unshape at 0x7f3f100aec90, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 470>) | |
136 LOAD_CONST 10 ('T5Attention.forward.<locals>.unshape') | |
138 MAKE_FUNCTION 8 (closure) | |
140 STORE_FAST 13 (unshape) | |
474 142 LOAD_CLOSURE 2 (shape) | |
144 BUILD_TUPLE 1 | |
146 LOAD_CONST 11 (<code object project at 0x7f3f100aed40, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 474>) | |
148 LOAD_CONST 12 ('T5Attention.forward.<locals>.project') | |
150 MAKE_FUNCTION 8 (closure) | |
152 STORE_FAST 14 (project) | |
496 154 LOAD_DEREF 2 (shape) | |
156 LOAD_DEREF 1 (self) | |
158 LOAD_METHOD 3 (q) | |
160 LOAD_FAST 1 (hidden_states) | |
162 CALL_METHOD 1 | |
164 CALL_FUNCTION 1 | |
166 STORE_FAST 15 (query_states) | |
499 168 LOAD_FAST 14 (project) | |
500 170 LOAD_FAST 1 (hidden_states) | |
172 LOAD_DEREF 1 (self) | |
174 LOAD_ATTR 4 (k) | |
176 LOAD_FAST 3 (key_value_states) | |
178 LOAD_FAST 5 (past_key_value) | |
180 LOAD_CONST 1 (None) | |
182 COMPARE_OP 9 (is not) | |
184 POP_JUMP_IF_FALSE 194 | |
186 LOAD_FAST 5 (past_key_value) | |
188 LOAD_CONST 5 (0) | |
190 BINARY_SUBSCR | |
192 JUMP_FORWARD 2 (to 196) | |
>> 194 LOAD_CONST 1 (None) | |
499 >> 196 CALL_FUNCTION 4 | |
198 STORE_FAST 16 (key_states) | |
502 200 LOAD_FAST 14 (project) | |
503 202 LOAD_FAST 1 (hidden_states) | |
204 LOAD_DEREF 1 (self) | |
206 LOAD_ATTR 5 (v) | |
208 LOAD_FAST 3 (key_value_states) | |
210 LOAD_FAST 5 (past_key_value) | |
212 LOAD_CONST 1 (None) | |
214 COMPARE_OP 9 (is not) | |
216 POP_JUMP_IF_FALSE 226 | |
218 LOAD_FAST 5 (past_key_value) | |
220 LOAD_CONST 6 (1) | |
222 BINARY_SUBSCR | |
224 JUMP_FORWARD 2 (to 228) | |
>> 226 LOAD_CONST 1 (None) | |
502 >> 228 CALL_FUNCTION 4 | |
230 STORE_FAST 17 (value_states) | |
507 232 LOAD_GLOBAL 6 (torch) | |
234 LOAD_METHOD 7 (matmul) | |
508 236 LOAD_FAST 15 (query_states) | |
238 LOAD_FAST 16 (key_states) | |
240 LOAD_METHOD 8 (transpose) | |
242 LOAD_CONST 13 (3) | |
244 LOAD_CONST 2 (2) | |
246 CALL_METHOD 2 | |
507 248 CALL_METHOD 2 | |
250 STORE_FAST 18 (scores) | |
511 252 LOAD_FAST 4 (position_bias) | |
254 LOAD_CONST 1 (None) | |
256 COMPARE_OP 8 (is) | |
258 EXTENDED_ARG 1 | |
260 POP_JUMP_IF_FALSE 410 | |
512 262 LOAD_DEREF 1 (self) | |
264 LOAD_ATTR 9 (has_relative_attention_bias) | |
266 EXTENDED_ARG 1 | |
268 POP_JUMP_IF_TRUE 324 | |
513 270 LOAD_GLOBAL 6 (torch) | |
272 LOAD_ATTR 10 (zeros) | |
514 274 LOAD_CONST 6 (1) | |
276 LOAD_DEREF 1 (self) | |
278 LOAD_ATTR 11 (n_heads) | |
280 LOAD_FAST 11 (real_seq_length) | |
282 LOAD_FAST 12 (key_length) | |
284 BUILD_TUPLE 4 | |
286 LOAD_FAST 18 (scores) | |
288 LOAD_ATTR 12 (device) | |
290 LOAD_FAST 18 (scores) | |
292 LOAD_ATTR 13 (dtype) | |
513 294 LOAD_CONST 14 (('device', 'dtype')) | |
296 CALL_FUNCTION_KW 3 | |
298 STORE_FAST 4 (position_bias) | |
516 300 LOAD_DEREF 1 (self) | |
302 LOAD_ATTR 14 (gradient_checkpointing) | |
304 EXTENDED_ARG 1 | |
306 POP_JUMP_IF_FALSE 342 | |
308 LOAD_DEREF 1 (self) | |
310 LOAD_ATTR 15 (training) | |
312 EXTENDED_ARG 1 | |
314 POP_JUMP_IF_FALSE 342 | |
517 316 LOAD_CONST 15 (True) | |
318 LOAD_FAST 4 (position_bias) | |
320 STORE_ATTR 16 (requires_grad) | |
322 JUMP_FORWARD 18 (to 342) | |
519 >> 324 LOAD_DEREF 1 (self) | |
326 LOAD_ATTR 17 (compute_bias) | |
328 LOAD_FAST 11 (real_seq_length) | |
330 LOAD_FAST 12 (key_length) | |
332 LOAD_FAST 18 (scores) | |
334 LOAD_ATTR 12 (device) | |
336 LOAD_CONST 16 (('device',)) | |
338 CALL_FUNCTION_KW 3 | |
340 STORE_FAST 4 (position_bias) | |
523 >> 342 LOAD_FAST 5 (past_key_value) | |
344 LOAD_CONST 1 (None) | |
346 COMPARE_OP 9 (is not) | |
348 EXTENDED_ARG 1 | |
350 POP_JUMP_IF_FALSE 392 | |
524 352 LOAD_FAST 4 (position_bias) | |
354 LOAD_CONST 1 (None) | |
356 LOAD_CONST 1 (None) | |
358 BUILD_SLICE 2 | |
360 LOAD_CONST 1 (None) | |
362 LOAD_CONST 1 (None) | |
364 BUILD_SLICE 2 | |
366 LOAD_FAST 1 (hidden_states) | |
368 LOAD_METHOD 18 (size) | |
370 LOAD_CONST 6 (1) | |
372 CALL_METHOD 1 | |
374 UNARY_NEGATIVE | |
376 LOAD_CONST 1 (None) | |
378 BUILD_SLICE 2 | |
380 LOAD_CONST 1 (None) | |
382 LOAD_CONST 1 (None) | |
384 BUILD_SLICE 2 | |
386 BUILD_TUPLE 4 | |
388 BINARY_SUBSCR | |
390 STORE_FAST 4 (position_bias) | |
526 >> 392 LOAD_FAST 2 (mask) | |
394 LOAD_CONST 1 (None) | |
396 COMPARE_OP 9 (is not) | |
398 EXTENDED_ARG 1 | |
400 POP_JUMP_IF_FALSE 410 | |
527 402 LOAD_FAST 4 (position_bias) | |
404 LOAD_FAST 2 (mask) | |
406 BINARY_ADD | |
408 STORE_FAST 4 (position_bias) | |
529 >> 410 LOAD_FAST 18 (scores) | |
412 LOAD_FAST 4 (position_bias) | |
414 INPLACE_ADD | |
416 STORE_FAST 18 (scores) | |
530 418 LOAD_GLOBAL 19 (nn) | |
420 LOAD_ATTR 20 (functional) | |
422 LOAD_ATTR 21 (softmax) | |
424 LOAD_FAST 18 (scores) | |
426 LOAD_METHOD 22 (float) | |
428 CALL_METHOD 0 | |
430 LOAD_CONST 17 (-1) | |
432 LOAD_CONST 18 (('dim',)) | |
434 CALL_FUNCTION_KW 2 | |
436 LOAD_METHOD 23 (type_as) | |
531 438 LOAD_FAST 18 (scores) | |
530 440 CALL_METHOD 1 | |
442 STORE_FAST 19 (attn_weights) | |
533 444 LOAD_GLOBAL 19 (nn) | |
446 LOAD_ATTR 20 (functional) | |
448 LOAD_ATTR 24 (dropout) | |
534 450 LOAD_FAST 19 (attn_weights) | |
452 LOAD_DEREF 1 (self) | |
454 LOAD_ATTR 24 (dropout) | |
456 LOAD_DEREF 1 (self) | |
458 LOAD_ATTR 15 (training) | |
533 460 LOAD_CONST 19 (('p', 'training')) | |
462 CALL_FUNCTION_KW 3 | |
464 STORE_FAST 19 (attn_weights) | |
538 466 LOAD_FAST 6 (layer_head_mask) | |
468 LOAD_CONST 1 (None) | |
470 COMPARE_OP 9 (is not) | |
472 EXTENDED_ARG 1 | |
474 POP_JUMP_IF_FALSE 484 | |
539 476 LOAD_FAST 19 (attn_weights) | |
478 LOAD_FAST 6 (layer_head_mask) | |
480 BINARY_MULTIPLY | |
482 STORE_FAST 19 (attn_weights) | |
541 >> 484 LOAD_FAST 13 (unshape) | |
486 LOAD_GLOBAL 6 (torch) | |
488 LOAD_METHOD 7 (matmul) | |
490 LOAD_FAST 19 (attn_weights) | |
492 LOAD_FAST 17 (value_states) | |
494 CALL_METHOD 2 | |
496 CALL_FUNCTION 1 | |
498 STORE_FAST 20 (attn_output) | |
542 500 LOAD_DEREF 1 (self) | |
502 LOAD_METHOD 25 (o) | |
504 LOAD_FAST 20 (attn_output) | |
506 CALL_METHOD 1 | |
508 STORE_FAST 20 (attn_output) | |
544 510 LOAD_DEREF 1 (self) | |
512 LOAD_ATTR 26 (is_decoder) | |
514 EXTENDED_ARG 2 | |
516 POP_JUMP_IF_FALSE 532 | |
518 LOAD_FAST 8 (use_cache) | |
520 EXTENDED_ARG 2 | |
522 POP_JUMP_IF_FALSE 532 | |
524 LOAD_FAST 16 (key_states) | |
526 LOAD_FAST 17 (value_states) | |
528 BUILD_TUPLE 2 | |
530 JUMP_FORWARD 2 (to 534) | |
>> 532 LOAD_CONST 1 (None) | |
>> 534 STORE_FAST 21 (present_key_value_state) | |
545 536 LOAD_FAST 20 (attn_output) | |
538 BUILD_TUPLE 1 | |
540 LOAD_FAST 21 (present_key_value_state) | |
542 BUILD_TUPLE 1 | |
544 BINARY_ADD | |
546 LOAD_FAST 4 (position_bias) | |
548 BUILD_TUPLE 1 | |
550 BINARY_ADD | |
552 STORE_FAST 22 (outputs) | |
547 554 LOAD_FAST 9 (output_attentions) | |
556 EXTENDED_ARG 2 | |
558 POP_JUMP_IF_FALSE 570 | |
548 560 LOAD_FAST 22 (outputs) | |
562 LOAD_FAST 19 (attn_weights) | |
564 BUILD_TUPLE 1 | |
566 BINARY_ADD | |
568 STORE_FAST 22 (outputs) | |
549 >> 570 LOAD_FAST 22 (outputs) | |
572 RETURN_VALUE | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR shape [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [ShapeVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 2 [ShapeVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_SLICE 2 [ShapeVariable(), ConstantVariable(NoneType), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_SUBSCR None [ShapeVariable(), SliceVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE UNPACK_SEQUENCE 2 [ShapeVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_DEREF batch_size [ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST seq_length [ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST seq_length [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST real_seq_length [ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST past_key_value [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP is not [ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 92 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST key_value_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP is [ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 104 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST real_seq_length [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE JUMP_FORWARD 112 [ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST key_length [ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CLOSURE batch_size [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CLOSURE self [NewCellVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_TUPLE 2 [NewCellVariable(), NewCellVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST <code object shape at 0x7f3f100aebe0, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 466> [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST T5Attention.forward.<locals>.shape [TupleVariable(), ConstantVariable(code)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE MAKE_FUNCTION 8 [TupleVariable(), ConstantVariable(code), ConstantVariable(str)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_DEREF shape [NestedUserFunctionVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CLOSURE batch_size [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CLOSURE self [NewCellVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_TUPLE 2 [NewCellVariable(), NewCellVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST <code object unshape at 0x7f3f100aec90, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 470> [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST T5Attention.forward.<locals>.unshape [TupleVariable(), ConstantVariable(code)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE MAKE_FUNCTION 8 [TupleVariable(), ConstantVariable(code), ConstantVariable(str)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST unshape [NestedUserFunctionVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CLOSURE shape [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_TUPLE 1 [NewCellVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST <code object project at 0x7f3f100aed40, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 474> [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST T5Attention.forward.<locals>.project [TupleVariable(), ConstantVariable(code)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE MAKE_FUNCTION 8 [TupleVariable(), ConstantVariable(code), ConstantVariable(str)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST project [NestedUserFunctionVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF shape [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF self [NestedUserFunctionVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR q [NestedUserFunctionVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [NestedUserFunctionVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NestedUserFunctionVariable(), NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NestedUserFunctionVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] INLINING <code object shape at 0x7f3f100aebe0, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 466> | |
468 0 LOAD_FAST 0 (states) | |
2 LOAD_METHOD 0 (view) | |
4 LOAD_DEREF 0 (batch_size) | |
6 LOAD_CONST 1 (-1) | |
8 LOAD_DEREF 1 (self) | |
10 LOAD_ATTR 1 (n_heads) | |
12 LOAD_DEREF 1 (self) | |
14 LOAD_ATTR 2 (key_value_proj_dim) | |
16 CALL_METHOD 4 | |
18 LOAD_METHOD 3 (transpose) | |
20 LOAD_CONST 2 (1) | |
22 LOAD_CONST 3 (2) | |
24 CALL_METHOD 2 | |
26 RETURN_VALUE | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR view [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF batch_size [GetAttrVariable(TensorVariable(), view)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST -1 [GetAttrVariable(TensorVariable(), view), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF self [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR n_heads [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF self [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR key_value_proj_dim [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int), ConstantVariable(int), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 4 [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int), ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR transpose [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 1 [GetAttrVariable(TensorVariable(), transpose)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 2 [GetAttrVariable(TensorVariable(), transpose), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 2 [GetAttrVariable(TensorVariable(), transpose), ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE RETURN_VALUE None [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] DONE INLINING <code object shape at 0x7f3f100aebe0, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 466> | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST query_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST project [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [NestedUserFunctionVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF self [NestedUserFunctionVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR k [NestedUserFunctionVariable(), TensorVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST key_value_states [NestedUserFunctionVariable(), TensorVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST past_key_value [NestedUserFunctionVariable(), TensorVariable(), NNModuleVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [NestedUserFunctionVariable(), TensorVariable(), NNModuleVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP is not [NestedUserFunctionVariable(), TensorVariable(), NNModuleVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 194 [NestedUserFunctionVariable(), TensorVariable(), NNModuleVariable(), ConstantVariable(NoneType), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [NestedUserFunctionVariable(), TensorVariable(), NNModuleVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 4 [NestedUserFunctionVariable(), TensorVariable(), NNModuleVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] INLINING <code object project at 0x7f3f100aed40, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 474> | |
476 0 LOAD_FAST 2 (key_value_states) | |
2 LOAD_CONST 1 (None) | |
4 COMPARE_OP 8 (is) | |
6 POP_JUMP_IF_FALSE 22 | |
479 8 LOAD_DEREF 0 (shape) | |
10 LOAD_FAST 1 (proj_layer) | |
12 LOAD_FAST 0 (hidden_states) | |
14 CALL_FUNCTION 1 | |
16 CALL_FUNCTION 1 | |
18 STORE_FAST 0 (hidden_states) | |
20 JUMP_FORWARD 20 (to 42) | |
480 >> 22 LOAD_FAST 3 (past_key_value) | |
24 LOAD_CONST 1 (None) | |
26 COMPARE_OP 8 (is) | |
28 POP_JUMP_IF_FALSE 42 | |
483 30 LOAD_DEREF 0 (shape) | |
32 LOAD_FAST 1 (proj_layer) | |
34 LOAD_FAST 2 (key_value_states) | |
36 CALL_FUNCTION 1 | |
38 CALL_FUNCTION 1 | |
40 STORE_FAST 0 (hidden_states) | |
485 >> 42 LOAD_FAST 3 (past_key_value) | |
44 LOAD_CONST 1 (None) | |
46 COMPARE_OP 9 (is not) | |
48 POP_JUMP_IF_FALSE 82 | |
486 50 LOAD_FAST 2 (key_value_states) | |
52 LOAD_CONST 1 (None) | |
54 COMPARE_OP 8 (is) | |
56 POP_JUMP_IF_FALSE 78 | |
489 58 LOAD_GLOBAL 0 (torch) | |
60 LOAD_ATTR 1 (cat) | |
62 LOAD_FAST 3 (past_key_value) | |
64 LOAD_FAST 0 (hidden_states) | |
66 BUILD_LIST 2 | |
68 LOAD_CONST 2 (2) | |
70 LOAD_CONST 3 (('dim',)) | |
72 CALL_FUNCTION_KW 2 | |
74 STORE_FAST 0 (hidden_states) | |
76 JUMP_FORWARD 4 (to 82) | |
492 >> 78 LOAD_FAST 3 (past_key_value) | |
80 STORE_FAST 0 (hidden_states) | |
493 >> 82 LOAD_FAST 0 (hidden_states) | |
84 RETURN_VALUE | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST key_value_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP is [ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 22 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF shape [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST proj_layer [NestedUserFunctionVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [NestedUserFunctionVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NestedUserFunctionVariable(), NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NestedUserFunctionVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] INLINING <code object shape at 0x7f3f100aebe0, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 466> | |
468 0 LOAD_FAST 0 (states) | |
2 LOAD_METHOD 0 (view) | |
4 LOAD_DEREF 0 (batch_size) | |
6 LOAD_CONST 1 (-1) | |
8 LOAD_DEREF 1 (self) | |
10 LOAD_ATTR 1 (n_heads) | |
12 LOAD_DEREF 1 (self) | |
14 LOAD_ATTR 2 (key_value_proj_dim) | |
16 CALL_METHOD 4 | |
18 LOAD_METHOD 3 (transpose) | |
20 LOAD_CONST 2 (1) | |
22 LOAD_CONST 3 (2) | |
24 CALL_METHOD 2 | |
26 RETURN_VALUE | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR view [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF batch_size [GetAttrVariable(TensorVariable(), view)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST -1 [GetAttrVariable(TensorVariable(), view), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF self [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR n_heads [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF self [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR key_value_proj_dim [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int), ConstantVariable(int), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 4 [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int), ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR transpose [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 1 [GetAttrVariable(TensorVariable(), transpose)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 2 [GetAttrVariable(TensorVariable(), transpose), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 2 [GetAttrVariable(TensorVariable(), transpose), ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE RETURN_VALUE None [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] DONE INLINING <code object shape at 0x7f3f100aebe0, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 466> | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST hidden_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE JUMP_FORWARD 42 [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST past_key_value [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP is not [ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 82 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE RETURN_VALUE None [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] DONE INLINING <code object project at 0x7f3f100aed40, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 474> | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST key_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST project [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [NestedUserFunctionVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF self [NestedUserFunctionVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR v [NestedUserFunctionVariable(), TensorVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST key_value_states [NestedUserFunctionVariable(), TensorVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST past_key_value [NestedUserFunctionVariable(), TensorVariable(), NNModuleVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [NestedUserFunctionVariable(), TensorVariable(), NNModuleVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP is not [NestedUserFunctionVariable(), TensorVariable(), NNModuleVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 226 [NestedUserFunctionVariable(), TensorVariable(), NNModuleVariable(), ConstantVariable(NoneType), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [NestedUserFunctionVariable(), TensorVariable(), NNModuleVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 4 [NestedUserFunctionVariable(), TensorVariable(), NNModuleVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] INLINING <code object project at 0x7f3f100aed40, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 474> | |
476 0 LOAD_FAST 2 (key_value_states) | |
2 LOAD_CONST 1 (None) | |
4 COMPARE_OP 8 (is) | |
6 POP_JUMP_IF_FALSE 22 | |
479 8 LOAD_DEREF 0 (shape) | |
10 LOAD_FAST 1 (proj_layer) | |
12 LOAD_FAST 0 (hidden_states) | |
14 CALL_FUNCTION 1 | |
16 CALL_FUNCTION 1 | |
18 STORE_FAST 0 (hidden_states) | |
20 JUMP_FORWARD 20 (to 42) | |
480 >> 22 LOAD_FAST 3 (past_key_value) | |
24 LOAD_CONST 1 (None) | |
26 COMPARE_OP 8 (is) | |
28 POP_JUMP_IF_FALSE 42 | |
483 30 LOAD_DEREF 0 (shape) | |
32 LOAD_FAST 1 (proj_layer) | |
34 LOAD_FAST 2 (key_value_states) | |
36 CALL_FUNCTION 1 | |
38 CALL_FUNCTION 1 | |
40 STORE_FAST 0 (hidden_states) | |
485 >> 42 LOAD_FAST 3 (past_key_value) | |
44 LOAD_CONST 1 (None) | |
46 COMPARE_OP 9 (is not) | |
48 POP_JUMP_IF_FALSE 82 | |
486 50 LOAD_FAST 2 (key_value_states) | |
52 LOAD_CONST 1 (None) | |
54 COMPARE_OP 8 (is) | |
56 POP_JUMP_IF_FALSE 78 | |
489 58 LOAD_GLOBAL 0 (torch) | |
60 LOAD_ATTR 1 (cat) | |
62 LOAD_FAST 3 (past_key_value) | |
64 LOAD_FAST 0 (hidden_states) | |
66 BUILD_LIST 2 | |
68 LOAD_CONST 2 (2) | |
70 LOAD_CONST 3 (('dim',)) | |
72 CALL_FUNCTION_KW 2 | |
74 STORE_FAST 0 (hidden_states) | |
76 JUMP_FORWARD 4 (to 82) | |
492 >> 78 LOAD_FAST 3 (past_key_value) | |
80 STORE_FAST 0 (hidden_states) | |
493 >> 82 LOAD_FAST 0 (hidden_states) | |
84 RETURN_VALUE | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST key_value_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP is [ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 22 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF shape [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST proj_layer [NestedUserFunctionVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [NestedUserFunctionVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NestedUserFunctionVariable(), NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NestedUserFunctionVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] INLINING <code object shape at 0x7f3f100aebe0, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 466> | |
468 0 LOAD_FAST 0 (states) | |
2 LOAD_METHOD 0 (view) | |
4 LOAD_DEREF 0 (batch_size) | |
6 LOAD_CONST 1 (-1) | |
8 LOAD_DEREF 1 (self) | |
10 LOAD_ATTR 1 (n_heads) | |
12 LOAD_DEREF 1 (self) | |
14 LOAD_ATTR 2 (key_value_proj_dim) | |
16 CALL_METHOD 4 | |
18 LOAD_METHOD 3 (transpose) | |
20 LOAD_CONST 2 (1) | |
22 LOAD_CONST 3 (2) | |
24 CALL_METHOD 2 | |
26 RETURN_VALUE | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR view [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF batch_size [GetAttrVariable(TensorVariable(), view)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST -1 [GetAttrVariable(TensorVariable(), view), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF self [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR n_heads [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF self [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR key_value_proj_dim [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int), ConstantVariable(int), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 4 [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int), ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR transpose [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 1 [GetAttrVariable(TensorVariable(), transpose)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 2 [GetAttrVariable(TensorVariable(), transpose), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 2 [GetAttrVariable(TensorVariable(), transpose), ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE RETURN_VALUE None [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] DONE INLINING <code object shape at 0x7f3f100aebe0, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 466> | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST hidden_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE JUMP_FORWARD 42 [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST past_key_value [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP is not [ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 82 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE RETURN_VALUE None [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] DONE INLINING <code object project at 0x7f3f100aed40, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 474> | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST value_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL torch [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR matmul [TorchVariable(<module 'torch' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/__init__.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST query_states [TorchVariable(<built-in method matmul of type object at 0x7f40b0102e20>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST key_states [TorchVariable(<built-in method matmul of type object at 0x7f40b0102e20>), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR transpose [TorchVariable(<built-in method matmul of type object at 0x7f40b0102e20>), TensorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 3 [TorchVariable(<built-in method matmul of type object at 0x7f40b0102e20>), TensorVariable(), GetAttrVariable(TensorVariable(), transpose)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 2 [TorchVariable(<built-in method matmul of type object at 0x7f40b0102e20>), TensorVariable(), GetAttrVariable(TensorVariable(), transpose), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 2 [TorchVariable(<built-in method matmul of type object at 0x7f40b0102e20>), TensorVariable(), GetAttrVariable(TensorVariable(), transpose), ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 2 [TorchVariable(<built-in method matmul of type object at 0x7f40b0102e20>), TensorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST scores [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST position_bias [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP is [ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 410 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR has_relative_attention_bias [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_TRUE 324 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR compute_bias [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST real_seq_length [UserMethodVariable(<function T5Attention.compute_bias at 0x7f3ef771c8b0>, NNModuleVariable())] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST key_length [UserMethodVariable(<function T5Attention.compute_bias at 0x7f3ef771c8b0>, NNModuleVariable()), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST scores [UserMethodVariable(<function T5Attention.compute_bias at 0x7f3ef771c8b0>, NNModuleVariable()), ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR device [UserMethodVariable(<function T5Attention.compute_bias at 0x7f3ef771c8b0>, NNModuleVariable()), ConstantVariable(int), ConstantVariable(int), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST ('device',) [UserMethodVariable(<function T5Attention.compute_bias at 0x7f3ef771c8b0>, NNModuleVariable()), ConstantVariable(int), ConstantVariable(int), TorchVariable(cuda:0)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION_KW 3 [UserMethodVariable(<function T5Attention.compute_bias at 0x7f3ef771c8b0>, NNModuleVariable()), ConstantVariable(int), ConstantVariable(int), TorchVariable(cuda:0), ConstantVariable(tuple)] | |
torchdynamo.symbolic_convert: [DEBUG] INLINING <code object compute_bias at 0x7f3f100aeb30, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 419> | |
421 0 LOAD_FAST 3 (device) | |
2 LOAD_CONST 1 (None) | |
4 COMPARE_OP 8 (is) | |
6 POP_JUMP_IF_FALSE 18 | |
422 8 LOAD_FAST 0 (self) | |
10 LOAD_ATTR 0 (relative_attention_bias) | |
12 LOAD_ATTR 1 (weight) | |
14 LOAD_ATTR 2 (device) | |
16 STORE_FAST 3 (device) | |
423 >> 18 LOAD_GLOBAL 3 (torch) | |
20 LOAD_ATTR 4 (arange) | |
22 LOAD_FAST 1 (query_length) | |
24 LOAD_GLOBAL 3 (torch) | |
26 LOAD_ATTR 5 (long) | |
28 LOAD_FAST 3 (device) | |
30 LOAD_CONST 2 (('dtype', 'device')) | |
32 CALL_FUNCTION_KW 3 | |
34 LOAD_CONST 1 (None) | |
36 LOAD_CONST 1 (None) | |
38 BUILD_SLICE 2 | |
40 LOAD_CONST 1 (None) | |
42 BUILD_TUPLE 2 | |
44 BINARY_SUBSCR | |
46 STORE_FAST 4 (context_position) | |
424 48 LOAD_GLOBAL 3 (torch) | |
50 LOAD_ATTR 4 (arange) | |
52 LOAD_FAST 2 (key_length) | |
54 LOAD_GLOBAL 3 (torch) | |
56 LOAD_ATTR 5 (long) | |
58 LOAD_FAST 3 (device) | |
60 LOAD_CONST 2 (('dtype', 'device')) | |
62 CALL_FUNCTION_KW 3 | |
64 LOAD_CONST 1 (None) | |
66 LOAD_CONST 1 (None) | |
68 LOAD_CONST 1 (None) | |
70 BUILD_SLICE 2 | |
72 BUILD_TUPLE 2 | |
74 BINARY_SUBSCR | |
76 STORE_FAST 5 (memory_position) | |
425 78 LOAD_FAST 5 (memory_position) | |
80 LOAD_FAST 4 (context_position) | |
82 BINARY_SUBTRACT | |
84 STORE_FAST 6 (relative_position) | |
426 86 LOAD_FAST 0 (self) | |
88 LOAD_ATTR 6 (_relative_position_bucket) | |
427 90 LOAD_FAST 6 (relative_position) | |
428 92 LOAD_FAST 0 (self) | |
94 LOAD_ATTR 7 (is_decoder) | |
96 UNARY_NOT | |
429 98 LOAD_FAST 0 (self) | |
100 LOAD_ATTR 8 (relative_attention_num_buckets) | |
430 102 LOAD_FAST 0 (self) | |
104 LOAD_ATTR 9 (relative_attention_max_distance) | |
426 106 LOAD_CONST 3 (('bidirectional', 'num_buckets', 'max_distance')) | |
108 CALL_FUNCTION_KW 4 | |
110 STORE_FAST 7 (relative_position_bucket) | |
432 112 LOAD_FAST 0 (self) | |
114 LOAD_METHOD 0 (relative_attention_bias) | |
116 LOAD_FAST 7 (relative_position_bucket) | |
118 CALL_METHOD 1 | |
120 STORE_FAST 8 (values) | |
433 122 LOAD_FAST 8 (values) | |
124 LOAD_METHOD 10 (permute) | |
126 LOAD_CONST 4 (2) | |
128 LOAD_CONST 5 (0) | |
130 LOAD_CONST 6 (1) | |
132 BUILD_LIST 3 | |
134 CALL_METHOD 1 | |
136 LOAD_METHOD 11 (unsqueeze) | |
138 LOAD_CONST 5 (0) | |
140 CALL_METHOD 1 | |
142 STORE_FAST 8 (values) | |
434 144 LOAD_FAST 8 (values) | |
146 RETURN_VALUE | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST device [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [TorchVariable(cuda:0)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP is [TorchVariable(cuda:0), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 18 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL torch [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR arange [TorchVariable(<module 'torch' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/__init__.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST query_length [TorchVariable(<built-in method arange of type object at 0x7f40b0102e20>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL torch [TorchVariable(<built-in method arange of type object at 0x7f40b0102e20>), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR long [TorchVariable(<built-in method arange of type object at 0x7f40b0102e20>), ConstantVariable(int), TorchVariable(<module 'torch' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/__init__.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST device [TorchVariable(<built-in method arange of type object at 0x7f40b0102e20>), ConstantVariable(int), ConstantVariable(dtype)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST ('dtype', 'device') [TorchVariable(<built-in method arange of type object at 0x7f40b0102e20>), ConstantVariable(int), ConstantVariable(dtype), TorchVariable(cuda:0)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION_KW 3 [TorchVariable(<built-in method arange of type object at 0x7f40b0102e20>), ConstantVariable(int), ConstantVariable(dtype), TorchVariable(cuda:0), ConstantVariable(tuple)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [TensorVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_SLICE 2 [TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [TensorVariable(), SliceVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_TUPLE 2 [TensorVariable(), SliceVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_SUBSCR None [TensorVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST context_position [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL torch [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR arange [TorchVariable(<module 'torch' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/__init__.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST key_length [TorchVariable(<built-in method arange of type object at 0x7f40b0102e20>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL torch [TorchVariable(<built-in method arange of type object at 0x7f40b0102e20>), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR long [TorchVariable(<built-in method arange of type object at 0x7f40b0102e20>), ConstantVariable(int), TorchVariable(<module 'torch' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/__init__.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST device [TorchVariable(<built-in method arange of type object at 0x7f40b0102e20>), ConstantVariable(int), ConstantVariable(dtype)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST ('dtype', 'device') [TorchVariable(<built-in method arange of type object at 0x7f40b0102e20>), ConstantVariable(int), ConstantVariable(dtype), TorchVariable(cuda:0)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION_KW 3 [TorchVariable(<built-in method arange of type object at 0x7f40b0102e20>), ConstantVariable(int), ConstantVariable(dtype), TorchVariable(cuda:0), ConstantVariable(tuple)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [TensorVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_SLICE 2 [TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_TUPLE 2 [TensorVariable(), ConstantVariable(NoneType), SliceVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_SUBSCR None [TensorVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST memory_position [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST memory_position [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST context_position [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_SUBTRACT None [TensorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST relative_position [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR _relative_position_bucket [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST relative_position [UserFunctionVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [UserFunctionVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR is_decoder [UserFunctionVariable(), TensorVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE UNARY_NOT None [UserFunctionVariable(), TensorVariable(), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [UserFunctionVariable(), TensorVariable(), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR relative_attention_num_buckets [UserFunctionVariable(), TensorVariable(), ConstantVariable(bool), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [UserFunctionVariable(), TensorVariable(), ConstantVariable(bool), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR relative_attention_max_distance [UserFunctionVariable(), TensorVariable(), ConstantVariable(bool), ConstantVariable(int), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST ('bidirectional', 'num_buckets', 'max_distance') [UserFunctionVariable(), TensorVariable(), ConstantVariable(bool), ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION_KW 4 [UserFunctionVariable(), TensorVariable(), ConstantVariable(bool), ConstantVariable(int), ConstantVariable(int), ConstantVariable(tuple)] | |
torchdynamo.symbolic_convert: [DEBUG] INLINING <code object _relative_position_bucket at 0x7f3f100aea80, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 371> | |
393 0 LOAD_CONST 1 (0) | |
2 STORE_FAST 4 (relative_buckets) | |
394 4 LOAD_FAST 1 (bidirectional) | |
6 POP_JUMP_IF_FALSE 52 | |
395 8 LOAD_FAST 2 (num_buckets) | |
10 LOAD_CONST 2 (2) | |
12 INPLACE_FLOOR_DIVIDE | |
14 STORE_FAST 2 (num_buckets) | |
396 16 LOAD_FAST 4 (relative_buckets) | |
18 LOAD_FAST 0 (relative_position) | |
20 LOAD_CONST 1 (0) | |
22 COMPARE_OP 4 (>) | |
24 LOAD_METHOD 0 (to) | |
26 LOAD_GLOBAL 1 (torch) | |
28 LOAD_ATTR 2 (long) | |
30 CALL_METHOD 1 | |
32 LOAD_FAST 2 (num_buckets) | |
34 BINARY_MULTIPLY | |
36 INPLACE_ADD | |
38 STORE_FAST 4 (relative_buckets) | |
397 40 LOAD_GLOBAL 1 (torch) | |
42 LOAD_METHOD 3 (abs) | |
44 LOAD_FAST 0 (relative_position) | |
46 CALL_METHOD 1 | |
48 STORE_FAST 0 (relative_position) | |
50 JUMP_FORWARD 20 (to 72) | |
399 >> 52 LOAD_GLOBAL 1 (torch) | |
54 LOAD_METHOD 4 (min) | |
56 LOAD_FAST 0 (relative_position) | |
58 LOAD_GLOBAL 1 (torch) | |
60 LOAD_METHOD 5 (zeros_like) | |
62 LOAD_FAST 0 (relative_position) | |
64 CALL_METHOD 1 | |
66 CALL_METHOD 2 | |
68 UNARY_NEGATIVE | |
70 STORE_FAST 0 (relative_position) | |
403 >> 72 LOAD_FAST 2 (num_buckets) | |
74 LOAD_CONST 2 (2) | |
76 BINARY_FLOOR_DIVIDE | |
78 STORE_FAST 5 (max_exact) | |
404 80 LOAD_FAST 0 (relative_position) | |
82 LOAD_FAST 5 (max_exact) | |
84 COMPARE_OP 0 (<) | |
86 STORE_FAST 6 (is_small) | |
407 88 LOAD_FAST 5 (max_exact) | |
408 90 LOAD_GLOBAL 1 (torch) | |
92 LOAD_METHOD 6 (log) | |
94 LOAD_FAST 0 (relative_position) | |
96 LOAD_METHOD 7 (float) | |
98 CALL_METHOD 0 | |
100 LOAD_FAST 5 (max_exact) | |
102 BINARY_TRUE_DIVIDE | |
104 CALL_METHOD 1 | |
409 106 LOAD_GLOBAL 8 (math) | |
108 LOAD_METHOD 6 (log) | |
110 LOAD_FAST 3 (max_distance) | |
112 LOAD_FAST 5 (max_exact) | |
114 BINARY_TRUE_DIVIDE | |
116 CALL_METHOD 1 | |
408 118 BINARY_TRUE_DIVIDE | |
410 120 LOAD_FAST 2 (num_buckets) | |
122 LOAD_FAST 5 (max_exact) | |
124 BINARY_SUBTRACT | |
408 126 BINARY_MULTIPLY | |
407 128 LOAD_METHOD 0 (to) | |
411 130 LOAD_GLOBAL 1 (torch) | |
132 LOAD_ATTR 2 (long) | |
407 134 CALL_METHOD 1 | |
136 BINARY_ADD | |
138 STORE_FAST 7 (relative_position_if_large) | |
412 140 LOAD_GLOBAL 1 (torch) | |
142 LOAD_METHOD 4 (min) | |
413 144 LOAD_FAST 7 (relative_position_if_large) | |
146 LOAD_GLOBAL 1 (torch) | |
148 LOAD_METHOD 9 (full_like) | |
150 LOAD_FAST 7 (relative_position_if_large) | |
152 LOAD_FAST 2 (num_buckets) | |
154 LOAD_CONST 3 (1) | |
156 BINARY_SUBTRACT | |
158 CALL_METHOD 2 | |
412 160 CALL_METHOD 2 | |
162 STORE_FAST 7 (relative_position_if_large) | |
416 164 LOAD_FAST 4 (relative_buckets) | |
166 LOAD_GLOBAL 1 (torch) | |
168 LOAD_METHOD 10 (where) | |
170 LOAD_FAST 6 (is_small) | |
172 LOAD_FAST 0 (relative_position) | |
174 LOAD_FAST 7 (relative_position_if_large) | |
176 CALL_METHOD 3 | |
178 INPLACE_ADD | |
180 STORE_FAST 4 (relative_buckets) | |
417 182 LOAD_FAST 4 (relative_buckets) | |
184 RETURN_VALUE | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 0 [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST relative_buckets [ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST bidirectional [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 52 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST num_buckets [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 2 [ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE INPLACE_FLOOR_DIVIDE None [ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST num_buckets [ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST relative_buckets [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST relative_position [ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 0 [ConstantVariable(int), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP > [ConstantVariable(int), TensorVariable(), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR to [ConstantVariable(int), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL torch [ConstantVariable(int), GetAttrVariable(TensorVariable(), to)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR long [ConstantVariable(int), GetAttrVariable(TensorVariable(), to), TorchVariable(<module 'torch' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/__init__.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [ConstantVariable(int), GetAttrVariable(TensorVariable(), to), ConstantVariable(dtype)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST num_buckets [ConstantVariable(int), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_MULTIPLY None [ConstantVariable(int), TensorVariable(), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE INPLACE_ADD None [ConstantVariable(int), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST relative_buckets [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL torch [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR abs [TorchVariable(<module 'torch' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/__init__.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST relative_position [TorchVariable(<built-in method abs of type object at 0x7f40b0102e20>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [TorchVariable(<built-in method abs of type object at 0x7f40b0102e20>), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST relative_position [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE JUMP_FORWARD 72 [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST num_buckets [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 2 [ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_FLOOR_DIVIDE None [ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST max_exact [ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST relative_position [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST max_exact [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP < [TensorVariable(), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST is_small [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST max_exact [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL torch [ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR log [ConstantVariable(int), TorchVariable(<module 'torch' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/__init__.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST relative_position [ConstantVariable(int), TorchVariable(<built-in method log of type object at 0x7f40b0102e20>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR float [ConstantVariable(int), TorchVariable(<built-in method log of type object at 0x7f40b0102e20>), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 0 [ConstantVariable(int), TorchVariable(<built-in method log of type object at 0x7f40b0102e20>), GetAttrVariable(TensorVariable(), float)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST max_exact [ConstantVariable(int), TorchVariable(<built-in method log of type object at 0x7f40b0102e20>), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_TRUE_DIVIDE None [ConstantVariable(int), TorchVariable(<built-in method log of type object at 0x7f40b0102e20>), TensorVariable(), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [ConstantVariable(int), TorchVariable(<built-in method log of type object at 0x7f40b0102e20>), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL math [ConstantVariable(int), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR log [ConstantVariable(int), TensorVariable(), TorchVariable(<module 'math' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/lib-dynload/math.cpython-38-x86_64-linux-gnu.so'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST max_distance [ConstantVariable(int), TensorVariable(), TorchVariable(<built-in function log>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST max_exact [ConstantVariable(int), TensorVariable(), TorchVariable(<built-in function log>), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_TRUE_DIVIDE None [ConstantVariable(int), TensorVariable(), TorchVariable(<built-in function log>), ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [ConstantVariable(int), TensorVariable(), TorchVariable(<built-in function log>), ConstantVariable(float)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_TRUE_DIVIDE None [ConstantVariable(int), TensorVariable(), ConstantVariable(float)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST num_buckets [ConstantVariable(int), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST max_exact [ConstantVariable(int), TensorVariable(), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_SUBTRACT None [ConstantVariable(int), TensorVariable(), ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_MULTIPLY None [ConstantVariable(int), TensorVariable(), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR to [ConstantVariable(int), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL torch [ConstantVariable(int), GetAttrVariable(TensorVariable(), to)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR long [ConstantVariable(int), GetAttrVariable(TensorVariable(), to), TorchVariable(<module 'torch' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/__init__.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [ConstantVariable(int), GetAttrVariable(TensorVariable(), to), ConstantVariable(dtype)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_ADD None [ConstantVariable(int), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST relative_position_if_large [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL torch [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR min [TorchVariable(<module 'torch' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/__init__.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST relative_position_if_large [TorchVariable(<built-in method min of type object at 0x7f40b0102e20>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL torch [TorchVariable(<built-in method min of type object at 0x7f40b0102e20>), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR full_like [TorchVariable(<built-in method min of type object at 0x7f40b0102e20>), TensorVariable(), TorchVariable(<module 'torch' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/__init__.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST relative_position_if_large [TorchVariable(<built-in method min of type object at 0x7f40b0102e20>), TensorVariable(), TorchVariable(<built-in method full_like of type object at 0x7f40b0102e20>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST num_buckets [TorchVariable(<built-in method min of type object at 0x7f40b0102e20>), TensorVariable(), TorchVariable(<built-in method full_like of type object at 0x7f40b0102e20>), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 1 [TorchVariable(<built-in method min of type object at 0x7f40b0102e20>), TensorVariable(), TorchVariable(<built-in method full_like of type object at 0x7f40b0102e20>), TensorVariable(), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_SUBTRACT None [TorchVariable(<built-in method min of type object at 0x7f40b0102e20>), TensorVariable(), TorchVariable(<built-in method full_like of type object at 0x7f40b0102e20>), TensorVariable(), ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 2 [TorchVariable(<built-in method min of type object at 0x7f40b0102e20>), TensorVariable(), TorchVariable(<built-in method full_like of type object at 0x7f40b0102e20>), TensorVariable(), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 2 [TorchVariable(<built-in method min of type object at 0x7f40b0102e20>), TensorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST relative_position_if_large [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST relative_buckets [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL torch [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR where [TensorVariable(), TorchVariable(<module 'torch' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/__init__.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST is_small [TensorVariable(), TorchVariable(<built-in method where of type object at 0x7f40b0102e20>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST relative_position [TensorVariable(), TorchVariable(<built-in method where of type object at 0x7f40b0102e20>), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST relative_position_if_large [TensorVariable(), TorchVariable(<built-in method where of type object at 0x7f40b0102e20>), TensorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 3 [TensorVariable(), TorchVariable(<built-in method where of type object at 0x7f40b0102e20>), TensorVariable(), TensorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE INPLACE_ADD None [TensorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST relative_buckets [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST relative_buckets [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE RETURN_VALUE None [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] DONE INLINING <code object _relative_position_bucket at 0x7f3f100aea80, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 371> | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST relative_position_bucket [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR relative_attention_bias [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST relative_position_bucket [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST values [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST values [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR permute [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 2 [GetAttrVariable(TensorVariable(), permute)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 0 [GetAttrVariable(TensorVariable(), permute), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 1 [GetAttrVariable(TensorVariable(), permute), ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_LIST 3 [GetAttrVariable(TensorVariable(), permute), ConstantVariable(int), ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [GetAttrVariable(TensorVariable(), permute), ListVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR unsqueeze [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 0 [GetAttrVariable(TensorVariable(), unsqueeze)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [GetAttrVariable(TensorVariable(), unsqueeze), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST values [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST values [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE RETURN_VALUE None [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] DONE INLINING <code object compute_bias at 0x7f3f100aeb30, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 419> | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST position_bias [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST past_key_value [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP is not [ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 392 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST mask [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP is not [TensorVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 410 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST position_bias [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST mask [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_ADD None [TensorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST position_bias [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST scores [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST position_bias [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE INPLACE_ADD None [TensorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST scores [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL nn [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR functional [TorchVariable(<module 'torch.nn' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/nn/__init__.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR softmax [TorchVariable(<module 'torch.nn.functional' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/nn/functional.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST scores [TorchVariable(<function softmax at 0x7f3f1bf53e50>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR float [TorchVariable(<function softmax at 0x7f3f1bf53e50>), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 0 [TorchVariable(<function softmax at 0x7f3f1bf53e50>), GetAttrVariable(TensorVariable(), float)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST -1 [TorchVariable(<function softmax at 0x7f3f1bf53e50>), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST ('dim',) [TorchVariable(<function softmax at 0x7f3f1bf53e50>), TensorVariable(), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION_KW 2 [TorchVariable(<function softmax at 0x7f3f1bf53e50>), TensorVariable(), ConstantVariable(int), ConstantVariable(tuple)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR type_as [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST scores [GetAttrVariable(TensorVariable(), type_as)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [GetAttrVariable(TensorVariable(), type_as), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST attn_weights [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL nn [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR functional [TorchVariable(<module 'torch.nn' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/nn/__init__.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR dropout [TorchVariable(<module 'torch.nn.functional' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/nn/functional.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST attn_weights [TorchVariable(<function dropout at 0x7f3f1bf53310>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF self [TorchVariable(<function dropout at 0x7f3f1bf53310>), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR dropout [TorchVariable(<function dropout at 0x7f3f1bf53310>), TensorVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF self [TorchVariable(<function dropout at 0x7f3f1bf53310>), TensorVariable(), ConstantVariable(float)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR training [TorchVariable(<function dropout at 0x7f3f1bf53310>), TensorVariable(), ConstantVariable(float), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST ('p', 'training') [TorchVariable(<function dropout at 0x7f3f1bf53310>), TensorVariable(), ConstantVariable(float), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION_KW 3 [TorchVariable(<function dropout at 0x7f3f1bf53310>), TensorVariable(), ConstantVariable(float), ConstantVariable(bool), ConstantVariable(tuple)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST attn_weights [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST layer_head_mask [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP is not [ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 484 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST unshape [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL torch [NestedUserFunctionVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR matmul [NestedUserFunctionVariable(), TorchVariable(<module 'torch' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/__init__.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST attn_weights [NestedUserFunctionVariable(), TorchVariable(<built-in method matmul of type object at 0x7f40b0102e20>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST value_states [NestedUserFunctionVariable(), TorchVariable(<built-in method matmul of type object at 0x7f40b0102e20>), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 2 [NestedUserFunctionVariable(), TorchVariable(<built-in method matmul of type object at 0x7f40b0102e20>), TensorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NestedUserFunctionVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] INLINING <code object unshape at 0x7f3f100aec90, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 470> | |
472 0 LOAD_FAST 0 (states) | |
2 LOAD_METHOD 0 (transpose) | |
4 LOAD_CONST 1 (1) | |
6 LOAD_CONST 2 (2) | |
8 CALL_METHOD 2 | |
10 LOAD_METHOD 1 (contiguous) | |
12 CALL_METHOD 0 | |
14 LOAD_METHOD 2 (view) | |
16 LOAD_DEREF 0 (batch_size) | |
18 LOAD_CONST 3 (-1) | |
20 LOAD_DEREF 1 (self) | |
22 LOAD_ATTR 3 (inner_dim) | |
24 CALL_METHOD 3 | |
26 RETURN_VALUE | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR transpose [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 1 [GetAttrVariable(TensorVariable(), transpose)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 2 [GetAttrVariable(TensorVariable(), transpose), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 2 [GetAttrVariable(TensorVariable(), transpose), ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR contiguous [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 0 [GetAttrVariable(TensorVariable(), contiguous)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR view [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF batch_size [GetAttrVariable(TensorVariable(), view)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST -1 [GetAttrVariable(TensorVariable(), view), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF self [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR inner_dim [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 3 [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE RETURN_VALUE None [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] DONE INLINING <code object unshape at 0x7f3f100aec90, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 470> | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST attn_output [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR o [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST attn_output [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST attn_output [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR is_decoder [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 532 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST present_key_value_state [ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST attn_output [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_TUPLE 1 [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST present_key_value_state [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_TUPLE 1 [TupleVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_ADD None [TupleVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST position_bias [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_TUPLE 1 [TupleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_ADD None [TupleVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST outputs [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST output_attentions [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 570 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST outputs [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE RETURN_VALUE None [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] DONE INLINING <code object forward at 0x7f3f100aedf0, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 436> | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST attention_output [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR dropout [TensorVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST attention_output [TensorVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 0 [TensorVariable(), NNModuleVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_SUBSCR None [TensorVariable(), NNModuleVariable(), TupleVariable(), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [TensorVariable(), NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_ADD None [TensorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST hidden_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_TUPLE 1 [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST attention_output [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 1 [TupleVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [TupleVariable(), TupleVariable(), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_SLICE 2 [TupleVariable(), TupleVariable(), ConstantVariable(int), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_SUBSCR None [TupleVariable(), TupleVariable(), SliceVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_ADD None [TupleVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST outputs [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST outputs [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE RETURN_VALUE None [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] DONE INLINING <code object forward at 0x7f3f100b4030, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 559> | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST self_attention_outputs [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self_attention_outputs [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 2 [TupleVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_SLICE 2 [TupleVariable(), ConstantVariable(NoneType), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_SUBSCR None [TupleVariable(), SliceVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE UNPACK_SEQUENCE 2 [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST hidden_states [ConstantVariable(NoneType), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST present_key_value_state [ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self_attention_outputs [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 2 [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [TupleVariable(), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_SLICE 2 [TupleVariable(), ConstantVariable(int), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_SUBSCR None [TupleVariable(), SliceVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST attention_outputs [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR dtype [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL torch [TorchVariable(torch.float32)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR float16 [TorchVariable(torch.float32), TorchVariable(<module 'torch' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/__init__.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP == [TorchVariable(torch.float32), ConstantVariable(dtype)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 248 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR is_decoder [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE JUMP_IF_FALSE_OR_POP 262 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST do_cross_attention [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST do_cross_attention [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 444 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR layer [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST -1 [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_SUBSCR None [NNModuleVariable(), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] INLINING <code object forward at 0x7f3f100ae7c0, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 324> | |
325 0 LOAD_FAST 0 (self) | |
2 LOAD_METHOD 0 (layer_norm) | |
4 LOAD_FAST 1 (hidden_states) | |
6 CALL_METHOD 1 | |
8 STORE_FAST 2 (forwarded_states) | |
326 10 LOAD_FAST 0 (self) | |
12 LOAD_METHOD 1 (DenseReluDense) | |
14 LOAD_FAST 2 (forwarded_states) | |
16 CALL_METHOD 1 | |
18 STORE_FAST 2 (forwarded_states) | |
327 20 LOAD_FAST 1 (hidden_states) | |
22 LOAD_FAST 0 (self) | |
24 LOAD_METHOD 2 (dropout) | |
26 LOAD_FAST 2 (forwarded_states) | |
28 CALL_METHOD 1 | |
30 BINARY_ADD | |
32 STORE_FAST 1 (hidden_states) | |
328 34 LOAD_FAST 1 (hidden_states) | |
36 RETURN_VALUE | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR layer_norm [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] INLINING <code object forward at 0x7f3f100ae190, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 248> | |
255 0 LOAD_FAST 1 (hidden_states) | |
2 LOAD_METHOD 0 (to) | |
4 LOAD_GLOBAL 1 (torch) | |
6 LOAD_ATTR 2 (float32) | |
8 CALL_METHOD 1 | |
10 LOAD_METHOD 3 (pow) | |
12 LOAD_CONST 1 (2) | |
14 CALL_METHOD 1 | |
16 LOAD_ATTR 4 (mean) | |
18 LOAD_CONST 2 (-1) | |
20 LOAD_CONST 3 (True) | |
22 LOAD_CONST 4 (('keepdim',)) | |
24 CALL_FUNCTION_KW 2 | |
26 STORE_FAST 2 (variance) | |
256 28 LOAD_FAST 1 (hidden_states) | |
30 LOAD_GLOBAL 1 (torch) | |
32 LOAD_METHOD 5 (rsqrt) | |
34 LOAD_FAST 2 (variance) | |
36 LOAD_FAST 0 (self) | |
38 LOAD_ATTR 6 (variance_epsilon) | |
40 BINARY_ADD | |
42 CALL_METHOD 1 | |
44 BINARY_MULTIPLY | |
46 STORE_FAST 1 (hidden_states) | |
259 48 LOAD_FAST 0 (self) | |
50 LOAD_ATTR 7 (weight) | |
52 LOAD_ATTR 8 (dtype) | |
54 LOAD_GLOBAL 1 (torch) | |
56 LOAD_ATTR 9 (float16) | |
58 LOAD_GLOBAL 1 (torch) | |
60 LOAD_ATTR 10 (bfloat16) | |
62 BUILD_TUPLE 2 | |
64 COMPARE_OP 6 (in) | |
66 POP_JUMP_IF_FALSE 82 | |
260 68 LOAD_FAST 1 (hidden_states) | |
70 LOAD_METHOD 0 (to) | |
72 LOAD_FAST 0 (self) | |
74 LOAD_ATTR 7 (weight) | |
76 LOAD_ATTR 8 (dtype) | |
78 CALL_METHOD 1 | |
80 STORE_FAST 1 (hidden_states) | |
262 >> 82 LOAD_FAST 0 (self) | |
84 LOAD_ATTR 7 (weight) | |
86 LOAD_FAST 1 (hidden_states) | |
88 BINARY_MULTIPLY | |
90 RETURN_VALUE | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR to [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL torch [GetAttrVariable(TensorVariable(), to)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR float32 [GetAttrVariable(TensorVariable(), to), TorchVariable(<module 'torch' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/__init__.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [GetAttrVariable(TensorVariable(), to), ConstantVariable(dtype)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR pow [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 2 [GetAttrVariable(TensorVariable(), pow)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [GetAttrVariable(TensorVariable(), pow), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR mean [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST -1 [GetAttrVariable(TensorVariable(), mean)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST True [GetAttrVariable(TensorVariable(), mean), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST ('keepdim',) [GetAttrVariable(TensorVariable(), mean), ConstantVariable(int), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION_KW 2 [GetAttrVariable(TensorVariable(), mean), ConstantVariable(int), ConstantVariable(bool), ConstantVariable(tuple)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST variance [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL torch [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR rsqrt [TensorVariable(), TorchVariable(<module 'torch' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/__init__.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST variance [TensorVariable(), TorchVariable(<built-in method rsqrt of type object at 0x7f40b0102e20>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [TensorVariable(), TorchVariable(<built-in method rsqrt of type object at 0x7f40b0102e20>), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR variance_epsilon [TensorVariable(), TorchVariable(<built-in method rsqrt of type object at 0x7f40b0102e20>), TensorVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_ADD None [TensorVariable(), TorchVariable(<built-in method rsqrt of type object at 0x7f40b0102e20>), TensorVariable(), ConstantVariable(float)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [TensorVariable(), TorchVariable(<built-in method rsqrt of type object at 0x7f40b0102e20>), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_MULTIPLY None [TensorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST hidden_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR weight [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR dtype [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL torch [TorchVariable(torch.float32)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR float16 [TorchVariable(torch.float32), TorchVariable(<module 'torch' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/__init__.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL torch [TorchVariable(torch.float32), ConstantVariable(dtype)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR bfloat16 [TorchVariable(torch.float32), ConstantVariable(dtype), TorchVariable(<module 'torch' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/__init__.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_TUPLE 2 [TorchVariable(torch.float32), ConstantVariable(dtype), ConstantVariable(dtype)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP in [TorchVariable(torch.float32), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 82 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR weight [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_MULTIPLY None [TensorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE RETURN_VALUE None [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] DONE INLINING <code object forward at 0x7f3f100ae190, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 248> | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST forwarded_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR DenseReluDense [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST forwarded_states [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] INLINING <code object forward at 0x7f3f100ae3a0, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 287> | |
288 0 LOAD_FAST 0 (self) | |
2 LOAD_METHOD 0 (wi) | |
4 LOAD_FAST 1 (hidden_states) | |
6 CALL_METHOD 1 | |
8 STORE_FAST 1 (hidden_states) | |
289 10 LOAD_FAST 0 (self) | |
12 LOAD_METHOD 1 (act) | |
14 LOAD_FAST 1 (hidden_states) | |
16 CALL_METHOD 1 | |
18 STORE_FAST 1 (hidden_states) | |
290 20 LOAD_FAST 0 (self) | |
22 LOAD_METHOD 2 (dropout) | |
24 LOAD_FAST 1 (hidden_states) | |
26 CALL_METHOD 1 | |
28 STORE_FAST 1 (hidden_states) | |
291 30 LOAD_FAST 0 (self) | |
32 LOAD_METHOD 3 (wo) | |
34 LOAD_FAST 1 (hidden_states) | |
36 CALL_METHOD 1 | |
38 STORE_FAST 1 (hidden_states) | |
292 40 LOAD_FAST 1 (hidden_states) | |
42 RETURN_VALUE | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR wi [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST hidden_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR act [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST hidden_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR dropout [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST hidden_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR wo [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST hidden_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE RETURN_VALUE None [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] DONE INLINING <code object forward at 0x7f3f100ae3a0, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 287> | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST forwarded_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR dropout [TensorVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST forwarded_states [TensorVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [TensorVariable(), NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_ADD None [TensorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST hidden_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE RETURN_VALUE None [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] DONE INLINING <code object forward at 0x7f3f100ae7c0, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 324> | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST hidden_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR dtype [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL torch [TorchVariable(torch.float32)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR float16 [TorchVariable(torch.float32), TorchVariable(<module 'torch' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/__init__.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP == [TorchVariable(torch.float32), ConstantVariable(dtype)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 524 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_TUPLE 1 [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST outputs [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST use_cache [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 552 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST outputs [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST attention_outputs [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_ADD None [TupleVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST outputs [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST outputs [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE RETURN_VALUE None [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] DONE INLINING <code object forward at 0x7f3f100b4450, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 631> | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST layer_outputs [ListIteratorVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF use_cache [ListIteratorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST False [ListIteratorVariable(), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP is [ListIteratorVariable(), ConstantVariable(bool), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 1096 [ListIteratorVariable(), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST layer_outputs [ListIteratorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [ListIteratorVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 1 [ListIteratorVariable(), TupleVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_SLICE 2 [ListIteratorVariable(), TupleVariable(), ConstantVariable(NoneType), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_SUBSCR None [ListIteratorVariable(), TupleVariable(), SliceVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST (None,) [ListIteratorVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_ADD None [ListIteratorVariable(), TupleVariable(), ConstantVariable(tuple)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST layer_outputs [ListIteratorVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 1 [ListIteratorVariable(), TupleVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [ListIteratorVariable(), TupleVariable(), TupleVariable(), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_SLICE 2 [ListIteratorVariable(), TupleVariable(), TupleVariable(), ConstantVariable(int), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_SUBSCR None [ListIteratorVariable(), TupleVariable(), TupleVariable(), SliceVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_ADD None [ListIteratorVariable(), TupleVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST layer_outputs [ListIteratorVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST layer_outputs [ListIteratorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [ListIteratorVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 2 [ListIteratorVariable(), TupleVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_SLICE 2 [ListIteratorVariable(), TupleVariable(), ConstantVariable(NoneType), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_SUBSCR None [ListIteratorVariable(), TupleVariable(), SliceVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE UNPACK_SEQUENCE 2 [ListIteratorVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST hidden_states [ListIteratorVariable(), ConstantVariable(NoneType), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST present_key_value_state [ListIteratorVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST layer_outputs [ListIteratorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 2 [ListIteratorVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_SUBSCR None [ListIteratorVariable(), TupleVariable(), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST position_bias [ListIteratorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [ListIteratorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR is_decoder [ListIteratorVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 1156 [ListIteratorVariable(), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF use_cache [ListIteratorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 1172 [ListIteratorVariable(), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF output_attentions [ListIteratorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 1214 [ListIteratorVariable(), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [ListIteratorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR model_parallel [ListIteratorVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 724 [ListIteratorVariable(), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE FOR_ITER 1304 [ListIteratorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE UNPACK_SEQUENCE 2 [ListIteratorVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST i [ListIteratorVariable(), TupleVariable(), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE UNPACK_SEQUENCE 2 [ListIteratorVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST layer_module [ListIteratorVariable(), ConstantVariable(NoneType), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST past_key_value [ListIteratorVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST head_mask [ListIteratorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST i [ListIteratorVariable(), ListVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_SUBSCR None [ListIteratorVariable(), ListVariable(), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST layer_head_mask [ListIteratorVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST cross_attn_head_mask [ListIteratorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST i [ListIteratorVariable(), ListVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_SUBSCR None [ListIteratorVariable(), ListVariable(), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST cross_attn_layer_head_mask [ListIteratorVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [ListIteratorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR model_parallel [ListIteratorVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 930 [ListIteratorVariable(), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST output_hidden_states [ListIteratorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 946 [ListIteratorVariable(), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [ListIteratorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR gradient_checkpointing [ListIteratorVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 1028 [ListIteratorVariable(), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST layer_module [ListIteratorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [ListIteratorVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST extended_attention_mask [ListIteratorVariable(), NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST position_bias [ListIteratorVariable(), NNModuleVariable(), TensorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST encoder_hidden_states [ListIteratorVariable(), NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST encoder_extended_attention_mask [ListIteratorVariable(), NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST encoder_decoder_position_bias [ListIteratorVariable(), NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST layer_head_mask [ListIteratorVariable(), NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST cross_attn_layer_head_mask [ListIteratorVariable(), NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST past_key_value [ListIteratorVariable(), NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF use_cache [ListIteratorVariable(), NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF output_attentions [ListIteratorVariable(), NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST ('attention_mask', 'position_bias', 'encoder_hidden_states', 'encoder_attention_mask', 'encoder_decoder_position_bias', 'layer_head_mask', 'cross_attn_layer_head_mask', 'past_key_value', 'use_cache', 'output_attentions') [ListIteratorVariable(), NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(bool), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION_KW 11 [ListIteratorVariable(), NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(bool), ConstantVariable(bool), ConstantVariable(tuple)] | |
torchdynamo.symbolic_convert: [DEBUG] INLINING <code object forward at 0x7f3f100b4450, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 631> | |
647 0 LOAD_FAST 9 (past_key_value) | |
2 LOAD_CONST 0 (None) | |
4 COMPARE_OP 9 (is not) | |
6 POP_JUMP_IF_FALSE 122 | |
648 8 LOAD_FAST 0 (self) | |
10 LOAD_ATTR 0 (is_decoder) | |
12 POP_JUMP_IF_TRUE 24 | |
649 14 LOAD_GLOBAL 1 (logger) | |
16 LOAD_METHOD 2 (warning) | |
18 LOAD_CONST 1 ('`past_key_values` is passed to the encoder. Please make sure this is intended.') | |
20 CALL_METHOD 1 | |
22 POP_TOP | |
650 >> 24 LOAD_FAST 4 (encoder_hidden_states) | |
26 LOAD_CONST 0 (None) | |
28 COMPARE_OP 8 (is) | |
30 POP_JUMP_IF_FALSE 36 | |
32 LOAD_CONST 2 (2) | |
34 JUMP_FORWARD 2 (to 38) | |
>> 36 LOAD_CONST 3 (4) | |
>> 38 STORE_FAST 13 (expected_num_past_key_values) | |
652 40 LOAD_GLOBAL 3 (len) | |
42 LOAD_FAST 9 (past_key_value) | |
44 CALL_FUNCTION 1 | |
46 LOAD_FAST 13 (expected_num_past_key_values) | |
48 COMPARE_OP 3 (!=) | |
50 POP_JUMP_IF_FALSE 96 | |
653 52 LOAD_GLOBAL 4 (ValueError) | |
654 54 LOAD_CONST 4 ('There should be ') | |
56 LOAD_FAST 13 (expected_num_past_key_values) | |
58 FORMAT_VALUE 0 | |
60 LOAD_CONST 5 (' past states. ') | |
62 LOAD_FAST 13 (expected_num_past_key_values) | |
64 LOAD_CONST 3 (4) | |
66 COMPARE_OP 2 (==) | |
68 POP_JUMP_IF_FALSE 74 | |
70 LOAD_CONST 6 ('2 (past / key) for cross attention. ') | |
72 JUMP_FORWARD 2 (to 76) | |
>> 74 LOAD_CONST 7 ('') | |
>> 76 FORMAT_VALUE 0 | |
78 LOAD_CONST 8 ('Got ') | |
80 LOAD_GLOBAL 3 (len) | |
82 LOAD_FAST 9 (past_key_value) | |
84 CALL_FUNCTION 1 | |
86 FORMAT_VALUE 0 | |
88 LOAD_CONST 9 (' past key / value states') | |
90 BUILD_STRING 7 | |
653 92 CALL_FUNCTION 1 | |
94 RAISE_VARARGS 1 | |
659 >> 96 LOAD_FAST 9 (past_key_value) | |
98 LOAD_CONST 0 (None) | |
100 LOAD_CONST 2 (2) | |
102 BUILD_SLICE 2 | |
104 BINARY_SUBSCR | |
106 STORE_FAST 14 (self_attn_past_key_value) | |
660 108 LOAD_FAST 9 (past_key_value) | |
110 LOAD_CONST 2 (2) | |
112 LOAD_CONST 0 (None) | |
114 BUILD_SLICE 2 | |
116 BINARY_SUBSCR | |
118 STORE_FAST 15 (cross_attn_past_key_value) | |
120 JUMP_FORWARD 8 (to 130) | |
662 >> 122 LOAD_CONST 10 ((None, None)) | |
124 UNPACK_SEQUENCE 2 | |
126 STORE_FAST 14 (self_attn_past_key_value) | |
128 STORE_FAST 15 (cross_attn_past_key_value) | |
664 >> 130 LOAD_FAST 0 (self) | |
132 LOAD_ATTR 5 (layer) | |
134 LOAD_CONST 11 (0) | |
136 BINARY_SUBSCR | |
665 138 LOAD_FAST 1 (hidden_states) | |
666 140 LOAD_FAST 2 (attention_mask) | |
667 142 LOAD_FAST 3 (position_bias) | |
668 144 LOAD_FAST 7 (layer_head_mask) | |
669 146 LOAD_FAST 14 (self_attn_past_key_value) | |
670 148 LOAD_FAST 10 (use_cache) | |
671 150 LOAD_FAST 11 (output_attentions) | |
664 152 LOAD_CONST 12 (('attention_mask', 'position_bias', 'layer_head_mask', 'past_key_value', 'use_cache', 'output_attentions')) | |
154 CALL_FUNCTION_KW 7 | |
156 STORE_FAST 16 (self_attention_outputs) | |
673 158 LOAD_FAST 16 (self_attention_outputs) | |
160 LOAD_CONST 0 (None) | |
162 LOAD_CONST 2 (2) | |
164 BUILD_SLICE 2 | |
166 BINARY_SUBSCR | |
168 UNPACK_SEQUENCE 2 | |
170 STORE_FAST 1 (hidden_states) | |
172 STORE_FAST 17 (present_key_value_state) | |
674 174 LOAD_FAST 16 (self_attention_outputs) | |
176 LOAD_CONST 2 (2) | |
178 LOAD_CONST 0 (None) | |
180 BUILD_SLICE 2 | |
182 BINARY_SUBSCR | |
184 STORE_FAST 18 (attention_outputs) | |
677 186 LOAD_FAST 1 (hidden_states) | |
188 LOAD_ATTR 6 (dtype) | |
190 LOAD_GLOBAL 7 (torch) | |
192 LOAD_ATTR 8 (float16) | |
194 COMPARE_OP 2 (==) | |
196 POP_JUMP_IF_FALSE 248 | |
198 LOAD_GLOBAL 7 (torch) | |
200 LOAD_METHOD 9 (isinf) | |
202 LOAD_FAST 1 (hidden_states) | |
204 CALL_METHOD 1 | |
206 LOAD_METHOD 10 (any) | |
208 CALL_METHOD 0 | |
210 POP_JUMP_IF_FALSE 248 | |
678 212 LOAD_GLOBAL 7 (torch) | |
214 LOAD_METHOD 11 (finfo) | |
216 LOAD_FAST 1 (hidden_states) | |
218 LOAD_ATTR 6 (dtype) | |
220 CALL_METHOD 1 | |
222 LOAD_ATTR 12 (max) | |
224 LOAD_CONST 13 (1000) | |
226 BINARY_SUBTRACT | |
228 STORE_FAST 19 (clamp_value) | |
679 230 LOAD_GLOBAL 7 (torch) | |
232 LOAD_ATTR 13 (clamp) | |
234 LOAD_FAST 1 (hidden_states) | |
236 LOAD_FAST 19 (clamp_value) | |
238 UNARY_NEGATIVE | |
240 LOAD_FAST 19 (clamp_value) | |
242 LOAD_CONST 14 (('min', 'max')) | |
244 CALL_FUNCTION_KW 3 | |
246 STORE_FAST 1 (hidden_states) | |
681 >> 248 LOAD_FAST 0 (self) | |
250 LOAD_ATTR 0 (is_decoder) | |
252 EXTENDED_ARG 1 | |
254 JUMP_IF_FALSE_OR_POP 262 | |
256 LOAD_FAST 4 (encoder_hidden_states) | |
258 LOAD_CONST 0 (None) | |
260 COMPARE_OP 9 (is not) | |
>> 262 STORE_FAST 20 (do_cross_attention) | |
682 264 LOAD_FAST 20 (do_cross_attention) | |
266 EXTENDED_ARG 1 | |
268 POP_JUMP_IF_FALSE 444 | |
685 270 LOAD_FAST 17 (present_key_value_state) | |
272 LOAD_CONST 0 (None) | |
274 COMPARE_OP 9 (is not) | |
276 EXTENDED_ARG 1 | |
278 POP_JUMP_IF_FALSE 296 | |
686 280 LOAD_FAST 17 (present_key_value_state) | |
282 LOAD_CONST 11 (0) | |
284 BINARY_SUBSCR | |
286 LOAD_ATTR 14 (shape) | |
288 LOAD_CONST 2 (2) | |
290 BINARY_SUBSCR | |
292 STORE_FAST 21 (query_length) | |
294 JUMP_FORWARD 4 (to 300) | |
688 >> 296 LOAD_CONST 0 (None) | |
298 STORE_FAST 21 (query_length) | |
690 >> 300 LOAD_FAST 0 (self) | |
302 LOAD_ATTR 5 (layer) | |
304 LOAD_CONST 15 (1) | |
306 BINARY_SUBSCR | |
691 308 LOAD_FAST 1 (hidden_states) | |
692 310 LOAD_FAST 4 (encoder_hidden_states) | |
693 312 LOAD_FAST 5 (encoder_attention_mask) | |
694 314 LOAD_FAST 6 (encoder_decoder_position_bias) | |
695 316 LOAD_FAST 8 (cross_attn_layer_head_mask) | |
696 318 LOAD_FAST 15 (cross_attn_past_key_value) | |
697 320 LOAD_FAST 21 (query_length) | |
698 322 LOAD_FAST 10 (use_cache) | |
699 324 LOAD_FAST 11 (output_attentions) | |
690 326 LOAD_CONST 16 (('key_value_states', 'attention_mask', 'position_bias', 'layer_head_mask', 'past_key_value', 'query_length', 'use_cache', 'output_attentions')) | |
328 CALL_FUNCTION_KW 9 | |
330 STORE_FAST 22 (cross_attention_outputs) | |
701 332 LOAD_FAST 22 (cross_attention_outputs) | |
334 LOAD_CONST 11 (0) | |
336 BINARY_SUBSCR | |
338 STORE_FAST 1 (hidden_states) | |
704 340 LOAD_FAST 1 (hidden_states) | |
342 LOAD_ATTR 6 (dtype) | |
344 LOAD_GLOBAL 7 (torch) | |
346 LOAD_ATTR 8 (float16) | |
348 COMPARE_OP 2 (==) | |
350 EXTENDED_ARG 1 | |
352 POP_JUMP_IF_FALSE 406 | |
354 LOAD_GLOBAL 7 (torch) | |
356 LOAD_METHOD 9 (isinf) | |
358 LOAD_FAST 1 (hidden_states) | |
360 CALL_METHOD 1 | |
362 LOAD_METHOD 10 (any) | |
364 CALL_METHOD 0 | |
366 EXTENDED_ARG 1 | |
368 POP_JUMP_IF_FALSE 406 | |
705 370 LOAD_GLOBAL 7 (torch) | |
372 LOAD_METHOD 11 (finfo) | |
374 LOAD_FAST 1 (hidden_states) | |
376 LOAD_ATTR 6 (dtype) | |
378 CALL_METHOD 1 | |
380 LOAD_ATTR 12 (max) | |
382 LOAD_CONST 13 (1000) | |
384 BINARY_SUBTRACT | |
386 STORE_FAST 19 (clamp_value) | |
706 388 LOAD_GLOBAL 7 (torch) | |
390 LOAD_ATTR 13 (clamp) | |
392 LOAD_FAST 1 (hidden_states) | |
394 LOAD_FAST 19 (clamp_value) | |
396 UNARY_NEGATIVE | |
398 LOAD_FAST 19 (clamp_value) | |
400 LOAD_CONST 14 (('min', 'max')) | |
402 CALL_FUNCTION_KW 3 | |
404 STORE_FAST 1 (hidden_states) | |
709 >> 406 LOAD_FAST 17 (present_key_value_state) | |
408 LOAD_CONST 0 (None) | |
410 COMPARE_OP 9 (is not) | |
412 EXTENDED_ARG 1 | |
414 POP_JUMP_IF_FALSE 428 | |
710 416 LOAD_FAST 17 (present_key_value_state) | |
418 LOAD_FAST 22 (cross_attention_outputs) | |
420 LOAD_CONST 15 (1) | |
422 BINARY_SUBSCR | |
424 BINARY_ADD | |
426 STORE_FAST 17 (present_key_value_state) | |
713 >> 428 LOAD_FAST 18 (attention_outputs) | |
430 LOAD_FAST 22 (cross_attention_outputs) | |
432 LOAD_CONST 2 (2) | |
434 LOAD_CONST 0 (None) | |
436 BUILD_SLICE 2 | |
438 BINARY_SUBSCR | |
440 BINARY_ADD | |
442 STORE_FAST 18 (attention_outputs) | |
716 >> 444 LOAD_FAST 0 (self) | |
446 LOAD_ATTR 5 (layer) | |
448 LOAD_CONST 17 (-1) | |
450 BINARY_SUBSCR | |
452 LOAD_FAST 1 (hidden_states) | |
454 CALL_FUNCTION 1 | |
456 STORE_FAST 1 (hidden_states) | |
719 458 LOAD_FAST 1 (hidden_states) | |
460 LOAD_ATTR 6 (dtype) | |
462 LOAD_GLOBAL 7 (torch) | |
464 LOAD_ATTR 8 (float16) | |
466 COMPARE_OP 2 (==) | |
468 EXTENDED_ARG 2 | |
470 POP_JUMP_IF_FALSE 524 | |
472 LOAD_GLOBAL 7 (torch) | |
474 LOAD_METHOD 9 (isinf) | |
476 LOAD_FAST 1 (hidden_states) | |
478 CALL_METHOD 1 | |
480 LOAD_METHOD 10 (any) | |
482 CALL_METHOD 0 | |
484 EXTENDED_ARG 2 | |
486 POP_JUMP_IF_FALSE 524 | |
720 488 LOAD_GLOBAL 7 (torch) | |
490 LOAD_METHOD 11 (finfo) | |
492 LOAD_FAST 1 (hidden_states) | |
494 LOAD_ATTR 6 (dtype) | |
496 CALL_METHOD 1 | |
498 LOAD_ATTR 12 (max) | |
500 LOAD_CONST 13 (1000) | |
502 BINARY_SUBTRACT | |
504 STORE_FAST 19 (clamp_value) | |
721 506 LOAD_GLOBAL 7 (torch) | |
508 LOAD_ATTR 13 (clamp) | |
510 LOAD_FAST 1 (hidden_states) | |
512 LOAD_FAST 19 (clamp_value) | |
514 UNARY_NEGATIVE | |
516 LOAD_FAST 19 (clamp_value) | |
518 LOAD_CONST 14 (('min', 'max')) | |
520 CALL_FUNCTION_KW 3 | |
522 STORE_FAST 1 (hidden_states) | |
723 >> 524 LOAD_FAST 1 (hidden_states) | |
526 BUILD_TUPLE 1 | |
528 STORE_FAST 23 (outputs) | |
725 530 LOAD_FAST 10 (use_cache) | |
532 EXTENDED_ARG 2 | |
534 POP_JUMP_IF_FALSE 552 | |
726 536 LOAD_FAST 23 (outputs) | |
538 LOAD_FAST 17 (present_key_value_state) | |
540 BUILD_TUPLE 1 | |
542 BINARY_ADD | |
544 LOAD_FAST 18 (attention_outputs) | |
546 BINARY_ADD | |
548 STORE_FAST 23 (outputs) | |
550 JUMP_FORWARD 8 (to 560) | |
728 >> 552 LOAD_FAST 23 (outputs) | |
554 LOAD_FAST 18 (attention_outputs) | |
556 BINARY_ADD | |
558 STORE_FAST 23 (outputs) | |
730 >> 560 LOAD_FAST 23 (outputs) | |
562 RETURN_VALUE | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST past_key_value [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP is not [ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 122 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST (None, None) [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE UNPACK_SEQUENCE 2 [ConstantVariable(tuple)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST self_attn_past_key_value [ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST cross_attn_past_key_value [ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR layer [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 0 [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_SUBSCR None [NNModuleVariable(), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST attention_mask [NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST position_bias [NNModuleVariable(), TensorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST layer_head_mask [NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self_attn_past_key_value [NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST use_cache [NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST output_attentions [NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST ('attention_mask', 'position_bias', 'layer_head_mask', 'past_key_value', 'use_cache', 'output_attentions') [NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(bool), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION_KW 7 [NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(bool), ConstantVariable(bool), ConstantVariable(tuple)] | |
torchdynamo.symbolic_convert: [DEBUG] INLINING <code object forward at 0x7f3f100b4030, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 559> | |
569 0 LOAD_FAST 0 (self) | |
2 LOAD_METHOD 0 (layer_norm) | |
4 LOAD_FAST 1 (hidden_states) | |
6 CALL_METHOD 1 | |
8 STORE_FAST 8 (normed_hidden_states) | |
570 10 LOAD_FAST 0 (self) | |
12 LOAD_ATTR 1 (SelfAttention) | |
571 14 LOAD_FAST 8 (normed_hidden_states) | |
572 16 LOAD_FAST 2 (attention_mask) | |
573 18 LOAD_FAST 3 (position_bias) | |
574 20 LOAD_FAST 4 (layer_head_mask) | |
575 22 LOAD_FAST 5 (past_key_value) | |
576 24 LOAD_FAST 6 (use_cache) | |
577 26 LOAD_FAST 7 (output_attentions) | |
570 28 LOAD_CONST 1 (('mask', 'position_bias', 'layer_head_mask', 'past_key_value', 'use_cache', 'output_attentions')) | |
30 CALL_FUNCTION_KW 7 | |
32 STORE_FAST 9 (attention_output) | |
579 34 LOAD_FAST 1 (hidden_states) | |
36 LOAD_FAST 0 (self) | |
38 LOAD_METHOD 2 (dropout) | |
40 LOAD_FAST 9 (attention_output) | |
42 LOAD_CONST 2 (0) | |
44 BINARY_SUBSCR | |
46 CALL_METHOD 1 | |
48 BINARY_ADD | |
50 STORE_FAST 1 (hidden_states) | |
580 52 LOAD_FAST 1 (hidden_states) | |
54 BUILD_TUPLE 1 | |
56 LOAD_FAST 9 (attention_output) | |
58 LOAD_CONST 3 (1) | |
60 LOAD_CONST 0 (None) | |
62 BUILD_SLICE 2 | |
64 BINARY_SUBSCR | |
66 BINARY_ADD | |
68 STORE_FAST 10 (outputs) | |
581 70 LOAD_FAST 10 (outputs) | |
72 RETURN_VALUE | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR layer_norm [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] INLINING <code object forward at 0x7f3f100ae190, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 248> | |
255 0 LOAD_FAST 1 (hidden_states) | |
2 LOAD_METHOD 0 (to) | |
4 LOAD_GLOBAL 1 (torch) | |
6 LOAD_ATTR 2 (float32) | |
8 CALL_METHOD 1 | |
10 LOAD_METHOD 3 (pow) | |
12 LOAD_CONST 1 (2) | |
14 CALL_METHOD 1 | |
16 LOAD_ATTR 4 (mean) | |
18 LOAD_CONST 2 (-1) | |
20 LOAD_CONST 3 (True) | |
22 LOAD_CONST 4 (('keepdim',)) | |
24 CALL_FUNCTION_KW 2 | |
26 STORE_FAST 2 (variance) | |
256 28 LOAD_FAST 1 (hidden_states) | |
30 LOAD_GLOBAL 1 (torch) | |
32 LOAD_METHOD 5 (rsqrt) | |
34 LOAD_FAST 2 (variance) | |
36 LOAD_FAST 0 (self) | |
38 LOAD_ATTR 6 (variance_epsilon) | |
40 BINARY_ADD | |
42 CALL_METHOD 1 | |
44 BINARY_MULTIPLY | |
46 STORE_FAST 1 (hidden_states) | |
259 48 LOAD_FAST 0 (self) | |
50 LOAD_ATTR 7 (weight) | |
52 LOAD_ATTR 8 (dtype) | |
54 LOAD_GLOBAL 1 (torch) | |
56 LOAD_ATTR 9 (float16) | |
58 LOAD_GLOBAL 1 (torch) | |
60 LOAD_ATTR 10 (bfloat16) | |
62 BUILD_TUPLE 2 | |
64 COMPARE_OP 6 (in) | |
66 POP_JUMP_IF_FALSE 82 | |
260 68 LOAD_FAST 1 (hidden_states) | |
70 LOAD_METHOD 0 (to) | |
72 LOAD_FAST 0 (self) | |
74 LOAD_ATTR 7 (weight) | |
76 LOAD_ATTR 8 (dtype) | |
78 CALL_METHOD 1 | |
80 STORE_FAST 1 (hidden_states) | |
262 >> 82 LOAD_FAST 0 (self) | |
84 LOAD_ATTR 7 (weight) | |
86 LOAD_FAST 1 (hidden_states) | |
88 BINARY_MULTIPLY | |
90 RETURN_VALUE | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR to [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL torch [GetAttrVariable(TensorVariable(), to)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR float32 [GetAttrVariable(TensorVariable(), to), TorchVariable(<module 'torch' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/__init__.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [GetAttrVariable(TensorVariable(), to), ConstantVariable(dtype)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR pow [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 2 [GetAttrVariable(TensorVariable(), pow)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [GetAttrVariable(TensorVariable(), pow), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR mean [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST -1 [GetAttrVariable(TensorVariable(), mean)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST True [GetAttrVariable(TensorVariable(), mean), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST ('keepdim',) [GetAttrVariable(TensorVariable(), mean), ConstantVariable(int), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION_KW 2 [GetAttrVariable(TensorVariable(), mean), ConstantVariable(int), ConstantVariable(bool), ConstantVariable(tuple)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST variance [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL torch [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR rsqrt [TensorVariable(), TorchVariable(<module 'torch' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/__init__.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST variance [TensorVariable(), TorchVariable(<built-in method rsqrt of type object at 0x7f40b0102e20>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [TensorVariable(), TorchVariable(<built-in method rsqrt of type object at 0x7f40b0102e20>), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR variance_epsilon [TensorVariable(), TorchVariable(<built-in method rsqrt of type object at 0x7f40b0102e20>), TensorVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_ADD None [TensorVariable(), TorchVariable(<built-in method rsqrt of type object at 0x7f40b0102e20>), TensorVariable(), ConstantVariable(float)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [TensorVariable(), TorchVariable(<built-in method rsqrt of type object at 0x7f40b0102e20>), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_MULTIPLY None [TensorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST hidden_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR weight [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR dtype [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL torch [TorchVariable(torch.float32)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR float16 [TorchVariable(torch.float32), TorchVariable(<module 'torch' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/__init__.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL torch [TorchVariable(torch.float32), ConstantVariable(dtype)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR bfloat16 [TorchVariable(torch.float32), ConstantVariable(dtype), TorchVariable(<module 'torch' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/__init__.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_TUPLE 2 [TorchVariable(torch.float32), ConstantVariable(dtype), ConstantVariable(dtype)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP in [TorchVariable(torch.float32), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 82 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR weight [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_MULTIPLY None [TensorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE RETURN_VALUE None [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] DONE INLINING <code object forward at 0x7f3f100ae190, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 248> | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST normed_hidden_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR SelfAttention [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST normed_hidden_states [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST attention_mask [NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST position_bias [NNModuleVariable(), TensorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST layer_head_mask [NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST past_key_value [NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST use_cache [NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST output_attentions [NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST ('mask', 'position_bias', 'layer_head_mask', 'past_key_value', 'use_cache', 'output_attentions') [NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(bool), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION_KW 7 [NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(bool), ConstantVariable(bool), ConstantVariable(tuple)] | |
torchdynamo.symbolic_convert: [DEBUG] INLINING <code object forward at 0x7f3f100aedf0, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 436> | |
454 0 LOAD_FAST 1 (hidden_states) | |
2 LOAD_ATTR 0 (shape) | |
4 LOAD_CONST 1 (None) | |
6 LOAD_CONST 2 (2) | |
8 BUILD_SLICE 2 | |
10 BINARY_SUBSCR | |
12 UNPACK_SEQUENCE 2 | |
14 STORE_DEREF 0 (batch_size) | |
16 STORE_FAST 10 (seq_length) | |
456 18 LOAD_FAST 10 (seq_length) | |
20 STORE_FAST 11 (real_seq_length) | |
458 22 LOAD_FAST 5 (past_key_value) | |
24 LOAD_CONST 1 (None) | |
26 COMPARE_OP 9 (is not) | |
28 POP_JUMP_IF_FALSE 92 | |
460 30 LOAD_GLOBAL 1 (len) | |
32 LOAD_FAST 5 (past_key_value) | |
34 CALL_FUNCTION 1 | |
36 LOAD_CONST 2 (2) | |
38 COMPARE_OP 2 (==) | |
459 40 POP_JUMP_IF_TRUE 62 | |
42 LOAD_GLOBAL 2 (AssertionError) | |
461 44 LOAD_CONST 3 ('past_key_value should have 2 past states: keys and values. Got ') | |
46 LOAD_GLOBAL 1 (len) | |
48 LOAD_FAST 5 (past_key_value) | |
50 CALL_FUNCTION 1 | |
52 FORMAT_VALUE 0 | |
54 LOAD_CONST 4 (' past states') | |
56 BUILD_STRING 3 | |
459 58 CALL_FUNCTION 1 | |
60 RAISE_VARARGS 1 | |
462 >> 62 LOAD_FAST 11 (real_seq_length) | |
64 LOAD_FAST 7 (query_length) | |
66 LOAD_CONST 1 (None) | |
68 COMPARE_OP 8 (is) | |
70 POP_JUMP_IF_FALSE 86 | |
72 LOAD_FAST 5 (past_key_value) | |
74 LOAD_CONST 5 (0) | |
76 BINARY_SUBSCR | |
78 LOAD_ATTR 0 (shape) | |
80 LOAD_CONST 2 (2) | |
82 BINARY_SUBSCR | |
84 JUMP_FORWARD 2 (to 88) | |
>> 86 LOAD_FAST 7 (query_length) | |
>> 88 INPLACE_ADD | |
90 STORE_FAST 11 (real_seq_length) | |
464 >> 92 LOAD_FAST 3 (key_value_states) | |
94 LOAD_CONST 1 (None) | |
96 COMPARE_OP 8 (is) | |
98 POP_JUMP_IF_FALSE 104 | |
100 LOAD_FAST 11 (real_seq_length) | |
102 JUMP_FORWARD 8 (to 112) | |
>> 104 LOAD_FAST 3 (key_value_states) | |
106 LOAD_ATTR 0 (shape) | |
108 LOAD_CONST 6 (1) | |
110 BINARY_SUBSCR | |
>> 112 STORE_FAST 12 (key_length) | |
466 114 LOAD_CLOSURE 0 (batch_size) | |
116 LOAD_CLOSURE 1 (self) | |
118 BUILD_TUPLE 2 | |
120 LOAD_CONST 7 (<code object shape at 0x7f3f100aebe0, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 466>) | |
122 LOAD_CONST 8 ('T5Attention.forward.<locals>.shape') | |
124 MAKE_FUNCTION 8 (closure) | |
126 STORE_DEREF 2 (shape) | |
470 128 LOAD_CLOSURE 0 (batch_size) | |
130 LOAD_CLOSURE 1 (self) | |
132 BUILD_TUPLE 2 | |
134 LOAD_CONST 9 (<code object unshape at 0x7f3f100aec90, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 470>) | |
136 LOAD_CONST 10 ('T5Attention.forward.<locals>.unshape') | |
138 MAKE_FUNCTION 8 (closure) | |
140 STORE_FAST 13 (unshape) | |
474 142 LOAD_CLOSURE 2 (shape) | |
144 BUILD_TUPLE 1 | |
146 LOAD_CONST 11 (<code object project at 0x7f3f100aed40, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 474>) | |
148 LOAD_CONST 12 ('T5Attention.forward.<locals>.project') | |
150 MAKE_FUNCTION 8 (closure) | |
152 STORE_FAST 14 (project) | |
496 154 LOAD_DEREF 2 (shape) | |
156 LOAD_DEREF 1 (self) | |
158 LOAD_METHOD 3 (q) | |
160 LOAD_FAST 1 (hidden_states) | |
162 CALL_METHOD 1 | |
164 CALL_FUNCTION 1 | |
166 STORE_FAST 15 (query_states) | |
499 168 LOAD_FAST 14 (project) | |
500 170 LOAD_FAST 1 (hidden_states) | |
172 LOAD_DEREF 1 (self) | |
174 LOAD_ATTR 4 (k) | |
176 LOAD_FAST 3 (key_value_states) | |
178 LOAD_FAST 5 (past_key_value) | |
180 LOAD_CONST 1 (None) | |
182 COMPARE_OP 9 (is not) | |
184 POP_JUMP_IF_FALSE 194 | |
186 LOAD_FAST 5 (past_key_value) | |
188 LOAD_CONST 5 (0) | |
190 BINARY_SUBSCR | |
192 JUMP_FORWARD 2 (to 196) | |
>> 194 LOAD_CONST 1 (None) | |
499 >> 196 CALL_FUNCTION 4 | |
198 STORE_FAST 16 (key_states) | |
502 200 LOAD_FAST 14 (project) | |
503 202 LOAD_FAST 1 (hidden_states) | |
204 LOAD_DEREF 1 (self) | |
206 LOAD_ATTR 5 (v) | |
208 LOAD_FAST 3 (key_value_states) | |
210 LOAD_FAST 5 (past_key_value) | |
212 LOAD_CONST 1 (None) | |
214 COMPARE_OP 9 (is not) | |
216 POP_JUMP_IF_FALSE 226 | |
218 LOAD_FAST 5 (past_key_value) | |
220 LOAD_CONST 6 (1) | |
222 BINARY_SUBSCR | |
224 JUMP_FORWARD 2 (to 228) | |
>> 226 LOAD_CONST 1 (None) | |
502 >> 228 CALL_FUNCTION 4 | |
230 STORE_FAST 17 (value_states) | |
507 232 LOAD_GLOBAL 6 (torch) | |
234 LOAD_METHOD 7 (matmul) | |
508 236 LOAD_FAST 15 (query_states) | |
238 LOAD_FAST 16 (key_states) | |
240 LOAD_METHOD 8 (transpose) | |
242 LOAD_CONST 13 (3) | |
244 LOAD_CONST 2 (2) | |
246 CALL_METHOD 2 | |
507 248 CALL_METHOD 2 | |
250 STORE_FAST 18 (scores) | |
511 252 LOAD_FAST 4 (position_bias) | |
254 LOAD_CONST 1 (None) | |
256 COMPARE_OP 8 (is) | |
258 EXTENDED_ARG 1 | |
260 POP_JUMP_IF_FALSE 410 | |
512 262 LOAD_DEREF 1 (self) | |
264 LOAD_ATTR 9 (has_relative_attention_bias) | |
266 EXTENDED_ARG 1 | |
268 POP_JUMP_IF_TRUE 324 | |
513 270 LOAD_GLOBAL 6 (torch) | |
272 LOAD_ATTR 10 (zeros) | |
514 274 LOAD_CONST 6 (1) | |
276 LOAD_DEREF 1 (self) | |
278 LOAD_ATTR 11 (n_heads) | |
280 LOAD_FAST 11 (real_seq_length) | |
282 LOAD_FAST 12 (key_length) | |
284 BUILD_TUPLE 4 | |
286 LOAD_FAST 18 (scores) | |
288 LOAD_ATTR 12 (device) | |
290 LOAD_FAST 18 (scores) | |
292 LOAD_ATTR 13 (dtype) | |
513 294 LOAD_CONST 14 (('device', 'dtype')) | |
296 CALL_FUNCTION_KW 3 | |
298 STORE_FAST 4 (position_bias) | |
516 300 LOAD_DEREF 1 (self) | |
302 LOAD_ATTR 14 (gradient_checkpointing) | |
304 EXTENDED_ARG 1 | |
306 POP_JUMP_IF_FALSE 342 | |
308 LOAD_DEREF 1 (self) | |
310 LOAD_ATTR 15 (training) | |
312 EXTENDED_ARG 1 | |
314 POP_JUMP_IF_FALSE 342 | |
517 316 LOAD_CONST 15 (True) | |
318 LOAD_FAST 4 (position_bias) | |
320 STORE_ATTR 16 (requires_grad) | |
322 JUMP_FORWARD 18 (to 342) | |
519 >> 324 LOAD_DEREF 1 (self) | |
326 LOAD_ATTR 17 (compute_bias) | |
328 LOAD_FAST 11 (real_seq_length) | |
330 LOAD_FAST 12 (key_length) | |
332 LOAD_FAST 18 (scores) | |
334 LOAD_ATTR 12 (device) | |
336 LOAD_CONST 16 (('device',)) | |
338 CALL_FUNCTION_KW 3 | |
340 STORE_FAST 4 (position_bias) | |
523 >> 342 LOAD_FAST 5 (past_key_value) | |
344 LOAD_CONST 1 (None) | |
346 COMPARE_OP 9 (is not) | |
348 EXTENDED_ARG 1 | |
350 POP_JUMP_IF_FALSE 392 | |
524 352 LOAD_FAST 4 (position_bias) | |
354 LOAD_CONST 1 (None) | |
356 LOAD_CONST 1 (None) | |
358 BUILD_SLICE 2 | |
360 LOAD_CONST 1 (None) | |
362 LOAD_CONST 1 (None) | |
364 BUILD_SLICE 2 | |
366 LOAD_FAST 1 (hidden_states) | |
368 LOAD_METHOD 18 (size) | |
370 LOAD_CONST 6 (1) | |
372 CALL_METHOD 1 | |
374 UNARY_NEGATIVE | |
376 LOAD_CONST 1 (None) | |
378 BUILD_SLICE 2 | |
380 LOAD_CONST 1 (None) | |
382 LOAD_CONST 1 (None) | |
384 BUILD_SLICE 2 | |
386 BUILD_TUPLE 4 | |
388 BINARY_SUBSCR | |
390 STORE_FAST 4 (position_bias) | |
526 >> 392 LOAD_FAST 2 (mask) | |
394 LOAD_CONST 1 (None) | |
396 COMPARE_OP 9 (is not) | |
398 EXTENDED_ARG 1 | |
400 POP_JUMP_IF_FALSE 410 | |
527 402 LOAD_FAST 4 (position_bias) | |
404 LOAD_FAST 2 (mask) | |
406 BINARY_ADD | |
408 STORE_FAST 4 (position_bias) | |
529 >> 410 LOAD_FAST 18 (scores) | |
412 LOAD_FAST 4 (position_bias) | |
414 INPLACE_ADD | |
416 STORE_FAST 18 (scores) | |
530 418 LOAD_GLOBAL 19 (nn) | |
420 LOAD_ATTR 20 (functional) | |
422 LOAD_ATTR 21 (softmax) | |
424 LOAD_FAST 18 (scores) | |
426 LOAD_METHOD 22 (float) | |
428 CALL_METHOD 0 | |
430 LOAD_CONST 17 (-1) | |
432 LOAD_CONST 18 (('dim',)) | |
434 CALL_FUNCTION_KW 2 | |
436 LOAD_METHOD 23 (type_as) | |
531 438 LOAD_FAST 18 (scores) | |
530 440 CALL_METHOD 1 | |
442 STORE_FAST 19 (attn_weights) | |
533 444 LOAD_GLOBAL 19 (nn) | |
446 LOAD_ATTR 20 (functional) | |
448 LOAD_ATTR 24 (dropout) | |
534 450 LOAD_FAST 19 (attn_weights) | |
452 LOAD_DEREF 1 (self) | |
454 LOAD_ATTR 24 (dropout) | |
456 LOAD_DEREF 1 (self) | |
458 LOAD_ATTR 15 (training) | |
533 460 LOAD_CONST 19 (('p', 'training')) | |
462 CALL_FUNCTION_KW 3 | |
464 STORE_FAST 19 (attn_weights) | |
538 466 LOAD_FAST 6 (layer_head_mask) | |
468 LOAD_CONST 1 (None) | |
470 COMPARE_OP 9 (is not) | |
472 EXTENDED_ARG 1 | |
474 POP_JUMP_IF_FALSE 484 | |
539 476 LOAD_FAST 19 (attn_weights) | |
478 LOAD_FAST 6 (layer_head_mask) | |
480 BINARY_MULTIPLY | |
482 STORE_FAST 19 (attn_weights) | |
541 >> 484 LOAD_FAST 13 (unshape) | |
486 LOAD_GLOBAL 6 (torch) | |
488 LOAD_METHOD 7 (matmul) | |
490 LOAD_FAST 19 (attn_weights) | |
492 LOAD_FAST 17 (value_states) | |
494 CALL_METHOD 2 | |
496 CALL_FUNCTION 1 | |
498 STORE_FAST 20 (attn_output) | |
542 500 LOAD_DEREF 1 (self) | |
502 LOAD_METHOD 25 (o) | |
504 LOAD_FAST 20 (attn_output) | |
506 CALL_METHOD 1 | |
508 STORE_FAST 20 (attn_output) | |
544 510 LOAD_DEREF 1 (self) | |
512 LOAD_ATTR 26 (is_decoder) | |
514 EXTENDED_ARG 2 | |
516 POP_JUMP_IF_FALSE 532 | |
518 LOAD_FAST 8 (use_cache) | |
520 EXTENDED_ARG 2 | |
522 POP_JUMP_IF_FALSE 532 | |
524 LOAD_FAST 16 (key_states) | |
526 LOAD_FAST 17 (value_states) | |
528 BUILD_TUPLE 2 | |
530 JUMP_FORWARD 2 (to 534) | |
>> 532 LOAD_CONST 1 (None) | |
>> 534 STORE_FAST 21 (present_key_value_state) | |
545 536 LOAD_FAST 20 (attn_output) | |
538 BUILD_TUPLE 1 | |
540 LOAD_FAST 21 (present_key_value_state) | |
542 BUILD_TUPLE 1 | |
544 BINARY_ADD | |
546 LOAD_FAST 4 (position_bias) | |
548 BUILD_TUPLE 1 | |
550 BINARY_ADD | |
552 STORE_FAST 22 (outputs) | |
547 554 LOAD_FAST 9 (output_attentions) | |
556 EXTENDED_ARG 2 | |
558 POP_JUMP_IF_FALSE 570 | |
548 560 LOAD_FAST 22 (outputs) | |
562 LOAD_FAST 19 (attn_weights) | |
564 BUILD_TUPLE 1 | |
566 BINARY_ADD | |
568 STORE_FAST 22 (outputs) | |
549 >> 570 LOAD_FAST 22 (outputs) | |
572 RETURN_VALUE | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR shape [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [ShapeVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 2 [ShapeVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_SLICE 2 [ShapeVariable(), ConstantVariable(NoneType), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_SUBSCR None [ShapeVariable(), SliceVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE UNPACK_SEQUENCE 2 [ShapeVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_DEREF batch_size [ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST seq_length [ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST seq_length [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST real_seq_length [ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST past_key_value [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP is not [ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 92 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST key_value_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP is [ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 104 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST real_seq_length [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE JUMP_FORWARD 112 [ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST key_length [ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CLOSURE batch_size [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CLOSURE self [NewCellVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_TUPLE 2 [NewCellVariable(), NewCellVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST <code object shape at 0x7f3f100aebe0, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 466> [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST T5Attention.forward.<locals>.shape [TupleVariable(), ConstantVariable(code)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE MAKE_FUNCTION 8 [TupleVariable(), ConstantVariable(code), ConstantVariable(str)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_DEREF shape [NestedUserFunctionVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CLOSURE batch_size [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CLOSURE self [NewCellVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_TUPLE 2 [NewCellVariable(), NewCellVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST <code object unshape at 0x7f3f100aec90, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 470> [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST T5Attention.forward.<locals>.unshape [TupleVariable(), ConstantVariable(code)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE MAKE_FUNCTION 8 [TupleVariable(), ConstantVariable(code), ConstantVariable(str)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST unshape [NestedUserFunctionVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CLOSURE shape [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_TUPLE 1 [NewCellVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST <code object project at 0x7f3f100aed40, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 474> [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST T5Attention.forward.<locals>.project [TupleVariable(), ConstantVariable(code)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE MAKE_FUNCTION 8 [TupleVariable(), ConstantVariable(code), ConstantVariable(str)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST project [NestedUserFunctionVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF shape [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF self [NestedUserFunctionVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR q [NestedUserFunctionVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [NestedUserFunctionVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NestedUserFunctionVariable(), NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NestedUserFunctionVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] INLINING <code object shape at 0x7f3f100aebe0, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 466> | |
468 0 LOAD_FAST 0 (states) | |
2 LOAD_METHOD 0 (view) | |
4 LOAD_DEREF 0 (batch_size) | |
6 LOAD_CONST 1 (-1) | |
8 LOAD_DEREF 1 (self) | |
10 LOAD_ATTR 1 (n_heads) | |
12 LOAD_DEREF 1 (self) | |
14 LOAD_ATTR 2 (key_value_proj_dim) | |
16 CALL_METHOD 4 | |
18 LOAD_METHOD 3 (transpose) | |
20 LOAD_CONST 2 (1) | |
22 LOAD_CONST 3 (2) | |
24 CALL_METHOD 2 | |
26 RETURN_VALUE | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR view [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF batch_size [GetAttrVariable(TensorVariable(), view)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST -1 [GetAttrVariable(TensorVariable(), view), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF self [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR n_heads [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF self [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR key_value_proj_dim [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int), ConstantVariable(int), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 4 [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int), ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR transpose [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 1 [GetAttrVariable(TensorVariable(), transpose)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 2 [GetAttrVariable(TensorVariable(), transpose), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 2 [GetAttrVariable(TensorVariable(), transpose), ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE RETURN_VALUE None [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] DONE INLINING <code object shape at 0x7f3f100aebe0, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 466> | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST query_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST project [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [NestedUserFunctionVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF self [NestedUserFunctionVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR k [NestedUserFunctionVariable(), TensorVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST key_value_states [NestedUserFunctionVariable(), TensorVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST past_key_value [NestedUserFunctionVariable(), TensorVariable(), NNModuleVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [NestedUserFunctionVariable(), TensorVariable(), NNModuleVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP is not [NestedUserFunctionVariable(), TensorVariable(), NNModuleVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 194 [NestedUserFunctionVariable(), TensorVariable(), NNModuleVariable(), ConstantVariable(NoneType), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [NestedUserFunctionVariable(), TensorVariable(), NNModuleVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 4 [NestedUserFunctionVariable(), TensorVariable(), NNModuleVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] INLINING <code object project at 0x7f3f100aed40, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 474> | |
476 0 LOAD_FAST 2 (key_value_states) | |
2 LOAD_CONST 1 (None) | |
4 COMPARE_OP 8 (is) | |
6 POP_JUMP_IF_FALSE 22 | |
479 8 LOAD_DEREF 0 (shape) | |
10 LOAD_FAST 1 (proj_layer) | |
12 LOAD_FAST 0 (hidden_states) | |
14 CALL_FUNCTION 1 | |
16 CALL_FUNCTION 1 | |
18 STORE_FAST 0 (hidden_states) | |
20 JUMP_FORWARD 20 (to 42) | |
480 >> 22 LOAD_FAST 3 (past_key_value) | |
24 LOAD_CONST 1 (None) | |
26 COMPARE_OP 8 (is) | |
28 POP_JUMP_IF_FALSE 42 | |
483 30 LOAD_DEREF 0 (shape) | |
32 LOAD_FAST 1 (proj_layer) | |
34 LOAD_FAST 2 (key_value_states) | |
36 CALL_FUNCTION 1 | |
38 CALL_FUNCTION 1 | |
40 STORE_FAST 0 (hidden_states) | |
485 >> 42 LOAD_FAST 3 (past_key_value) | |
44 LOAD_CONST 1 (None) | |
46 COMPARE_OP 9 (is not) | |
48 POP_JUMP_IF_FALSE 82 | |
486 50 LOAD_FAST 2 (key_value_states) | |
52 LOAD_CONST 1 (None) | |
54 COMPARE_OP 8 (is) | |
56 POP_JUMP_IF_FALSE 78 | |
489 58 LOAD_GLOBAL 0 (torch) | |
60 LOAD_ATTR 1 (cat) | |
62 LOAD_FAST 3 (past_key_value) | |
64 LOAD_FAST 0 (hidden_states) | |
66 BUILD_LIST 2 | |
68 LOAD_CONST 2 (2) | |
70 LOAD_CONST 3 (('dim',)) | |
72 CALL_FUNCTION_KW 2 | |
74 STORE_FAST 0 (hidden_states) | |
76 JUMP_FORWARD 4 (to 82) | |
492 >> 78 LOAD_FAST 3 (past_key_value) | |
80 STORE_FAST 0 (hidden_states) | |
493 >> 82 LOAD_FAST 0 (hidden_states) | |
84 RETURN_VALUE | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST key_value_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP is [ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 22 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF shape [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST proj_layer [NestedUserFunctionVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [NestedUserFunctionVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NestedUserFunctionVariable(), NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NestedUserFunctionVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] INLINING <code object shape at 0x7f3f100aebe0, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 466> | |
468 0 LOAD_FAST 0 (states) | |
2 LOAD_METHOD 0 (view) | |
4 LOAD_DEREF 0 (batch_size) | |
6 LOAD_CONST 1 (-1) | |
8 LOAD_DEREF 1 (self) | |
10 LOAD_ATTR 1 (n_heads) | |
12 LOAD_DEREF 1 (self) | |
14 LOAD_ATTR 2 (key_value_proj_dim) | |
16 CALL_METHOD 4 | |
18 LOAD_METHOD 3 (transpose) | |
20 LOAD_CONST 2 (1) | |
22 LOAD_CONST 3 (2) | |
24 CALL_METHOD 2 | |
26 RETURN_VALUE | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR view [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF batch_size [GetAttrVariable(TensorVariable(), view)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST -1 [GetAttrVariable(TensorVariable(), view), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF self [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR n_heads [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF self [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR key_value_proj_dim [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int), ConstantVariable(int), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 4 [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int), ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR transpose [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 1 [GetAttrVariable(TensorVariable(), transpose)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 2 [GetAttrVariable(TensorVariable(), transpose), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 2 [GetAttrVariable(TensorVariable(), transpose), ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE RETURN_VALUE None [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] DONE INLINING <code object shape at 0x7f3f100aebe0, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 466> | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST hidden_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE JUMP_FORWARD 42 [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST past_key_value [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP is not [ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 82 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE RETURN_VALUE None [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] DONE INLINING <code object project at 0x7f3f100aed40, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 474> | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST key_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST project [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [NestedUserFunctionVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF self [NestedUserFunctionVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR v [NestedUserFunctionVariable(), TensorVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST key_value_states [NestedUserFunctionVariable(), TensorVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST past_key_value [NestedUserFunctionVariable(), TensorVariable(), NNModuleVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [NestedUserFunctionVariable(), TensorVariable(), NNModuleVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP is not [NestedUserFunctionVariable(), TensorVariable(), NNModuleVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 226 [NestedUserFunctionVariable(), TensorVariable(), NNModuleVariable(), ConstantVariable(NoneType), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [NestedUserFunctionVariable(), TensorVariable(), NNModuleVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 4 [NestedUserFunctionVariable(), TensorVariable(), NNModuleVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] INLINING <code object project at 0x7f3f100aed40, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 474> | |
476 0 LOAD_FAST 2 (key_value_states) | |
2 LOAD_CONST 1 (None) | |
4 COMPARE_OP 8 (is) | |
6 POP_JUMP_IF_FALSE 22 | |
479 8 LOAD_DEREF 0 (shape) | |
10 LOAD_FAST 1 (proj_layer) | |
12 LOAD_FAST 0 (hidden_states) | |
14 CALL_FUNCTION 1 | |
16 CALL_FUNCTION 1 | |
18 STORE_FAST 0 (hidden_states) | |
20 JUMP_FORWARD 20 (to 42) | |
480 >> 22 LOAD_FAST 3 (past_key_value) | |
24 LOAD_CONST 1 (None) | |
26 COMPARE_OP 8 (is) | |
28 POP_JUMP_IF_FALSE 42 | |
483 30 LOAD_DEREF 0 (shape) | |
32 LOAD_FAST 1 (proj_layer) | |
34 LOAD_FAST 2 (key_value_states) | |
36 CALL_FUNCTION 1 | |
38 CALL_FUNCTION 1 | |
40 STORE_FAST 0 (hidden_states) | |
485 >> 42 LOAD_FAST 3 (past_key_value) | |
44 LOAD_CONST 1 (None) | |
46 COMPARE_OP 9 (is not) | |
48 POP_JUMP_IF_FALSE 82 | |
486 50 LOAD_FAST 2 (key_value_states) | |
52 LOAD_CONST 1 (None) | |
54 COMPARE_OP 8 (is) | |
56 POP_JUMP_IF_FALSE 78 | |
489 58 LOAD_GLOBAL 0 (torch) | |
60 LOAD_ATTR 1 (cat) | |
62 LOAD_FAST 3 (past_key_value) | |
64 LOAD_FAST 0 (hidden_states) | |
66 BUILD_LIST 2 | |
68 LOAD_CONST 2 (2) | |
70 LOAD_CONST 3 (('dim',)) | |
72 CALL_FUNCTION_KW 2 | |
74 STORE_FAST 0 (hidden_states) | |
76 JUMP_FORWARD 4 (to 82) | |
492 >> 78 LOAD_FAST 3 (past_key_value) | |
80 STORE_FAST 0 (hidden_states) | |
493 >> 82 LOAD_FAST 0 (hidden_states) | |
84 RETURN_VALUE | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST key_value_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP is [ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 22 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF shape [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST proj_layer [NestedUserFunctionVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [NestedUserFunctionVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NestedUserFunctionVariable(), NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NestedUserFunctionVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] INLINING <code object shape at 0x7f3f100aebe0, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 466> | |
468 0 LOAD_FAST 0 (states) | |
2 LOAD_METHOD 0 (view) | |
4 LOAD_DEREF 0 (batch_size) | |
6 LOAD_CONST 1 (-1) | |
8 LOAD_DEREF 1 (self) | |
10 LOAD_ATTR 1 (n_heads) | |
12 LOAD_DEREF 1 (self) | |
14 LOAD_ATTR 2 (key_value_proj_dim) | |
16 CALL_METHOD 4 | |
18 LOAD_METHOD 3 (transpose) | |
20 LOAD_CONST 2 (1) | |
22 LOAD_CONST 3 (2) | |
24 CALL_METHOD 2 | |
26 RETURN_VALUE | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR view [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF batch_size [GetAttrVariable(TensorVariable(), view)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST -1 [GetAttrVariable(TensorVariable(), view), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF self [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR n_heads [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF self [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR key_value_proj_dim [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int), ConstantVariable(int), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 4 [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int), ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR transpose [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 1 [GetAttrVariable(TensorVariable(), transpose)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 2 [GetAttrVariable(TensorVariable(), transpose), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 2 [GetAttrVariable(TensorVariable(), transpose), ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE RETURN_VALUE None [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] DONE INLINING <code object shape at 0x7f3f100aebe0, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 466> | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST hidden_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE JUMP_FORWARD 42 [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST past_key_value [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP is not [ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 82 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE RETURN_VALUE None [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] DONE INLINING <code object project at 0x7f3f100aed40, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 474> | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST value_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL torch [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR matmul [TorchVariable(<module 'torch' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/__init__.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST query_states [TorchVariable(<built-in method matmul of type object at 0x7f40b0102e20>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST key_states [TorchVariable(<built-in method matmul of type object at 0x7f40b0102e20>), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR transpose [TorchVariable(<built-in method matmul of type object at 0x7f40b0102e20>), TensorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 3 [TorchVariable(<built-in method matmul of type object at 0x7f40b0102e20>), TensorVariable(), GetAttrVariable(TensorVariable(), transpose)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 2 [TorchVariable(<built-in method matmul of type object at 0x7f40b0102e20>), TensorVariable(), GetAttrVariable(TensorVariable(), transpose), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 2 [TorchVariable(<built-in method matmul of type object at 0x7f40b0102e20>), TensorVariable(), GetAttrVariable(TensorVariable(), transpose), ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 2 [TorchVariable(<built-in method matmul of type object at 0x7f40b0102e20>), TensorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST scores [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST position_bias [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP is [TensorVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 410 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST scores [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST position_bias [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE INPLACE_ADD None [TensorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST scores [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL nn [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR functional [TorchVariable(<module 'torch.nn' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/nn/__init__.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR softmax [TorchVariable(<module 'torch.nn.functional' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/nn/functional.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST scores [TorchVariable(<function softmax at 0x7f3f1bf53e50>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR float [TorchVariable(<function softmax at 0x7f3f1bf53e50>), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 0 [TorchVariable(<function softmax at 0x7f3f1bf53e50>), GetAttrVariable(TensorVariable(), float)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST -1 [TorchVariable(<function softmax at 0x7f3f1bf53e50>), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST ('dim',) [TorchVariable(<function softmax at 0x7f3f1bf53e50>), TensorVariable(), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION_KW 2 [TorchVariable(<function softmax at 0x7f3f1bf53e50>), TensorVariable(), ConstantVariable(int), ConstantVariable(tuple)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR type_as [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST scores [GetAttrVariable(TensorVariable(), type_as)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [GetAttrVariable(TensorVariable(), type_as), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST attn_weights [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL nn [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR functional [TorchVariable(<module 'torch.nn' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/nn/__init__.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR dropout [TorchVariable(<module 'torch.nn.functional' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/nn/functional.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST attn_weights [TorchVariable(<function dropout at 0x7f3f1bf53310>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF self [TorchVariable(<function dropout at 0x7f3f1bf53310>), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR dropout [TorchVariable(<function dropout at 0x7f3f1bf53310>), TensorVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF self [TorchVariable(<function dropout at 0x7f3f1bf53310>), TensorVariable(), ConstantVariable(float)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR training [TorchVariable(<function dropout at 0x7f3f1bf53310>), TensorVariable(), ConstantVariable(float), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST ('p', 'training') [TorchVariable(<function dropout at 0x7f3f1bf53310>), TensorVariable(), ConstantVariable(float), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION_KW 3 [TorchVariable(<function dropout at 0x7f3f1bf53310>), TensorVariable(), ConstantVariable(float), ConstantVariable(bool), ConstantVariable(tuple)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST attn_weights [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST layer_head_mask [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP is not [ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 484 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST unshape [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL torch [NestedUserFunctionVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR matmul [NestedUserFunctionVariable(), TorchVariable(<module 'torch' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/__init__.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST attn_weights [NestedUserFunctionVariable(), TorchVariable(<built-in method matmul of type object at 0x7f40b0102e20>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST value_states [NestedUserFunctionVariable(), TorchVariable(<built-in method matmul of type object at 0x7f40b0102e20>), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 2 [NestedUserFunctionVariable(), TorchVariable(<built-in method matmul of type object at 0x7f40b0102e20>), TensorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NestedUserFunctionVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] INLINING <code object unshape at 0x7f3f100aec90, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 470> | |
472 0 LOAD_FAST 0 (states) | |
2 LOAD_METHOD 0 (transpose) | |
4 LOAD_CONST 1 (1) | |
6 LOAD_CONST 2 (2) | |
8 CALL_METHOD 2 | |
10 LOAD_METHOD 1 (contiguous) | |
12 CALL_METHOD 0 | |
14 LOAD_METHOD 2 (view) | |
16 LOAD_DEREF 0 (batch_size) | |
18 LOAD_CONST 3 (-1) | |
20 LOAD_DEREF 1 (self) | |
22 LOAD_ATTR 3 (inner_dim) | |
24 CALL_METHOD 3 | |
26 RETURN_VALUE | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR transpose [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 1 [GetAttrVariable(TensorVariable(), transpose)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 2 [GetAttrVariable(TensorVariable(), transpose), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 2 [GetAttrVariable(TensorVariable(), transpose), ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR contiguous [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 0 [GetAttrVariable(TensorVariable(), contiguous)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR view [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF batch_size [GetAttrVariable(TensorVariable(), view)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST -1 [GetAttrVariable(TensorVariable(), view), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF self [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR inner_dim [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 3 [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE RETURN_VALUE None [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] DONE INLINING <code object unshape at 0x7f3f100aec90, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 470> | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST attn_output [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR o [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST attn_output [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST attn_output [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR is_decoder [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 532 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST present_key_value_state [ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST attn_output [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_TUPLE 1 [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST present_key_value_state [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_TUPLE 1 [TupleVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_ADD None [TupleVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST position_bias [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_TUPLE 1 [TupleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_ADD None [TupleVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST outputs [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST output_attentions [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 570 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST outputs [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE RETURN_VALUE None [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] DONE INLINING <code object forward at 0x7f3f100aedf0, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 436> | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST attention_output [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR dropout [TensorVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST attention_output [TensorVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 0 [TensorVariable(), NNModuleVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_SUBSCR None [TensorVariable(), NNModuleVariable(), TupleVariable(), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [TensorVariable(), NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_ADD None [TensorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST hidden_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_TUPLE 1 [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST attention_output [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 1 [TupleVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [TupleVariable(), TupleVariable(), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_SLICE 2 [TupleVariable(), TupleVariable(), ConstantVariable(int), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_SUBSCR None [TupleVariable(), TupleVariable(), SliceVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_ADD None [TupleVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST outputs [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST outputs [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE RETURN_VALUE None [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] DONE INLINING <code object forward at 0x7f3f100b4030, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 559> | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST self_attention_outputs [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self_attention_outputs [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 2 [TupleVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_SLICE 2 [TupleVariable(), ConstantVariable(NoneType), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_SUBSCR None [TupleVariable(), SliceVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE UNPACK_SEQUENCE 2 [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST hidden_states [ConstantVariable(NoneType), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST present_key_value_state [ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self_attention_outputs [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 2 [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [TupleVariable(), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_SLICE 2 [TupleVariable(), ConstantVariable(int), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_SUBSCR None [TupleVariable(), SliceVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST attention_outputs [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR dtype [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL torch [TorchVariable(torch.float32)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR float16 [TorchVariable(torch.float32), TorchVariable(<module 'torch' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/__init__.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP == [TorchVariable(torch.float32), ConstantVariable(dtype)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 248 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR is_decoder [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE JUMP_IF_FALSE_OR_POP 262 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST do_cross_attention [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST do_cross_attention [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 444 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR layer [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST -1 [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_SUBSCR None [NNModuleVariable(), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] INLINING <code object forward at 0x7f3f100ae7c0, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 324> | |
325 0 LOAD_FAST 0 (self) | |
2 LOAD_METHOD 0 (layer_norm) | |
4 LOAD_FAST 1 (hidden_states) | |
6 CALL_METHOD 1 | |
8 STORE_FAST 2 (forwarded_states) | |
326 10 LOAD_FAST 0 (self) | |
12 LOAD_METHOD 1 (DenseReluDense) | |
14 LOAD_FAST 2 (forwarded_states) | |
16 CALL_METHOD 1 | |
18 STORE_FAST 2 (forwarded_states) | |
327 20 LOAD_FAST 1 (hidden_states) | |
22 LOAD_FAST 0 (self) | |
24 LOAD_METHOD 2 (dropout) | |
26 LOAD_FAST 2 (forwarded_states) | |
28 CALL_METHOD 1 | |
30 BINARY_ADD | |
32 STORE_FAST 1 (hidden_states) | |
328 34 LOAD_FAST 1 (hidden_states) | |
36 RETURN_VALUE | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR layer_norm [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] INLINING <code object forward at 0x7f3f100ae190, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 248> | |
255 0 LOAD_FAST 1 (hidden_states) | |
2 LOAD_METHOD 0 (to) | |
4 LOAD_GLOBAL 1 (torch) | |
6 LOAD_ATTR 2 (float32) | |
8 CALL_METHOD 1 | |
10 LOAD_METHOD 3 (pow) | |
12 LOAD_CONST 1 (2) | |
14 CALL_METHOD 1 | |
16 LOAD_ATTR 4 (mean) | |
18 LOAD_CONST 2 (-1) | |
20 LOAD_CONST 3 (True) | |
22 LOAD_CONST 4 (('keepdim',)) | |
24 CALL_FUNCTION_KW 2 | |
26 STORE_FAST 2 (variance) | |
256 28 LOAD_FAST 1 (hidden_states) | |
30 LOAD_GLOBAL 1 (torch) | |
32 LOAD_METHOD 5 (rsqrt) | |
34 LOAD_FAST 2 (variance) | |
36 LOAD_FAST 0 (self) | |
38 LOAD_ATTR 6 (variance_epsilon) | |
40 BINARY_ADD | |
42 CALL_METHOD 1 | |
44 BINARY_MULTIPLY | |
46 STORE_FAST 1 (hidden_states) | |
259 48 LOAD_FAST 0 (self) | |
50 LOAD_ATTR 7 (weight) | |
52 LOAD_ATTR 8 (dtype) | |
54 LOAD_GLOBAL 1 (torch) | |
56 LOAD_ATTR 9 (float16) | |
58 LOAD_GLOBAL 1 (torch) | |
60 LOAD_ATTR 10 (bfloat16) | |
62 BUILD_TUPLE 2 | |
64 COMPARE_OP 6 (in) | |
66 POP_JUMP_IF_FALSE 82 | |
260 68 LOAD_FAST 1 (hidden_states) | |
70 LOAD_METHOD 0 (to) | |
72 LOAD_FAST 0 (self) | |
74 LOAD_ATTR 7 (weight) | |
76 LOAD_ATTR 8 (dtype) | |
78 CALL_METHOD 1 | |
80 STORE_FAST 1 (hidden_states) | |
262 >> 82 LOAD_FAST 0 (self) | |
84 LOAD_ATTR 7 (weight) | |
86 LOAD_FAST 1 (hidden_states) | |
88 BINARY_MULTIPLY | |
90 RETURN_VALUE | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR to [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL torch [GetAttrVariable(TensorVariable(), to)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR float32 [GetAttrVariable(TensorVariable(), to), TorchVariable(<module 'torch' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/__init__.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [GetAttrVariable(TensorVariable(), to), ConstantVariable(dtype)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR pow [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 2 [GetAttrVariable(TensorVariable(), pow)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [GetAttrVariable(TensorVariable(), pow), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR mean [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST -1 [GetAttrVariable(TensorVariable(), mean)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST True [GetAttrVariable(TensorVariable(), mean), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST ('keepdim',) [GetAttrVariable(TensorVariable(), mean), ConstantVariable(int), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION_KW 2 [GetAttrVariable(TensorVariable(), mean), ConstantVariable(int), ConstantVariable(bool), ConstantVariable(tuple)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST variance [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL torch [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR rsqrt [TensorVariable(), TorchVariable(<module 'torch' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/__init__.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST variance [TensorVariable(), TorchVariable(<built-in method rsqrt of type object at 0x7f40b0102e20>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [TensorVariable(), TorchVariable(<built-in method rsqrt of type object at 0x7f40b0102e20>), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR variance_epsilon [TensorVariable(), TorchVariable(<built-in method rsqrt of type object at 0x7f40b0102e20>), TensorVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_ADD None [TensorVariable(), TorchVariable(<built-in method rsqrt of type object at 0x7f40b0102e20>), TensorVariable(), ConstantVariable(float)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [TensorVariable(), TorchVariable(<built-in method rsqrt of type object at 0x7f40b0102e20>), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_MULTIPLY None [TensorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST hidden_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR weight [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR dtype [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL torch [TorchVariable(torch.float32)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR float16 [TorchVariable(torch.float32), TorchVariable(<module 'torch' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/__init__.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL torch [TorchVariable(torch.float32), ConstantVariable(dtype)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR bfloat16 [TorchVariable(torch.float32), ConstantVariable(dtype), TorchVariable(<module 'torch' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/__init__.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_TUPLE 2 [TorchVariable(torch.float32), ConstantVariable(dtype), ConstantVariable(dtype)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP in [TorchVariable(torch.float32), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 82 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR weight [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_MULTIPLY None [TensorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE RETURN_VALUE None [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] DONE INLINING <code object forward at 0x7f3f100ae190, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 248> | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST forwarded_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR DenseReluDense [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST forwarded_states [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] INLINING <code object forward at 0x7f3f100ae3a0, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 287> | |
288 0 LOAD_FAST 0 (self) | |
2 LOAD_METHOD 0 (wi) | |
4 LOAD_FAST 1 (hidden_states) | |
6 CALL_METHOD 1 | |
8 STORE_FAST 1 (hidden_states) | |
289 10 LOAD_FAST 0 (self) | |
12 LOAD_METHOD 1 (act) | |
14 LOAD_FAST 1 (hidden_states) | |
16 CALL_METHOD 1 | |
18 STORE_FAST 1 (hidden_states) | |
290 20 LOAD_FAST 0 (self) | |
22 LOAD_METHOD 2 (dropout) | |
24 LOAD_FAST 1 (hidden_states) | |
26 CALL_METHOD 1 | |
28 STORE_FAST 1 (hidden_states) | |
291 30 LOAD_FAST 0 (self) | |
32 LOAD_METHOD 3 (wo) | |
34 LOAD_FAST 1 (hidden_states) | |
36 CALL_METHOD 1 | |
38 STORE_FAST 1 (hidden_states) | |
292 40 LOAD_FAST 1 (hidden_states) | |
42 RETURN_VALUE | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR wi [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST hidden_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR act [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST hidden_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR dropout [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST hidden_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR wo [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST hidden_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE RETURN_VALUE None [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] DONE INLINING <code object forward at 0x7f3f100ae3a0, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 287> | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST forwarded_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR dropout [TensorVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST forwarded_states [TensorVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [TensorVariable(), NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_ADD None [TensorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST hidden_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE RETURN_VALUE None [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] DONE INLINING <code object forward at 0x7f3f100ae7c0, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 324> | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST hidden_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR dtype [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL torch [TorchVariable(torch.float32)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR float16 [TorchVariable(torch.float32), TorchVariable(<module 'torch' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/__init__.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP == [TorchVariable(torch.float32), ConstantVariable(dtype)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 524 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_TUPLE 1 [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST outputs [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST use_cache [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 552 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST outputs [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST attention_outputs [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_ADD None [TupleVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST outputs [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST outputs [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE RETURN_VALUE None [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] DONE INLINING <code object forward at 0x7f3f100b4450, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 631> | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST layer_outputs [ListIteratorVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF use_cache [ListIteratorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST False [ListIteratorVariable(), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP is [ListIteratorVariable(), ConstantVariable(bool), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 1096 [ListIteratorVariable(), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST layer_outputs [ListIteratorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [ListIteratorVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 1 [ListIteratorVariable(), TupleVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_SLICE 2 [ListIteratorVariable(), TupleVariable(), ConstantVariable(NoneType), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_SUBSCR None [ListIteratorVariable(), TupleVariable(), SliceVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST (None,) [ListIteratorVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_ADD None [ListIteratorVariable(), TupleVariable(), ConstantVariable(tuple)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST layer_outputs [ListIteratorVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 1 [ListIteratorVariable(), TupleVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [ListIteratorVariable(), TupleVariable(), TupleVariable(), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_SLICE 2 [ListIteratorVariable(), TupleVariable(), TupleVariable(), ConstantVariable(int), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_SUBSCR None [ListIteratorVariable(), TupleVariable(), TupleVariable(), SliceVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_ADD None [ListIteratorVariable(), TupleVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST layer_outputs [ListIteratorVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST layer_outputs [ListIteratorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [ListIteratorVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 2 [ListIteratorVariable(), TupleVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_SLICE 2 [ListIteratorVariable(), TupleVariable(), ConstantVariable(NoneType), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_SUBSCR None [ListIteratorVariable(), TupleVariable(), SliceVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE UNPACK_SEQUENCE 2 [ListIteratorVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST hidden_states [ListIteratorVariable(), ConstantVariable(NoneType), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST present_key_value_state [ListIteratorVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST layer_outputs [ListIteratorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 2 [ListIteratorVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_SUBSCR None [ListIteratorVariable(), TupleVariable(), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST position_bias [ListIteratorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [ListIteratorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR is_decoder [ListIteratorVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 1156 [ListIteratorVariable(), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF use_cache [ListIteratorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 1172 [ListIteratorVariable(), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF output_attentions [ListIteratorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 1214 [ListIteratorVariable(), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [ListIteratorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR model_parallel [ListIteratorVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 724 [ListIteratorVariable(), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE FOR_ITER 1304 [ListIteratorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE UNPACK_SEQUENCE 2 [ListIteratorVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST i [ListIteratorVariable(), TupleVariable(), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE UNPACK_SEQUENCE 2 [ListIteratorVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST layer_module [ListIteratorVariable(), ConstantVariable(NoneType), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST past_key_value [ListIteratorVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST head_mask [ListIteratorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST i [ListIteratorVariable(), ListVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_SUBSCR None [ListIteratorVariable(), ListVariable(), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST layer_head_mask [ListIteratorVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST cross_attn_head_mask [ListIteratorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST i [ListIteratorVariable(), ListVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_SUBSCR None [ListIteratorVariable(), ListVariable(), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST cross_attn_layer_head_mask [ListIteratorVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [ListIteratorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR model_parallel [ListIteratorVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 930 [ListIteratorVariable(), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST output_hidden_states [ListIteratorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 946 [ListIteratorVariable(), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [ListIteratorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR gradient_checkpointing [ListIteratorVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 1028 [ListIteratorVariable(), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST layer_module [ListIteratorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [ListIteratorVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST extended_attention_mask [ListIteratorVariable(), NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST position_bias [ListIteratorVariable(), NNModuleVariable(), TensorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST encoder_hidden_states [ListIteratorVariable(), NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST encoder_extended_attention_mask [ListIteratorVariable(), NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST encoder_decoder_position_bias [ListIteratorVariable(), NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST layer_head_mask [ListIteratorVariable(), NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST cross_attn_layer_head_mask [ListIteratorVariable(), NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST past_key_value [ListIteratorVariable(), NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF use_cache [ListIteratorVariable(), NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF output_attentions [ListIteratorVariable(), NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST ('attention_mask', 'position_bias', 'encoder_hidden_states', 'encoder_attention_mask', 'encoder_decoder_position_bias', 'layer_head_mask', 'cross_attn_layer_head_mask', 'past_key_value', 'use_cache', 'output_attentions') [ListIteratorVariable(), NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(bool), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION_KW 11 [ListIteratorVariable(), NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(bool), ConstantVariable(bool), ConstantVariable(tuple)] | |
torchdynamo.symbolic_convert: [DEBUG] INLINING <code object forward at 0x7f3f100b4450, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 631> | |
647 0 LOAD_FAST 9 (past_key_value) | |
2 LOAD_CONST 0 (None) | |
4 COMPARE_OP 9 (is not) | |
6 POP_JUMP_IF_FALSE 122 | |
648 8 LOAD_FAST 0 (self) | |
10 LOAD_ATTR 0 (is_decoder) | |
12 POP_JUMP_IF_TRUE 24 | |
649 14 LOAD_GLOBAL 1 (logger) | |
16 LOAD_METHOD 2 (warning) | |
18 LOAD_CONST 1 ('`past_key_values` is passed to the encoder. Please make sure this is intended.') | |
20 CALL_METHOD 1 | |
22 POP_TOP | |
650 >> 24 LOAD_FAST 4 (encoder_hidden_states) | |
26 LOAD_CONST 0 (None) | |
28 COMPARE_OP 8 (is) | |
30 POP_JUMP_IF_FALSE 36 | |
32 LOAD_CONST 2 (2) | |
34 JUMP_FORWARD 2 (to 38) | |
>> 36 LOAD_CONST 3 (4) | |
>> 38 STORE_FAST 13 (expected_num_past_key_values) | |
652 40 LOAD_GLOBAL 3 (len) | |
42 LOAD_FAST 9 (past_key_value) | |
44 CALL_FUNCTION 1 | |
46 LOAD_FAST 13 (expected_num_past_key_values) | |
48 COMPARE_OP 3 (!=) | |
50 POP_JUMP_IF_FALSE 96 | |
653 52 LOAD_GLOBAL 4 (ValueError) | |
654 54 LOAD_CONST 4 ('There should be ') | |
56 LOAD_FAST 13 (expected_num_past_key_values) | |
58 FORMAT_VALUE 0 | |
60 LOAD_CONST 5 (' past states. ') | |
62 LOAD_FAST 13 (expected_num_past_key_values) | |
64 LOAD_CONST 3 (4) | |
66 COMPARE_OP 2 (==) | |
68 POP_JUMP_IF_FALSE 74 | |
70 LOAD_CONST 6 ('2 (past / key) for cross attention. ') | |
72 JUMP_FORWARD 2 (to 76) | |
>> 74 LOAD_CONST 7 ('') | |
>> 76 FORMAT_VALUE 0 | |
78 LOAD_CONST 8 ('Got ') | |
80 LOAD_GLOBAL 3 (len) | |
82 LOAD_FAST 9 (past_key_value) | |
84 CALL_FUNCTION 1 | |
86 FORMAT_VALUE 0 | |
88 LOAD_CONST 9 (' past key / value states') | |
90 BUILD_STRING 7 | |
653 92 CALL_FUNCTION 1 | |
94 RAISE_VARARGS 1 | |
659 >> 96 LOAD_FAST 9 (past_key_value) | |
98 LOAD_CONST 0 (None) | |
100 LOAD_CONST 2 (2) | |
102 BUILD_SLICE 2 | |
104 BINARY_SUBSCR | |
106 STORE_FAST 14 (self_attn_past_key_value) | |
660 108 LOAD_FAST 9 (past_key_value) | |
110 LOAD_CONST 2 (2) | |
112 LOAD_CONST 0 (None) | |
114 BUILD_SLICE 2 | |
116 BINARY_SUBSCR | |
118 STORE_FAST 15 (cross_attn_past_key_value) | |
120 JUMP_FORWARD 8 (to 130) | |
662 >> 122 LOAD_CONST 10 ((None, None)) | |
124 UNPACK_SEQUENCE 2 | |
126 STORE_FAST 14 (self_attn_past_key_value) | |
128 STORE_FAST 15 (cross_attn_past_key_value) | |
664 >> 130 LOAD_FAST 0 (self) | |
132 LOAD_ATTR 5 (layer) | |
134 LOAD_CONST 11 (0) | |
136 BINARY_SUBSCR | |
665 138 LOAD_FAST 1 (hidden_states) | |
666 140 LOAD_FAST 2 (attention_mask) | |
667 142 LOAD_FAST 3 (position_bias) | |
668 144 LOAD_FAST 7 (layer_head_mask) | |
669 146 LOAD_FAST 14 (self_attn_past_key_value) | |
670 148 LOAD_FAST 10 (use_cache) | |
671 150 LOAD_FAST 11 (output_attentions) | |
664 152 LOAD_CONST 12 (('attention_mask', 'position_bias', 'layer_head_mask', 'past_key_value', 'use_cache', 'output_attentions')) | |
154 CALL_FUNCTION_KW 7 | |
156 STORE_FAST 16 (self_attention_outputs) | |
673 158 LOAD_FAST 16 (self_attention_outputs) | |
160 LOAD_CONST 0 (None) | |
162 LOAD_CONST 2 (2) | |
164 BUILD_SLICE 2 | |
166 BINARY_SUBSCR | |
168 UNPACK_SEQUENCE 2 | |
170 STORE_FAST 1 (hidden_states) | |
172 STORE_FAST 17 (present_key_value_state) | |
674 174 LOAD_FAST 16 (self_attention_outputs) | |
176 LOAD_CONST 2 (2) | |
178 LOAD_CONST 0 (None) | |
180 BUILD_SLICE 2 | |
182 BINARY_SUBSCR | |
184 STORE_FAST 18 (attention_outputs) | |
677 186 LOAD_FAST 1 (hidden_states) | |
188 LOAD_ATTR 6 (dtype) | |
190 LOAD_GLOBAL 7 (torch) | |
192 LOAD_ATTR 8 (float16) | |
194 COMPARE_OP 2 (==) | |
196 POP_JUMP_IF_FALSE 248 | |
198 LOAD_GLOBAL 7 (torch) | |
200 LOAD_METHOD 9 (isinf) | |
202 LOAD_FAST 1 (hidden_states) | |
204 CALL_METHOD 1 | |
206 LOAD_METHOD 10 (any) | |
208 CALL_METHOD 0 | |
210 POP_JUMP_IF_FALSE 248 | |
678 212 LOAD_GLOBAL 7 (torch) | |
214 LOAD_METHOD 11 (finfo) | |
216 LOAD_FAST 1 (hidden_states) | |
218 LOAD_ATTR 6 (dtype) | |
220 CALL_METHOD 1 | |
222 LOAD_ATTR 12 (max) | |
224 LOAD_CONST 13 (1000) | |
226 BINARY_SUBTRACT | |
228 STORE_FAST 19 (clamp_value) | |
679 230 LOAD_GLOBAL 7 (torch) | |
232 LOAD_ATTR 13 (clamp) | |
234 LOAD_FAST 1 (hidden_states) | |
236 LOAD_FAST 19 (clamp_value) | |
238 UNARY_NEGATIVE | |
240 LOAD_FAST 19 (clamp_value) | |
242 LOAD_CONST 14 (('min', 'max')) | |
244 CALL_FUNCTION_KW 3 | |
246 STORE_FAST 1 (hidden_states) | |
681 >> 248 LOAD_FAST 0 (self) | |
250 LOAD_ATTR 0 (is_decoder) | |
252 EXTENDED_ARG 1 | |
254 JUMP_IF_FALSE_OR_POP 262 | |
256 LOAD_FAST 4 (encoder_hidden_states) | |
258 LOAD_CONST 0 (None) | |
260 COMPARE_OP 9 (is not) | |
>> 262 STORE_FAST 20 (do_cross_attention) | |
682 264 LOAD_FAST 20 (do_cross_attention) | |
266 EXTENDED_ARG 1 | |
268 POP_JUMP_IF_FALSE 444 | |
685 270 LOAD_FAST 17 (present_key_value_state) | |
272 LOAD_CONST 0 (None) | |
274 COMPARE_OP 9 (is not) | |
276 EXTENDED_ARG 1 | |
278 POP_JUMP_IF_FALSE 296 | |
686 280 LOAD_FAST 17 (present_key_value_state) | |
282 LOAD_CONST 11 (0) | |
284 BINARY_SUBSCR | |
286 LOAD_ATTR 14 (shape) | |
288 LOAD_CONST 2 (2) | |
290 BINARY_SUBSCR | |
292 STORE_FAST 21 (query_length) | |
294 JUMP_FORWARD 4 (to 300) | |
688 >> 296 LOAD_CONST 0 (None) | |
298 STORE_FAST 21 (query_length) | |
690 >> 300 LOAD_FAST 0 (self) | |
302 LOAD_ATTR 5 (layer) | |
304 LOAD_CONST 15 (1) | |
306 BINARY_SUBSCR | |
691 308 LOAD_FAST 1 (hidden_states) | |
692 310 LOAD_FAST 4 (encoder_hidden_states) | |
693 312 LOAD_FAST 5 (encoder_attention_mask) | |
694 314 LOAD_FAST 6 (encoder_decoder_position_bias) | |
695 316 LOAD_FAST 8 (cross_attn_layer_head_mask) | |
696 318 LOAD_FAST 15 (cross_attn_past_key_value) | |
697 320 LOAD_FAST 21 (query_length) | |
698 322 LOAD_FAST 10 (use_cache) | |
699 324 LOAD_FAST 11 (output_attentions) | |
690 326 LOAD_CONST 16 (('key_value_states', 'attention_mask', 'position_bias', 'layer_head_mask', 'past_key_value', 'query_length', 'use_cache', 'output_attentions')) | |
328 CALL_FUNCTION_KW 9 | |
330 STORE_FAST 22 (cross_attention_outputs) | |
701 332 LOAD_FAST 22 (cross_attention_outputs) | |
334 LOAD_CONST 11 (0) | |
336 BINARY_SUBSCR | |
338 STORE_FAST 1 (hidden_states) | |
704 340 LOAD_FAST 1 (hidden_states) | |
342 LOAD_ATTR 6 (dtype) | |
344 LOAD_GLOBAL 7 (torch) | |
346 LOAD_ATTR 8 (float16) | |
348 COMPARE_OP 2 (==) | |
350 EXTENDED_ARG 1 | |
352 POP_JUMP_IF_FALSE 406 | |
354 LOAD_GLOBAL 7 (torch) | |
356 LOAD_METHOD 9 (isinf) | |
358 LOAD_FAST 1 (hidden_states) | |
360 CALL_METHOD 1 | |
362 LOAD_METHOD 10 (any) | |
364 CALL_METHOD 0 | |
366 EXTENDED_ARG 1 | |
368 POP_JUMP_IF_FALSE 406 | |
705 370 LOAD_GLOBAL 7 (torch) | |
372 LOAD_METHOD 11 (finfo) | |
374 LOAD_FAST 1 (hidden_states) | |
376 LOAD_ATTR 6 (dtype) | |
378 CALL_METHOD 1 | |
380 LOAD_ATTR 12 (max) | |
382 LOAD_CONST 13 (1000) | |
384 BINARY_SUBTRACT | |
386 STORE_FAST 19 (clamp_value) | |
706 388 LOAD_GLOBAL 7 (torch) | |
390 LOAD_ATTR 13 (clamp) | |
392 LOAD_FAST 1 (hidden_states) | |
394 LOAD_FAST 19 (clamp_value) | |
396 UNARY_NEGATIVE | |
398 LOAD_FAST 19 (clamp_value) | |
400 LOAD_CONST 14 (('min', 'max')) | |
402 CALL_FUNCTION_KW 3 | |
404 STORE_FAST 1 (hidden_states) | |
709 >> 406 LOAD_FAST 17 (present_key_value_state) | |
408 LOAD_CONST 0 (None) | |
410 COMPARE_OP 9 (is not) | |
412 EXTENDED_ARG 1 | |
414 POP_JUMP_IF_FALSE 428 | |
710 416 LOAD_FAST 17 (present_key_value_state) | |
418 LOAD_FAST 22 (cross_attention_outputs) | |
420 LOAD_CONST 15 (1) | |
422 BINARY_SUBSCR | |
424 BINARY_ADD | |
426 STORE_FAST 17 (present_key_value_state) | |
713 >> 428 LOAD_FAST 18 (attention_outputs) | |
430 LOAD_FAST 22 (cross_attention_outputs) | |
432 LOAD_CONST 2 (2) | |
434 LOAD_CONST 0 (None) | |
436 BUILD_SLICE 2 | |
438 BINARY_SUBSCR | |
440 BINARY_ADD | |
442 STORE_FAST 18 (attention_outputs) | |
716 >> 444 LOAD_FAST 0 (self) | |
446 LOAD_ATTR 5 (layer) | |
448 LOAD_CONST 17 (-1) | |
450 BINARY_SUBSCR | |
452 LOAD_FAST 1 (hidden_states) | |
454 CALL_FUNCTION 1 | |
456 STORE_FAST 1 (hidden_states) | |
719 458 LOAD_FAST 1 (hidden_states) | |
460 LOAD_ATTR 6 (dtype) | |
462 LOAD_GLOBAL 7 (torch) | |
464 LOAD_ATTR 8 (float16) | |
466 COMPARE_OP 2 (==) | |
468 EXTENDED_ARG 2 | |
470 POP_JUMP_IF_FALSE 524 | |
472 LOAD_GLOBAL 7 (torch) | |
474 LOAD_METHOD 9 (isinf) | |
476 LOAD_FAST 1 (hidden_states) | |
478 CALL_METHOD 1 | |
480 LOAD_METHOD 10 (any) | |
482 CALL_METHOD 0 | |
484 EXTENDED_ARG 2 | |
486 POP_JUMP_IF_FALSE 524 | |
720 488 LOAD_GLOBAL 7 (torch) | |
490 LOAD_METHOD 11 (finfo) | |
492 LOAD_FAST 1 (hidden_states) | |
494 LOAD_ATTR 6 (dtype) | |
496 CALL_METHOD 1 | |
498 LOAD_ATTR 12 (max) | |
500 LOAD_CONST 13 (1000) | |
502 BINARY_SUBTRACT | |
504 STORE_FAST 19 (clamp_value) | |
721 506 LOAD_GLOBAL 7 (torch) | |
508 LOAD_ATTR 13 (clamp) | |
510 LOAD_FAST 1 (hidden_states) | |
512 LOAD_FAST 19 (clamp_value) | |
514 UNARY_NEGATIVE | |
516 LOAD_FAST 19 (clamp_value) | |
518 LOAD_CONST 14 (('min', 'max')) | |
520 CALL_FUNCTION_KW 3 | |
522 STORE_FAST 1 (hidden_states) | |
723 >> 524 LOAD_FAST 1 (hidden_states) | |
526 BUILD_TUPLE 1 | |
528 STORE_FAST 23 (outputs) | |
725 530 LOAD_FAST 10 (use_cache) | |
532 EXTENDED_ARG 2 | |
534 POP_JUMP_IF_FALSE 552 | |
726 536 LOAD_FAST 23 (outputs) | |
538 LOAD_FAST 17 (present_key_value_state) | |
540 BUILD_TUPLE 1 | |
542 BINARY_ADD | |
544 LOAD_FAST 18 (attention_outputs) | |
546 BINARY_ADD | |
548 STORE_FAST 23 (outputs) | |
550 JUMP_FORWARD 8 (to 560) | |
728 >> 552 LOAD_FAST 23 (outputs) | |
554 LOAD_FAST 18 (attention_outputs) | |
556 BINARY_ADD | |
558 STORE_FAST 23 (outputs) | |
730 >> 560 LOAD_FAST 23 (outputs) | |
562 RETURN_VALUE | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST past_key_value [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP is not [ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 122 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST (None, None) [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE UNPACK_SEQUENCE 2 [ConstantVariable(tuple)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST self_attn_past_key_value [ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST cross_attn_past_key_value [ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR layer [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 0 [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_SUBSCR None [NNModuleVariable(), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST attention_mask [NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST position_bias [NNModuleVariable(), TensorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST layer_head_mask [NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self_attn_past_key_value [NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST use_cache [NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST output_attentions [NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST ('attention_mask', 'position_bias', 'layer_head_mask', 'past_key_value', 'use_cache', 'output_attentions') [NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(bool), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION_KW 7 [NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(bool), ConstantVariable(bool), ConstantVariable(tuple)] | |
torchdynamo.symbolic_convert: [DEBUG] INLINING <code object forward at 0x7f3f100b4030, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 559> | |
569 0 LOAD_FAST 0 (self) | |
2 LOAD_METHOD 0 (layer_norm) | |
4 LOAD_FAST 1 (hidden_states) | |
6 CALL_METHOD 1 | |
8 STORE_FAST 8 (normed_hidden_states) | |
570 10 LOAD_FAST 0 (self) | |
12 LOAD_ATTR 1 (SelfAttention) | |
571 14 LOAD_FAST 8 (normed_hidden_states) | |
572 16 LOAD_FAST 2 (attention_mask) | |
573 18 LOAD_FAST 3 (position_bias) | |
574 20 LOAD_FAST 4 (layer_head_mask) | |
575 22 LOAD_FAST 5 (past_key_value) | |
576 24 LOAD_FAST 6 (use_cache) | |
577 26 LOAD_FAST 7 (output_attentions) | |
570 28 LOAD_CONST 1 (('mask', 'position_bias', 'layer_head_mask', 'past_key_value', 'use_cache', 'output_attentions')) | |
30 CALL_FUNCTION_KW 7 | |
32 STORE_FAST 9 (attention_output) | |
579 34 LOAD_FAST 1 (hidden_states) | |
36 LOAD_FAST 0 (self) | |
38 LOAD_METHOD 2 (dropout) | |
40 LOAD_FAST 9 (attention_output) | |
42 LOAD_CONST 2 (0) | |
44 BINARY_SUBSCR | |
46 CALL_METHOD 1 | |
48 BINARY_ADD | |
50 STORE_FAST 1 (hidden_states) | |
580 52 LOAD_FAST 1 (hidden_states) | |
54 BUILD_TUPLE 1 | |
56 LOAD_FAST 9 (attention_output) | |
58 LOAD_CONST 3 (1) | |
60 LOAD_CONST 0 (None) | |
62 BUILD_SLICE 2 | |
64 BINARY_SUBSCR | |
66 BINARY_ADD | |
68 STORE_FAST 10 (outputs) | |
581 70 LOAD_FAST 10 (outputs) | |
72 RETURN_VALUE | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR layer_norm [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] INLINING <code object forward at 0x7f3f100ae190, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 248> | |
255 0 LOAD_FAST 1 (hidden_states) | |
2 LOAD_METHOD 0 (to) | |
4 LOAD_GLOBAL 1 (torch) | |
6 LOAD_ATTR 2 (float32) | |
8 CALL_METHOD 1 | |
10 LOAD_METHOD 3 (pow) | |
12 LOAD_CONST 1 (2) | |
14 CALL_METHOD 1 | |
16 LOAD_ATTR 4 (mean) | |
18 LOAD_CONST 2 (-1) | |
20 LOAD_CONST 3 (True) | |
22 LOAD_CONST 4 (('keepdim',)) | |
24 CALL_FUNCTION_KW 2 | |
26 STORE_FAST 2 (variance) | |
256 28 LOAD_FAST 1 (hidden_states) | |
30 LOAD_GLOBAL 1 (torch) | |
32 LOAD_METHOD 5 (rsqrt) | |
34 LOAD_FAST 2 (variance) | |
36 LOAD_FAST 0 (self) | |
38 LOAD_ATTR 6 (variance_epsilon) | |
40 BINARY_ADD | |
42 CALL_METHOD 1 | |
44 BINARY_MULTIPLY | |
46 STORE_FAST 1 (hidden_states) | |
259 48 LOAD_FAST 0 (self) | |
50 LOAD_ATTR 7 (weight) | |
52 LOAD_ATTR 8 (dtype) | |
54 LOAD_GLOBAL 1 (torch) | |
56 LOAD_ATTR 9 (float16) | |
58 LOAD_GLOBAL 1 (torch) | |
60 LOAD_ATTR 10 (bfloat16) | |
62 BUILD_TUPLE 2 | |
64 COMPARE_OP 6 (in) | |
66 POP_JUMP_IF_FALSE 82 | |
260 68 LOAD_FAST 1 (hidden_states) | |
70 LOAD_METHOD 0 (to) | |
72 LOAD_FAST 0 (self) | |
74 LOAD_ATTR 7 (weight) | |
76 LOAD_ATTR 8 (dtype) | |
78 CALL_METHOD 1 | |
80 STORE_FAST 1 (hidden_states) | |
262 >> 82 LOAD_FAST 0 (self) | |
84 LOAD_ATTR 7 (weight) | |
86 LOAD_FAST 1 (hidden_states) | |
88 BINARY_MULTIPLY | |
90 RETURN_VALUE | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR to [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL torch [GetAttrVariable(TensorVariable(), to)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR float32 [GetAttrVariable(TensorVariable(), to), TorchVariable(<module 'torch' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/__init__.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [GetAttrVariable(TensorVariable(), to), ConstantVariable(dtype)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR pow [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 2 [GetAttrVariable(TensorVariable(), pow)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [GetAttrVariable(TensorVariable(), pow), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR mean [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST -1 [GetAttrVariable(TensorVariable(), mean)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST True [GetAttrVariable(TensorVariable(), mean), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST ('keepdim',) [GetAttrVariable(TensorVariable(), mean), ConstantVariable(int), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION_KW 2 [GetAttrVariable(TensorVariable(), mean), ConstantVariable(int), ConstantVariable(bool), ConstantVariable(tuple)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST variance [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL torch [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR rsqrt [TensorVariable(), TorchVariable(<module 'torch' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/__init__.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST variance [TensorVariable(), TorchVariable(<built-in method rsqrt of type object at 0x7f40b0102e20>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [TensorVariable(), TorchVariable(<built-in method rsqrt of type object at 0x7f40b0102e20>), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR variance_epsilon [TensorVariable(), TorchVariable(<built-in method rsqrt of type object at 0x7f40b0102e20>), TensorVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_ADD None [TensorVariable(), TorchVariable(<built-in method rsqrt of type object at 0x7f40b0102e20>), TensorVariable(), ConstantVariable(float)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [TensorVariable(), TorchVariable(<built-in method rsqrt of type object at 0x7f40b0102e20>), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_MULTIPLY None [TensorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST hidden_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR weight [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR dtype [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL torch [TorchVariable(torch.float32)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR float16 [TorchVariable(torch.float32), TorchVariable(<module 'torch' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/__init__.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL torch [TorchVariable(torch.float32), ConstantVariable(dtype)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR bfloat16 [TorchVariable(torch.float32), ConstantVariable(dtype), TorchVariable(<module 'torch' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/__init__.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_TUPLE 2 [TorchVariable(torch.float32), ConstantVariable(dtype), ConstantVariable(dtype)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP in [TorchVariable(torch.float32), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 82 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR weight [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_MULTIPLY None [TensorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE RETURN_VALUE None [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] DONE INLINING <code object forward at 0x7f3f100ae190, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 248> | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST normed_hidden_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR SelfAttention [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST normed_hidden_states [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST attention_mask [NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST position_bias [NNModuleVariable(), TensorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST layer_head_mask [NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST past_key_value [NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST use_cache [NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST output_attentions [NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST ('mask', 'position_bias', 'layer_head_mask', 'past_key_value', 'use_cache', 'output_attentions') [NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(bool), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION_KW 7 [NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(bool), ConstantVariable(bool), ConstantVariable(tuple)] | |
torchdynamo.symbolic_convert: [DEBUG] INLINING <code object forward at 0x7f3f100aedf0, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 436> | |
454 0 LOAD_FAST 1 (hidden_states) | |
2 LOAD_ATTR 0 (shape) | |
4 LOAD_CONST 1 (None) | |
6 LOAD_CONST 2 (2) | |
8 BUILD_SLICE 2 | |
10 BINARY_SUBSCR | |
12 UNPACK_SEQUENCE 2 | |
14 STORE_DEREF 0 (batch_size) | |
16 STORE_FAST 10 (seq_length) | |
456 18 LOAD_FAST 10 (seq_length) | |
20 STORE_FAST 11 (real_seq_length) | |
458 22 LOAD_FAST 5 (past_key_value) | |
24 LOAD_CONST 1 (None) | |
26 COMPARE_OP 9 (is not) | |
28 POP_JUMP_IF_FALSE 92 | |
460 30 LOAD_GLOBAL 1 (len) | |
32 LOAD_FAST 5 (past_key_value) | |
34 CALL_FUNCTION 1 | |
36 LOAD_CONST 2 (2) | |
38 COMPARE_OP 2 (==) | |
459 40 POP_JUMP_IF_TRUE 62 | |
42 LOAD_GLOBAL 2 (AssertionError) | |
461 44 LOAD_CONST 3 ('past_key_value should have 2 past states: keys and values. Got ') | |
46 LOAD_GLOBAL 1 (len) | |
48 LOAD_FAST 5 (past_key_value) | |
50 CALL_FUNCTION 1 | |
52 FORMAT_VALUE 0 | |
54 LOAD_CONST 4 (' past states') | |
56 BUILD_STRING 3 | |
459 58 CALL_FUNCTION 1 | |
60 RAISE_VARARGS 1 | |
462 >> 62 LOAD_FAST 11 (real_seq_length) | |
64 LOAD_FAST 7 (query_length) | |
66 LOAD_CONST 1 (None) | |
68 COMPARE_OP 8 (is) | |
70 POP_JUMP_IF_FALSE 86 | |
72 LOAD_FAST 5 (past_key_value) | |
74 LOAD_CONST 5 (0) | |
76 BINARY_SUBSCR | |
78 LOAD_ATTR 0 (shape) | |
80 LOAD_CONST 2 (2) | |
82 BINARY_SUBSCR | |
84 JUMP_FORWARD 2 (to 88) | |
>> 86 LOAD_FAST 7 (query_length) | |
>> 88 INPLACE_ADD | |
90 STORE_FAST 11 (real_seq_length) | |
464 >> 92 LOAD_FAST 3 (key_value_states) | |
94 LOAD_CONST 1 (None) | |
96 COMPARE_OP 8 (is) | |
98 POP_JUMP_IF_FALSE 104 | |
100 LOAD_FAST 11 (real_seq_length) | |
102 JUMP_FORWARD 8 (to 112) | |
>> 104 LOAD_FAST 3 (key_value_states) | |
106 LOAD_ATTR 0 (shape) | |
108 LOAD_CONST 6 (1) | |
110 BINARY_SUBSCR | |
>> 112 STORE_FAST 12 (key_length) | |
466 114 LOAD_CLOSURE 0 (batch_size) | |
116 LOAD_CLOSURE 1 (self) | |
118 BUILD_TUPLE 2 | |
120 LOAD_CONST 7 (<code object shape at 0x7f3f100aebe0, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 466>) | |
122 LOAD_CONST 8 ('T5Attention.forward.<locals>.shape') | |
124 MAKE_FUNCTION 8 (closure) | |
126 STORE_DEREF 2 (shape) | |
470 128 LOAD_CLOSURE 0 (batch_size) | |
130 LOAD_CLOSURE 1 (self) | |
132 BUILD_TUPLE 2 | |
134 LOAD_CONST 9 (<code object unshape at 0x7f3f100aec90, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 470>) | |
136 LOAD_CONST 10 ('T5Attention.forward.<locals>.unshape') | |
138 MAKE_FUNCTION 8 (closure) | |
140 STORE_FAST 13 (unshape) | |
474 142 LOAD_CLOSURE 2 (shape) | |
144 BUILD_TUPLE 1 | |
146 LOAD_CONST 11 (<code object project at 0x7f3f100aed40, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 474>) | |
148 LOAD_CONST 12 ('T5Attention.forward.<locals>.project') | |
150 MAKE_FUNCTION 8 (closure) | |
152 STORE_FAST 14 (project) | |
496 154 LOAD_DEREF 2 (shape) | |
156 LOAD_DEREF 1 (self) | |
158 LOAD_METHOD 3 (q) | |
160 LOAD_FAST 1 (hidden_states) | |
162 CALL_METHOD 1 | |
164 CALL_FUNCTION 1 | |
166 STORE_FAST 15 (query_states) | |
499 168 LOAD_FAST 14 (project) | |
500 170 LOAD_FAST 1 (hidden_states) | |
172 LOAD_DEREF 1 (self) | |
174 LOAD_ATTR 4 (k) | |
176 LOAD_FAST 3 (key_value_states) | |
178 LOAD_FAST 5 (past_key_value) | |
180 LOAD_CONST 1 (None) | |
182 COMPARE_OP 9 (is not) | |
184 POP_JUMP_IF_FALSE 194 | |
186 LOAD_FAST 5 (past_key_value) | |
188 LOAD_CONST 5 (0) | |
190 BINARY_SUBSCR | |
192 JUMP_FORWARD 2 (to 196) | |
>> 194 LOAD_CONST 1 (None) | |
499 >> 196 CALL_FUNCTION 4 | |
198 STORE_FAST 16 (key_states) | |
502 200 LOAD_FAST 14 (project) | |
503 202 LOAD_FAST 1 (hidden_states) | |
204 LOAD_DEREF 1 (self) | |
206 LOAD_ATTR 5 (v) | |
208 LOAD_FAST 3 (key_value_states) | |
210 LOAD_FAST 5 (past_key_value) | |
212 LOAD_CONST 1 (None) | |
214 COMPARE_OP 9 (is not) | |
216 POP_JUMP_IF_FALSE 226 | |
218 LOAD_FAST 5 (past_key_value) | |
220 LOAD_CONST 6 (1) | |
222 BINARY_SUBSCR | |
224 JUMP_FORWARD 2 (to 228) | |
>> 226 LOAD_CONST 1 (None) | |
502 >> 228 CALL_FUNCTION 4 | |
230 STORE_FAST 17 (value_states) | |
507 232 LOAD_GLOBAL 6 (torch) | |
234 LOAD_METHOD 7 (matmul) | |
508 236 LOAD_FAST 15 (query_states) | |
238 LOAD_FAST 16 (key_states) | |
240 LOAD_METHOD 8 (transpose) | |
242 LOAD_CONST 13 (3) | |
244 LOAD_CONST 2 (2) | |
246 CALL_METHOD 2 | |
507 248 CALL_METHOD 2 | |
250 STORE_FAST 18 (scores) | |
511 252 LOAD_FAST 4 (position_bias) | |
254 LOAD_CONST 1 (None) | |
256 COMPARE_OP 8 (is) | |
258 EXTENDED_ARG 1 | |
260 POP_JUMP_IF_FALSE 410 | |
512 262 LOAD_DEREF 1 (self) | |
264 LOAD_ATTR 9 (has_relative_attention_bias) | |
266 EXTENDED_ARG 1 | |
268 POP_JUMP_IF_TRUE 324 | |
513 270 LOAD_GLOBAL 6 (torch) | |
272 LOAD_ATTR 10 (zeros) | |
514 274 LOAD_CONST 6 (1) | |
276 LOAD_DEREF 1 (self) | |
278 LOAD_ATTR 11 (n_heads) | |
280 LOAD_FAST 11 (real_seq_length) | |
282 LOAD_FAST 12 (key_length) | |
284 BUILD_TUPLE 4 | |
286 LOAD_FAST 18 (scores) | |
288 LOAD_ATTR 12 (device) | |
290 LOAD_FAST 18 (scores) | |
292 LOAD_ATTR 13 (dtype) | |
513 294 LOAD_CONST 14 (('device', 'dtype')) | |
296 CALL_FUNCTION_KW 3 | |
298 STORE_FAST 4 (position_bias) | |
516 300 LOAD_DEREF 1 (self) | |
302 LOAD_ATTR 14 (gradient_checkpointing) | |
304 EXTENDED_ARG 1 | |
306 POP_JUMP_IF_FALSE 342 | |
308 LOAD_DEREF 1 (self) | |
310 LOAD_ATTR 15 (training) | |
312 EXTENDED_ARG 1 | |
314 POP_JUMP_IF_FALSE 342 | |
517 316 LOAD_CONST 15 (True) | |
318 LOAD_FAST 4 (position_bias) | |
320 STORE_ATTR 16 (requires_grad) | |
322 JUMP_FORWARD 18 (to 342) | |
519 >> 324 LOAD_DEREF 1 (self) | |
326 LOAD_ATTR 17 (compute_bias) | |
328 LOAD_FAST 11 (real_seq_length) | |
330 LOAD_FAST 12 (key_length) | |
332 LOAD_FAST 18 (scores) | |
334 LOAD_ATTR 12 (device) | |
336 LOAD_CONST 16 (('device',)) | |
338 CALL_FUNCTION_KW 3 | |
340 STORE_FAST 4 (position_bias) | |
523 >> 342 LOAD_FAST 5 (past_key_value) | |
344 LOAD_CONST 1 (None) | |
346 COMPARE_OP 9 (is not) | |
348 EXTENDED_ARG 1 | |
350 POP_JUMP_IF_FALSE 392 | |
524 352 LOAD_FAST 4 (position_bias) | |
354 LOAD_CONST 1 (None) | |
356 LOAD_CONST 1 (None) | |
358 BUILD_SLICE 2 | |
360 LOAD_CONST 1 (None) | |
362 LOAD_CONST 1 (None) | |
364 BUILD_SLICE 2 | |
366 LOAD_FAST 1 (hidden_states) | |
368 LOAD_METHOD 18 (size) | |
370 LOAD_CONST 6 (1) | |
372 CALL_METHOD 1 | |
374 UNARY_NEGATIVE | |
376 LOAD_CONST 1 (None) | |
378 BUILD_SLICE 2 | |
380 LOAD_CONST 1 (None) | |
382 LOAD_CONST 1 (None) | |
384 BUILD_SLICE 2 | |
386 BUILD_TUPLE 4 | |
388 BINARY_SUBSCR | |
390 STORE_FAST 4 (position_bias) | |
526 >> 392 LOAD_FAST 2 (mask) | |
394 LOAD_CONST 1 (None) | |
396 COMPARE_OP 9 (is not) | |
398 EXTENDED_ARG 1 | |
400 POP_JUMP_IF_FALSE 410 | |
527 402 LOAD_FAST 4 (position_bias) | |
404 LOAD_FAST 2 (mask) | |
406 BINARY_ADD | |
408 STORE_FAST 4 (position_bias) | |
529 >> 410 LOAD_FAST 18 (scores) | |
412 LOAD_FAST 4 (position_bias) | |
414 INPLACE_ADD | |
416 STORE_FAST 18 (scores) | |
530 418 LOAD_GLOBAL 19 (nn) | |
420 LOAD_ATTR 20 (functional) | |
422 LOAD_ATTR 21 (softmax) | |
424 LOAD_FAST 18 (scores) | |
426 LOAD_METHOD 22 (float) | |
428 CALL_METHOD 0 | |
430 LOAD_CONST 17 (-1) | |
432 LOAD_CONST 18 (('dim',)) | |
434 CALL_FUNCTION_KW 2 | |
436 LOAD_METHOD 23 (type_as) | |
531 438 LOAD_FAST 18 (scores) | |
530 440 CALL_METHOD 1 | |
442 STORE_FAST 19 (attn_weights) | |
533 444 LOAD_GLOBAL 19 (nn) | |
446 LOAD_ATTR 20 (functional) | |
448 LOAD_ATTR 24 (dropout) | |
534 450 LOAD_FAST 19 (attn_weights) | |
452 LOAD_DEREF 1 (self) | |
454 LOAD_ATTR 24 (dropout) | |
456 LOAD_DEREF 1 (self) | |
458 LOAD_ATTR 15 (training) | |
533 460 LOAD_CONST 19 (('p', 'training')) | |
462 CALL_FUNCTION_KW 3 | |
464 STORE_FAST 19 (attn_weights) | |
538 466 LOAD_FAST 6 (layer_head_mask) | |
468 LOAD_CONST 1 (None) | |
470 COMPARE_OP 9 (is not) | |
472 EXTENDED_ARG 1 | |
474 POP_JUMP_IF_FALSE 484 | |
539 476 LOAD_FAST 19 (attn_weights) | |
478 LOAD_FAST 6 (layer_head_mask) | |
480 BINARY_MULTIPLY | |
482 STORE_FAST 19 (attn_weights) | |
541 >> 484 LOAD_FAST 13 (unshape) | |
486 LOAD_GLOBAL 6 (torch) | |
488 LOAD_METHOD 7 (matmul) | |
490 LOAD_FAST 19 (attn_weights) | |
492 LOAD_FAST 17 (value_states) | |
494 CALL_METHOD 2 | |
496 CALL_FUNCTION 1 | |
498 STORE_FAST 20 (attn_output) | |
542 500 LOAD_DEREF 1 (self) | |
502 LOAD_METHOD 25 (o) | |
504 LOAD_FAST 20 (attn_output) | |
506 CALL_METHOD 1 | |
508 STORE_FAST 20 (attn_output) | |
544 510 LOAD_DEREF 1 (self) | |
512 LOAD_ATTR 26 (is_decoder) | |
514 EXTENDED_ARG 2 | |
516 POP_JUMP_IF_FALSE 532 | |
518 LOAD_FAST 8 (use_cache) | |
520 EXTENDED_ARG 2 | |
522 POP_JUMP_IF_FALSE 532 | |
524 LOAD_FAST 16 (key_states) | |
526 LOAD_FAST 17 (value_states) | |
528 BUILD_TUPLE 2 | |
530 JUMP_FORWARD 2 (to 534) | |
>> 532 LOAD_CONST 1 (None) | |
>> 534 STORE_FAST 21 (present_key_value_state) | |
545 536 LOAD_FAST 20 (attn_output) | |
538 BUILD_TUPLE 1 | |
540 LOAD_FAST 21 (present_key_value_state) | |
542 BUILD_TUPLE 1 | |
544 BINARY_ADD | |
546 LOAD_FAST 4 (position_bias) | |
548 BUILD_TUPLE 1 | |
550 BINARY_ADD | |
552 STORE_FAST 22 (outputs) | |
547 554 LOAD_FAST 9 (output_attentions) | |
556 EXTENDED_ARG 2 | |
558 POP_JUMP_IF_FALSE 570 | |
548 560 LOAD_FAST 22 (outputs) | |
562 LOAD_FAST 19 (attn_weights) | |
564 BUILD_TUPLE 1 | |
566 BINARY_ADD | |
568 STORE_FAST 22 (outputs) | |
549 >> 570 LOAD_FAST 22 (outputs) | |
572 RETURN_VALUE | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR shape [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [ShapeVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 2 [ShapeVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_SLICE 2 [ShapeVariable(), ConstantVariable(NoneType), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_SUBSCR None [ShapeVariable(), SliceVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE UNPACK_SEQUENCE 2 [ShapeVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_DEREF batch_size [ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST seq_length [ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST seq_length [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST real_seq_length [ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST past_key_value [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP is not [ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 92 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST key_value_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP is [ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 104 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST real_seq_length [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE JUMP_FORWARD 112 [ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST key_length [ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CLOSURE batch_size [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CLOSURE self [NewCellVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_TUPLE 2 [NewCellVariable(), NewCellVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST <code object shape at 0x7f3f100aebe0, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 466> [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST T5Attention.forward.<locals>.shape [TupleVariable(), ConstantVariable(code)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE MAKE_FUNCTION 8 [TupleVariable(), ConstantVariable(code), ConstantVariable(str)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_DEREF shape [NestedUserFunctionVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CLOSURE batch_size [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CLOSURE self [NewCellVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_TUPLE 2 [NewCellVariable(), NewCellVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST <code object unshape at 0x7f3f100aec90, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 470> [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST T5Attention.forward.<locals>.unshape [TupleVariable(), ConstantVariable(code)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE MAKE_FUNCTION 8 [TupleVariable(), ConstantVariable(code), ConstantVariable(str)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST unshape [NestedUserFunctionVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CLOSURE shape [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_TUPLE 1 [NewCellVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST <code object project at 0x7f3f100aed40, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 474> [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST T5Attention.forward.<locals>.project [TupleVariable(), ConstantVariable(code)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE MAKE_FUNCTION 8 [TupleVariable(), ConstantVariable(code), ConstantVariable(str)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST project [NestedUserFunctionVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF shape [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF self [NestedUserFunctionVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR q [NestedUserFunctionVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [NestedUserFunctionVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NestedUserFunctionVariable(), NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NestedUserFunctionVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] INLINING <code object shape at 0x7f3f100aebe0, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 466> | |
468 0 LOAD_FAST 0 (states) | |
2 LOAD_METHOD 0 (view) | |
4 LOAD_DEREF 0 (batch_size) | |
6 LOAD_CONST 1 (-1) | |
8 LOAD_DEREF 1 (self) | |
10 LOAD_ATTR 1 (n_heads) | |
12 LOAD_DEREF 1 (self) | |
14 LOAD_ATTR 2 (key_value_proj_dim) | |
16 CALL_METHOD 4 | |
18 LOAD_METHOD 3 (transpose) | |
20 LOAD_CONST 2 (1) | |
22 LOAD_CONST 3 (2) | |
24 CALL_METHOD 2 | |
26 RETURN_VALUE | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR view [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF batch_size [GetAttrVariable(TensorVariable(), view)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST -1 [GetAttrVariable(TensorVariable(), view), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF self [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR n_heads [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF self [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR key_value_proj_dim [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int), ConstantVariable(int), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 4 [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int), ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR transpose [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 1 [GetAttrVariable(TensorVariable(), transpose)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 2 [GetAttrVariable(TensorVariable(), transpose), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 2 [GetAttrVariable(TensorVariable(), transpose), ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE RETURN_VALUE None [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] DONE INLINING <code object shape at 0x7f3f100aebe0, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 466> | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST query_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST project [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [NestedUserFunctionVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF self [NestedUserFunctionVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR k [NestedUserFunctionVariable(), TensorVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST key_value_states [NestedUserFunctionVariable(), TensorVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST past_key_value [NestedUserFunctionVariable(), TensorVariable(), NNModuleVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [NestedUserFunctionVariable(), TensorVariable(), NNModuleVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP is not [NestedUserFunctionVariable(), TensorVariable(), NNModuleVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 194 [NestedUserFunctionVariable(), TensorVariable(), NNModuleVariable(), ConstantVariable(NoneType), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [NestedUserFunctionVariable(), TensorVariable(), NNModuleVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 4 [NestedUserFunctionVariable(), TensorVariable(), NNModuleVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] INLINING <code object project at 0x7f3f100aed40, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 474> | |
476 0 LOAD_FAST 2 (key_value_states) | |
2 LOAD_CONST 1 (None) | |
4 COMPARE_OP 8 (is) | |
6 POP_JUMP_IF_FALSE 22 | |
479 8 LOAD_DEREF 0 (shape) | |
10 LOAD_FAST 1 (proj_layer) | |
12 LOAD_FAST 0 (hidden_states) | |
14 CALL_FUNCTION 1 | |
16 CALL_FUNCTION 1 | |
18 STORE_FAST 0 (hidden_states) | |
20 JUMP_FORWARD 20 (to 42) | |
480 >> 22 LOAD_FAST 3 (past_key_value) | |
24 LOAD_CONST 1 (None) | |
26 COMPARE_OP 8 (is) | |
28 POP_JUMP_IF_FALSE 42 | |
483 30 LOAD_DEREF 0 (shape) | |
32 LOAD_FAST 1 (proj_layer) | |
34 LOAD_FAST 2 (key_value_states) | |
36 CALL_FUNCTION 1 | |
38 CALL_FUNCTION 1 | |
40 STORE_FAST 0 (hidden_states) | |
485 >> 42 LOAD_FAST 3 (past_key_value) | |
44 LOAD_CONST 1 (None) | |
46 COMPARE_OP 9 (is not) | |
48 POP_JUMP_IF_FALSE 82 | |
486 50 LOAD_FAST 2 (key_value_states) | |
52 LOAD_CONST 1 (None) | |
54 COMPARE_OP 8 (is) | |
56 POP_JUMP_IF_FALSE 78 | |
489 58 LOAD_GLOBAL 0 (torch) | |
60 LOAD_ATTR 1 (cat) | |
62 LOAD_FAST 3 (past_key_value) | |
64 LOAD_FAST 0 (hidden_states) | |
66 BUILD_LIST 2 | |
68 LOAD_CONST 2 (2) | |
70 LOAD_CONST 3 (('dim',)) | |
72 CALL_FUNCTION_KW 2 | |
74 STORE_FAST 0 (hidden_states) | |
76 JUMP_FORWARD 4 (to 82) | |
492 >> 78 LOAD_FAST 3 (past_key_value) | |
80 STORE_FAST 0 (hidden_states) | |
493 >> 82 LOAD_FAST 0 (hidden_states) | |
84 RETURN_VALUE | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST key_value_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP is [ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 22 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF shape [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST proj_layer [NestedUserFunctionVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [NestedUserFunctionVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NestedUserFunctionVariable(), NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NestedUserFunctionVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] INLINING <code object shape at 0x7f3f100aebe0, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 466> | |
468 0 LOAD_FAST 0 (states) | |
2 LOAD_METHOD 0 (view) | |
4 LOAD_DEREF 0 (batch_size) | |
6 LOAD_CONST 1 (-1) | |
8 LOAD_DEREF 1 (self) | |
10 LOAD_ATTR 1 (n_heads) | |
12 LOAD_DEREF 1 (self) | |
14 LOAD_ATTR 2 (key_value_proj_dim) | |
16 CALL_METHOD 4 | |
18 LOAD_METHOD 3 (transpose) | |
20 LOAD_CONST 2 (1) | |
22 LOAD_CONST 3 (2) | |
24 CALL_METHOD 2 | |
26 RETURN_VALUE | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR view [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF batch_size [GetAttrVariable(TensorVariable(), view)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST -1 [GetAttrVariable(TensorVariable(), view), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF self [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR n_heads [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF self [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR key_value_proj_dim [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int), ConstantVariable(int), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 4 [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int), ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR transpose [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 1 [GetAttrVariable(TensorVariable(), transpose)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 2 [GetAttrVariable(TensorVariable(), transpose), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 2 [GetAttrVariable(TensorVariable(), transpose), ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE RETURN_VALUE None [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] DONE INLINING <code object shape at 0x7f3f100aebe0, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 466> | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST hidden_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE JUMP_FORWARD 42 [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST past_key_value [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP is not [ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 82 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE RETURN_VALUE None [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] DONE INLINING <code object project at 0x7f3f100aed40, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 474> | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST key_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST project [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [NestedUserFunctionVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF self [NestedUserFunctionVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR v [NestedUserFunctionVariable(), TensorVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST key_value_states [NestedUserFunctionVariable(), TensorVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST past_key_value [NestedUserFunctionVariable(), TensorVariable(), NNModuleVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [NestedUserFunctionVariable(), TensorVariable(), NNModuleVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP is not [NestedUserFunctionVariable(), TensorVariable(), NNModuleVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 226 [NestedUserFunctionVariable(), TensorVariable(), NNModuleVariable(), ConstantVariable(NoneType), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [NestedUserFunctionVariable(), TensorVariable(), NNModuleVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 4 [NestedUserFunctionVariable(), TensorVariable(), NNModuleVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] INLINING <code object project at 0x7f3f100aed40, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 474> | |
476 0 LOAD_FAST 2 (key_value_states) | |
2 LOAD_CONST 1 (None) | |
4 COMPARE_OP 8 (is) | |
6 POP_JUMP_IF_FALSE 22 | |
479 8 LOAD_DEREF 0 (shape) | |
10 LOAD_FAST 1 (proj_layer) | |
12 LOAD_FAST 0 (hidden_states) | |
14 CALL_FUNCTION 1 | |
16 CALL_FUNCTION 1 | |
18 STORE_FAST 0 (hidden_states) | |
20 JUMP_FORWARD 20 (to 42) | |
480 >> 22 LOAD_FAST 3 (past_key_value) | |
24 LOAD_CONST 1 (None) | |
26 COMPARE_OP 8 (is) | |
28 POP_JUMP_IF_FALSE 42 | |
483 30 LOAD_DEREF 0 (shape) | |
32 LOAD_FAST 1 (proj_layer) | |
34 LOAD_FAST 2 (key_value_states) | |
36 CALL_FUNCTION 1 | |
38 CALL_FUNCTION 1 | |
40 STORE_FAST 0 (hidden_states) | |
485 >> 42 LOAD_FAST 3 (past_key_value) | |
44 LOAD_CONST 1 (None) | |
46 COMPARE_OP 9 (is not) | |
48 POP_JUMP_IF_FALSE 82 | |
486 50 LOAD_FAST 2 (key_value_states) | |
52 LOAD_CONST 1 (None) | |
54 COMPARE_OP 8 (is) | |
56 POP_JUMP_IF_FALSE 78 | |
489 58 LOAD_GLOBAL 0 (torch) | |
60 LOAD_ATTR 1 (cat) | |
62 LOAD_FAST 3 (past_key_value) | |
64 LOAD_FAST 0 (hidden_states) | |
66 BUILD_LIST 2 | |
68 LOAD_CONST 2 (2) | |
70 LOAD_CONST 3 (('dim',)) | |
72 CALL_FUNCTION_KW 2 | |
74 STORE_FAST 0 (hidden_states) | |
76 JUMP_FORWARD 4 (to 82) | |
492 >> 78 LOAD_FAST 3 (past_key_value) | |
80 STORE_FAST 0 (hidden_states) | |
493 >> 82 LOAD_FAST 0 (hidden_states) | |
84 RETURN_VALUE | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST key_value_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP is [ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 22 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF shape [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST proj_layer [NestedUserFunctionVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [NestedUserFunctionVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NestedUserFunctionVariable(), NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NestedUserFunctionVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] INLINING <code object shape at 0x7f3f100aebe0, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 466> | |
468 0 LOAD_FAST 0 (states) | |
2 LOAD_METHOD 0 (view) | |
4 LOAD_DEREF 0 (batch_size) | |
6 LOAD_CONST 1 (-1) | |
8 LOAD_DEREF 1 (self) | |
10 LOAD_ATTR 1 (n_heads) | |
12 LOAD_DEREF 1 (self) | |
14 LOAD_ATTR 2 (key_value_proj_dim) | |
16 CALL_METHOD 4 | |
18 LOAD_METHOD 3 (transpose) | |
20 LOAD_CONST 2 (1) | |
22 LOAD_CONST 3 (2) | |
24 CALL_METHOD 2 | |
26 RETURN_VALUE | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR view [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF batch_size [GetAttrVariable(TensorVariable(), view)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST -1 [GetAttrVariable(TensorVariable(), view), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF self [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR n_heads [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF self [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR key_value_proj_dim [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int), ConstantVariable(int), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 4 [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int), ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR transpose [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 1 [GetAttrVariable(TensorVariable(), transpose)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 2 [GetAttrVariable(TensorVariable(), transpose), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 2 [GetAttrVariable(TensorVariable(), transpose), ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE RETURN_VALUE None [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] DONE INLINING <code object shape at 0x7f3f100aebe0, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 466> | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST hidden_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE JUMP_FORWARD 42 [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST past_key_value [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP is not [ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 82 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE RETURN_VALUE None [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] DONE INLINING <code object project at 0x7f3f100aed40, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 474> | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST value_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL torch [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR matmul [TorchVariable(<module 'torch' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/__init__.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST query_states [TorchVariable(<built-in method matmul of type object at 0x7f40b0102e20>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST key_states [TorchVariable(<built-in method matmul of type object at 0x7f40b0102e20>), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR transpose [TorchVariable(<built-in method matmul of type object at 0x7f40b0102e20>), TensorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 3 [TorchVariable(<built-in method matmul of type object at 0x7f40b0102e20>), TensorVariable(), GetAttrVariable(TensorVariable(), transpose)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 2 [TorchVariable(<built-in method matmul of type object at 0x7f40b0102e20>), TensorVariable(), GetAttrVariable(TensorVariable(), transpose), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 2 [TorchVariable(<built-in method matmul of type object at 0x7f40b0102e20>), TensorVariable(), GetAttrVariable(TensorVariable(), transpose), ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 2 [TorchVariable(<built-in method matmul of type object at 0x7f40b0102e20>), TensorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST scores [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST position_bias [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP is [TensorVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 410 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST scores [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST position_bias [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE INPLACE_ADD None [TensorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST scores [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL nn [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR functional [TorchVariable(<module 'torch.nn' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/nn/__init__.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR softmax [TorchVariable(<module 'torch.nn.functional' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/nn/functional.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST scores [TorchVariable(<function softmax at 0x7f3f1bf53e50>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR float [TorchVariable(<function softmax at 0x7f3f1bf53e50>), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 0 [TorchVariable(<function softmax at 0x7f3f1bf53e50>), GetAttrVariable(TensorVariable(), float)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST -1 [TorchVariable(<function softmax at 0x7f3f1bf53e50>), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST ('dim',) [TorchVariable(<function softmax at 0x7f3f1bf53e50>), TensorVariable(), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION_KW 2 [TorchVariable(<function softmax at 0x7f3f1bf53e50>), TensorVariable(), ConstantVariable(int), ConstantVariable(tuple)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR type_as [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST scores [GetAttrVariable(TensorVariable(), type_as)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [GetAttrVariable(TensorVariable(), type_as), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST attn_weights [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL nn [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR functional [TorchVariable(<module 'torch.nn' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/nn/__init__.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR dropout [TorchVariable(<module 'torch.nn.functional' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/nn/functional.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST attn_weights [TorchVariable(<function dropout at 0x7f3f1bf53310>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF self [TorchVariable(<function dropout at 0x7f3f1bf53310>), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR dropout [TorchVariable(<function dropout at 0x7f3f1bf53310>), TensorVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF self [TorchVariable(<function dropout at 0x7f3f1bf53310>), TensorVariable(), ConstantVariable(float)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR training [TorchVariable(<function dropout at 0x7f3f1bf53310>), TensorVariable(), ConstantVariable(float), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST ('p', 'training') [TorchVariable(<function dropout at 0x7f3f1bf53310>), TensorVariable(), ConstantVariable(float), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION_KW 3 [TorchVariable(<function dropout at 0x7f3f1bf53310>), TensorVariable(), ConstantVariable(float), ConstantVariable(bool), ConstantVariable(tuple)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST attn_weights [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST layer_head_mask [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP is not [ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 484 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST unshape [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL torch [NestedUserFunctionVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR matmul [NestedUserFunctionVariable(), TorchVariable(<module 'torch' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/__init__.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST attn_weights [NestedUserFunctionVariable(), TorchVariable(<built-in method matmul of type object at 0x7f40b0102e20>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST value_states [NestedUserFunctionVariable(), TorchVariable(<built-in method matmul of type object at 0x7f40b0102e20>), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 2 [NestedUserFunctionVariable(), TorchVariable(<built-in method matmul of type object at 0x7f40b0102e20>), TensorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NestedUserFunctionVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] INLINING <code object unshape at 0x7f3f100aec90, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 470> | |
472 0 LOAD_FAST 0 (states) | |
2 LOAD_METHOD 0 (transpose) | |
4 LOAD_CONST 1 (1) | |
6 LOAD_CONST 2 (2) | |
8 CALL_METHOD 2 | |
10 LOAD_METHOD 1 (contiguous) | |
12 CALL_METHOD 0 | |
14 LOAD_METHOD 2 (view) | |
16 LOAD_DEREF 0 (batch_size) | |
18 LOAD_CONST 3 (-1) | |
20 LOAD_DEREF 1 (self) | |
22 LOAD_ATTR 3 (inner_dim) | |
24 CALL_METHOD 3 | |
26 RETURN_VALUE | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR transpose [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 1 [GetAttrVariable(TensorVariable(), transpose)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 2 [GetAttrVariable(TensorVariable(), transpose), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 2 [GetAttrVariable(TensorVariable(), transpose), ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR contiguous [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 0 [GetAttrVariable(TensorVariable(), contiguous)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR view [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF batch_size [GetAttrVariable(TensorVariable(), view)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST -1 [GetAttrVariable(TensorVariable(), view), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF self [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR inner_dim [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 3 [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE RETURN_VALUE None [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] DONE INLINING <code object unshape at 0x7f3f100aec90, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 470> | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST attn_output [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR o [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST attn_output [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST attn_output [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR is_decoder [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 532 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST present_key_value_state [ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST attn_output [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_TUPLE 1 [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST present_key_value_state [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_TUPLE 1 [TupleVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_ADD None [TupleVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST position_bias [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_TUPLE 1 [TupleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_ADD None [TupleVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST outputs [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST output_attentions [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 570 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST outputs [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE RETURN_VALUE None [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] DONE INLINING <code object forward at 0x7f3f100aedf0, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 436> | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST attention_output [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR dropout [TensorVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST attention_output [TensorVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 0 [TensorVariable(), NNModuleVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_SUBSCR None [TensorVariable(), NNModuleVariable(), TupleVariable(), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [TensorVariable(), NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_ADD None [TensorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST hidden_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_TUPLE 1 [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST attention_output [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 1 [TupleVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [TupleVariable(), TupleVariable(), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_SLICE 2 [TupleVariable(), TupleVariable(), ConstantVariable(int), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_SUBSCR None [TupleVariable(), TupleVariable(), SliceVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_ADD None [TupleVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST outputs [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST outputs [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE RETURN_VALUE None [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] DONE INLINING <code object forward at 0x7f3f100b4030, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 559> | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST self_attention_outputs [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self_attention_outputs [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 2 [TupleVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_SLICE 2 [TupleVariable(), ConstantVariable(NoneType), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_SUBSCR None [TupleVariable(), SliceVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE UNPACK_SEQUENCE 2 [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST hidden_states [ConstantVariable(NoneType), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST present_key_value_state [ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self_attention_outputs [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 2 [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [TupleVariable(), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_SLICE 2 [TupleVariable(), ConstantVariable(int), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_SUBSCR None [TupleVariable(), SliceVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST attention_outputs [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR dtype [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL torch [TorchVariable(torch.float32)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR float16 [TorchVariable(torch.float32), TorchVariable(<module 'torch' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/__init__.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP == [TorchVariable(torch.float32), ConstantVariable(dtype)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 248 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR is_decoder [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE JUMP_IF_FALSE_OR_POP 262 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST do_cross_attention [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST do_cross_attention [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 444 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR layer [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST -1 [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_SUBSCR None [NNModuleVariable(), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] INLINING <code object forward at 0x7f3f100ae7c0, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 324> | |
325 0 LOAD_FAST 0 (self) | |
2 LOAD_METHOD 0 (layer_norm) | |
4 LOAD_FAST 1 (hidden_states) | |
6 CALL_METHOD 1 | |
8 STORE_FAST 2 (forwarded_states) | |
326 10 LOAD_FAST 0 (self) | |
12 LOAD_METHOD 1 (DenseReluDense) | |
14 LOAD_FAST 2 (forwarded_states) | |
16 CALL_METHOD 1 | |
18 STORE_FAST 2 (forwarded_states) | |
327 20 LOAD_FAST 1 (hidden_states) | |
22 LOAD_FAST 0 (self) | |
24 LOAD_METHOD 2 (dropout) | |
26 LOAD_FAST 2 (forwarded_states) | |
28 CALL_METHOD 1 | |
30 BINARY_ADD | |
32 STORE_FAST 1 (hidden_states) | |
328 34 LOAD_FAST 1 (hidden_states) | |
36 RETURN_VALUE | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR layer_norm [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] INLINING <code object forward at 0x7f3f100ae190, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 248> | |
255 0 LOAD_FAST 1 (hidden_states) | |
2 LOAD_METHOD 0 (to) | |
4 LOAD_GLOBAL 1 (torch) | |
6 LOAD_ATTR 2 (float32) | |
8 CALL_METHOD 1 | |
10 LOAD_METHOD 3 (pow) | |
12 LOAD_CONST 1 (2) | |
14 CALL_METHOD 1 | |
16 LOAD_ATTR 4 (mean) | |
18 LOAD_CONST 2 (-1) | |
20 LOAD_CONST 3 (True) | |
22 LOAD_CONST 4 (('keepdim',)) | |
24 CALL_FUNCTION_KW 2 | |
26 STORE_FAST 2 (variance) | |
256 28 LOAD_FAST 1 (hidden_states) | |
30 LOAD_GLOBAL 1 (torch) | |
32 LOAD_METHOD 5 (rsqrt) | |
34 LOAD_FAST 2 (variance) | |
36 LOAD_FAST 0 (self) | |
38 LOAD_ATTR 6 (variance_epsilon) | |
40 BINARY_ADD | |
42 CALL_METHOD 1 | |
44 BINARY_MULTIPLY | |
46 STORE_FAST 1 (hidden_states) | |
259 48 LOAD_FAST 0 (self) | |
50 LOAD_ATTR 7 (weight) | |
52 LOAD_ATTR 8 (dtype) | |
54 LOAD_GLOBAL 1 (torch) | |
56 LOAD_ATTR 9 (float16) | |
58 LOAD_GLOBAL 1 (torch) | |
60 LOAD_ATTR 10 (bfloat16) | |
62 BUILD_TUPLE 2 | |
64 COMPARE_OP 6 (in) | |
66 POP_JUMP_IF_FALSE 82 | |
260 68 LOAD_FAST 1 (hidden_states) | |
70 LOAD_METHOD 0 (to) | |
72 LOAD_FAST 0 (self) | |
74 LOAD_ATTR 7 (weight) | |
76 LOAD_ATTR 8 (dtype) | |
78 CALL_METHOD 1 | |
80 STORE_FAST 1 (hidden_states) | |
262 >> 82 LOAD_FAST 0 (self) | |
84 LOAD_ATTR 7 (weight) | |
86 LOAD_FAST 1 (hidden_states) | |
88 BINARY_MULTIPLY | |
90 RETURN_VALUE | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR to [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL torch [GetAttrVariable(TensorVariable(), to)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR float32 [GetAttrVariable(TensorVariable(), to), TorchVariable(<module 'torch' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/__init__.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [GetAttrVariable(TensorVariable(), to), ConstantVariable(dtype)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR pow [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 2 [GetAttrVariable(TensorVariable(), pow)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [GetAttrVariable(TensorVariable(), pow), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR mean [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST -1 [GetAttrVariable(TensorVariable(), mean)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST True [GetAttrVariable(TensorVariable(), mean), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST ('keepdim',) [GetAttrVariable(TensorVariable(), mean), ConstantVariable(int), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION_KW 2 [GetAttrVariable(TensorVariable(), mean), ConstantVariable(int), ConstantVariable(bool), ConstantVariable(tuple)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST variance [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL torch [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR rsqrt [TensorVariable(), TorchVariable(<module 'torch' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/__init__.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST variance [TensorVariable(), TorchVariable(<built-in method rsqrt of type object at 0x7f40b0102e20>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [TensorVariable(), TorchVariable(<built-in method rsqrt of type object at 0x7f40b0102e20>), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR variance_epsilon [TensorVariable(), TorchVariable(<built-in method rsqrt of type object at 0x7f40b0102e20>), TensorVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_ADD None [TensorVariable(), TorchVariable(<built-in method rsqrt of type object at 0x7f40b0102e20>), TensorVariable(), ConstantVariable(float)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [TensorVariable(), TorchVariable(<built-in method rsqrt of type object at 0x7f40b0102e20>), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_MULTIPLY None [TensorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST hidden_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR weight [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR dtype [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL torch [TorchVariable(torch.float32)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR float16 [TorchVariable(torch.float32), TorchVariable(<module 'torch' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/__init__.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL torch [TorchVariable(torch.float32), ConstantVariable(dtype)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR bfloat16 [TorchVariable(torch.float32), ConstantVariable(dtype), TorchVariable(<module 'torch' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/__init__.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_TUPLE 2 [TorchVariable(torch.float32), ConstantVariable(dtype), ConstantVariable(dtype)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP in [TorchVariable(torch.float32), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 82 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR weight [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_MULTIPLY None [TensorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE RETURN_VALUE None [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] DONE INLINING <code object forward at 0x7f3f100ae190, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 248> | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST forwarded_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR DenseReluDense [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST forwarded_states [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] INLINING <code object forward at 0x7f3f100ae3a0, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 287> | |
288 0 LOAD_FAST 0 (self) | |
2 LOAD_METHOD 0 (wi) | |
4 LOAD_FAST 1 (hidden_states) | |
6 CALL_METHOD 1 | |
8 STORE_FAST 1 (hidden_states) | |
289 10 LOAD_FAST 0 (self) | |
12 LOAD_METHOD 1 (act) | |
14 LOAD_FAST 1 (hidden_states) | |
16 CALL_METHOD 1 | |
18 STORE_FAST 1 (hidden_states) | |
290 20 LOAD_FAST 0 (self) | |
22 LOAD_METHOD 2 (dropout) | |
24 LOAD_FAST 1 (hidden_states) | |
26 CALL_METHOD 1 | |
28 STORE_FAST 1 (hidden_states) | |
291 30 LOAD_FAST 0 (self) | |
32 LOAD_METHOD 3 (wo) | |
34 LOAD_FAST 1 (hidden_states) | |
36 CALL_METHOD 1 | |
38 STORE_FAST 1 (hidden_states) | |
292 40 LOAD_FAST 1 (hidden_states) | |
42 RETURN_VALUE | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR wi [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST hidden_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR act [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST hidden_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR dropout [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST hidden_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR wo [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST hidden_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE RETURN_VALUE None [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] DONE INLINING <code object forward at 0x7f3f100ae3a0, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 287> | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST forwarded_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR dropout [TensorVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST forwarded_states [TensorVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [TensorVariable(), NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_ADD None [TensorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST hidden_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE RETURN_VALUE None [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] DONE INLINING <code object forward at 0x7f3f100ae7c0, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 324> | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST hidden_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR dtype [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL torch [TorchVariable(torch.float32)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR float16 [TorchVariable(torch.float32), TorchVariable(<module 'torch' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/__init__.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP == [TorchVariable(torch.float32), ConstantVariable(dtype)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 524 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_TUPLE 1 [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST outputs [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST use_cache [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 552 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST outputs [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST attention_outputs [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_ADD None [TupleVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST outputs [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST outputs [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE RETURN_VALUE None [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] DONE INLINING <code object forward at 0x7f3f100b4450, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 631> | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST layer_outputs [ListIteratorVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF use_cache [ListIteratorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST False [ListIteratorVariable(), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP is [ListIteratorVariable(), ConstantVariable(bool), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 1096 [ListIteratorVariable(), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST layer_outputs [ListIteratorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [ListIteratorVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 1 [ListIteratorVariable(), TupleVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_SLICE 2 [ListIteratorVariable(), TupleVariable(), ConstantVariable(NoneType), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_SUBSCR None [ListIteratorVariable(), TupleVariable(), SliceVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST (None,) [ListIteratorVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_ADD None [ListIteratorVariable(), TupleVariable(), ConstantVariable(tuple)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST layer_outputs [ListIteratorVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 1 [ListIteratorVariable(), TupleVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [ListIteratorVariable(), TupleVariable(), TupleVariable(), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_SLICE 2 [ListIteratorVariable(), TupleVariable(), TupleVariable(), ConstantVariable(int), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_SUBSCR None [ListIteratorVariable(), TupleVariable(), TupleVariable(), SliceVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_ADD None [ListIteratorVariable(), TupleVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST layer_outputs [ListIteratorVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST layer_outputs [ListIteratorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [ListIteratorVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 2 [ListIteratorVariable(), TupleVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_SLICE 2 [ListIteratorVariable(), TupleVariable(), ConstantVariable(NoneType), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_SUBSCR None [ListIteratorVariable(), TupleVariable(), SliceVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE UNPACK_SEQUENCE 2 [ListIteratorVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST hidden_states [ListIteratorVariable(), ConstantVariable(NoneType), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST present_key_value_state [ListIteratorVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST layer_outputs [ListIteratorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 2 [ListIteratorVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_SUBSCR None [ListIteratorVariable(), TupleVariable(), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST position_bias [ListIteratorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [ListIteratorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR is_decoder [ListIteratorVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 1156 [ListIteratorVariable(), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF use_cache [ListIteratorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 1172 [ListIteratorVariable(), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF output_attentions [ListIteratorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 1214 [ListIteratorVariable(), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [ListIteratorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR model_parallel [ListIteratorVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 724 [ListIteratorVariable(), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE FOR_ITER 1304 [ListIteratorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE UNPACK_SEQUENCE 2 [ListIteratorVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST i [ListIteratorVariable(), TupleVariable(), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE UNPACK_SEQUENCE 2 [ListIteratorVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST layer_module [ListIteratorVariable(), ConstantVariable(NoneType), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST past_key_value [ListIteratorVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST head_mask [ListIteratorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST i [ListIteratorVariable(), ListVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_SUBSCR None [ListIteratorVariable(), ListVariable(), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST layer_head_mask [ListIteratorVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST cross_attn_head_mask [ListIteratorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST i [ListIteratorVariable(), ListVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_SUBSCR None [ListIteratorVariable(), ListVariable(), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST cross_attn_layer_head_mask [ListIteratorVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [ListIteratorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR model_parallel [ListIteratorVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 930 [ListIteratorVariable(), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST output_hidden_states [ListIteratorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 946 [ListIteratorVariable(), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [ListIteratorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR gradient_checkpointing [ListIteratorVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 1028 [ListIteratorVariable(), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST layer_module [ListIteratorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [ListIteratorVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST extended_attention_mask [ListIteratorVariable(), NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST position_bias [ListIteratorVariable(), NNModuleVariable(), TensorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST encoder_hidden_states [ListIteratorVariable(), NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST encoder_extended_attention_mask [ListIteratorVariable(), NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST encoder_decoder_position_bias [ListIteratorVariable(), NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST layer_head_mask [ListIteratorVariable(), NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST cross_attn_layer_head_mask [ListIteratorVariable(), NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST past_key_value [ListIteratorVariable(), NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF use_cache [ListIteratorVariable(), NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF output_attentions [ListIteratorVariable(), NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST ('attention_mask', 'position_bias', 'encoder_hidden_states', 'encoder_attention_mask', 'encoder_decoder_position_bias', 'layer_head_mask', 'cross_attn_layer_head_mask', 'past_key_value', 'use_cache', 'output_attentions') [ListIteratorVariable(), NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(bool), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION_KW 11 [ListIteratorVariable(), NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(bool), ConstantVariable(bool), ConstantVariable(tuple)] | |
torchdynamo.symbolic_convert: [DEBUG] INLINING <code object forward at 0x7f3f100b4450, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 631> | |
647 0 LOAD_FAST 9 (past_key_value) | |
2 LOAD_CONST 0 (None) | |
4 COMPARE_OP 9 (is not) | |
6 POP_JUMP_IF_FALSE 122 | |
648 8 LOAD_FAST 0 (self) | |
10 LOAD_ATTR 0 (is_decoder) | |
12 POP_JUMP_IF_TRUE 24 | |
649 14 LOAD_GLOBAL 1 (logger) | |
16 LOAD_METHOD 2 (warning) | |
18 LOAD_CONST 1 ('`past_key_values` is passed to the encoder. Please make sure this is intended.') | |
20 CALL_METHOD 1 | |
22 POP_TOP | |
650 >> 24 LOAD_FAST 4 (encoder_hidden_states) | |
26 LOAD_CONST 0 (None) | |
28 COMPARE_OP 8 (is) | |
30 POP_JUMP_IF_FALSE 36 | |
32 LOAD_CONST 2 (2) | |
34 JUMP_FORWARD 2 (to 38) | |
>> 36 LOAD_CONST 3 (4) | |
>> 38 STORE_FAST 13 (expected_num_past_key_values) | |
652 40 LOAD_GLOBAL 3 (len) | |
42 LOAD_FAST 9 (past_key_value) | |
44 CALL_FUNCTION 1 | |
46 LOAD_FAST 13 (expected_num_past_key_values) | |
48 COMPARE_OP 3 (!=) | |
50 POP_JUMP_IF_FALSE 96 | |
653 52 LOAD_GLOBAL 4 (ValueError) | |
654 54 LOAD_CONST 4 ('There should be ') | |
56 LOAD_FAST 13 (expected_num_past_key_values) | |
58 FORMAT_VALUE 0 | |
60 LOAD_CONST 5 (' past states. ') | |
62 LOAD_FAST 13 (expected_num_past_key_values) | |
64 LOAD_CONST 3 (4) | |
66 COMPARE_OP 2 (==) | |
68 POP_JUMP_IF_FALSE 74 | |
70 LOAD_CONST 6 ('2 (past / key) for cross attention. ') | |
72 JUMP_FORWARD 2 (to 76) | |
>> 74 LOAD_CONST 7 ('') | |
>> 76 FORMAT_VALUE 0 | |
78 LOAD_CONST 8 ('Got ') | |
80 LOAD_GLOBAL 3 (len) | |
82 LOAD_FAST 9 (past_key_value) | |
84 CALL_FUNCTION 1 | |
86 FORMAT_VALUE 0 | |
88 LOAD_CONST 9 (' past key / value states') | |
90 BUILD_STRING 7 | |
653 92 CALL_FUNCTION 1 | |
94 RAISE_VARARGS 1 | |
659 >> 96 LOAD_FAST 9 (past_key_value) | |
98 LOAD_CONST 0 (None) | |
100 LOAD_CONST 2 (2) | |
102 BUILD_SLICE 2 | |
104 BINARY_SUBSCR | |
106 STORE_FAST 14 (self_attn_past_key_value) | |
660 108 LOAD_FAST 9 (past_key_value) | |
110 LOAD_CONST 2 (2) | |
112 LOAD_CONST 0 (None) | |
114 BUILD_SLICE 2 | |
116 BINARY_SUBSCR | |
118 STORE_FAST 15 (cross_attn_past_key_value) | |
120 JUMP_FORWARD 8 (to 130) | |
662 >> 122 LOAD_CONST 10 ((None, None)) | |
124 UNPACK_SEQUENCE 2 | |
126 STORE_FAST 14 (self_attn_past_key_value) | |
128 STORE_FAST 15 (cross_attn_past_key_value) | |
664 >> 130 LOAD_FAST 0 (self) | |
132 LOAD_ATTR 5 (layer) | |
134 LOAD_CONST 11 (0) | |
136 BINARY_SUBSCR | |
665 138 LOAD_FAST 1 (hidden_states) | |
666 140 LOAD_FAST 2 (attention_mask) | |
667 142 LOAD_FAST 3 (position_bias) | |
668 144 LOAD_FAST 7 (layer_head_mask) | |
669 146 LOAD_FAST 14 (self_attn_past_key_value) | |
670 148 LOAD_FAST 10 (use_cache) | |
671 150 LOAD_FAST 11 (output_attentions) | |
664 152 LOAD_CONST 12 (('attention_mask', 'position_bias', 'layer_head_mask', 'past_key_value', 'use_cache', 'output_attentions')) | |
154 CALL_FUNCTION_KW 7 | |
156 STORE_FAST 16 (self_attention_outputs) | |
673 158 LOAD_FAST 16 (self_attention_outputs) | |
160 LOAD_CONST 0 (None) | |
162 LOAD_CONST 2 (2) | |
164 BUILD_SLICE 2 | |
166 BINARY_SUBSCR | |
168 UNPACK_SEQUENCE 2 | |
170 STORE_FAST 1 (hidden_states) | |
172 STORE_FAST 17 (present_key_value_state) | |
674 174 LOAD_FAST 16 (self_attention_outputs) | |
176 LOAD_CONST 2 (2) | |
178 LOAD_CONST 0 (None) | |
180 BUILD_SLICE 2 | |
182 BINARY_SUBSCR | |
184 STORE_FAST 18 (attention_outputs) | |
677 186 LOAD_FAST 1 (hidden_states) | |
188 LOAD_ATTR 6 (dtype) | |
190 LOAD_GLOBAL 7 (torch) | |
192 LOAD_ATTR 8 (float16) | |
194 COMPARE_OP 2 (==) | |
196 POP_JUMP_IF_FALSE 248 | |
198 LOAD_GLOBAL 7 (torch) | |
200 LOAD_METHOD 9 (isinf) | |
202 LOAD_FAST 1 (hidden_states) | |
204 CALL_METHOD 1 | |
206 LOAD_METHOD 10 (any) | |
208 CALL_METHOD 0 | |
210 POP_JUMP_IF_FALSE 248 | |
678 212 LOAD_GLOBAL 7 (torch) | |
214 LOAD_METHOD 11 (finfo) | |
216 LOAD_FAST 1 (hidden_states) | |
218 LOAD_ATTR 6 (dtype) | |
220 CALL_METHOD 1 | |
222 LOAD_ATTR 12 (max) | |
224 LOAD_CONST 13 (1000) | |
226 BINARY_SUBTRACT | |
228 STORE_FAST 19 (clamp_value) | |
679 230 LOAD_GLOBAL 7 (torch) | |
232 LOAD_ATTR 13 (clamp) | |
234 LOAD_FAST 1 (hidden_states) | |
236 LOAD_FAST 19 (clamp_value) | |
238 UNARY_NEGATIVE | |
240 LOAD_FAST 19 (clamp_value) | |
242 LOAD_CONST 14 (('min', 'max')) | |
244 CALL_FUNCTION_KW 3 | |
246 STORE_FAST 1 (hidden_states) | |
681 >> 248 LOAD_FAST 0 (self) | |
250 LOAD_ATTR 0 (is_decoder) | |
252 EXTENDED_ARG 1 | |
254 JUMP_IF_FALSE_OR_POP 262 | |
256 LOAD_FAST 4 (encoder_hidden_states) | |
258 LOAD_CONST 0 (None) | |
260 COMPARE_OP 9 (is not) | |
>> 262 STORE_FAST 20 (do_cross_attention) | |
682 264 LOAD_FAST 20 (do_cross_attention) | |
266 EXTENDED_ARG 1 | |
268 POP_JUMP_IF_FALSE 444 | |
685 270 LOAD_FAST 17 (present_key_value_state) | |
272 LOAD_CONST 0 (None) | |
274 COMPARE_OP 9 (is not) | |
276 EXTENDED_ARG 1 | |
278 POP_JUMP_IF_FALSE 296 | |
686 280 LOAD_FAST 17 (present_key_value_state) | |
282 LOAD_CONST 11 (0) | |
284 BINARY_SUBSCR | |
286 LOAD_ATTR 14 (shape) | |
288 LOAD_CONST 2 (2) | |
290 BINARY_SUBSCR | |
292 STORE_FAST 21 (query_length) | |
294 JUMP_FORWARD 4 (to 300) | |
688 >> 296 LOAD_CONST 0 (None) | |
298 STORE_FAST 21 (query_length) | |
690 >> 300 LOAD_FAST 0 (self) | |
302 LOAD_ATTR 5 (layer) | |
304 LOAD_CONST 15 (1) | |
306 BINARY_SUBSCR | |
691 308 LOAD_FAST 1 (hidden_states) | |
692 310 LOAD_FAST 4 (encoder_hidden_states) | |
693 312 LOAD_FAST 5 (encoder_attention_mask) | |
694 314 LOAD_FAST 6 (encoder_decoder_position_bias) | |
695 316 LOAD_FAST 8 (cross_attn_layer_head_mask) | |
696 318 LOAD_FAST 15 (cross_attn_past_key_value) | |
697 320 LOAD_FAST 21 (query_length) | |
698 322 LOAD_FAST 10 (use_cache) | |
699 324 LOAD_FAST 11 (output_attentions) | |
690 326 LOAD_CONST 16 (('key_value_states', 'attention_mask', 'position_bias', 'layer_head_mask', 'past_key_value', 'query_length', 'use_cache', 'output_attentions')) | |
328 CALL_FUNCTION_KW 9 | |
330 STORE_FAST 22 (cross_attention_outputs) | |
701 332 LOAD_FAST 22 (cross_attention_outputs) | |
334 LOAD_CONST 11 (0) | |
336 BINARY_SUBSCR | |
338 STORE_FAST 1 (hidden_states) | |
704 340 LOAD_FAST 1 (hidden_states) | |
342 LOAD_ATTR 6 (dtype) | |
344 LOAD_GLOBAL 7 (torch) | |
346 LOAD_ATTR 8 (float16) | |
348 COMPARE_OP 2 (==) | |
350 EXTENDED_ARG 1 | |
352 POP_JUMP_IF_FALSE 406 | |
354 LOAD_GLOBAL 7 (torch) | |
356 LOAD_METHOD 9 (isinf) | |
358 LOAD_FAST 1 (hidden_states) | |
360 CALL_METHOD 1 | |
362 LOAD_METHOD 10 (any) | |
364 CALL_METHOD 0 | |
366 EXTENDED_ARG 1 | |
368 POP_JUMP_IF_FALSE 406 | |
705 370 LOAD_GLOBAL 7 (torch) | |
372 LOAD_METHOD 11 (finfo) | |
374 LOAD_FAST 1 (hidden_states) | |
376 LOAD_ATTR 6 (dtype) | |
378 CALL_METHOD 1 | |
380 LOAD_ATTR 12 (max) | |
382 LOAD_CONST 13 (1000) | |
384 BINARY_SUBTRACT | |
386 STORE_FAST 19 (clamp_value) | |
706 388 LOAD_GLOBAL 7 (torch) | |
390 LOAD_ATTR 13 (clamp) | |
392 LOAD_FAST 1 (hidden_states) | |
394 LOAD_FAST 19 (clamp_value) | |
396 UNARY_NEGATIVE | |
398 LOAD_FAST 19 (clamp_value) | |
400 LOAD_CONST 14 (('min', 'max')) | |
402 CALL_FUNCTION_KW 3 | |
404 STORE_FAST 1 (hidden_states) | |
709 >> 406 LOAD_FAST 17 (present_key_value_state) | |
408 LOAD_CONST 0 (None) | |
410 COMPARE_OP 9 (is not) | |
412 EXTENDED_ARG 1 | |
414 POP_JUMP_IF_FALSE 428 | |
710 416 LOAD_FAST 17 (present_key_value_state) | |
418 LOAD_FAST 22 (cross_attention_outputs) | |
420 LOAD_CONST 15 (1) | |
422 BINARY_SUBSCR | |
424 BINARY_ADD | |
426 STORE_FAST 17 (present_key_value_state) | |
713 >> 428 LOAD_FAST 18 (attention_outputs) | |
430 LOAD_FAST 22 (cross_attention_outputs) | |
432 LOAD_CONST 2 (2) | |
434 LOAD_CONST 0 (None) | |
436 BUILD_SLICE 2 | |
438 BINARY_SUBSCR | |
440 BINARY_ADD | |
442 STORE_FAST 18 (attention_outputs) | |
716 >> 444 LOAD_FAST 0 (self) | |
446 LOAD_ATTR 5 (layer) | |
448 LOAD_CONST 17 (-1) | |
450 BINARY_SUBSCR | |
452 LOAD_FAST 1 (hidden_states) | |
454 CALL_FUNCTION 1 | |
456 STORE_FAST 1 (hidden_states) | |
719 458 LOAD_FAST 1 (hidden_states) | |
460 LOAD_ATTR 6 (dtype) | |
462 LOAD_GLOBAL 7 (torch) | |
464 LOAD_ATTR 8 (float16) | |
466 COMPARE_OP 2 (==) | |
468 EXTENDED_ARG 2 | |
470 POP_JUMP_IF_FALSE 524 | |
472 LOAD_GLOBAL 7 (torch) | |
474 LOAD_METHOD 9 (isinf) | |
476 LOAD_FAST 1 (hidden_states) | |
478 CALL_METHOD 1 | |
480 LOAD_METHOD 10 (any) | |
482 CALL_METHOD 0 | |
484 EXTENDED_ARG 2 | |
486 POP_JUMP_IF_FALSE 524 | |
720 488 LOAD_GLOBAL 7 (torch) | |
490 LOAD_METHOD 11 (finfo) | |
492 LOAD_FAST 1 (hidden_states) | |
494 LOAD_ATTR 6 (dtype) | |
496 CALL_METHOD 1 | |
498 LOAD_ATTR 12 (max) | |
500 LOAD_CONST 13 (1000) | |
502 BINARY_SUBTRACT | |
504 STORE_FAST 19 (clamp_value) | |
721 506 LOAD_GLOBAL 7 (torch) | |
508 LOAD_ATTR 13 (clamp) | |
510 LOAD_FAST 1 (hidden_states) | |
512 LOAD_FAST 19 (clamp_value) | |
514 UNARY_NEGATIVE | |
516 LOAD_FAST 19 (clamp_value) | |
518 LOAD_CONST 14 (('min', 'max')) | |
520 CALL_FUNCTION_KW 3 | |
522 STORE_FAST 1 (hidden_states) | |
723 >> 524 LOAD_FAST 1 (hidden_states) | |
526 BUILD_TUPLE 1 | |
528 STORE_FAST 23 (outputs) | |
725 530 LOAD_FAST 10 (use_cache) | |
532 EXTENDED_ARG 2 | |
534 POP_JUMP_IF_FALSE 552 | |
726 536 LOAD_FAST 23 (outputs) | |
538 LOAD_FAST 17 (present_key_value_state) | |
540 BUILD_TUPLE 1 | |
542 BINARY_ADD | |
544 LOAD_FAST 18 (attention_outputs) | |
546 BINARY_ADD | |
548 STORE_FAST 23 (outputs) | |
550 JUMP_FORWARD 8 (to 560) | |
728 >> 552 LOAD_FAST 23 (outputs) | |
554 LOAD_FAST 18 (attention_outputs) | |
556 BINARY_ADD | |
558 STORE_FAST 23 (outputs) | |
730 >> 560 LOAD_FAST 23 (outputs) | |
562 RETURN_VALUE | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST past_key_value [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP is not [ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 122 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST (None, None) [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE UNPACK_SEQUENCE 2 [ConstantVariable(tuple)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST self_attn_past_key_value [ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST cross_attn_past_key_value [ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR layer [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 0 [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_SUBSCR None [NNModuleVariable(), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST attention_mask [NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST position_bias [NNModuleVariable(), TensorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST layer_head_mask [NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self_attn_past_key_value [NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST use_cache [NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST output_attentions [NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST ('attention_mask', 'position_bias', 'layer_head_mask', 'past_key_value', 'use_cache', 'output_attentions') [NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(bool), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION_KW 7 [NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(bool), ConstantVariable(bool), ConstantVariable(tuple)] | |
torchdynamo.symbolic_convert: [DEBUG] INLINING <code object forward at 0x7f3f100b4030, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 559> | |
569 0 LOAD_FAST 0 (self) | |
2 LOAD_METHOD 0 (layer_norm) | |
4 LOAD_FAST 1 (hidden_states) | |
6 CALL_METHOD 1 | |
8 STORE_FAST 8 (normed_hidden_states) | |
570 10 LOAD_FAST 0 (self) | |
12 LOAD_ATTR 1 (SelfAttention) | |
571 14 LOAD_FAST 8 (normed_hidden_states) | |
572 16 LOAD_FAST 2 (attention_mask) | |
573 18 LOAD_FAST 3 (position_bias) | |
574 20 LOAD_FAST 4 (layer_head_mask) | |
575 22 LOAD_FAST 5 (past_key_value) | |
576 24 LOAD_FAST 6 (use_cache) | |
577 26 LOAD_FAST 7 (output_attentions) | |
570 28 LOAD_CONST 1 (('mask', 'position_bias', 'layer_head_mask', 'past_key_value', 'use_cache', 'output_attentions')) | |
30 CALL_FUNCTION_KW 7 | |
32 STORE_FAST 9 (attention_output) | |
579 34 LOAD_FAST 1 (hidden_states) | |
36 LOAD_FAST 0 (self) | |
38 LOAD_METHOD 2 (dropout) | |
40 LOAD_FAST 9 (attention_output) | |
42 LOAD_CONST 2 (0) | |
44 BINARY_SUBSCR | |
46 CALL_METHOD 1 | |
48 BINARY_ADD | |
50 STORE_FAST 1 (hidden_states) | |
580 52 LOAD_FAST 1 (hidden_states) | |
54 BUILD_TUPLE 1 | |
56 LOAD_FAST 9 (attention_output) | |
58 LOAD_CONST 3 (1) | |
60 LOAD_CONST 0 (None) | |
62 BUILD_SLICE 2 | |
64 BINARY_SUBSCR | |
66 BINARY_ADD | |
68 STORE_FAST 10 (outputs) | |
581 70 LOAD_FAST 10 (outputs) | |
72 RETURN_VALUE | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR layer_norm [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] INLINING <code object forward at 0x7f3f100ae190, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 248> | |
255 0 LOAD_FAST 1 (hidden_states) | |
2 LOAD_METHOD 0 (to) | |
4 LOAD_GLOBAL 1 (torch) | |
6 LOAD_ATTR 2 (float32) | |
8 CALL_METHOD 1 | |
10 LOAD_METHOD 3 (pow) | |
12 LOAD_CONST 1 (2) | |
14 CALL_METHOD 1 | |
16 LOAD_ATTR 4 (mean) | |
18 LOAD_CONST 2 (-1) | |
20 LOAD_CONST 3 (True) | |
22 LOAD_CONST 4 (('keepdim',)) | |
24 CALL_FUNCTION_KW 2 | |
26 STORE_FAST 2 (variance) | |
256 28 LOAD_FAST 1 (hidden_states) | |
30 LOAD_GLOBAL 1 (torch) | |
32 LOAD_METHOD 5 (rsqrt) | |
34 LOAD_FAST 2 (variance) | |
36 LOAD_FAST 0 (self) | |
38 LOAD_ATTR 6 (variance_epsilon) | |
40 BINARY_ADD | |
42 CALL_METHOD 1 | |
44 BINARY_MULTIPLY | |
46 STORE_FAST 1 (hidden_states) | |
259 48 LOAD_FAST 0 (self) | |
50 LOAD_ATTR 7 (weight) | |
52 LOAD_ATTR 8 (dtype) | |
54 LOAD_GLOBAL 1 (torch) | |
56 LOAD_ATTR 9 (float16) | |
58 LOAD_GLOBAL 1 (torch) | |
60 LOAD_ATTR 10 (bfloat16) | |
62 BUILD_TUPLE 2 | |
64 COMPARE_OP 6 (in) | |
66 POP_JUMP_IF_FALSE 82 | |
260 68 LOAD_FAST 1 (hidden_states) | |
70 LOAD_METHOD 0 (to) | |
72 LOAD_FAST 0 (self) | |
74 LOAD_ATTR 7 (weight) | |
76 LOAD_ATTR 8 (dtype) | |
78 CALL_METHOD 1 | |
80 STORE_FAST 1 (hidden_states) | |
262 >> 82 LOAD_FAST 0 (self) | |
84 LOAD_ATTR 7 (weight) | |
86 LOAD_FAST 1 (hidden_states) | |
88 BINARY_MULTIPLY | |
90 RETURN_VALUE | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR to [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL torch [GetAttrVariable(TensorVariable(), to)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR float32 [GetAttrVariable(TensorVariable(), to), TorchVariable(<module 'torch' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/__init__.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [GetAttrVariable(TensorVariable(), to), ConstantVariable(dtype)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR pow [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 2 [GetAttrVariable(TensorVariable(), pow)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [GetAttrVariable(TensorVariable(), pow), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR mean [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST -1 [GetAttrVariable(TensorVariable(), mean)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST True [GetAttrVariable(TensorVariable(), mean), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST ('keepdim',) [GetAttrVariable(TensorVariable(), mean), ConstantVariable(int), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION_KW 2 [GetAttrVariable(TensorVariable(), mean), ConstantVariable(int), ConstantVariable(bool), ConstantVariable(tuple)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST variance [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL torch [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR rsqrt [TensorVariable(), TorchVariable(<module 'torch' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/__init__.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST variance [TensorVariable(), TorchVariable(<built-in method rsqrt of type object at 0x7f40b0102e20>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [TensorVariable(), TorchVariable(<built-in method rsqrt of type object at 0x7f40b0102e20>), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR variance_epsilon [TensorVariable(), TorchVariable(<built-in method rsqrt of type object at 0x7f40b0102e20>), TensorVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_ADD None [TensorVariable(), TorchVariable(<built-in method rsqrt of type object at 0x7f40b0102e20>), TensorVariable(), ConstantVariable(float)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [TensorVariable(), TorchVariable(<built-in method rsqrt of type object at 0x7f40b0102e20>), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_MULTIPLY None [TensorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST hidden_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR weight [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR dtype [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL torch [TorchVariable(torch.float32)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR float16 [TorchVariable(torch.float32), TorchVariable(<module 'torch' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/__init__.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL torch [TorchVariable(torch.float32), ConstantVariable(dtype)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR bfloat16 [TorchVariable(torch.float32), ConstantVariable(dtype), TorchVariable(<module 'torch' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/__init__.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_TUPLE 2 [TorchVariable(torch.float32), ConstantVariable(dtype), ConstantVariable(dtype)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP in [TorchVariable(torch.float32), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 82 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR weight [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_MULTIPLY None [TensorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE RETURN_VALUE None [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] DONE INLINING <code object forward at 0x7f3f100ae190, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 248> | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST normed_hidden_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR SelfAttention [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST normed_hidden_states [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST attention_mask [NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST position_bias [NNModuleVariable(), TensorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST layer_head_mask [NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST past_key_value [NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST use_cache [NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST output_attentions [NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST ('mask', 'position_bias', 'layer_head_mask', 'past_key_value', 'use_cache', 'output_attentions') [NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(bool), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION_KW 7 [NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(bool), ConstantVariable(bool), ConstantVariable(tuple)] | |
torchdynamo.symbolic_convert: [DEBUG] INLINING <code object forward at 0x7f3f100aedf0, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 436> | |
454 0 LOAD_FAST 1 (hidden_states) | |
2 LOAD_ATTR 0 (shape) | |
4 LOAD_CONST 1 (None) | |
6 LOAD_CONST 2 (2) | |
8 BUILD_SLICE 2 | |
10 BINARY_SUBSCR | |
12 UNPACK_SEQUENCE 2 | |
14 STORE_DEREF 0 (batch_size) | |
16 STORE_FAST 10 (seq_length) | |
456 18 LOAD_FAST 10 (seq_length) | |
20 STORE_FAST 11 (real_seq_length) | |
458 22 LOAD_FAST 5 (past_key_value) | |
24 LOAD_CONST 1 (None) | |
26 COMPARE_OP 9 (is not) | |
28 POP_JUMP_IF_FALSE 92 | |
460 30 LOAD_GLOBAL 1 (len) | |
32 LOAD_FAST 5 (past_key_value) | |
34 CALL_FUNCTION 1 | |
36 LOAD_CONST 2 (2) | |
38 COMPARE_OP 2 (==) | |
459 40 POP_JUMP_IF_TRUE 62 | |
42 LOAD_GLOBAL 2 (AssertionError) | |
461 44 LOAD_CONST 3 ('past_key_value should have 2 past states: keys and values. Got ') | |
46 LOAD_GLOBAL 1 (len) | |
48 LOAD_FAST 5 (past_key_value) | |
50 CALL_FUNCTION 1 | |
52 FORMAT_VALUE 0 | |
54 LOAD_CONST 4 (' past states') | |
56 BUILD_STRING 3 | |
459 58 CALL_FUNCTION 1 | |
60 RAISE_VARARGS 1 | |
462 >> 62 LOAD_FAST 11 (real_seq_length) | |
64 LOAD_FAST 7 (query_length) | |
66 LOAD_CONST 1 (None) | |
68 COMPARE_OP 8 (is) | |
70 POP_JUMP_IF_FALSE 86 | |
72 LOAD_FAST 5 (past_key_value) | |
74 LOAD_CONST 5 (0) | |
76 BINARY_SUBSCR | |
78 LOAD_ATTR 0 (shape) | |
80 LOAD_CONST 2 (2) | |
82 BINARY_SUBSCR | |
84 JUMP_FORWARD 2 (to 88) | |
>> 86 LOAD_FAST 7 (query_length) | |
>> 88 INPLACE_ADD | |
90 STORE_FAST 11 (real_seq_length) | |
464 >> 92 LOAD_FAST 3 (key_value_states) | |
94 LOAD_CONST 1 (None) | |
96 COMPARE_OP 8 (is) | |
98 POP_JUMP_IF_FALSE 104 | |
100 LOAD_FAST 11 (real_seq_length) | |
102 JUMP_FORWARD 8 (to 112) | |
>> 104 LOAD_FAST 3 (key_value_states) | |
106 LOAD_ATTR 0 (shape) | |
108 LOAD_CONST 6 (1) | |
110 BINARY_SUBSCR | |
>> 112 STORE_FAST 12 (key_length) | |
466 114 LOAD_CLOSURE 0 (batch_size) | |
116 LOAD_CLOSURE 1 (self) | |
118 BUILD_TUPLE 2 | |
120 LOAD_CONST 7 (<code object shape at 0x7f3f100aebe0, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 466>) | |
122 LOAD_CONST 8 ('T5Attention.forward.<locals>.shape') | |
124 MAKE_FUNCTION 8 (closure) | |
126 STORE_DEREF 2 (shape) | |
470 128 LOAD_CLOSURE 0 (batch_size) | |
130 LOAD_CLOSURE 1 (self) | |
132 BUILD_TUPLE 2 | |
134 LOAD_CONST 9 (<code object unshape at 0x7f3f100aec90, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 470>) | |
136 LOAD_CONST 10 ('T5Attention.forward.<locals>.unshape') | |
138 MAKE_FUNCTION 8 (closure) | |
140 STORE_FAST 13 (unshape) | |
474 142 LOAD_CLOSURE 2 (shape) | |
144 BUILD_TUPLE 1 | |
146 LOAD_CONST 11 (<code object project at 0x7f3f100aed40, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 474>) | |
148 LOAD_CONST 12 ('T5Attention.forward.<locals>.project') | |
150 MAKE_FUNCTION 8 (closure) | |
152 STORE_FAST 14 (project) | |
496 154 LOAD_DEREF 2 (shape) | |
156 LOAD_DEREF 1 (self) | |
158 LOAD_METHOD 3 (q) | |
160 LOAD_FAST 1 (hidden_states) | |
162 CALL_METHOD 1 | |
164 CALL_FUNCTION 1 | |
166 STORE_FAST 15 (query_states) | |
499 168 LOAD_FAST 14 (project) | |
500 170 LOAD_FAST 1 (hidden_states) | |
172 LOAD_DEREF 1 (self) | |
174 LOAD_ATTR 4 (k) | |
176 LOAD_FAST 3 (key_value_states) | |
178 LOAD_FAST 5 (past_key_value) | |
180 LOAD_CONST 1 (None) | |
182 COMPARE_OP 9 (is not) | |
184 POP_JUMP_IF_FALSE 194 | |
186 LOAD_FAST 5 (past_key_value) | |
188 LOAD_CONST 5 (0) | |
190 BINARY_SUBSCR | |
192 JUMP_FORWARD 2 (to 196) | |
>> 194 LOAD_CONST 1 (None) | |
499 >> 196 CALL_FUNCTION 4 | |
198 STORE_FAST 16 (key_states) | |
502 200 LOAD_FAST 14 (project) | |
503 202 LOAD_FAST 1 (hidden_states) | |
204 LOAD_DEREF 1 (self) | |
206 LOAD_ATTR 5 (v) | |
208 LOAD_FAST 3 (key_value_states) | |
210 LOAD_FAST 5 (past_key_value) | |
212 LOAD_CONST 1 (None) | |
214 COMPARE_OP 9 (is not) | |
216 POP_JUMP_IF_FALSE 226 | |
218 LOAD_FAST 5 (past_key_value) | |
220 LOAD_CONST 6 (1) | |
222 BINARY_SUBSCR | |
224 JUMP_FORWARD 2 (to 228) | |
>> 226 LOAD_CONST 1 (None) | |
502 >> 228 CALL_FUNCTION 4 | |
230 STORE_FAST 17 (value_states) | |
507 232 LOAD_GLOBAL 6 (torch) | |
234 LOAD_METHOD 7 (matmul) | |
508 236 LOAD_FAST 15 (query_states) | |
238 LOAD_FAST 16 (key_states) | |
240 LOAD_METHOD 8 (transpose) | |
242 LOAD_CONST 13 (3) | |
244 LOAD_CONST 2 (2) | |
246 CALL_METHOD 2 | |
507 248 CALL_METHOD 2 | |
250 STORE_FAST 18 (scores) | |
511 252 LOAD_FAST 4 (position_bias) | |
254 LOAD_CONST 1 (None) | |
256 COMPARE_OP 8 (is) | |
258 EXTENDED_ARG 1 | |
260 POP_JUMP_IF_FALSE 410 | |
512 262 LOAD_DEREF 1 (self) | |
264 LOAD_ATTR 9 (has_relative_attention_bias) | |
266 EXTENDED_ARG 1 | |
268 POP_JUMP_IF_TRUE 324 | |
513 270 LOAD_GLOBAL 6 (torch) | |
272 LOAD_ATTR 10 (zeros) | |
514 274 LOAD_CONST 6 (1) | |
276 LOAD_DEREF 1 (self) | |
278 LOAD_ATTR 11 (n_heads) | |
280 LOAD_FAST 11 (real_seq_length) | |
282 LOAD_FAST 12 (key_length) | |
284 BUILD_TUPLE 4 | |
286 LOAD_FAST 18 (scores) | |
288 LOAD_ATTR 12 (device) | |
290 LOAD_FAST 18 (scores) | |
292 LOAD_ATTR 13 (dtype) | |
513 294 LOAD_CONST 14 (('device', 'dtype')) | |
296 CALL_FUNCTION_KW 3 | |
298 STORE_FAST 4 (position_bias) | |
516 300 LOAD_DEREF 1 (self) | |
302 LOAD_ATTR 14 (gradient_checkpointing) | |
304 EXTENDED_ARG 1 | |
306 POP_JUMP_IF_FALSE 342 | |
308 LOAD_DEREF 1 (self) | |
310 LOAD_ATTR 15 (training) | |
312 EXTENDED_ARG 1 | |
314 POP_JUMP_IF_FALSE 342 | |
517 316 LOAD_CONST 15 (True) | |
318 LOAD_FAST 4 (position_bias) | |
320 STORE_ATTR 16 (requires_grad) | |
322 JUMP_FORWARD 18 (to 342) | |
519 >> 324 LOAD_DEREF 1 (self) | |
326 LOAD_ATTR 17 (compute_bias) | |
328 LOAD_FAST 11 (real_seq_length) | |
330 LOAD_FAST 12 (key_length) | |
332 LOAD_FAST 18 (scores) | |
334 LOAD_ATTR 12 (device) | |
336 LOAD_CONST 16 (('device',)) | |
338 CALL_FUNCTION_KW 3 | |
340 STORE_FAST 4 (position_bias) | |
523 >> 342 LOAD_FAST 5 (past_key_value) | |
344 LOAD_CONST 1 (None) | |
346 COMPARE_OP 9 (is not) | |
348 EXTENDED_ARG 1 | |
350 POP_JUMP_IF_FALSE 392 | |
524 352 LOAD_FAST 4 (position_bias) | |
354 LOAD_CONST 1 (None) | |
356 LOAD_CONST 1 (None) | |
358 BUILD_SLICE 2 | |
360 LOAD_CONST 1 (None) | |
362 LOAD_CONST 1 (None) | |
364 BUILD_SLICE 2 | |
366 LOAD_FAST 1 (hidden_states) | |
368 LOAD_METHOD 18 (size) | |
370 LOAD_CONST 6 (1) | |
372 CALL_METHOD 1 | |
374 UNARY_NEGATIVE | |
376 LOAD_CONST 1 (None) | |
378 BUILD_SLICE 2 | |
380 LOAD_CONST 1 (None) | |
382 LOAD_CONST 1 (None) | |
384 BUILD_SLICE 2 | |
386 BUILD_TUPLE 4 | |
388 BINARY_SUBSCR | |
390 STORE_FAST 4 (position_bias) | |
526 >> 392 LOAD_FAST 2 (mask) | |
394 LOAD_CONST 1 (None) | |
396 COMPARE_OP 9 (is not) | |
398 EXTENDED_ARG 1 | |
400 POP_JUMP_IF_FALSE 410 | |
527 402 LOAD_FAST 4 (position_bias) | |
404 LOAD_FAST 2 (mask) | |
406 BINARY_ADD | |
408 STORE_FAST 4 (position_bias) | |
529 >> 410 LOAD_FAST 18 (scores) | |
412 LOAD_FAST 4 (position_bias) | |
414 INPLACE_ADD | |
416 STORE_FAST 18 (scores) | |
530 418 LOAD_GLOBAL 19 (nn) | |
420 LOAD_ATTR 20 (functional) | |
422 LOAD_ATTR 21 (softmax) | |
424 LOAD_FAST 18 (scores) | |
426 LOAD_METHOD 22 (float) | |
428 CALL_METHOD 0 | |
430 LOAD_CONST 17 (-1) | |
432 LOAD_CONST 18 (('dim',)) | |
434 CALL_FUNCTION_KW 2 | |
436 LOAD_METHOD 23 (type_as) | |
531 438 LOAD_FAST 18 (scores) | |
530 440 CALL_METHOD 1 | |
442 STORE_FAST 19 (attn_weights) | |
533 444 LOAD_GLOBAL 19 (nn) | |
446 LOAD_ATTR 20 (functional) | |
448 LOAD_ATTR 24 (dropout) | |
534 450 LOAD_FAST 19 (attn_weights) | |
452 LOAD_DEREF 1 (self) | |
454 LOAD_ATTR 24 (dropout) | |
456 LOAD_DEREF 1 (self) | |
458 LOAD_ATTR 15 (training) | |
533 460 LOAD_CONST 19 (('p', 'training')) | |
462 CALL_FUNCTION_KW 3 | |
464 STORE_FAST 19 (attn_weights) | |
538 466 LOAD_FAST 6 (layer_head_mask) | |
468 LOAD_CONST 1 (None) | |
470 COMPARE_OP 9 (is not) | |
472 EXTENDED_ARG 1 | |
474 POP_JUMP_IF_FALSE 484 | |
539 476 LOAD_FAST 19 (attn_weights) | |
478 LOAD_FAST 6 (layer_head_mask) | |
480 BINARY_MULTIPLY | |
482 STORE_FAST 19 (attn_weights) | |
541 >> 484 LOAD_FAST 13 (unshape) | |
486 LOAD_GLOBAL 6 (torch) | |
488 LOAD_METHOD 7 (matmul) | |
490 LOAD_FAST 19 (attn_weights) | |
492 LOAD_FAST 17 (value_states) | |
494 CALL_METHOD 2 | |
496 CALL_FUNCTION 1 | |
498 STORE_FAST 20 (attn_output) | |
542 500 LOAD_DEREF 1 (self) | |
502 LOAD_METHOD 25 (o) | |
504 LOAD_FAST 20 (attn_output) | |
506 CALL_METHOD 1 | |
508 STORE_FAST 20 (attn_output) | |
544 510 LOAD_DEREF 1 (self) | |
512 LOAD_ATTR 26 (is_decoder) | |
514 EXTENDED_ARG 2 | |
516 POP_JUMP_IF_FALSE 532 | |
518 LOAD_FAST 8 (use_cache) | |
520 EXTENDED_ARG 2 | |
522 POP_JUMP_IF_FALSE 532 | |
524 LOAD_FAST 16 (key_states) | |
526 LOAD_FAST 17 (value_states) | |
528 BUILD_TUPLE 2 | |
530 JUMP_FORWARD 2 (to 534) | |
>> 532 LOAD_CONST 1 (None) | |
>> 534 STORE_FAST 21 (present_key_value_state) | |
545 536 LOAD_FAST 20 (attn_output) | |
538 BUILD_TUPLE 1 | |
540 LOAD_FAST 21 (present_key_value_state) | |
542 BUILD_TUPLE 1 | |
544 BINARY_ADD | |
546 LOAD_FAST 4 (position_bias) | |
548 BUILD_TUPLE 1 | |
550 BINARY_ADD | |
552 STORE_FAST 22 (outputs) | |
547 554 LOAD_FAST 9 (output_attentions) | |
556 EXTENDED_ARG 2 | |
558 POP_JUMP_IF_FALSE 570 | |
548 560 LOAD_FAST 22 (outputs) | |
562 LOAD_FAST 19 (attn_weights) | |
564 BUILD_TUPLE 1 | |
566 BINARY_ADD | |
568 STORE_FAST 22 (outputs) | |
549 >> 570 LOAD_FAST 22 (outputs) | |
572 RETURN_VALUE | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR shape [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [ShapeVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 2 [ShapeVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_SLICE 2 [ShapeVariable(), ConstantVariable(NoneType), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_SUBSCR None [ShapeVariable(), SliceVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE UNPACK_SEQUENCE 2 [ShapeVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_DEREF batch_size [ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST seq_length [ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST seq_length [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST real_seq_length [ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST past_key_value [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP is not [ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 92 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST key_value_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP is [ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 104 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST real_seq_length [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE JUMP_FORWARD 112 [ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST key_length [ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CLOSURE batch_size [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CLOSURE self [NewCellVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_TUPLE 2 [NewCellVariable(), NewCellVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST <code object shape at 0x7f3f100aebe0, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 466> [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST T5Attention.forward.<locals>.shape [TupleVariable(), ConstantVariable(code)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE MAKE_FUNCTION 8 [TupleVariable(), ConstantVariable(code), ConstantVariable(str)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_DEREF shape [NestedUserFunctionVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CLOSURE batch_size [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CLOSURE self [NewCellVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_TUPLE 2 [NewCellVariable(), NewCellVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST <code object unshape at 0x7f3f100aec90, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 470> [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST T5Attention.forward.<locals>.unshape [TupleVariable(), ConstantVariable(code)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE MAKE_FUNCTION 8 [TupleVariable(), ConstantVariable(code), ConstantVariable(str)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST unshape [NestedUserFunctionVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CLOSURE shape [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_TUPLE 1 [NewCellVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST <code object project at 0x7f3f100aed40, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 474> [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST T5Attention.forward.<locals>.project [TupleVariable(), ConstantVariable(code)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE MAKE_FUNCTION 8 [TupleVariable(), ConstantVariable(code), ConstantVariable(str)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST project [NestedUserFunctionVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF shape [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF self [NestedUserFunctionVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR q [NestedUserFunctionVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [NestedUserFunctionVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NestedUserFunctionVariable(), NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NestedUserFunctionVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] INLINING <code object shape at 0x7f3f100aebe0, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 466> | |
468 0 LOAD_FAST 0 (states) | |
2 LOAD_METHOD 0 (view) | |
4 LOAD_DEREF 0 (batch_size) | |
6 LOAD_CONST 1 (-1) | |
8 LOAD_DEREF 1 (self) | |
10 LOAD_ATTR 1 (n_heads) | |
12 LOAD_DEREF 1 (self) | |
14 LOAD_ATTR 2 (key_value_proj_dim) | |
16 CALL_METHOD 4 | |
18 LOAD_METHOD 3 (transpose) | |
20 LOAD_CONST 2 (1) | |
22 LOAD_CONST 3 (2) | |
24 CALL_METHOD 2 | |
26 RETURN_VALUE | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR view [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF batch_size [GetAttrVariable(TensorVariable(), view)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST -1 [GetAttrVariable(TensorVariable(), view), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF self [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR n_heads [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF self [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR key_value_proj_dim [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int), ConstantVariable(int), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 4 [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int), ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR transpose [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 1 [GetAttrVariable(TensorVariable(), transpose)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 2 [GetAttrVariable(TensorVariable(), transpose), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 2 [GetAttrVariable(TensorVariable(), transpose), ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE RETURN_VALUE None [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] DONE INLINING <code object shape at 0x7f3f100aebe0, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 466> | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST query_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST project [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [NestedUserFunctionVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF self [NestedUserFunctionVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR k [NestedUserFunctionVariable(), TensorVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST key_value_states [NestedUserFunctionVariable(), TensorVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST past_key_value [NestedUserFunctionVariable(), TensorVariable(), NNModuleVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [NestedUserFunctionVariable(), TensorVariable(), NNModuleVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP is not [NestedUserFunctionVariable(), TensorVariable(), NNModuleVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 194 [NestedUserFunctionVariable(), TensorVariable(), NNModuleVariable(), ConstantVariable(NoneType), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [NestedUserFunctionVariable(), TensorVariable(), NNModuleVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 4 [NestedUserFunctionVariable(), TensorVariable(), NNModuleVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] INLINING <code object project at 0x7f3f100aed40, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 474> | |
476 0 LOAD_FAST 2 (key_value_states) | |
2 LOAD_CONST 1 (None) | |
4 COMPARE_OP 8 (is) | |
6 POP_JUMP_IF_FALSE 22 | |
479 8 LOAD_DEREF 0 (shape) | |
10 LOAD_FAST 1 (proj_layer) | |
12 LOAD_FAST 0 (hidden_states) | |
14 CALL_FUNCTION 1 | |
16 CALL_FUNCTION 1 | |
18 STORE_FAST 0 (hidden_states) | |
20 JUMP_FORWARD 20 (to 42) | |
480 >> 22 LOAD_FAST 3 (past_key_value) | |
24 LOAD_CONST 1 (None) | |
26 COMPARE_OP 8 (is) | |
28 POP_JUMP_IF_FALSE 42 | |
483 30 LOAD_DEREF 0 (shape) | |
32 LOAD_FAST 1 (proj_layer) | |
34 LOAD_FAST 2 (key_value_states) | |
36 CALL_FUNCTION 1 | |
38 CALL_FUNCTION 1 | |
40 STORE_FAST 0 (hidden_states) | |
485 >> 42 LOAD_FAST 3 (past_key_value) | |
44 LOAD_CONST 1 (None) | |
46 COMPARE_OP 9 (is not) | |
48 POP_JUMP_IF_FALSE 82 | |
486 50 LOAD_FAST 2 (key_value_states) | |
52 LOAD_CONST 1 (None) | |
54 COMPARE_OP 8 (is) | |
56 POP_JUMP_IF_FALSE 78 | |
489 58 LOAD_GLOBAL 0 (torch) | |
60 LOAD_ATTR 1 (cat) | |
62 LOAD_FAST 3 (past_key_value) | |
64 LOAD_FAST 0 (hidden_states) | |
66 BUILD_LIST 2 | |
68 LOAD_CONST 2 (2) | |
70 LOAD_CONST 3 (('dim',)) | |
72 CALL_FUNCTION_KW 2 | |
74 STORE_FAST 0 (hidden_states) | |
76 JUMP_FORWARD 4 (to 82) | |
492 >> 78 LOAD_FAST 3 (past_key_value) | |
80 STORE_FAST 0 (hidden_states) | |
493 >> 82 LOAD_FAST 0 (hidden_states) | |
84 RETURN_VALUE | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST key_value_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP is [ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 22 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF shape [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST proj_layer [NestedUserFunctionVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [NestedUserFunctionVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NestedUserFunctionVariable(), NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NestedUserFunctionVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] INLINING <code object shape at 0x7f3f100aebe0, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 466> | |
468 0 LOAD_FAST 0 (states) | |
2 LOAD_METHOD 0 (view) | |
4 LOAD_DEREF 0 (batch_size) | |
6 LOAD_CONST 1 (-1) | |
8 LOAD_DEREF 1 (self) | |
10 LOAD_ATTR 1 (n_heads) | |
12 LOAD_DEREF 1 (self) | |
14 LOAD_ATTR 2 (key_value_proj_dim) | |
16 CALL_METHOD 4 | |
18 LOAD_METHOD 3 (transpose) | |
20 LOAD_CONST 2 (1) | |
22 LOAD_CONST 3 (2) | |
24 CALL_METHOD 2 | |
26 RETURN_VALUE | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR view [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF batch_size [GetAttrVariable(TensorVariable(), view)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST -1 [GetAttrVariable(TensorVariable(), view), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF self [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR n_heads [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF self [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR key_value_proj_dim [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int), ConstantVariable(int), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 4 [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int), ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR transpose [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 1 [GetAttrVariable(TensorVariable(), transpose)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 2 [GetAttrVariable(TensorVariable(), transpose), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 2 [GetAttrVariable(TensorVariable(), transpose), ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE RETURN_VALUE None [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] DONE INLINING <code object shape at 0x7f3f100aebe0, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 466> | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST hidden_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE JUMP_FORWARD 42 [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST past_key_value [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP is not [ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 82 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE RETURN_VALUE None [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] DONE INLINING <code object project at 0x7f3f100aed40, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 474> | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST key_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST project [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [NestedUserFunctionVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF self [NestedUserFunctionVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR v [NestedUserFunctionVariable(), TensorVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST key_value_states [NestedUserFunctionVariable(), TensorVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST past_key_value [NestedUserFunctionVariable(), TensorVariable(), NNModuleVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [NestedUserFunctionVariable(), TensorVariable(), NNModuleVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP is not [NestedUserFunctionVariable(), TensorVariable(), NNModuleVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 226 [NestedUserFunctionVariable(), TensorVariable(), NNModuleVariable(), ConstantVariable(NoneType), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [NestedUserFunctionVariable(), TensorVariable(), NNModuleVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 4 [NestedUserFunctionVariable(), TensorVariable(), NNModuleVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] INLINING <code object project at 0x7f3f100aed40, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 474> | |
476 0 LOAD_FAST 2 (key_value_states) | |
2 LOAD_CONST 1 (None) | |
4 COMPARE_OP 8 (is) | |
6 POP_JUMP_IF_FALSE 22 | |
479 8 LOAD_DEREF 0 (shape) | |
10 LOAD_FAST 1 (proj_layer) | |
12 LOAD_FAST 0 (hidden_states) | |
14 CALL_FUNCTION 1 | |
16 CALL_FUNCTION 1 | |
18 STORE_FAST 0 (hidden_states) | |
20 JUMP_FORWARD 20 (to 42) | |
480 >> 22 LOAD_FAST 3 (past_key_value) | |
24 LOAD_CONST 1 (None) | |
26 COMPARE_OP 8 (is) | |
28 POP_JUMP_IF_FALSE 42 | |
483 30 LOAD_DEREF 0 (shape) | |
32 LOAD_FAST 1 (proj_layer) | |
34 LOAD_FAST 2 (key_value_states) | |
36 CALL_FUNCTION 1 | |
38 CALL_FUNCTION 1 | |
40 STORE_FAST 0 (hidden_states) | |
485 >> 42 LOAD_FAST 3 (past_key_value) | |
44 LOAD_CONST 1 (None) | |
46 COMPARE_OP 9 (is not) | |
48 POP_JUMP_IF_FALSE 82 | |
486 50 LOAD_FAST 2 (key_value_states) | |
52 LOAD_CONST 1 (None) | |
54 COMPARE_OP 8 (is) | |
56 POP_JUMP_IF_FALSE 78 | |
489 58 LOAD_GLOBAL 0 (torch) | |
60 LOAD_ATTR 1 (cat) | |
62 LOAD_FAST 3 (past_key_value) | |
64 LOAD_FAST 0 (hidden_states) | |
66 BUILD_LIST 2 | |
68 LOAD_CONST 2 (2) | |
70 LOAD_CONST 3 (('dim',)) | |
72 CALL_FUNCTION_KW 2 | |
74 STORE_FAST 0 (hidden_states) | |
76 JUMP_FORWARD 4 (to 82) | |
492 >> 78 LOAD_FAST 3 (past_key_value) | |
80 STORE_FAST 0 (hidden_states) | |
493 >> 82 LOAD_FAST 0 (hidden_states) | |
84 RETURN_VALUE | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST key_value_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP is [ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 22 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF shape [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST proj_layer [NestedUserFunctionVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [NestedUserFunctionVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NestedUserFunctionVariable(), NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NestedUserFunctionVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] INLINING <code object shape at 0x7f3f100aebe0, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 466> | |
468 0 LOAD_FAST 0 (states) | |
2 LOAD_METHOD 0 (view) | |
4 LOAD_DEREF 0 (batch_size) | |
6 LOAD_CONST 1 (-1) | |
8 LOAD_DEREF 1 (self) | |
10 LOAD_ATTR 1 (n_heads) | |
12 LOAD_DEREF 1 (self) | |
14 LOAD_ATTR 2 (key_value_proj_dim) | |
16 CALL_METHOD 4 | |
18 LOAD_METHOD 3 (transpose) | |
20 LOAD_CONST 2 (1) | |
22 LOAD_CONST 3 (2) | |
24 CALL_METHOD 2 | |
26 RETURN_VALUE | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR view [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF batch_size [GetAttrVariable(TensorVariable(), view)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST -1 [GetAttrVariable(TensorVariable(), view), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF self [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR n_heads [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF self [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR key_value_proj_dim [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int), ConstantVariable(int), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 4 [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int), ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR transpose [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 1 [GetAttrVariable(TensorVariable(), transpose)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 2 [GetAttrVariable(TensorVariable(), transpose), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 2 [GetAttrVariable(TensorVariable(), transpose), ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE RETURN_VALUE None [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] DONE INLINING <code object shape at 0x7f3f100aebe0, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 466> | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST hidden_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE JUMP_FORWARD 42 [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST past_key_value [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP is not [ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 82 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE RETURN_VALUE None [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] DONE INLINING <code object project at 0x7f3f100aed40, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 474> | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST value_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL torch [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR matmul [TorchVariable(<module 'torch' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/__init__.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST query_states [TorchVariable(<built-in method matmul of type object at 0x7f40b0102e20>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST key_states [TorchVariable(<built-in method matmul of type object at 0x7f40b0102e20>), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR transpose [TorchVariable(<built-in method matmul of type object at 0x7f40b0102e20>), TensorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 3 [TorchVariable(<built-in method matmul of type object at 0x7f40b0102e20>), TensorVariable(), GetAttrVariable(TensorVariable(), transpose)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 2 [TorchVariable(<built-in method matmul of type object at 0x7f40b0102e20>), TensorVariable(), GetAttrVariable(TensorVariable(), transpose), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 2 [TorchVariable(<built-in method matmul of type object at 0x7f40b0102e20>), TensorVariable(), GetAttrVariable(TensorVariable(), transpose), ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 2 [TorchVariable(<built-in method matmul of type object at 0x7f40b0102e20>), TensorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST scores [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST position_bias [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP is [TensorVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 410 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST scores [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST position_bias [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE INPLACE_ADD None [TensorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST scores [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL nn [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR functional [TorchVariable(<module 'torch.nn' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/nn/__init__.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR softmax [TorchVariable(<module 'torch.nn.functional' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/nn/functional.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST scores [TorchVariable(<function softmax at 0x7f3f1bf53e50>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR float [TorchVariable(<function softmax at 0x7f3f1bf53e50>), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 0 [TorchVariable(<function softmax at 0x7f3f1bf53e50>), GetAttrVariable(TensorVariable(), float)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST -1 [TorchVariable(<function softmax at 0x7f3f1bf53e50>), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST ('dim',) [TorchVariable(<function softmax at 0x7f3f1bf53e50>), TensorVariable(), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION_KW 2 [TorchVariable(<function softmax at 0x7f3f1bf53e50>), TensorVariable(), ConstantVariable(int), ConstantVariable(tuple)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR type_as [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST scores [GetAttrVariable(TensorVariable(), type_as)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [GetAttrVariable(TensorVariable(), type_as), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST attn_weights [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL nn [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR functional [TorchVariable(<module 'torch.nn' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/nn/__init__.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR dropout [TorchVariable(<module 'torch.nn.functional' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/nn/functional.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST attn_weights [TorchVariable(<function dropout at 0x7f3f1bf53310>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF self [TorchVariable(<function dropout at 0x7f3f1bf53310>), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR dropout [TorchVariable(<function dropout at 0x7f3f1bf53310>), TensorVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF self [TorchVariable(<function dropout at 0x7f3f1bf53310>), TensorVariable(), ConstantVariable(float)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR training [TorchVariable(<function dropout at 0x7f3f1bf53310>), TensorVariable(), ConstantVariable(float), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST ('p', 'training') [TorchVariable(<function dropout at 0x7f3f1bf53310>), TensorVariable(), ConstantVariable(float), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION_KW 3 [TorchVariable(<function dropout at 0x7f3f1bf53310>), TensorVariable(), ConstantVariable(float), ConstantVariable(bool), ConstantVariable(tuple)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST attn_weights [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST layer_head_mask [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP is not [ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 484 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST unshape [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL torch [NestedUserFunctionVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR matmul [NestedUserFunctionVariable(), TorchVariable(<module 'torch' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/__init__.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST attn_weights [NestedUserFunctionVariable(), TorchVariable(<built-in method matmul of type object at 0x7f40b0102e20>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST value_states [NestedUserFunctionVariable(), TorchVariable(<built-in method matmul of type object at 0x7f40b0102e20>), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 2 [NestedUserFunctionVariable(), TorchVariable(<built-in method matmul of type object at 0x7f40b0102e20>), TensorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NestedUserFunctionVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] INLINING <code object unshape at 0x7f3f100aec90, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 470> | |
472 0 LOAD_FAST 0 (states) | |
2 LOAD_METHOD 0 (transpose) | |
4 LOAD_CONST 1 (1) | |
6 LOAD_CONST 2 (2) | |
8 CALL_METHOD 2 | |
10 LOAD_METHOD 1 (contiguous) | |
12 CALL_METHOD 0 | |
14 LOAD_METHOD 2 (view) | |
16 LOAD_DEREF 0 (batch_size) | |
18 LOAD_CONST 3 (-1) | |
20 LOAD_DEREF 1 (self) | |
22 LOAD_ATTR 3 (inner_dim) | |
24 CALL_METHOD 3 | |
26 RETURN_VALUE | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR transpose [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 1 [GetAttrVariable(TensorVariable(), transpose)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 2 [GetAttrVariable(TensorVariable(), transpose), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 2 [GetAttrVariable(TensorVariable(), transpose), ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR contiguous [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 0 [GetAttrVariable(TensorVariable(), contiguous)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR view [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF batch_size [GetAttrVariable(TensorVariable(), view)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST -1 [GetAttrVariable(TensorVariable(), view), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF self [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR inner_dim [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 3 [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE RETURN_VALUE None [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] DONE INLINING <code object unshape at 0x7f3f100aec90, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 470> | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST attn_output [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR o [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST attn_output [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST attn_output [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR is_decoder [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 532 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST present_key_value_state [ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST attn_output [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_TUPLE 1 [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST present_key_value_state [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_TUPLE 1 [TupleVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_ADD None [TupleVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST position_bias [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_TUPLE 1 [TupleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_ADD None [TupleVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST outputs [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST output_attentions [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 570 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST outputs [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE RETURN_VALUE None [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] DONE INLINING <code object forward at 0x7f3f100aedf0, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 436> | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST attention_output [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR dropout [TensorVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST attention_output [TensorVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 0 [TensorVariable(), NNModuleVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_SUBSCR None [TensorVariable(), NNModuleVariable(), TupleVariable(), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [TensorVariable(), NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_ADD None [TensorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST hidden_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_TUPLE 1 [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST attention_output [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 1 [TupleVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [TupleVariable(), TupleVariable(), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_SLICE 2 [TupleVariable(), TupleVariable(), ConstantVariable(int), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_SUBSCR None [TupleVariable(), TupleVariable(), SliceVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_ADD None [TupleVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST outputs [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST outputs [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE RETURN_VALUE None [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] DONE INLINING <code object forward at 0x7f3f100b4030, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 559> | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST self_attention_outputs [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self_attention_outputs [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 2 [TupleVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_SLICE 2 [TupleVariable(), ConstantVariable(NoneType), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_SUBSCR None [TupleVariable(), SliceVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE UNPACK_SEQUENCE 2 [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST hidden_states [ConstantVariable(NoneType), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST present_key_value_state [ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self_attention_outputs [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 2 [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [TupleVariable(), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_SLICE 2 [TupleVariable(), ConstantVariable(int), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_SUBSCR None [TupleVariable(), SliceVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST attention_outputs [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR dtype [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL torch [TorchVariable(torch.float32)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR float16 [TorchVariable(torch.float32), TorchVariable(<module 'torch' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/__init__.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP == [TorchVariable(torch.float32), ConstantVariable(dtype)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 248 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR is_decoder [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE JUMP_IF_FALSE_OR_POP 262 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST do_cross_attention [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST do_cross_attention [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 444 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR layer [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST -1 [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_SUBSCR None [NNModuleVariable(), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] INLINING <code object forward at 0x7f3f100ae7c0, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 324> | |
325 0 LOAD_FAST 0 (self) | |
2 LOAD_METHOD 0 (layer_norm) | |
4 LOAD_FAST 1 (hidden_states) | |
6 CALL_METHOD 1 | |
8 STORE_FAST 2 (forwarded_states) | |
326 10 LOAD_FAST 0 (self) | |
12 LOAD_METHOD 1 (DenseReluDense) | |
14 LOAD_FAST 2 (forwarded_states) | |
16 CALL_METHOD 1 | |
18 STORE_FAST 2 (forwarded_states) | |
327 20 LOAD_FAST 1 (hidden_states) | |
22 LOAD_FAST 0 (self) | |
24 LOAD_METHOD 2 (dropout) | |
26 LOAD_FAST 2 (forwarded_states) | |
28 CALL_METHOD 1 | |
30 BINARY_ADD | |
32 STORE_FAST 1 (hidden_states) | |
328 34 LOAD_FAST 1 (hidden_states) | |
36 RETURN_VALUE | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR layer_norm [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] INLINING <code object forward at 0x7f3f100ae190, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 248> | |
255 0 LOAD_FAST 1 (hidden_states) | |
2 LOAD_METHOD 0 (to) | |
4 LOAD_GLOBAL 1 (torch) | |
6 LOAD_ATTR 2 (float32) | |
8 CALL_METHOD 1 | |
10 LOAD_METHOD 3 (pow) | |
12 LOAD_CONST 1 (2) | |
14 CALL_METHOD 1 | |
16 LOAD_ATTR 4 (mean) | |
18 LOAD_CONST 2 (-1) | |
20 LOAD_CONST 3 (True) | |
22 LOAD_CONST 4 (('keepdim',)) | |
24 CALL_FUNCTION_KW 2 | |
26 STORE_FAST 2 (variance) | |
256 28 LOAD_FAST 1 (hidden_states) | |
30 LOAD_GLOBAL 1 (torch) | |
32 LOAD_METHOD 5 (rsqrt) | |
34 LOAD_FAST 2 (variance) | |
36 LOAD_FAST 0 (self) | |
38 LOAD_ATTR 6 (variance_epsilon) | |
40 BINARY_ADD | |
42 CALL_METHOD 1 | |
44 BINARY_MULTIPLY | |
46 STORE_FAST 1 (hidden_states) | |
259 48 LOAD_FAST 0 (self) | |
50 LOAD_ATTR 7 (weight) | |
52 LOAD_ATTR 8 (dtype) | |
54 LOAD_GLOBAL 1 (torch) | |
56 LOAD_ATTR 9 (float16) | |
58 LOAD_GLOBAL 1 (torch) | |
60 LOAD_ATTR 10 (bfloat16) | |
62 BUILD_TUPLE 2 | |
64 COMPARE_OP 6 (in) | |
66 POP_JUMP_IF_FALSE 82 | |
260 68 LOAD_FAST 1 (hidden_states) | |
70 LOAD_METHOD 0 (to) | |
72 LOAD_FAST 0 (self) | |
74 LOAD_ATTR 7 (weight) | |
76 LOAD_ATTR 8 (dtype) | |
78 CALL_METHOD 1 | |
80 STORE_FAST 1 (hidden_states) | |
262 >> 82 LOAD_FAST 0 (self) | |
84 LOAD_ATTR 7 (weight) | |
86 LOAD_FAST 1 (hidden_states) | |
88 BINARY_MULTIPLY | |
90 RETURN_VALUE | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR to [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL torch [GetAttrVariable(TensorVariable(), to)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR float32 [GetAttrVariable(TensorVariable(), to), TorchVariable(<module 'torch' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/__init__.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [GetAttrVariable(TensorVariable(), to), ConstantVariable(dtype)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR pow [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 2 [GetAttrVariable(TensorVariable(), pow)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [GetAttrVariable(TensorVariable(), pow), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR mean [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST -1 [GetAttrVariable(TensorVariable(), mean)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST True [GetAttrVariable(TensorVariable(), mean), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST ('keepdim',) [GetAttrVariable(TensorVariable(), mean), ConstantVariable(int), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION_KW 2 [GetAttrVariable(TensorVariable(), mean), ConstantVariable(int), ConstantVariable(bool), ConstantVariable(tuple)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST variance [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL torch [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR rsqrt [TensorVariable(), TorchVariable(<module 'torch' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/__init__.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST variance [TensorVariable(), TorchVariable(<built-in method rsqrt of type object at 0x7f40b0102e20>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [TensorVariable(), TorchVariable(<built-in method rsqrt of type object at 0x7f40b0102e20>), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR variance_epsilon [TensorVariable(), TorchVariable(<built-in method rsqrt of type object at 0x7f40b0102e20>), TensorVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_ADD None [TensorVariable(), TorchVariable(<built-in method rsqrt of type object at 0x7f40b0102e20>), TensorVariable(), ConstantVariable(float)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [TensorVariable(), TorchVariable(<built-in method rsqrt of type object at 0x7f40b0102e20>), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_MULTIPLY None [TensorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST hidden_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR weight [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR dtype [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL torch [TorchVariable(torch.float32)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR float16 [TorchVariable(torch.float32), TorchVariable(<module 'torch' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/__init__.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL torch [TorchVariable(torch.float32), ConstantVariable(dtype)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR bfloat16 [TorchVariable(torch.float32), ConstantVariable(dtype), TorchVariable(<module 'torch' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/__init__.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_TUPLE 2 [TorchVariable(torch.float32), ConstantVariable(dtype), ConstantVariable(dtype)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP in [TorchVariable(torch.float32), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 82 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR weight [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_MULTIPLY None [TensorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE RETURN_VALUE None [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] DONE INLINING <code object forward at 0x7f3f100ae190, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 248> | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST forwarded_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR DenseReluDense [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST forwarded_states [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] INLINING <code object forward at 0x7f3f100ae3a0, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 287> | |
288 0 LOAD_FAST 0 (self) | |
2 LOAD_METHOD 0 (wi) | |
4 LOAD_FAST 1 (hidden_states) | |
6 CALL_METHOD 1 | |
8 STORE_FAST 1 (hidden_states) | |
289 10 LOAD_FAST 0 (self) | |
12 LOAD_METHOD 1 (act) | |
14 LOAD_FAST 1 (hidden_states) | |
16 CALL_METHOD 1 | |
18 STORE_FAST 1 (hidden_states) | |
290 20 LOAD_FAST 0 (self) | |
22 LOAD_METHOD 2 (dropout) | |
24 LOAD_FAST 1 (hidden_states) | |
26 CALL_METHOD 1 | |
28 STORE_FAST 1 (hidden_states) | |
291 30 LOAD_FAST 0 (self) | |
32 LOAD_METHOD 3 (wo) | |
34 LOAD_FAST 1 (hidden_states) | |
36 CALL_METHOD 1 | |
38 STORE_FAST 1 (hidden_states) | |
292 40 LOAD_FAST 1 (hidden_states) | |
42 RETURN_VALUE | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR wi [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST hidden_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR act [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST hidden_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR dropout [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST hidden_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR wo [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST hidden_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE RETURN_VALUE None [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] DONE INLINING <code object forward at 0x7f3f100ae3a0, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 287> | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST forwarded_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR dropout [TensorVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST forwarded_states [TensorVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [TensorVariable(), NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_ADD None [TensorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST hidden_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE RETURN_VALUE None [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] DONE INLINING <code object forward at 0x7f3f100ae7c0, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 324> | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST hidden_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR dtype [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL torch [TorchVariable(torch.float32)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR float16 [TorchVariable(torch.float32), TorchVariable(<module 'torch' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/__init__.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP == [TorchVariable(torch.float32), ConstantVariable(dtype)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 524 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_TUPLE 1 [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST outputs [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST use_cache [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 552 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST outputs [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST attention_outputs [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_ADD None [TupleVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST outputs [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST outputs [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE RETURN_VALUE None [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] DONE INLINING <code object forward at 0x7f3f100b4450, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 631> | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST layer_outputs [ListIteratorVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF use_cache [ListIteratorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST False [ListIteratorVariable(), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP is [ListIteratorVariable(), ConstantVariable(bool), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 1096 [ListIteratorVariable(), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST layer_outputs [ListIteratorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [ListIteratorVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 1 [ListIteratorVariable(), TupleVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_SLICE 2 [ListIteratorVariable(), TupleVariable(), ConstantVariable(NoneType), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_SUBSCR None [ListIteratorVariable(), TupleVariable(), SliceVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST (None,) [ListIteratorVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_ADD None [ListIteratorVariable(), TupleVariable(), ConstantVariable(tuple)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST layer_outputs [ListIteratorVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 1 [ListIteratorVariable(), TupleVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [ListIteratorVariable(), TupleVariable(), TupleVariable(), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_SLICE 2 [ListIteratorVariable(), TupleVariable(), TupleVariable(), ConstantVariable(int), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_SUBSCR None [ListIteratorVariable(), TupleVariable(), TupleVariable(), SliceVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_ADD None [ListIteratorVariable(), TupleVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST layer_outputs [ListIteratorVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST layer_outputs [ListIteratorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [ListIteratorVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 2 [ListIteratorVariable(), TupleVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_SLICE 2 [ListIteratorVariable(), TupleVariable(), ConstantVariable(NoneType), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_SUBSCR None [ListIteratorVariable(), TupleVariable(), SliceVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE UNPACK_SEQUENCE 2 [ListIteratorVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST hidden_states [ListIteratorVariable(), ConstantVariable(NoneType), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST present_key_value_state [ListIteratorVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST layer_outputs [ListIteratorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 2 [ListIteratorVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_SUBSCR None [ListIteratorVariable(), TupleVariable(), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST position_bias [ListIteratorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [ListIteratorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR is_decoder [ListIteratorVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 1156 [ListIteratorVariable(), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF use_cache [ListIteratorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 1172 [ListIteratorVariable(), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF output_attentions [ListIteratorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 1214 [ListIteratorVariable(), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [ListIteratorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR model_parallel [ListIteratorVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 724 [ListIteratorVariable(), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE FOR_ITER 1304 [ListIteratorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE UNPACK_SEQUENCE 2 [ListIteratorVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST i [ListIteratorVariable(), TupleVariable(), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE UNPACK_SEQUENCE 2 [ListIteratorVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST layer_module [ListIteratorVariable(), ConstantVariable(NoneType), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST past_key_value [ListIteratorVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST head_mask [ListIteratorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST i [ListIteratorVariable(), ListVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_SUBSCR None [ListIteratorVariable(), ListVariable(), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST layer_head_mask [ListIteratorVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST cross_attn_head_mask [ListIteratorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST i [ListIteratorVariable(), ListVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_SUBSCR None [ListIteratorVariable(), ListVariable(), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST cross_attn_layer_head_mask [ListIteratorVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [ListIteratorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR model_parallel [ListIteratorVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 930 [ListIteratorVariable(), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST output_hidden_states [ListIteratorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 946 [ListIteratorVariable(), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [ListIteratorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR gradient_checkpointing [ListIteratorVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 1028 [ListIteratorVariable(), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST layer_module [ListIteratorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [ListIteratorVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST extended_attention_mask [ListIteratorVariable(), NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST position_bias [ListIteratorVariable(), NNModuleVariable(), TensorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST encoder_hidden_states [ListIteratorVariable(), NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST encoder_extended_attention_mask [ListIteratorVariable(), NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST encoder_decoder_position_bias [ListIteratorVariable(), NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST layer_head_mask [ListIteratorVariable(), NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST cross_attn_layer_head_mask [ListIteratorVariable(), NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST past_key_value [ListIteratorVariable(), NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF use_cache [ListIteratorVariable(), NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF output_attentions [ListIteratorVariable(), NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST ('attention_mask', 'position_bias', 'encoder_hidden_states', 'encoder_attention_mask', 'encoder_decoder_position_bias', 'layer_head_mask', 'cross_attn_layer_head_mask', 'past_key_value', 'use_cache', 'output_attentions') [ListIteratorVariable(), NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(bool), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION_KW 11 [ListIteratorVariable(), NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(bool), ConstantVariable(bool), ConstantVariable(tuple)] | |
torchdynamo.symbolic_convert: [DEBUG] INLINING <code object forward at 0x7f3f100b4450, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 631> | |
647 0 LOAD_FAST 9 (past_key_value) | |
2 LOAD_CONST 0 (None) | |
4 COMPARE_OP 9 (is not) | |
6 POP_JUMP_IF_FALSE 122 | |
648 8 LOAD_FAST 0 (self) | |
10 LOAD_ATTR 0 (is_decoder) | |
12 POP_JUMP_IF_TRUE 24 | |
649 14 LOAD_GLOBAL 1 (logger) | |
16 LOAD_METHOD 2 (warning) | |
18 LOAD_CONST 1 ('`past_key_values` is passed to the encoder. Please make sure this is intended.') | |
20 CALL_METHOD 1 | |
22 POP_TOP | |
650 >> 24 LOAD_FAST 4 (encoder_hidden_states) | |
26 LOAD_CONST 0 (None) | |
28 COMPARE_OP 8 (is) | |
30 POP_JUMP_IF_FALSE 36 | |
32 LOAD_CONST 2 (2) | |
34 JUMP_FORWARD 2 (to 38) | |
>> 36 LOAD_CONST 3 (4) | |
>> 38 STORE_FAST 13 (expected_num_past_key_values) | |
652 40 LOAD_GLOBAL 3 (len) | |
42 LOAD_FAST 9 (past_key_value) | |
44 CALL_FUNCTION 1 | |
46 LOAD_FAST 13 (expected_num_past_key_values) | |
48 COMPARE_OP 3 (!=) | |
50 POP_JUMP_IF_FALSE 96 | |
653 52 LOAD_GLOBAL 4 (ValueError) | |
654 54 LOAD_CONST 4 ('There should be ') | |
56 LOAD_FAST 13 (expected_num_past_key_values) | |
58 FORMAT_VALUE 0 | |
60 LOAD_CONST 5 (' past states. ') | |
62 LOAD_FAST 13 (expected_num_past_key_values) | |
64 LOAD_CONST 3 (4) | |
66 COMPARE_OP 2 (==) | |
68 POP_JUMP_IF_FALSE 74 | |
70 LOAD_CONST 6 ('2 (past / key) for cross attention. ') | |
72 JUMP_FORWARD 2 (to 76) | |
>> 74 LOAD_CONST 7 ('') | |
>> 76 FORMAT_VALUE 0 | |
78 LOAD_CONST 8 ('Got ') | |
80 LOAD_GLOBAL 3 (len) | |
82 LOAD_FAST 9 (past_key_value) | |
84 CALL_FUNCTION 1 | |
86 FORMAT_VALUE 0 | |
88 LOAD_CONST 9 (' past key / value states') | |
90 BUILD_STRING 7 | |
653 92 CALL_FUNCTION 1 | |
94 RAISE_VARARGS 1 | |
659 >> 96 LOAD_FAST 9 (past_key_value) | |
98 LOAD_CONST 0 (None) | |
100 LOAD_CONST 2 (2) | |
102 BUILD_SLICE 2 | |
104 BINARY_SUBSCR | |
106 STORE_FAST 14 (self_attn_past_key_value) | |
660 108 LOAD_FAST 9 (past_key_value) | |
110 LOAD_CONST 2 (2) | |
112 LOAD_CONST 0 (None) | |
114 BUILD_SLICE 2 | |
116 BINARY_SUBSCR | |
118 STORE_FAST 15 (cross_attn_past_key_value) | |
120 JUMP_FORWARD 8 (to 130) | |
662 >> 122 LOAD_CONST 10 ((None, None)) | |
124 UNPACK_SEQUENCE 2 | |
126 STORE_FAST 14 (self_attn_past_key_value) | |
128 STORE_FAST 15 (cross_attn_past_key_value) | |
664 >> 130 LOAD_FAST 0 (self) | |
132 LOAD_ATTR 5 (layer) | |
134 LOAD_CONST 11 (0) | |
136 BINARY_SUBSCR | |
665 138 LOAD_FAST 1 (hidden_states) | |
666 140 LOAD_FAST 2 (attention_mask) | |
667 142 LOAD_FAST 3 (position_bias) | |
668 144 LOAD_FAST 7 (layer_head_mask) | |
669 146 LOAD_FAST 14 (self_attn_past_key_value) | |
670 148 LOAD_FAST 10 (use_cache) | |
671 150 LOAD_FAST 11 (output_attentions) | |
664 152 LOAD_CONST 12 (('attention_mask', 'position_bias', 'layer_head_mask', 'past_key_value', 'use_cache', 'output_attentions')) | |
154 CALL_FUNCTION_KW 7 | |
156 STORE_FAST 16 (self_attention_outputs) | |
673 158 LOAD_FAST 16 (self_attention_outputs) | |
160 LOAD_CONST 0 (None) | |
162 LOAD_CONST 2 (2) | |
164 BUILD_SLICE 2 | |
166 BINARY_SUBSCR | |
168 UNPACK_SEQUENCE 2 | |
170 STORE_FAST 1 (hidden_states) | |
172 STORE_FAST 17 (present_key_value_state) | |
674 174 LOAD_FAST 16 (self_attention_outputs) | |
176 LOAD_CONST 2 (2) | |
178 LOAD_CONST 0 (None) | |
180 BUILD_SLICE 2 | |
182 BINARY_SUBSCR | |
184 STORE_FAST 18 (attention_outputs) | |
677 186 LOAD_FAST 1 (hidden_states) | |
188 LOAD_ATTR 6 (dtype) | |
190 LOAD_GLOBAL 7 (torch) | |
192 LOAD_ATTR 8 (float16) | |
194 COMPARE_OP 2 (==) | |
196 POP_JUMP_IF_FALSE 248 | |
198 LOAD_GLOBAL 7 (torch) | |
200 LOAD_METHOD 9 (isinf) | |
202 LOAD_FAST 1 (hidden_states) | |
204 CALL_METHOD 1 | |
206 LOAD_METHOD 10 (any) | |
208 CALL_METHOD 0 | |
210 POP_JUMP_IF_FALSE 248 | |
678 212 LOAD_GLOBAL 7 (torch) | |
214 LOAD_METHOD 11 (finfo) | |
216 LOAD_FAST 1 (hidden_states) | |
218 LOAD_ATTR 6 (dtype) | |
220 CALL_METHOD 1 | |
222 LOAD_ATTR 12 (max) | |
224 LOAD_CONST 13 (1000) | |
226 BINARY_SUBTRACT | |
228 STORE_FAST 19 (clamp_value) | |
679 230 LOAD_GLOBAL 7 (torch) | |
232 LOAD_ATTR 13 (clamp) | |
234 LOAD_FAST 1 (hidden_states) | |
236 LOAD_FAST 19 (clamp_value) | |
238 UNARY_NEGATIVE | |
240 LOAD_FAST 19 (clamp_value) | |
242 LOAD_CONST 14 (('min', 'max')) | |
244 CALL_FUNCTION_KW 3 | |
246 STORE_FAST 1 (hidden_states) | |
681 >> 248 LOAD_FAST 0 (self) | |
250 LOAD_ATTR 0 (is_decoder) | |
252 EXTENDED_ARG 1 | |
254 JUMP_IF_FALSE_OR_POP 262 | |
256 LOAD_FAST 4 (encoder_hidden_states) | |
258 LOAD_CONST 0 (None) | |
260 COMPARE_OP 9 (is not) | |
>> 262 STORE_FAST 20 (do_cross_attention) | |
682 264 LOAD_FAST 20 (do_cross_attention) | |
266 EXTENDED_ARG 1 | |
268 POP_JUMP_IF_FALSE 444 | |
685 270 LOAD_FAST 17 (present_key_value_state) | |
272 LOAD_CONST 0 (None) | |
274 COMPARE_OP 9 (is not) | |
276 EXTENDED_ARG 1 | |
278 POP_JUMP_IF_FALSE 296 | |
686 280 LOAD_FAST 17 (present_key_value_state) | |
282 LOAD_CONST 11 (0) | |
284 BINARY_SUBSCR | |
286 LOAD_ATTR 14 (shape) | |
288 LOAD_CONST 2 (2) | |
290 BINARY_SUBSCR | |
292 STORE_FAST 21 (query_length) | |
294 JUMP_FORWARD 4 (to 300) | |
688 >> 296 LOAD_CONST 0 (None) | |
298 STORE_FAST 21 (query_length) | |
690 >> 300 LOAD_FAST 0 (self) | |
302 LOAD_ATTR 5 (layer) | |
304 LOAD_CONST 15 (1) | |
306 BINARY_SUBSCR | |
691 308 LOAD_FAST 1 (hidden_states) | |
692 310 LOAD_FAST 4 (encoder_hidden_states) | |
693 312 LOAD_FAST 5 (encoder_attention_mask) | |
694 314 LOAD_FAST 6 (encoder_decoder_position_bias) | |
695 316 LOAD_FAST 8 (cross_attn_layer_head_mask) | |
696 318 LOAD_FAST 15 (cross_attn_past_key_value) | |
697 320 LOAD_FAST 21 (query_length) | |
698 322 LOAD_FAST 10 (use_cache) | |
699 324 LOAD_FAST 11 (output_attentions) | |
690 326 LOAD_CONST 16 (('key_value_states', 'attention_mask', 'position_bias', 'layer_head_mask', 'past_key_value', 'query_length', 'use_cache', 'output_attentions')) | |
328 CALL_FUNCTION_KW 9 | |
330 STORE_FAST 22 (cross_attention_outputs) | |
701 332 LOAD_FAST 22 (cross_attention_outputs) | |
334 LOAD_CONST 11 (0) | |
336 BINARY_SUBSCR | |
338 STORE_FAST 1 (hidden_states) | |
704 340 LOAD_FAST 1 (hidden_states) | |
342 LOAD_ATTR 6 (dtype) | |
344 LOAD_GLOBAL 7 (torch) | |
346 LOAD_ATTR 8 (float16) | |
348 COMPARE_OP 2 (==) | |
350 EXTENDED_ARG 1 | |
352 POP_JUMP_IF_FALSE 406 | |
354 LOAD_GLOBAL 7 (torch) | |
356 LOAD_METHOD 9 (isinf) | |
358 LOAD_FAST 1 (hidden_states) | |
360 CALL_METHOD 1 | |
362 LOAD_METHOD 10 (any) | |
364 CALL_METHOD 0 | |
366 EXTENDED_ARG 1 | |
368 POP_JUMP_IF_FALSE 406 | |
705 370 LOAD_GLOBAL 7 (torch) | |
372 LOAD_METHOD 11 (finfo) | |
374 LOAD_FAST 1 (hidden_states) | |
376 LOAD_ATTR 6 (dtype) | |
378 CALL_METHOD 1 | |
380 LOAD_ATTR 12 (max) | |
382 LOAD_CONST 13 (1000) | |
384 BINARY_SUBTRACT | |
386 STORE_FAST 19 (clamp_value) | |
706 388 LOAD_GLOBAL 7 (torch) | |
390 LOAD_ATTR 13 (clamp) | |
392 LOAD_FAST 1 (hidden_states) | |
394 LOAD_FAST 19 (clamp_value) | |
396 UNARY_NEGATIVE | |
398 LOAD_FAST 19 (clamp_value) | |
400 LOAD_CONST 14 (('min', 'max')) | |
402 CALL_FUNCTION_KW 3 | |
404 STORE_FAST 1 (hidden_states) | |
709 >> 406 LOAD_FAST 17 (present_key_value_state) | |
408 LOAD_CONST 0 (None) | |
410 COMPARE_OP 9 (is not) | |
412 EXTENDED_ARG 1 | |
414 POP_JUMP_IF_FALSE 428 | |
710 416 LOAD_FAST 17 (present_key_value_state) | |
418 LOAD_FAST 22 (cross_attention_outputs) | |
420 LOAD_CONST 15 (1) | |
422 BINARY_SUBSCR | |
424 BINARY_ADD | |
426 STORE_FAST 17 (present_key_value_state) | |
713 >> 428 LOAD_FAST 18 (attention_outputs) | |
430 LOAD_FAST 22 (cross_attention_outputs) | |
432 LOAD_CONST 2 (2) | |
434 LOAD_CONST 0 (None) | |
436 BUILD_SLICE 2 | |
438 BINARY_SUBSCR | |
440 BINARY_ADD | |
442 STORE_FAST 18 (attention_outputs) | |
716 >> 444 LOAD_FAST 0 (self) | |
446 LOAD_ATTR 5 (layer) | |
448 LOAD_CONST 17 (-1) | |
450 BINARY_SUBSCR | |
452 LOAD_FAST 1 (hidden_states) | |
454 CALL_FUNCTION 1 | |
456 STORE_FAST 1 (hidden_states) | |
719 458 LOAD_FAST 1 (hidden_states) | |
460 LOAD_ATTR 6 (dtype) | |
462 LOAD_GLOBAL 7 (torch) | |
464 LOAD_ATTR 8 (float16) | |
466 COMPARE_OP 2 (==) | |
468 EXTENDED_ARG 2 | |
470 POP_JUMP_IF_FALSE 524 | |
472 LOAD_GLOBAL 7 (torch) | |
474 LOAD_METHOD 9 (isinf) | |
476 LOAD_FAST 1 (hidden_states) | |
478 CALL_METHOD 1 | |
480 LOAD_METHOD 10 (any) | |
482 CALL_METHOD 0 | |
484 EXTENDED_ARG 2 | |
486 POP_JUMP_IF_FALSE 524 | |
720 488 LOAD_GLOBAL 7 (torch) | |
490 LOAD_METHOD 11 (finfo) | |
492 LOAD_FAST 1 (hidden_states) | |
494 LOAD_ATTR 6 (dtype) | |
496 CALL_METHOD 1 | |
498 LOAD_ATTR 12 (max) | |
500 LOAD_CONST 13 (1000) | |
502 BINARY_SUBTRACT | |
504 STORE_FAST 19 (clamp_value) | |
721 506 LOAD_GLOBAL 7 (torch) | |
508 LOAD_ATTR 13 (clamp) | |
510 LOAD_FAST 1 (hidden_states) | |
512 LOAD_FAST 19 (clamp_value) | |
514 UNARY_NEGATIVE | |
516 LOAD_FAST 19 (clamp_value) | |
518 LOAD_CONST 14 (('min', 'max')) | |
520 CALL_FUNCTION_KW 3 | |
522 STORE_FAST 1 (hidden_states) | |
723 >> 524 LOAD_FAST 1 (hidden_states) | |
526 BUILD_TUPLE 1 | |
528 STORE_FAST 23 (outputs) | |
725 530 LOAD_FAST 10 (use_cache) | |
532 EXTENDED_ARG 2 | |
534 POP_JUMP_IF_FALSE 552 | |
726 536 LOAD_FAST 23 (outputs) | |
538 LOAD_FAST 17 (present_key_value_state) | |
540 BUILD_TUPLE 1 | |
542 BINARY_ADD | |
544 LOAD_FAST 18 (attention_outputs) | |
546 BINARY_ADD | |
548 STORE_FAST 23 (outputs) | |
550 JUMP_FORWARD 8 (to 560) | |
728 >> 552 LOAD_FAST 23 (outputs) | |
554 LOAD_FAST 18 (attention_outputs) | |
556 BINARY_ADD | |
558 STORE_FAST 23 (outputs) | |
730 >> 560 LOAD_FAST 23 (outputs) | |
562 RETURN_VALUE | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST past_key_value [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP is not [ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 122 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST (None, None) [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE UNPACK_SEQUENCE 2 [ConstantVariable(tuple)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST self_attn_past_key_value [ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST cross_attn_past_key_value [ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR layer [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 0 [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_SUBSCR None [NNModuleVariable(), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST attention_mask [NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST position_bias [NNModuleVariable(), TensorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST layer_head_mask [NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self_attn_past_key_value [NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST use_cache [NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST output_attentions [NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST ('attention_mask', 'position_bias', 'layer_head_mask', 'past_key_value', 'use_cache', 'output_attentions') [NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(bool), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION_KW 7 [NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(bool), ConstantVariable(bool), ConstantVariable(tuple)] | |
torchdynamo.symbolic_convert: [DEBUG] INLINING <code object forward at 0x7f3f100b4030, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 559> | |
569 0 LOAD_FAST 0 (self) | |
2 LOAD_METHOD 0 (layer_norm) | |
4 LOAD_FAST 1 (hidden_states) | |
6 CALL_METHOD 1 | |
8 STORE_FAST 8 (normed_hidden_states) | |
570 10 LOAD_FAST 0 (self) | |
12 LOAD_ATTR 1 (SelfAttention) | |
571 14 LOAD_FAST 8 (normed_hidden_states) | |
572 16 LOAD_FAST 2 (attention_mask) | |
573 18 LOAD_FAST 3 (position_bias) | |
574 20 LOAD_FAST 4 (layer_head_mask) | |
575 22 LOAD_FAST 5 (past_key_value) | |
576 24 LOAD_FAST 6 (use_cache) | |
577 26 LOAD_FAST 7 (output_attentions) | |
570 28 LOAD_CONST 1 (('mask', 'position_bias', 'layer_head_mask', 'past_key_value', 'use_cache', 'output_attentions')) | |
30 CALL_FUNCTION_KW 7 | |
32 STORE_FAST 9 (attention_output) | |
579 34 LOAD_FAST 1 (hidden_states) | |
36 LOAD_FAST 0 (self) | |
38 LOAD_METHOD 2 (dropout) | |
40 LOAD_FAST 9 (attention_output) | |
42 LOAD_CONST 2 (0) | |
44 BINARY_SUBSCR | |
46 CALL_METHOD 1 | |
48 BINARY_ADD | |
50 STORE_FAST 1 (hidden_states) | |
580 52 LOAD_FAST 1 (hidden_states) | |
54 BUILD_TUPLE 1 | |
56 LOAD_FAST 9 (attention_output) | |
58 LOAD_CONST 3 (1) | |
60 LOAD_CONST 0 (None) | |
62 BUILD_SLICE 2 | |
64 BINARY_SUBSCR | |
66 BINARY_ADD | |
68 STORE_FAST 10 (outputs) | |
581 70 LOAD_FAST 10 (outputs) | |
72 RETURN_VALUE | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR layer_norm [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] INLINING <code object forward at 0x7f3f100ae190, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 248> | |
255 0 LOAD_FAST 1 (hidden_states) | |
2 LOAD_METHOD 0 (to) | |
4 LOAD_GLOBAL 1 (torch) | |
6 LOAD_ATTR 2 (float32) | |
8 CALL_METHOD 1 | |
10 LOAD_METHOD 3 (pow) | |
12 LOAD_CONST 1 (2) | |
14 CALL_METHOD 1 | |
16 LOAD_ATTR 4 (mean) | |
18 LOAD_CONST 2 (-1) | |
20 LOAD_CONST 3 (True) | |
22 LOAD_CONST 4 (('keepdim',)) | |
24 CALL_FUNCTION_KW 2 | |
26 STORE_FAST 2 (variance) | |
256 28 LOAD_FAST 1 (hidden_states) | |
30 LOAD_GLOBAL 1 (torch) | |
32 LOAD_METHOD 5 (rsqrt) | |
34 LOAD_FAST 2 (variance) | |
36 LOAD_FAST 0 (self) | |
38 LOAD_ATTR 6 (variance_epsilon) | |
40 BINARY_ADD | |
42 CALL_METHOD 1 | |
44 BINARY_MULTIPLY | |
46 STORE_FAST 1 (hidden_states) | |
259 48 LOAD_FAST 0 (self) | |
50 LOAD_ATTR 7 (weight) | |
52 LOAD_ATTR 8 (dtype) | |
54 LOAD_GLOBAL 1 (torch) | |
56 LOAD_ATTR 9 (float16) | |
58 LOAD_GLOBAL 1 (torch) | |
60 LOAD_ATTR 10 (bfloat16) | |
62 BUILD_TUPLE 2 | |
64 COMPARE_OP 6 (in) | |
66 POP_JUMP_IF_FALSE 82 | |
260 68 LOAD_FAST 1 (hidden_states) | |
70 LOAD_METHOD 0 (to) | |
72 LOAD_FAST 0 (self) | |
74 LOAD_ATTR 7 (weight) | |
76 LOAD_ATTR 8 (dtype) | |
78 CALL_METHOD 1 | |
80 STORE_FAST 1 (hidden_states) | |
262 >> 82 LOAD_FAST 0 (self) | |
84 LOAD_ATTR 7 (weight) | |
86 LOAD_FAST 1 (hidden_states) | |
88 BINARY_MULTIPLY | |
90 RETURN_VALUE | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR to [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL torch [GetAttrVariable(TensorVariable(), to)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR float32 [GetAttrVariable(TensorVariable(), to), TorchVariable(<module 'torch' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/__init__.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [GetAttrVariable(TensorVariable(), to), ConstantVariable(dtype)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR pow [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 2 [GetAttrVariable(TensorVariable(), pow)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [GetAttrVariable(TensorVariable(), pow), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR mean [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST -1 [GetAttrVariable(TensorVariable(), mean)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST True [GetAttrVariable(TensorVariable(), mean), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST ('keepdim',) [GetAttrVariable(TensorVariable(), mean), ConstantVariable(int), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION_KW 2 [GetAttrVariable(TensorVariable(), mean), ConstantVariable(int), ConstantVariable(bool), ConstantVariable(tuple)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST variance [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL torch [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR rsqrt [TensorVariable(), TorchVariable(<module 'torch' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/__init__.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST variance [TensorVariable(), TorchVariable(<built-in method rsqrt of type object at 0x7f40b0102e20>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [TensorVariable(), TorchVariable(<built-in method rsqrt of type object at 0x7f40b0102e20>), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR variance_epsilon [TensorVariable(), TorchVariable(<built-in method rsqrt of type object at 0x7f40b0102e20>), TensorVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_ADD None [TensorVariable(), TorchVariable(<built-in method rsqrt of type object at 0x7f40b0102e20>), TensorVariable(), ConstantVariable(float)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [TensorVariable(), TorchVariable(<built-in method rsqrt of type object at 0x7f40b0102e20>), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_MULTIPLY None [TensorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST hidden_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR weight [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR dtype [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL torch [TorchVariable(torch.float32)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR float16 [TorchVariable(torch.float32), TorchVariable(<module 'torch' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/__init__.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL torch [TorchVariable(torch.float32), ConstantVariable(dtype)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR bfloat16 [TorchVariable(torch.float32), ConstantVariable(dtype), TorchVariable(<module 'torch' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/__init__.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_TUPLE 2 [TorchVariable(torch.float32), ConstantVariable(dtype), ConstantVariable(dtype)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP in [TorchVariable(torch.float32), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 82 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR weight [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_MULTIPLY None [TensorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE RETURN_VALUE None [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] DONE INLINING <code object forward at 0x7f3f100ae190, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 248> | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST normed_hidden_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR SelfAttention [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST normed_hidden_states [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST attention_mask [NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST position_bias [NNModuleVariable(), TensorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST layer_head_mask [NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST past_key_value [NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST use_cache [NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST output_attentions [NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST ('mask', 'position_bias', 'layer_head_mask', 'past_key_value', 'use_cache', 'output_attentions') [NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(bool), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION_KW 7 [NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(bool), ConstantVariable(bool), ConstantVariable(tuple)] | |
torchdynamo.symbolic_convert: [DEBUG] INLINING <code object forward at 0x7f3f100aedf0, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 436> | |
454 0 LOAD_FAST 1 (hidden_states) | |
2 LOAD_ATTR 0 (shape) | |
4 LOAD_CONST 1 (None) | |
6 LOAD_CONST 2 (2) | |
8 BUILD_SLICE 2 | |
10 BINARY_SUBSCR | |
12 UNPACK_SEQUENCE 2 | |
14 STORE_DEREF 0 (batch_size) | |
16 STORE_FAST 10 (seq_length) | |
456 18 LOAD_FAST 10 (seq_length) | |
20 STORE_FAST 11 (real_seq_length) | |
458 22 LOAD_FAST 5 (past_key_value) | |
24 LOAD_CONST 1 (None) | |
26 COMPARE_OP 9 (is not) | |
28 POP_JUMP_IF_FALSE 92 | |
460 30 LOAD_GLOBAL 1 (len) | |
32 LOAD_FAST 5 (past_key_value) | |
34 CALL_FUNCTION 1 | |
36 LOAD_CONST 2 (2) | |
38 COMPARE_OP 2 (==) | |
459 40 POP_JUMP_IF_TRUE 62 | |
42 LOAD_GLOBAL 2 (AssertionError) | |
461 44 LOAD_CONST 3 ('past_key_value should have 2 past states: keys and values. Got ') | |
46 LOAD_GLOBAL 1 (len) | |
48 LOAD_FAST 5 (past_key_value) | |
50 CALL_FUNCTION 1 | |
52 FORMAT_VALUE 0 | |
54 LOAD_CONST 4 (' past states') | |
56 BUILD_STRING 3 | |
459 58 CALL_FUNCTION 1 | |
60 RAISE_VARARGS 1 | |
462 >> 62 LOAD_FAST 11 (real_seq_length) | |
64 LOAD_FAST 7 (query_length) | |
66 LOAD_CONST 1 (None) | |
68 COMPARE_OP 8 (is) | |
70 POP_JUMP_IF_FALSE 86 | |
72 LOAD_FAST 5 (past_key_value) | |
74 LOAD_CONST 5 (0) | |
76 BINARY_SUBSCR | |
78 LOAD_ATTR 0 (shape) | |
80 LOAD_CONST 2 (2) | |
82 BINARY_SUBSCR | |
84 JUMP_FORWARD 2 (to 88) | |
>> 86 LOAD_FAST 7 (query_length) | |
>> 88 INPLACE_ADD | |
90 STORE_FAST 11 (real_seq_length) | |
464 >> 92 LOAD_FAST 3 (key_value_states) | |
94 LOAD_CONST 1 (None) | |
96 COMPARE_OP 8 (is) | |
98 POP_JUMP_IF_FALSE 104 | |
100 LOAD_FAST 11 (real_seq_length) | |
102 JUMP_FORWARD 8 (to 112) | |
>> 104 LOAD_FAST 3 (key_value_states) | |
106 LOAD_ATTR 0 (shape) | |
108 LOAD_CONST 6 (1) | |
110 BINARY_SUBSCR | |
>> 112 STORE_FAST 12 (key_length) | |
466 114 LOAD_CLOSURE 0 (batch_size) | |
116 LOAD_CLOSURE 1 (self) | |
118 BUILD_TUPLE 2 | |
120 LOAD_CONST 7 (<code object shape at 0x7f3f100aebe0, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 466>) | |
122 LOAD_CONST 8 ('T5Attention.forward.<locals>.shape') | |
124 MAKE_FUNCTION 8 (closure) | |
126 STORE_DEREF 2 (shape) | |
470 128 LOAD_CLOSURE 0 (batch_size) | |
130 LOAD_CLOSURE 1 (self) | |
132 BUILD_TUPLE 2 | |
134 LOAD_CONST 9 (<code object unshape at 0x7f3f100aec90, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 470>) | |
136 LOAD_CONST 10 ('T5Attention.forward.<locals>.unshape') | |
138 MAKE_FUNCTION 8 (closure) | |
140 STORE_FAST 13 (unshape) | |
474 142 LOAD_CLOSURE 2 (shape) | |
144 BUILD_TUPLE 1 | |
146 LOAD_CONST 11 (<code object project at 0x7f3f100aed40, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 474>) | |
148 LOAD_CONST 12 ('T5Attention.forward.<locals>.project') | |
150 MAKE_FUNCTION 8 (closure) | |
152 STORE_FAST 14 (project) | |
496 154 LOAD_DEREF 2 (shape) | |
156 LOAD_DEREF 1 (self) | |
158 LOAD_METHOD 3 (q) | |
160 LOAD_FAST 1 (hidden_states) | |
162 CALL_METHOD 1 | |
164 CALL_FUNCTION 1 | |
166 STORE_FAST 15 (query_states) | |
499 168 LOAD_FAST 14 (project) | |
500 170 LOAD_FAST 1 (hidden_states) | |
172 LOAD_DEREF 1 (self) | |
174 LOAD_ATTR 4 (k) | |
176 LOAD_FAST 3 (key_value_states) | |
178 LOAD_FAST 5 (past_key_value) | |
180 LOAD_CONST 1 (None) | |
182 COMPARE_OP 9 (is not) | |
184 POP_JUMP_IF_FALSE 194 | |
186 LOAD_FAST 5 (past_key_value) | |
188 LOAD_CONST 5 (0) | |
190 BINARY_SUBSCR | |
192 JUMP_FORWARD 2 (to 196) | |
>> 194 LOAD_CONST 1 (None) | |
499 >> 196 CALL_FUNCTION 4 | |
198 STORE_FAST 16 (key_states) | |
502 200 LOAD_FAST 14 (project) | |
503 202 LOAD_FAST 1 (hidden_states) | |
204 LOAD_DEREF 1 (self) | |
206 LOAD_ATTR 5 (v) | |
208 LOAD_FAST 3 (key_value_states) | |
210 LOAD_FAST 5 (past_key_value) | |
212 LOAD_CONST 1 (None) | |
214 COMPARE_OP 9 (is not) | |
216 POP_JUMP_IF_FALSE 226 | |
218 LOAD_FAST 5 (past_key_value) | |
220 LOAD_CONST 6 (1) | |
222 BINARY_SUBSCR | |
224 JUMP_FORWARD 2 (to 228) | |
>> 226 LOAD_CONST 1 (None) | |
502 >> 228 CALL_FUNCTION 4 | |
230 STORE_FAST 17 (value_states) | |
507 232 LOAD_GLOBAL 6 (torch) | |
234 LOAD_METHOD 7 (matmul) | |
508 236 LOAD_FAST 15 (query_states) | |
238 LOAD_FAST 16 (key_states) | |
240 LOAD_METHOD 8 (transpose) | |
242 LOAD_CONST 13 (3) | |
244 LOAD_CONST 2 (2) | |
246 CALL_METHOD 2 | |
507 248 CALL_METHOD 2 | |
250 STORE_FAST 18 (scores) | |
511 252 LOAD_FAST 4 (position_bias) | |
254 LOAD_CONST 1 (None) | |
256 COMPARE_OP 8 (is) | |
258 EXTENDED_ARG 1 | |
260 POP_JUMP_IF_FALSE 410 | |
512 262 LOAD_DEREF 1 (self) | |
264 LOAD_ATTR 9 (has_relative_attention_bias) | |
266 EXTENDED_ARG 1 | |
268 POP_JUMP_IF_TRUE 324 | |
513 270 LOAD_GLOBAL 6 (torch) | |
272 LOAD_ATTR 10 (zeros) | |
514 274 LOAD_CONST 6 (1) | |
276 LOAD_DEREF 1 (self) | |
278 LOAD_ATTR 11 (n_heads) | |
280 LOAD_FAST 11 (real_seq_length) | |
282 LOAD_FAST 12 (key_length) | |
284 BUILD_TUPLE 4 | |
286 LOAD_FAST 18 (scores) | |
288 LOAD_ATTR 12 (device) | |
290 LOAD_FAST 18 (scores) | |
292 LOAD_ATTR 13 (dtype) | |
513 294 LOAD_CONST 14 (('device', 'dtype')) | |
296 CALL_FUNCTION_KW 3 | |
298 STORE_FAST 4 (position_bias) | |
516 300 LOAD_DEREF 1 (self) | |
302 LOAD_ATTR 14 (gradient_checkpointing) | |
304 EXTENDED_ARG 1 | |
306 POP_JUMP_IF_FALSE 342 | |
308 LOAD_DEREF 1 (self) | |
310 LOAD_ATTR 15 (training) | |
312 EXTENDED_ARG 1 | |
314 POP_JUMP_IF_FALSE 342 | |
517 316 LOAD_CONST 15 (True) | |
318 LOAD_FAST 4 (position_bias) | |
320 STORE_ATTR 16 (requires_grad) | |
322 JUMP_FORWARD 18 (to 342) | |
519 >> 324 LOAD_DEREF 1 (self) | |
326 LOAD_ATTR 17 (compute_bias) | |
328 LOAD_FAST 11 (real_seq_length) | |
330 LOAD_FAST 12 (key_length) | |
332 LOAD_FAST 18 (scores) | |
334 LOAD_ATTR 12 (device) | |
336 LOAD_CONST 16 (('device',)) | |
338 CALL_FUNCTION_KW 3 | |
340 STORE_FAST 4 (position_bias) | |
523 >> 342 LOAD_FAST 5 (past_key_value) | |
344 LOAD_CONST 1 (None) | |
346 COMPARE_OP 9 (is not) | |
348 EXTENDED_ARG 1 | |
350 POP_JUMP_IF_FALSE 392 | |
524 352 LOAD_FAST 4 (position_bias) | |
354 LOAD_CONST 1 (None) | |
356 LOAD_CONST 1 (None) | |
358 BUILD_SLICE 2 | |
360 LOAD_CONST 1 (None) | |
362 LOAD_CONST 1 (None) | |
364 BUILD_SLICE 2 | |
366 LOAD_FAST 1 (hidden_states) | |
368 LOAD_METHOD 18 (size) | |
370 LOAD_CONST 6 (1) | |
372 CALL_METHOD 1 | |
374 UNARY_NEGATIVE | |
376 LOAD_CONST 1 (None) | |
378 BUILD_SLICE 2 | |
380 LOAD_CONST 1 (None) | |
382 LOAD_CONST 1 (None) | |
384 BUILD_SLICE 2 | |
386 BUILD_TUPLE 4 | |
388 BINARY_SUBSCR | |
390 STORE_FAST 4 (position_bias) | |
526 >> 392 LOAD_FAST 2 (mask) | |
394 LOAD_CONST 1 (None) | |
396 COMPARE_OP 9 (is not) | |
398 EXTENDED_ARG 1 | |
400 POP_JUMP_IF_FALSE 410 | |
527 402 LOAD_FAST 4 (position_bias) | |
404 LOAD_FAST 2 (mask) | |
406 BINARY_ADD | |
408 STORE_FAST 4 (position_bias) | |
529 >> 410 LOAD_FAST 18 (scores) | |
412 LOAD_FAST 4 (position_bias) | |
414 INPLACE_ADD | |
416 STORE_FAST 18 (scores) | |
530 418 LOAD_GLOBAL 19 (nn) | |
420 LOAD_ATTR 20 (functional) | |
422 LOAD_ATTR 21 (softmax) | |
424 LOAD_FAST 18 (scores) | |
426 LOAD_METHOD 22 (float) | |
428 CALL_METHOD 0 | |
430 LOAD_CONST 17 (-1) | |
432 LOAD_CONST 18 (('dim',)) | |
434 CALL_FUNCTION_KW 2 | |
436 LOAD_METHOD 23 (type_as) | |
531 438 LOAD_FAST 18 (scores) | |
530 440 CALL_METHOD 1 | |
442 STORE_FAST 19 (attn_weights) | |
533 444 LOAD_GLOBAL 19 (nn) | |
446 LOAD_ATTR 20 (functional) | |
448 LOAD_ATTR 24 (dropout) | |
534 450 LOAD_FAST 19 (attn_weights) | |
452 LOAD_DEREF 1 (self) | |
454 LOAD_ATTR 24 (dropout) | |
456 LOAD_DEREF 1 (self) | |
458 LOAD_ATTR 15 (training) | |
533 460 LOAD_CONST 19 (('p', 'training')) | |
462 CALL_FUNCTION_KW 3 | |
464 STORE_FAST 19 (attn_weights) | |
538 466 LOAD_FAST 6 (layer_head_mask) | |
468 LOAD_CONST 1 (None) | |
470 COMPARE_OP 9 (is not) | |
472 EXTENDED_ARG 1 | |
474 POP_JUMP_IF_FALSE 484 | |
539 476 LOAD_FAST 19 (attn_weights) | |
478 LOAD_FAST 6 (layer_head_mask) | |
480 BINARY_MULTIPLY | |
482 STORE_FAST 19 (attn_weights) | |
541 >> 484 LOAD_FAST 13 (unshape) | |
486 LOAD_GLOBAL 6 (torch) | |
488 LOAD_METHOD 7 (matmul) | |
490 LOAD_FAST 19 (attn_weights) | |
492 LOAD_FAST 17 (value_states) | |
494 CALL_METHOD 2 | |
496 CALL_FUNCTION 1 | |
498 STORE_FAST 20 (attn_output) | |
542 500 LOAD_DEREF 1 (self) | |
502 LOAD_METHOD 25 (o) | |
504 LOAD_FAST 20 (attn_output) | |
506 CALL_METHOD 1 | |
508 STORE_FAST 20 (attn_output) | |
544 510 LOAD_DEREF 1 (self) | |
512 LOAD_ATTR 26 (is_decoder) | |
514 EXTENDED_ARG 2 | |
516 POP_JUMP_IF_FALSE 532 | |
518 LOAD_FAST 8 (use_cache) | |
520 EXTENDED_ARG 2 | |
522 POP_JUMP_IF_FALSE 532 | |
524 LOAD_FAST 16 (key_states) | |
526 LOAD_FAST 17 (value_states) | |
528 BUILD_TUPLE 2 | |
530 JUMP_FORWARD 2 (to 534) | |
>> 532 LOAD_CONST 1 (None) | |
>> 534 STORE_FAST 21 (present_key_value_state) | |
545 536 LOAD_FAST 20 (attn_output) | |
538 BUILD_TUPLE 1 | |
540 LOAD_FAST 21 (present_key_value_state) | |
542 BUILD_TUPLE 1 | |
544 BINARY_ADD | |
546 LOAD_FAST 4 (position_bias) | |
548 BUILD_TUPLE 1 | |
550 BINARY_ADD | |
552 STORE_FAST 22 (outputs) | |
547 554 LOAD_FAST 9 (output_attentions) | |
556 EXTENDED_ARG 2 | |
558 POP_JUMP_IF_FALSE 570 | |
548 560 LOAD_FAST 22 (outputs) | |
562 LOAD_FAST 19 (attn_weights) | |
564 BUILD_TUPLE 1 | |
566 BINARY_ADD | |
568 STORE_FAST 22 (outputs) | |
549 >> 570 LOAD_FAST 22 (outputs) | |
572 RETURN_VALUE | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR shape [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [ShapeVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 2 [ShapeVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_SLICE 2 [ShapeVariable(), ConstantVariable(NoneType), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_SUBSCR None [ShapeVariable(), SliceVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE UNPACK_SEQUENCE 2 [ShapeVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_DEREF batch_size [ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST seq_length [ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST seq_length [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST real_seq_length [ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST past_key_value [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP is not [ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 92 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST key_value_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP is [ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 104 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST real_seq_length [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE JUMP_FORWARD 112 [ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST key_length [ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CLOSURE batch_size [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CLOSURE self [NewCellVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_TUPLE 2 [NewCellVariable(), NewCellVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST <code object shape at 0x7f3f100aebe0, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 466> [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST T5Attention.forward.<locals>.shape [TupleVariable(), ConstantVariable(code)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE MAKE_FUNCTION 8 [TupleVariable(), ConstantVariable(code), ConstantVariable(str)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_DEREF shape [NestedUserFunctionVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CLOSURE batch_size [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CLOSURE self [NewCellVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_TUPLE 2 [NewCellVariable(), NewCellVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST <code object unshape at 0x7f3f100aec90, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 470> [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST T5Attention.forward.<locals>.unshape [TupleVariable(), ConstantVariable(code)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE MAKE_FUNCTION 8 [TupleVariable(), ConstantVariable(code), ConstantVariable(str)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST unshape [NestedUserFunctionVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CLOSURE shape [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_TUPLE 1 [NewCellVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST <code object project at 0x7f3f100aed40, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 474> [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST T5Attention.forward.<locals>.project [TupleVariable(), ConstantVariable(code)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE MAKE_FUNCTION 8 [TupleVariable(), ConstantVariable(code), ConstantVariable(str)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST project [NestedUserFunctionVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF shape [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF self [NestedUserFunctionVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR q [NestedUserFunctionVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [NestedUserFunctionVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NestedUserFunctionVariable(), NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NestedUserFunctionVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] INLINING <code object shape at 0x7f3f100aebe0, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 466> | |
468 0 LOAD_FAST 0 (states) | |
2 LOAD_METHOD 0 (view) | |
4 LOAD_DEREF 0 (batch_size) | |
6 LOAD_CONST 1 (-1) | |
8 LOAD_DEREF 1 (self) | |
10 LOAD_ATTR 1 (n_heads) | |
12 LOAD_DEREF 1 (self) | |
14 LOAD_ATTR 2 (key_value_proj_dim) | |
16 CALL_METHOD 4 | |
18 LOAD_METHOD 3 (transpose) | |
20 LOAD_CONST 2 (1) | |
22 LOAD_CONST 3 (2) | |
24 CALL_METHOD 2 | |
26 RETURN_VALUE | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR view [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF batch_size [GetAttrVariable(TensorVariable(), view)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST -1 [GetAttrVariable(TensorVariable(), view), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF self [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR n_heads [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF self [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR key_value_proj_dim [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int), ConstantVariable(int), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 4 [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int), ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR transpose [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 1 [GetAttrVariable(TensorVariable(), transpose)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 2 [GetAttrVariable(TensorVariable(), transpose), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 2 [GetAttrVariable(TensorVariable(), transpose), ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE RETURN_VALUE None [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] DONE INLINING <code object shape at 0x7f3f100aebe0, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 466> | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST query_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST project [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [NestedUserFunctionVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF self [NestedUserFunctionVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR k [NestedUserFunctionVariable(), TensorVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST key_value_states [NestedUserFunctionVariable(), TensorVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST past_key_value [NestedUserFunctionVariable(), TensorVariable(), NNModuleVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [NestedUserFunctionVariable(), TensorVariable(), NNModuleVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP is not [NestedUserFunctionVariable(), TensorVariable(), NNModuleVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 194 [NestedUserFunctionVariable(), TensorVariable(), NNModuleVariable(), ConstantVariable(NoneType), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [NestedUserFunctionVariable(), TensorVariable(), NNModuleVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 4 [NestedUserFunctionVariable(), TensorVariable(), NNModuleVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] INLINING <code object project at 0x7f3f100aed40, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 474> | |
476 0 LOAD_FAST 2 (key_value_states) | |
2 LOAD_CONST 1 (None) | |
4 COMPARE_OP 8 (is) | |
6 POP_JUMP_IF_FALSE 22 | |
479 8 LOAD_DEREF 0 (shape) | |
10 LOAD_FAST 1 (proj_layer) | |
12 LOAD_FAST 0 (hidden_states) | |
14 CALL_FUNCTION 1 | |
16 CALL_FUNCTION 1 | |
18 STORE_FAST 0 (hidden_states) | |
20 JUMP_FORWARD 20 (to 42) | |
480 >> 22 LOAD_FAST 3 (past_key_value) | |
24 LOAD_CONST 1 (None) | |
26 COMPARE_OP 8 (is) | |
28 POP_JUMP_IF_FALSE 42 | |
483 30 LOAD_DEREF 0 (shape) | |
32 LOAD_FAST 1 (proj_layer) | |
34 LOAD_FAST 2 (key_value_states) | |
36 CALL_FUNCTION 1 | |
38 CALL_FUNCTION 1 | |
40 STORE_FAST 0 (hidden_states) | |
485 >> 42 LOAD_FAST 3 (past_key_value) | |
44 LOAD_CONST 1 (None) | |
46 COMPARE_OP 9 (is not) | |
48 POP_JUMP_IF_FALSE 82 | |
486 50 LOAD_FAST 2 (key_value_states) | |
52 LOAD_CONST 1 (None) | |
54 COMPARE_OP 8 (is) | |
56 POP_JUMP_IF_FALSE 78 | |
489 58 LOAD_GLOBAL 0 (torch) | |
60 LOAD_ATTR 1 (cat) | |
62 LOAD_FAST 3 (past_key_value) | |
64 LOAD_FAST 0 (hidden_states) | |
66 BUILD_LIST 2 | |
68 LOAD_CONST 2 (2) | |
70 LOAD_CONST 3 (('dim',)) | |
72 CALL_FUNCTION_KW 2 | |
74 STORE_FAST 0 (hidden_states) | |
76 JUMP_FORWARD 4 (to 82) | |
492 >> 78 LOAD_FAST 3 (past_key_value) | |
80 STORE_FAST 0 (hidden_states) | |
493 >> 82 LOAD_FAST 0 (hidden_states) | |
84 RETURN_VALUE | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST key_value_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP is [ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 22 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF shape [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST proj_layer [NestedUserFunctionVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [NestedUserFunctionVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NestedUserFunctionVariable(), NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NestedUserFunctionVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] INLINING <code object shape at 0x7f3f100aebe0, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 466> | |
468 0 LOAD_FAST 0 (states) | |
2 LOAD_METHOD 0 (view) | |
4 LOAD_DEREF 0 (batch_size) | |
6 LOAD_CONST 1 (-1) | |
8 LOAD_DEREF 1 (self) | |
10 LOAD_ATTR 1 (n_heads) | |
12 LOAD_DEREF 1 (self) | |
14 LOAD_ATTR 2 (key_value_proj_dim) | |
16 CALL_METHOD 4 | |
18 LOAD_METHOD 3 (transpose) | |
20 LOAD_CONST 2 (1) | |
22 LOAD_CONST 3 (2) | |
24 CALL_METHOD 2 | |
26 RETURN_VALUE | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR view [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF batch_size [GetAttrVariable(TensorVariable(), view)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST -1 [GetAttrVariable(TensorVariable(), view), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF self [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR n_heads [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF self [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR key_value_proj_dim [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int), ConstantVariable(int), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 4 [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int), ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR transpose [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 1 [GetAttrVariable(TensorVariable(), transpose)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 2 [GetAttrVariable(TensorVariable(), transpose), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 2 [GetAttrVariable(TensorVariable(), transpose), ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE RETURN_VALUE None [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] DONE INLINING <code object shape at 0x7f3f100aebe0, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 466> | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST hidden_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE JUMP_FORWARD 42 [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST past_key_value [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP is not [ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 82 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE RETURN_VALUE None [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] DONE INLINING <code object project at 0x7f3f100aed40, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 474> | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST key_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST project [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [NestedUserFunctionVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF self [NestedUserFunctionVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR v [NestedUserFunctionVariable(), TensorVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST key_value_states [NestedUserFunctionVariable(), TensorVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST past_key_value [NestedUserFunctionVariable(), TensorVariable(), NNModuleVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [NestedUserFunctionVariable(), TensorVariable(), NNModuleVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP is not [NestedUserFunctionVariable(), TensorVariable(), NNModuleVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 226 [NestedUserFunctionVariable(), TensorVariable(), NNModuleVariable(), ConstantVariable(NoneType), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [NestedUserFunctionVariable(), TensorVariable(), NNModuleVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 4 [NestedUserFunctionVariable(), TensorVariable(), NNModuleVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] INLINING <code object project at 0x7f3f100aed40, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 474> | |
476 0 LOAD_FAST 2 (key_value_states) | |
2 LOAD_CONST 1 (None) | |
4 COMPARE_OP 8 (is) | |
6 POP_JUMP_IF_FALSE 22 | |
479 8 LOAD_DEREF 0 (shape) | |
10 LOAD_FAST 1 (proj_layer) | |
12 LOAD_FAST 0 (hidden_states) | |
14 CALL_FUNCTION 1 | |
16 CALL_FUNCTION 1 | |
18 STORE_FAST 0 (hidden_states) | |
20 JUMP_FORWARD 20 (to 42) | |
480 >> 22 LOAD_FAST 3 (past_key_value) | |
24 LOAD_CONST 1 (None) | |
26 COMPARE_OP 8 (is) | |
28 POP_JUMP_IF_FALSE 42 | |
483 30 LOAD_DEREF 0 (shape) | |
32 LOAD_FAST 1 (proj_layer) | |
34 LOAD_FAST 2 (key_value_states) | |
36 CALL_FUNCTION 1 | |
38 CALL_FUNCTION 1 | |
40 STORE_FAST 0 (hidden_states) | |
485 >> 42 LOAD_FAST 3 (past_key_value) | |
44 LOAD_CONST 1 (None) | |
46 COMPARE_OP 9 (is not) | |
48 POP_JUMP_IF_FALSE 82 | |
486 50 LOAD_FAST 2 (key_value_states) | |
52 LOAD_CONST 1 (None) | |
54 COMPARE_OP 8 (is) | |
56 POP_JUMP_IF_FALSE 78 | |
489 58 LOAD_GLOBAL 0 (torch) | |
60 LOAD_ATTR 1 (cat) | |
62 LOAD_FAST 3 (past_key_value) | |
64 LOAD_FAST 0 (hidden_states) | |
66 BUILD_LIST 2 | |
68 LOAD_CONST 2 (2) | |
70 LOAD_CONST 3 (('dim',)) | |
72 CALL_FUNCTION_KW 2 | |
74 STORE_FAST 0 (hidden_states) | |
76 JUMP_FORWARD 4 (to 82) | |
492 >> 78 LOAD_FAST 3 (past_key_value) | |
80 STORE_FAST 0 (hidden_states) | |
493 >> 82 LOAD_FAST 0 (hidden_states) | |
84 RETURN_VALUE | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST key_value_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP is [ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 22 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF shape [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST proj_layer [NestedUserFunctionVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [NestedUserFunctionVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NestedUserFunctionVariable(), NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NestedUserFunctionVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] INLINING <code object shape at 0x7f3f100aebe0, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 466> | |
468 0 LOAD_FAST 0 (states) | |
2 LOAD_METHOD 0 (view) | |
4 LOAD_DEREF 0 (batch_size) | |
6 LOAD_CONST 1 (-1) | |
8 LOAD_DEREF 1 (self) | |
10 LOAD_ATTR 1 (n_heads) | |
12 LOAD_DEREF 1 (self) | |
14 LOAD_ATTR 2 (key_value_proj_dim) | |
16 CALL_METHOD 4 | |
18 LOAD_METHOD 3 (transpose) | |
20 LOAD_CONST 2 (1) | |
22 LOAD_CONST 3 (2) | |
24 CALL_METHOD 2 | |
26 RETURN_VALUE | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR view [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF batch_size [GetAttrVariable(TensorVariable(), view)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST -1 [GetAttrVariable(TensorVariable(), view), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF self [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR n_heads [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF self [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR key_value_proj_dim [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int), ConstantVariable(int), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 4 [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int), ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR transpose [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 1 [GetAttrVariable(TensorVariable(), transpose)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 2 [GetAttrVariable(TensorVariable(), transpose), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 2 [GetAttrVariable(TensorVariable(), transpose), ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE RETURN_VALUE None [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] DONE INLINING <code object shape at 0x7f3f100aebe0, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 466> | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST hidden_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE JUMP_FORWARD 42 [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST past_key_value [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP is not [ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 82 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE RETURN_VALUE None [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] DONE INLINING <code object project at 0x7f3f100aed40, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 474> | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST value_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL torch [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR matmul [TorchVariable(<module 'torch' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/__init__.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST query_states [TorchVariable(<built-in method matmul of type object at 0x7f40b0102e20>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST key_states [TorchVariable(<built-in method matmul of type object at 0x7f40b0102e20>), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR transpose [TorchVariable(<built-in method matmul of type object at 0x7f40b0102e20>), TensorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 3 [TorchVariable(<built-in method matmul of type object at 0x7f40b0102e20>), TensorVariable(), GetAttrVariable(TensorVariable(), transpose)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 2 [TorchVariable(<built-in method matmul of type object at 0x7f40b0102e20>), TensorVariable(), GetAttrVariable(TensorVariable(), transpose), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 2 [TorchVariable(<built-in method matmul of type object at 0x7f40b0102e20>), TensorVariable(), GetAttrVariable(TensorVariable(), transpose), ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 2 [TorchVariable(<built-in method matmul of type object at 0x7f40b0102e20>), TensorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST scores [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST position_bias [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP is [TensorVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 410 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST scores [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST position_bias [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE INPLACE_ADD None [TensorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST scores [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL nn [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR functional [TorchVariable(<module 'torch.nn' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/nn/__init__.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR softmax [TorchVariable(<module 'torch.nn.functional' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/nn/functional.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST scores [TorchVariable(<function softmax at 0x7f3f1bf53e50>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR float [TorchVariable(<function softmax at 0x7f3f1bf53e50>), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 0 [TorchVariable(<function softmax at 0x7f3f1bf53e50>), GetAttrVariable(TensorVariable(), float)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST -1 [TorchVariable(<function softmax at 0x7f3f1bf53e50>), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST ('dim',) [TorchVariable(<function softmax at 0x7f3f1bf53e50>), TensorVariable(), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION_KW 2 [TorchVariable(<function softmax at 0x7f3f1bf53e50>), TensorVariable(), ConstantVariable(int), ConstantVariable(tuple)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR type_as [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST scores [GetAttrVariable(TensorVariable(), type_as)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [GetAttrVariable(TensorVariable(), type_as), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST attn_weights [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL nn [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR functional [TorchVariable(<module 'torch.nn' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/nn/__init__.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR dropout [TorchVariable(<module 'torch.nn.functional' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/nn/functional.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST attn_weights [TorchVariable(<function dropout at 0x7f3f1bf53310>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF self [TorchVariable(<function dropout at 0x7f3f1bf53310>), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR dropout [TorchVariable(<function dropout at 0x7f3f1bf53310>), TensorVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF self [TorchVariable(<function dropout at 0x7f3f1bf53310>), TensorVariable(), ConstantVariable(float)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR training [TorchVariable(<function dropout at 0x7f3f1bf53310>), TensorVariable(), ConstantVariable(float), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST ('p', 'training') [TorchVariable(<function dropout at 0x7f3f1bf53310>), TensorVariable(), ConstantVariable(float), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION_KW 3 [TorchVariable(<function dropout at 0x7f3f1bf53310>), TensorVariable(), ConstantVariable(float), ConstantVariable(bool), ConstantVariable(tuple)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST attn_weights [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST layer_head_mask [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP is not [ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 484 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST unshape [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL torch [NestedUserFunctionVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR matmul [NestedUserFunctionVariable(), TorchVariable(<module 'torch' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/__init__.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST attn_weights [NestedUserFunctionVariable(), TorchVariable(<built-in method matmul of type object at 0x7f40b0102e20>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST value_states [NestedUserFunctionVariable(), TorchVariable(<built-in method matmul of type object at 0x7f40b0102e20>), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 2 [NestedUserFunctionVariable(), TorchVariable(<built-in method matmul of type object at 0x7f40b0102e20>), TensorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NestedUserFunctionVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] INLINING <code object unshape at 0x7f3f100aec90, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 470> | |
472 0 LOAD_FAST 0 (states) | |
2 LOAD_METHOD 0 (transpose) | |
4 LOAD_CONST 1 (1) | |
6 LOAD_CONST 2 (2) | |
8 CALL_METHOD 2 | |
10 LOAD_METHOD 1 (contiguous) | |
12 CALL_METHOD 0 | |
14 LOAD_METHOD 2 (view) | |
16 LOAD_DEREF 0 (batch_size) | |
18 LOAD_CONST 3 (-1) | |
20 LOAD_DEREF 1 (self) | |
22 LOAD_ATTR 3 (inner_dim) | |
24 CALL_METHOD 3 | |
26 RETURN_VALUE | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR transpose [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 1 [GetAttrVariable(TensorVariable(), transpose)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 2 [GetAttrVariable(TensorVariable(), transpose), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 2 [GetAttrVariable(TensorVariable(), transpose), ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR contiguous [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 0 [GetAttrVariable(TensorVariable(), contiguous)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR view [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF batch_size [GetAttrVariable(TensorVariable(), view)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST -1 [GetAttrVariable(TensorVariable(), view), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF self [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR inner_dim [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 3 [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE RETURN_VALUE None [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] DONE INLINING <code object unshape at 0x7f3f100aec90, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 470> | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST attn_output [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR o [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST attn_output [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST attn_output [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR is_decoder [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 532 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST present_key_value_state [ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST attn_output [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_TUPLE 1 [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST present_key_value_state [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_TUPLE 1 [TupleVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_ADD None [TupleVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST position_bias [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_TUPLE 1 [TupleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_ADD None [TupleVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST outputs [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST output_attentions [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 570 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST outputs [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE RETURN_VALUE None [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] DONE INLINING <code object forward at 0x7f3f100aedf0, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 436> | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST attention_output [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR dropout [TensorVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST attention_output [TensorVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 0 [TensorVariable(), NNModuleVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_SUBSCR None [TensorVariable(), NNModuleVariable(), TupleVariable(), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [TensorVariable(), NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_ADD None [TensorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST hidden_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_TUPLE 1 [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST attention_output [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 1 [TupleVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [TupleVariable(), TupleVariable(), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_SLICE 2 [TupleVariable(), TupleVariable(), ConstantVariable(int), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_SUBSCR None [TupleVariable(), TupleVariable(), SliceVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_ADD None [TupleVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST outputs [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST outputs [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE RETURN_VALUE None [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] DONE INLINING <code object forward at 0x7f3f100b4030, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 559> | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST self_attention_outputs [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self_attention_outputs [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 2 [TupleVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_SLICE 2 [TupleVariable(), ConstantVariable(NoneType), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_SUBSCR None [TupleVariable(), SliceVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE UNPACK_SEQUENCE 2 [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST hidden_states [ConstantVariable(NoneType), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST present_key_value_state [ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self_attention_outputs [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 2 [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [TupleVariable(), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_SLICE 2 [TupleVariable(), ConstantVariable(int), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_SUBSCR None [TupleVariable(), SliceVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST attention_outputs [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR dtype [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL torch [TorchVariable(torch.float32)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR float16 [TorchVariable(torch.float32), TorchVariable(<module 'torch' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/__init__.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP == [TorchVariable(torch.float32), ConstantVariable(dtype)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 248 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR is_decoder [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE JUMP_IF_FALSE_OR_POP 262 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST do_cross_attention [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST do_cross_attention [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 444 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR layer [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST -1 [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_SUBSCR None [NNModuleVariable(), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] INLINING <code object forward at 0x7f3f100ae7c0, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 324> | |
325 0 LOAD_FAST 0 (self) | |
2 LOAD_METHOD 0 (layer_norm) | |
4 LOAD_FAST 1 (hidden_states) | |
6 CALL_METHOD 1 | |
8 STORE_FAST 2 (forwarded_states) | |
326 10 LOAD_FAST 0 (self) | |
12 LOAD_METHOD 1 (DenseReluDense) | |
14 LOAD_FAST 2 (forwarded_states) | |
16 CALL_METHOD 1 | |
18 STORE_FAST 2 (forwarded_states) | |
327 20 LOAD_FAST 1 (hidden_states) | |
22 LOAD_FAST 0 (self) | |
24 LOAD_METHOD 2 (dropout) | |
26 LOAD_FAST 2 (forwarded_states) | |
28 CALL_METHOD 1 | |
30 BINARY_ADD | |
32 STORE_FAST 1 (hidden_states) | |
328 34 LOAD_FAST 1 (hidden_states) | |
36 RETURN_VALUE | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR layer_norm [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] INLINING <code object forward at 0x7f3f100ae190, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 248> | |
255 0 LOAD_FAST 1 (hidden_states) | |
2 LOAD_METHOD 0 (to) | |
4 LOAD_GLOBAL 1 (torch) | |
6 LOAD_ATTR 2 (float32) | |
8 CALL_METHOD 1 | |
10 LOAD_METHOD 3 (pow) | |
12 LOAD_CONST 1 (2) | |
14 CALL_METHOD 1 | |
16 LOAD_ATTR 4 (mean) | |
18 LOAD_CONST 2 (-1) | |
20 LOAD_CONST 3 (True) | |
22 LOAD_CONST 4 (('keepdim',)) | |
24 CALL_FUNCTION_KW 2 | |
26 STORE_FAST 2 (variance) | |
256 28 LOAD_FAST 1 (hidden_states) | |
30 LOAD_GLOBAL 1 (torch) | |
32 LOAD_METHOD 5 (rsqrt) | |
34 LOAD_FAST 2 (variance) | |
36 LOAD_FAST 0 (self) | |
38 LOAD_ATTR 6 (variance_epsilon) | |
40 BINARY_ADD | |
42 CALL_METHOD 1 | |
44 BINARY_MULTIPLY | |
46 STORE_FAST 1 (hidden_states) | |
259 48 LOAD_FAST 0 (self) | |
50 LOAD_ATTR 7 (weight) | |
52 LOAD_ATTR 8 (dtype) | |
54 LOAD_GLOBAL 1 (torch) | |
56 LOAD_ATTR 9 (float16) | |
58 LOAD_GLOBAL 1 (torch) | |
60 LOAD_ATTR 10 (bfloat16) | |
62 BUILD_TUPLE 2 | |
64 COMPARE_OP 6 (in) | |
66 POP_JUMP_IF_FALSE 82 | |
260 68 LOAD_FAST 1 (hidden_states) | |
70 LOAD_METHOD 0 (to) | |
72 LOAD_FAST 0 (self) | |
74 LOAD_ATTR 7 (weight) | |
76 LOAD_ATTR 8 (dtype) | |
78 CALL_METHOD 1 | |
80 STORE_FAST 1 (hidden_states) | |
262 >> 82 LOAD_FAST 0 (self) | |
84 LOAD_ATTR 7 (weight) | |
86 LOAD_FAST 1 (hidden_states) | |
88 BINARY_MULTIPLY | |
90 RETURN_VALUE | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR to [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL torch [GetAttrVariable(TensorVariable(), to)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR float32 [GetAttrVariable(TensorVariable(), to), TorchVariable(<module 'torch' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/__init__.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [GetAttrVariable(TensorVariable(), to), ConstantVariable(dtype)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR pow [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 2 [GetAttrVariable(TensorVariable(), pow)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [GetAttrVariable(TensorVariable(), pow), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR mean [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST -1 [GetAttrVariable(TensorVariable(), mean)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST True [GetAttrVariable(TensorVariable(), mean), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST ('keepdim',) [GetAttrVariable(TensorVariable(), mean), ConstantVariable(int), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION_KW 2 [GetAttrVariable(TensorVariable(), mean), ConstantVariable(int), ConstantVariable(bool), ConstantVariable(tuple)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST variance [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL torch [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR rsqrt [TensorVariable(), TorchVariable(<module 'torch' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/__init__.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST variance [TensorVariable(), TorchVariable(<built-in method rsqrt of type object at 0x7f40b0102e20>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [TensorVariable(), TorchVariable(<built-in method rsqrt of type object at 0x7f40b0102e20>), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR variance_epsilon [TensorVariable(), TorchVariable(<built-in method rsqrt of type object at 0x7f40b0102e20>), TensorVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_ADD None [TensorVariable(), TorchVariable(<built-in method rsqrt of type object at 0x7f40b0102e20>), TensorVariable(), ConstantVariable(float)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [TensorVariable(), TorchVariable(<built-in method rsqrt of type object at 0x7f40b0102e20>), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_MULTIPLY None [TensorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST hidden_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR weight [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR dtype [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL torch [TorchVariable(torch.float32)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR float16 [TorchVariable(torch.float32), TorchVariable(<module 'torch' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/__init__.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL torch [TorchVariable(torch.float32), ConstantVariable(dtype)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR bfloat16 [TorchVariable(torch.float32), ConstantVariable(dtype), TorchVariable(<module 'torch' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/__init__.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_TUPLE 2 [TorchVariable(torch.float32), ConstantVariable(dtype), ConstantVariable(dtype)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP in [TorchVariable(torch.float32), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 82 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR weight [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_MULTIPLY None [TensorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE RETURN_VALUE None [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] DONE INLINING <code object forward at 0x7f3f100ae190, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 248> | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST forwarded_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR DenseReluDense [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST forwarded_states [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] INLINING <code object forward at 0x7f3f100ae3a0, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 287> | |
288 0 LOAD_FAST 0 (self) | |
2 LOAD_METHOD 0 (wi) | |
4 LOAD_FAST 1 (hidden_states) | |
6 CALL_METHOD 1 | |
8 STORE_FAST 1 (hidden_states) | |
289 10 LOAD_FAST 0 (self) | |
12 LOAD_METHOD 1 (act) | |
14 LOAD_FAST 1 (hidden_states) | |
16 CALL_METHOD 1 | |
18 STORE_FAST 1 (hidden_states) | |
290 20 LOAD_FAST 0 (self) | |
22 LOAD_METHOD 2 (dropout) | |
24 LOAD_FAST 1 (hidden_states) | |
26 CALL_METHOD 1 | |
28 STORE_FAST 1 (hidden_states) | |
291 30 LOAD_FAST 0 (self) | |
32 LOAD_METHOD 3 (wo) | |
34 LOAD_FAST 1 (hidden_states) | |
36 CALL_METHOD 1 | |
38 STORE_FAST 1 (hidden_states) | |
292 40 LOAD_FAST 1 (hidden_states) | |
42 RETURN_VALUE | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR wi [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST hidden_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR act [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST hidden_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR dropout [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST hidden_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR wo [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST hidden_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE RETURN_VALUE None [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] DONE INLINING <code object forward at 0x7f3f100ae3a0, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 287> | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST forwarded_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR dropout [TensorVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST forwarded_states [TensorVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [TensorVariable(), NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_ADD None [TensorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST hidden_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE RETURN_VALUE None [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] DONE INLINING <code object forward at 0x7f3f100ae7c0, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 324> | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST hidden_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR dtype [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL torch [TorchVariable(torch.float32)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR float16 [TorchVariable(torch.float32), TorchVariable(<module 'torch' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/__init__.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP == [TorchVariable(torch.float32), ConstantVariable(dtype)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 524 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_TUPLE 1 [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST outputs [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST use_cache [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 552 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST outputs [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST attention_outputs [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_ADD None [TupleVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST outputs [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST outputs [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE RETURN_VALUE None [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] DONE INLINING <code object forward at 0x7f3f100b4450, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 631> | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST layer_outputs [ListIteratorVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF use_cache [ListIteratorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST False [ListIteratorVariable(), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP is [ListIteratorVariable(), ConstantVariable(bool), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 1096 [ListIteratorVariable(), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST layer_outputs [ListIteratorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [ListIteratorVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 1 [ListIteratorVariable(), TupleVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_SLICE 2 [ListIteratorVariable(), TupleVariable(), ConstantVariable(NoneType), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_SUBSCR None [ListIteratorVariable(), TupleVariable(), SliceVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST (None,) [ListIteratorVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_ADD None [ListIteratorVariable(), TupleVariable(), ConstantVariable(tuple)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST layer_outputs [ListIteratorVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 1 [ListIteratorVariable(), TupleVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [ListIteratorVariable(), TupleVariable(), TupleVariable(), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_SLICE 2 [ListIteratorVariable(), TupleVariable(), TupleVariable(), ConstantVariable(int), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_SUBSCR None [ListIteratorVariable(), TupleVariable(), TupleVariable(), SliceVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_ADD None [ListIteratorVariable(), TupleVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST layer_outputs [ListIteratorVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST layer_outputs [ListIteratorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [ListIteratorVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 2 [ListIteratorVariable(), TupleVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_SLICE 2 [ListIteratorVariable(), TupleVariable(), ConstantVariable(NoneType), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_SUBSCR None [ListIteratorVariable(), TupleVariable(), SliceVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE UNPACK_SEQUENCE 2 [ListIteratorVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST hidden_states [ListIteratorVariable(), ConstantVariable(NoneType), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST present_key_value_state [ListIteratorVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST layer_outputs [ListIteratorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 2 [ListIteratorVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_SUBSCR None [ListIteratorVariable(), TupleVariable(), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST position_bias [ListIteratorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [ListIteratorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR is_decoder [ListIteratorVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 1156 [ListIteratorVariable(), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF use_cache [ListIteratorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 1172 [ListIteratorVariable(), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF output_attentions [ListIteratorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 1214 [ListIteratorVariable(), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [ListIteratorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR model_parallel [ListIteratorVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 724 [ListIteratorVariable(), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE FOR_ITER 1304 [ListIteratorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE UNPACK_SEQUENCE 2 [ListIteratorVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST i [ListIteratorVariable(), TupleVariable(), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE UNPACK_SEQUENCE 2 [ListIteratorVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST layer_module [ListIteratorVariable(), ConstantVariable(NoneType), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST past_key_value [ListIteratorVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST head_mask [ListIteratorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST i [ListIteratorVariable(), ListVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_SUBSCR None [ListIteratorVariable(), ListVariable(), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST layer_head_mask [ListIteratorVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST cross_attn_head_mask [ListIteratorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST i [ListIteratorVariable(), ListVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_SUBSCR None [ListIteratorVariable(), ListVariable(), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST cross_attn_layer_head_mask [ListIteratorVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [ListIteratorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR model_parallel [ListIteratorVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 930 [ListIteratorVariable(), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST output_hidden_states [ListIteratorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 946 [ListIteratorVariable(), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [ListIteratorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR gradient_checkpointing [ListIteratorVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 1028 [ListIteratorVariable(), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST layer_module [ListIteratorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [ListIteratorVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST extended_attention_mask [ListIteratorVariable(), NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST position_bias [ListIteratorVariable(), NNModuleVariable(), TensorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST encoder_hidden_states [ListIteratorVariable(), NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST encoder_extended_attention_mask [ListIteratorVariable(), NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST encoder_decoder_position_bias [ListIteratorVariable(), NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST layer_head_mask [ListIteratorVariable(), NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST cross_attn_layer_head_mask [ListIteratorVariable(), NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST past_key_value [ListIteratorVariable(), NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF use_cache [ListIteratorVariable(), NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF output_attentions [ListIteratorVariable(), NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST ('attention_mask', 'position_bias', 'encoder_hidden_states', 'encoder_attention_mask', 'encoder_decoder_position_bias', 'layer_head_mask', 'cross_attn_layer_head_mask', 'past_key_value', 'use_cache', 'output_attentions') [ListIteratorVariable(), NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(bool), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION_KW 11 [ListIteratorVariable(), NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(bool), ConstantVariable(bool), ConstantVariable(tuple)] | |
torchdynamo.symbolic_convert: [DEBUG] INLINING <code object forward at 0x7f3f100b4450, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 631> | |
647 0 LOAD_FAST 9 (past_key_value) | |
2 LOAD_CONST 0 (None) | |
4 COMPARE_OP 9 (is not) | |
6 POP_JUMP_IF_FALSE 122 | |
648 8 LOAD_FAST 0 (self) | |
10 LOAD_ATTR 0 (is_decoder) | |
12 POP_JUMP_IF_TRUE 24 | |
649 14 LOAD_GLOBAL 1 (logger) | |
16 LOAD_METHOD 2 (warning) | |
18 LOAD_CONST 1 ('`past_key_values` is passed to the encoder. Please make sure this is intended.') | |
20 CALL_METHOD 1 | |
22 POP_TOP | |
650 >> 24 LOAD_FAST 4 (encoder_hidden_states) | |
26 LOAD_CONST 0 (None) | |
28 COMPARE_OP 8 (is) | |
30 POP_JUMP_IF_FALSE 36 | |
32 LOAD_CONST 2 (2) | |
34 JUMP_FORWARD 2 (to 38) | |
>> 36 LOAD_CONST 3 (4) | |
>> 38 STORE_FAST 13 (expected_num_past_key_values) | |
652 40 LOAD_GLOBAL 3 (len) | |
42 LOAD_FAST 9 (past_key_value) | |
44 CALL_FUNCTION 1 | |
46 LOAD_FAST 13 (expected_num_past_key_values) | |
48 COMPARE_OP 3 (!=) | |
50 POP_JUMP_IF_FALSE 96 | |
653 52 LOAD_GLOBAL 4 (ValueError) | |
654 54 LOAD_CONST 4 ('There should be ') | |
56 LOAD_FAST 13 (expected_num_past_key_values) | |
58 FORMAT_VALUE 0 | |
60 LOAD_CONST 5 (' past states. ') | |
62 LOAD_FAST 13 (expected_num_past_key_values) | |
64 LOAD_CONST 3 (4) | |
66 COMPARE_OP 2 (==) | |
68 POP_JUMP_IF_FALSE 74 | |
70 LOAD_CONST 6 ('2 (past / key) for cross attention. ') | |
72 JUMP_FORWARD 2 (to 76) | |
>> 74 LOAD_CONST 7 ('') | |
>> 76 FORMAT_VALUE 0 | |
78 LOAD_CONST 8 ('Got ') | |
80 LOAD_GLOBAL 3 (len) | |
82 LOAD_FAST 9 (past_key_value) | |
84 CALL_FUNCTION 1 | |
86 FORMAT_VALUE 0 | |
88 LOAD_CONST 9 (' past key / value states') | |
90 BUILD_STRING 7 | |
653 92 CALL_FUNCTION 1 | |
94 RAISE_VARARGS 1 | |
659 >> 96 LOAD_FAST 9 (past_key_value) | |
98 LOAD_CONST 0 (None) | |
100 LOAD_CONST 2 (2) | |
102 BUILD_SLICE 2 | |
104 BINARY_SUBSCR | |
106 STORE_FAST 14 (self_attn_past_key_value) | |
660 108 LOAD_FAST 9 (past_key_value) | |
110 LOAD_CONST 2 (2) | |
112 LOAD_CONST 0 (None) | |
114 BUILD_SLICE 2 | |
116 BINARY_SUBSCR | |
118 STORE_FAST 15 (cross_attn_past_key_value) | |
120 JUMP_FORWARD 8 (to 130) | |
662 >> 122 LOAD_CONST 10 ((None, None)) | |
124 UNPACK_SEQUENCE 2 | |
126 STORE_FAST 14 (self_attn_past_key_value) | |
128 STORE_FAST 15 (cross_attn_past_key_value) | |
664 >> 130 LOAD_FAST 0 (self) | |
132 LOAD_ATTR 5 (layer) | |
134 LOAD_CONST 11 (0) | |
136 BINARY_SUBSCR | |
665 138 LOAD_FAST 1 (hidden_states) | |
666 140 LOAD_FAST 2 (attention_mask) | |
667 142 LOAD_FAST 3 (position_bias) | |
668 144 LOAD_FAST 7 (layer_head_mask) | |
669 146 LOAD_FAST 14 (self_attn_past_key_value) | |
670 148 LOAD_FAST 10 (use_cache) | |
671 150 LOAD_FAST 11 (output_attentions) | |
664 152 LOAD_CONST 12 (('attention_mask', 'position_bias', 'layer_head_mask', 'past_key_value', 'use_cache', 'output_attentions')) | |
154 CALL_FUNCTION_KW 7 | |
156 STORE_FAST 16 (self_attention_outputs) | |
673 158 LOAD_FAST 16 (self_attention_outputs) | |
160 LOAD_CONST 0 (None) | |
162 LOAD_CONST 2 (2) | |
164 BUILD_SLICE 2 | |
166 BINARY_SUBSCR | |
168 UNPACK_SEQUENCE 2 | |
170 STORE_FAST 1 (hidden_states) | |
172 STORE_FAST 17 (present_key_value_state) | |
674 174 LOAD_FAST 16 (self_attention_outputs) | |
176 LOAD_CONST 2 (2) | |
178 LOAD_CONST 0 (None) | |
180 BUILD_SLICE 2 | |
182 BINARY_SUBSCR | |
184 STORE_FAST 18 (attention_outputs) | |
677 186 LOAD_FAST 1 (hidden_states) | |
188 LOAD_ATTR 6 (dtype) | |
190 LOAD_GLOBAL 7 (torch) | |
192 LOAD_ATTR 8 (float16) | |
194 COMPARE_OP 2 (==) | |
196 POP_JUMP_IF_FALSE 248 | |
198 LOAD_GLOBAL 7 (torch) | |
200 LOAD_METHOD 9 (isinf) | |
202 LOAD_FAST 1 (hidden_states) | |
204 CALL_METHOD 1 | |
206 LOAD_METHOD 10 (any) | |
208 CALL_METHOD 0 | |
210 POP_JUMP_IF_FALSE 248 | |
678 212 LOAD_GLOBAL 7 (torch) | |
214 LOAD_METHOD 11 (finfo) | |
216 LOAD_FAST 1 (hidden_states) | |
218 LOAD_ATTR 6 (dtype) | |
220 CALL_METHOD 1 | |
222 LOAD_ATTR 12 (max) | |
224 LOAD_CONST 13 (1000) | |
226 BINARY_SUBTRACT | |
228 STORE_FAST 19 (clamp_value) | |
679 230 LOAD_GLOBAL 7 (torch) | |
232 LOAD_ATTR 13 (clamp) | |
234 LOAD_FAST 1 (hidden_states) | |
236 LOAD_FAST 19 (clamp_value) | |
238 UNARY_NEGATIVE | |
240 LOAD_FAST 19 (clamp_value) | |
242 LOAD_CONST 14 (('min', 'max')) | |
244 CALL_FUNCTION_KW 3 | |
246 STORE_FAST 1 (hidden_states) | |
681 >> 248 LOAD_FAST 0 (self) | |
250 LOAD_ATTR 0 (is_decoder) | |
252 EXTENDED_ARG 1 | |
254 JUMP_IF_FALSE_OR_POP 262 | |
256 LOAD_FAST 4 (encoder_hidden_states) | |
258 LOAD_CONST 0 (None) | |
260 COMPARE_OP 9 (is not) | |
>> 262 STORE_FAST 20 (do_cross_attention) | |
682 264 LOAD_FAST 20 (do_cross_attention) | |
266 EXTENDED_ARG 1 | |
268 POP_JUMP_IF_FALSE 444 | |
685 270 LOAD_FAST 17 (present_key_value_state) | |
272 LOAD_CONST 0 (None) | |
274 COMPARE_OP 9 (is not) | |
276 EXTENDED_ARG 1 | |
278 POP_JUMP_IF_FALSE 296 | |
686 280 LOAD_FAST 17 (present_key_value_state) | |
282 LOAD_CONST 11 (0) | |
284 BINARY_SUBSCR | |
286 LOAD_ATTR 14 (shape) | |
288 LOAD_CONST 2 (2) | |
290 BINARY_SUBSCR | |
292 STORE_FAST 21 (query_length) | |
294 JUMP_FORWARD 4 (to 300) | |
688 >> 296 LOAD_CONST 0 (None) | |
298 STORE_FAST 21 (query_length) | |
690 >> 300 LOAD_FAST 0 (self) | |
302 LOAD_ATTR 5 (layer) | |
304 LOAD_CONST 15 (1) | |
306 BINARY_SUBSCR | |
691 308 LOAD_FAST 1 (hidden_states) | |
692 310 LOAD_FAST 4 (encoder_hidden_states) | |
693 312 LOAD_FAST 5 (encoder_attention_mask) | |
694 314 LOAD_FAST 6 (encoder_decoder_position_bias) | |
695 316 LOAD_FAST 8 (cross_attn_layer_head_mask) | |
696 318 LOAD_FAST 15 (cross_attn_past_key_value) | |
697 320 LOAD_FAST 21 (query_length) | |
698 322 LOAD_FAST 10 (use_cache) | |
699 324 LOAD_FAST 11 (output_attentions) | |
690 326 LOAD_CONST 16 (('key_value_states', 'attention_mask', 'position_bias', 'layer_head_mask', 'past_key_value', 'query_length', 'use_cache', 'output_attentions')) | |
328 CALL_FUNCTION_KW 9 | |
330 STORE_FAST 22 (cross_attention_outputs) | |
701 332 LOAD_FAST 22 (cross_attention_outputs) | |
334 LOAD_CONST 11 (0) | |
336 BINARY_SUBSCR | |
338 STORE_FAST 1 (hidden_states) | |
704 340 LOAD_FAST 1 (hidden_states) | |
342 LOAD_ATTR 6 (dtype) | |
344 LOAD_GLOBAL 7 (torch) | |
346 LOAD_ATTR 8 (float16) | |
348 COMPARE_OP 2 (==) | |
350 EXTENDED_ARG 1 | |
352 POP_JUMP_IF_FALSE 406 | |
354 LOAD_GLOBAL 7 (torch) | |
356 LOAD_METHOD 9 (isinf) | |
358 LOAD_FAST 1 (hidden_states) | |
360 CALL_METHOD 1 | |
362 LOAD_METHOD 10 (any) | |
364 CALL_METHOD 0 | |
366 EXTENDED_ARG 1 | |
368 POP_JUMP_IF_FALSE 406 | |
705 370 LOAD_GLOBAL 7 (torch) | |
372 LOAD_METHOD 11 (finfo) | |
374 LOAD_FAST 1 (hidden_states) | |
376 LOAD_ATTR 6 (dtype) | |
378 CALL_METHOD 1 | |
380 LOAD_ATTR 12 (max) | |
382 LOAD_CONST 13 (1000) | |
384 BINARY_SUBTRACT | |
386 STORE_FAST 19 (clamp_value) | |
706 388 LOAD_GLOBAL 7 (torch) | |
390 LOAD_ATTR 13 (clamp) | |
392 LOAD_FAST 1 (hidden_states) | |
394 LOAD_FAST 19 (clamp_value) | |
396 UNARY_NEGATIVE | |
398 LOAD_FAST 19 (clamp_value) | |
400 LOAD_CONST 14 (('min', 'max')) | |
402 CALL_FUNCTION_KW 3 | |
404 STORE_FAST 1 (hidden_states) | |
709 >> 406 LOAD_FAST 17 (present_key_value_state) | |
408 LOAD_CONST 0 (None) | |
410 COMPARE_OP 9 (is not) | |
412 EXTENDED_ARG 1 | |
414 POP_JUMP_IF_FALSE 428 | |
710 416 LOAD_FAST 17 (present_key_value_state) | |
418 LOAD_FAST 22 (cross_attention_outputs) | |
420 LOAD_CONST 15 (1) | |
422 BINARY_SUBSCR | |
424 BINARY_ADD | |
426 STORE_FAST 17 (present_key_value_state) | |
713 >> 428 LOAD_FAST 18 (attention_outputs) | |
430 LOAD_FAST 22 (cross_attention_outputs) | |
432 LOAD_CONST 2 (2) | |
434 LOAD_CONST 0 (None) | |
436 BUILD_SLICE 2 | |
438 BINARY_SUBSCR | |
440 BINARY_ADD | |
442 STORE_FAST 18 (attention_outputs) | |
716 >> 444 LOAD_FAST 0 (self) | |
446 LOAD_ATTR 5 (layer) | |
448 LOAD_CONST 17 (-1) | |
450 BINARY_SUBSCR | |
452 LOAD_FAST 1 (hidden_states) | |
454 CALL_FUNCTION 1 | |
456 STORE_FAST 1 (hidden_states) | |
719 458 LOAD_FAST 1 (hidden_states) | |
460 LOAD_ATTR 6 (dtype) | |
462 LOAD_GLOBAL 7 (torch) | |
464 LOAD_ATTR 8 (float16) | |
466 COMPARE_OP 2 (==) | |
468 EXTENDED_ARG 2 | |
470 POP_JUMP_IF_FALSE 524 | |
472 LOAD_GLOBAL 7 (torch) | |
474 LOAD_METHOD 9 (isinf) | |
476 LOAD_FAST 1 (hidden_states) | |
478 CALL_METHOD 1 | |
480 LOAD_METHOD 10 (any) | |
482 CALL_METHOD 0 | |
484 EXTENDED_ARG 2 | |
486 POP_JUMP_IF_FALSE 524 | |
720 488 LOAD_GLOBAL 7 (torch) | |
490 LOAD_METHOD 11 (finfo) | |
492 LOAD_FAST 1 (hidden_states) | |
494 LOAD_ATTR 6 (dtype) | |
496 CALL_METHOD 1 | |
498 LOAD_ATTR 12 (max) | |
500 LOAD_CONST 13 (1000) | |
502 BINARY_SUBTRACT | |
504 STORE_FAST 19 (clamp_value) | |
721 506 LOAD_GLOBAL 7 (torch) | |
508 LOAD_ATTR 13 (clamp) | |
510 LOAD_FAST 1 (hidden_states) | |
512 LOAD_FAST 19 (clamp_value) | |
514 UNARY_NEGATIVE | |
516 LOAD_FAST 19 (clamp_value) | |
518 LOAD_CONST 14 (('min', 'max')) | |
520 CALL_FUNCTION_KW 3 | |
522 STORE_FAST 1 (hidden_states) | |
723 >> 524 LOAD_FAST 1 (hidden_states) | |
526 BUILD_TUPLE 1 | |
528 STORE_FAST 23 (outputs) | |
725 530 LOAD_FAST 10 (use_cache) | |
532 EXTENDED_ARG 2 | |
534 POP_JUMP_IF_FALSE 552 | |
726 536 LOAD_FAST 23 (outputs) | |
538 LOAD_FAST 17 (present_key_value_state) | |
540 BUILD_TUPLE 1 | |
542 BINARY_ADD | |
544 LOAD_FAST 18 (attention_outputs) | |
546 BINARY_ADD | |
548 STORE_FAST 23 (outputs) | |
550 JUMP_FORWARD 8 (to 560) | |
728 >> 552 LOAD_FAST 23 (outputs) | |
554 LOAD_FAST 18 (attention_outputs) | |
556 BINARY_ADD | |
558 STORE_FAST 23 (outputs) | |
730 >> 560 LOAD_FAST 23 (outputs) | |
562 RETURN_VALUE | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST past_key_value [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP is not [ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 122 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST (None, None) [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE UNPACK_SEQUENCE 2 [ConstantVariable(tuple)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST self_attn_past_key_value [ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST cross_attn_past_key_value [ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR layer [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 0 [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_SUBSCR None [NNModuleVariable(), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST attention_mask [NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST position_bias [NNModuleVariable(), TensorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST layer_head_mask [NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self_attn_past_key_value [NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST use_cache [NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST output_attentions [NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST ('attention_mask', 'position_bias', 'layer_head_mask', 'past_key_value', 'use_cache', 'output_attentions') [NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(bool), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION_KW 7 [NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(bool), ConstantVariable(bool), ConstantVariable(tuple)] | |
torchdynamo.symbolic_convert: [DEBUG] INLINING <code object forward at 0x7f3f100b4030, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 559> | |
569 0 LOAD_FAST 0 (self) | |
2 LOAD_METHOD 0 (layer_norm) | |
4 LOAD_FAST 1 (hidden_states) | |
6 CALL_METHOD 1 | |
8 STORE_FAST 8 (normed_hidden_states) | |
570 10 LOAD_FAST 0 (self) | |
12 LOAD_ATTR 1 (SelfAttention) | |
571 14 LOAD_FAST 8 (normed_hidden_states) | |
572 16 LOAD_FAST 2 (attention_mask) | |
573 18 LOAD_FAST 3 (position_bias) | |
574 20 LOAD_FAST 4 (layer_head_mask) | |
575 22 LOAD_FAST 5 (past_key_value) | |
576 24 LOAD_FAST 6 (use_cache) | |
577 26 LOAD_FAST 7 (output_attentions) | |
570 28 LOAD_CONST 1 (('mask', 'position_bias', 'layer_head_mask', 'past_key_value', 'use_cache', 'output_attentions')) | |
30 CALL_FUNCTION_KW 7 | |
32 STORE_FAST 9 (attention_output) | |
579 34 LOAD_FAST 1 (hidden_states) | |
36 LOAD_FAST 0 (self) | |
38 LOAD_METHOD 2 (dropout) | |
40 LOAD_FAST 9 (attention_output) | |
42 LOAD_CONST 2 (0) | |
44 BINARY_SUBSCR | |
46 CALL_METHOD 1 | |
48 BINARY_ADD | |
50 STORE_FAST 1 (hidden_states) | |
580 52 LOAD_FAST 1 (hidden_states) | |
54 BUILD_TUPLE 1 | |
56 LOAD_FAST 9 (attention_output) | |
58 LOAD_CONST 3 (1) | |
60 LOAD_CONST 0 (None) | |
62 BUILD_SLICE 2 | |
64 BINARY_SUBSCR | |
66 BINARY_ADD | |
68 STORE_FAST 10 (outputs) | |
581 70 LOAD_FAST 10 (outputs) | |
72 RETURN_VALUE | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR layer_norm [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] INLINING <code object forward at 0x7f3f100ae190, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 248> | |
255 0 LOAD_FAST 1 (hidden_states) | |
2 LOAD_METHOD 0 (to) | |
4 LOAD_GLOBAL 1 (torch) | |
6 LOAD_ATTR 2 (float32) | |
8 CALL_METHOD 1 | |
10 LOAD_METHOD 3 (pow) | |
12 LOAD_CONST 1 (2) | |
14 CALL_METHOD 1 | |
16 LOAD_ATTR 4 (mean) | |
18 LOAD_CONST 2 (-1) | |
20 LOAD_CONST 3 (True) | |
22 LOAD_CONST 4 (('keepdim',)) | |
24 CALL_FUNCTION_KW 2 | |
26 STORE_FAST 2 (variance) | |
256 28 LOAD_FAST 1 (hidden_states) | |
30 LOAD_GLOBAL 1 (torch) | |
32 LOAD_METHOD 5 (rsqrt) | |
34 LOAD_FAST 2 (variance) | |
36 LOAD_FAST 0 (self) | |
38 LOAD_ATTR 6 (variance_epsilon) | |
40 BINARY_ADD | |
42 CALL_METHOD 1 | |
44 BINARY_MULTIPLY | |
46 STORE_FAST 1 (hidden_states) | |
259 48 LOAD_FAST 0 (self) | |
50 LOAD_ATTR 7 (weight) | |
52 LOAD_ATTR 8 (dtype) | |
54 LOAD_GLOBAL 1 (torch) | |
56 LOAD_ATTR 9 (float16) | |
58 LOAD_GLOBAL 1 (torch) | |
60 LOAD_ATTR 10 (bfloat16) | |
62 BUILD_TUPLE 2 | |
64 COMPARE_OP 6 (in) | |
66 POP_JUMP_IF_FALSE 82 | |
260 68 LOAD_FAST 1 (hidden_states) | |
70 LOAD_METHOD 0 (to) | |
72 LOAD_FAST 0 (self) | |
74 LOAD_ATTR 7 (weight) | |
76 LOAD_ATTR 8 (dtype) | |
78 CALL_METHOD 1 | |
80 STORE_FAST 1 (hidden_states) | |
262 >> 82 LOAD_FAST 0 (self) | |
84 LOAD_ATTR 7 (weight) | |
86 LOAD_FAST 1 (hidden_states) | |
88 BINARY_MULTIPLY | |
90 RETURN_VALUE | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR to [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL torch [GetAttrVariable(TensorVariable(), to)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR float32 [GetAttrVariable(TensorVariable(), to), TorchVariable(<module 'torch' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/__init__.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [GetAttrVariable(TensorVariable(), to), ConstantVariable(dtype)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR pow [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 2 [GetAttrVariable(TensorVariable(), pow)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [GetAttrVariable(TensorVariable(), pow), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR mean [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST -1 [GetAttrVariable(TensorVariable(), mean)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST True [GetAttrVariable(TensorVariable(), mean), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST ('keepdim',) [GetAttrVariable(TensorVariable(), mean), ConstantVariable(int), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION_KW 2 [GetAttrVariable(TensorVariable(), mean), ConstantVariable(int), ConstantVariable(bool), ConstantVariable(tuple)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST variance [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL torch [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR rsqrt [TensorVariable(), TorchVariable(<module 'torch' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/__init__.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST variance [TensorVariable(), TorchVariable(<built-in method rsqrt of type object at 0x7f40b0102e20>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [TensorVariable(), TorchVariable(<built-in method rsqrt of type object at 0x7f40b0102e20>), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR variance_epsilon [TensorVariable(), TorchVariable(<built-in method rsqrt of type object at 0x7f40b0102e20>), TensorVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_ADD None [TensorVariable(), TorchVariable(<built-in method rsqrt of type object at 0x7f40b0102e20>), TensorVariable(), ConstantVariable(float)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [TensorVariable(), TorchVariable(<built-in method rsqrt of type object at 0x7f40b0102e20>), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_MULTIPLY None [TensorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST hidden_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR weight [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR dtype [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL torch [TorchVariable(torch.float32)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR float16 [TorchVariable(torch.float32), TorchVariable(<module 'torch' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/__init__.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL torch [TorchVariable(torch.float32), ConstantVariable(dtype)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR bfloat16 [TorchVariable(torch.float32), ConstantVariable(dtype), TorchVariable(<module 'torch' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/__init__.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_TUPLE 2 [TorchVariable(torch.float32), ConstantVariable(dtype), ConstantVariable(dtype)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP in [TorchVariable(torch.float32), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 82 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR weight [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_MULTIPLY None [TensorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE RETURN_VALUE None [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] DONE INLINING <code object forward at 0x7f3f100ae190, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 248> | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST normed_hidden_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR SelfAttention [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST normed_hidden_states [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST attention_mask [NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST position_bias [NNModuleVariable(), TensorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST layer_head_mask [NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST past_key_value [NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST use_cache [NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST output_attentions [NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST ('mask', 'position_bias', 'layer_head_mask', 'past_key_value', 'use_cache', 'output_attentions') [NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(bool), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION_KW 7 [NNModuleVariable(), TensorVariable(), TensorVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(bool), ConstantVariable(bool), ConstantVariable(tuple)] | |
torchdynamo.symbolic_convert: [DEBUG] INLINING <code object forward at 0x7f3f100aedf0, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 436> | |
454 0 LOAD_FAST 1 (hidden_states) | |
2 LOAD_ATTR 0 (shape) | |
4 LOAD_CONST 1 (None) | |
6 LOAD_CONST 2 (2) | |
8 BUILD_SLICE 2 | |
10 BINARY_SUBSCR | |
12 UNPACK_SEQUENCE 2 | |
14 STORE_DEREF 0 (batch_size) | |
16 STORE_FAST 10 (seq_length) | |
456 18 LOAD_FAST 10 (seq_length) | |
20 STORE_FAST 11 (real_seq_length) | |
458 22 LOAD_FAST 5 (past_key_value) | |
24 LOAD_CONST 1 (None) | |
26 COMPARE_OP 9 (is not) | |
28 POP_JUMP_IF_FALSE 92 | |
460 30 LOAD_GLOBAL 1 (len) | |
32 LOAD_FAST 5 (past_key_value) | |
34 CALL_FUNCTION 1 | |
36 LOAD_CONST 2 (2) | |
38 COMPARE_OP 2 (==) | |
459 40 POP_JUMP_IF_TRUE 62 | |
42 LOAD_GLOBAL 2 (AssertionError) | |
461 44 LOAD_CONST 3 ('past_key_value should have 2 past states: keys and values. Got ') | |
46 LOAD_GLOBAL 1 (len) | |
48 LOAD_FAST 5 (past_key_value) | |
50 CALL_FUNCTION 1 | |
52 FORMAT_VALUE 0 | |
54 LOAD_CONST 4 (' past states') | |
56 BUILD_STRING 3 | |
459 58 CALL_FUNCTION 1 | |
60 RAISE_VARARGS 1 | |
462 >> 62 LOAD_FAST 11 (real_seq_length) | |
64 LOAD_FAST 7 (query_length) | |
66 LOAD_CONST 1 (None) | |
68 COMPARE_OP 8 (is) | |
70 POP_JUMP_IF_FALSE 86 | |
72 LOAD_FAST 5 (past_key_value) | |
74 LOAD_CONST 5 (0) | |
76 BINARY_SUBSCR | |
78 LOAD_ATTR 0 (shape) | |
80 LOAD_CONST 2 (2) | |
82 BINARY_SUBSCR | |
84 JUMP_FORWARD 2 (to 88) | |
>> 86 LOAD_FAST 7 (query_length) | |
>> 88 INPLACE_ADD | |
90 STORE_FAST 11 (real_seq_length) | |
464 >> 92 LOAD_FAST 3 (key_value_states) | |
94 LOAD_CONST 1 (None) | |
96 COMPARE_OP 8 (is) | |
98 POP_JUMP_IF_FALSE 104 | |
100 LOAD_FAST 11 (real_seq_length) | |
102 JUMP_FORWARD 8 (to 112) | |
>> 104 LOAD_FAST 3 (key_value_states) | |
106 LOAD_ATTR 0 (shape) | |
108 LOAD_CONST 6 (1) | |
110 BINARY_SUBSCR | |
>> 112 STORE_FAST 12 (key_length) | |
466 114 LOAD_CLOSURE 0 (batch_size) | |
116 LOAD_CLOSURE 1 (self) | |
118 BUILD_TUPLE 2 | |
120 LOAD_CONST 7 (<code object shape at 0x7f3f100aebe0, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 466>) | |
122 LOAD_CONST 8 ('T5Attention.forward.<locals>.shape') | |
124 MAKE_FUNCTION 8 (closure) | |
126 STORE_DEREF 2 (shape) | |
470 128 LOAD_CLOSURE 0 (batch_size) | |
130 LOAD_CLOSURE 1 (self) | |
132 BUILD_TUPLE 2 | |
134 LOAD_CONST 9 (<code object unshape at 0x7f3f100aec90, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 470>) | |
136 LOAD_CONST 10 ('T5Attention.forward.<locals>.unshape') | |
138 MAKE_FUNCTION 8 (closure) | |
140 STORE_FAST 13 (unshape) | |
474 142 LOAD_CLOSURE 2 (shape) | |
144 BUILD_TUPLE 1 | |
146 LOAD_CONST 11 (<code object project at 0x7f3f100aed40, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 474>) | |
148 LOAD_CONST 12 ('T5Attention.forward.<locals>.project') | |
150 MAKE_FUNCTION 8 (closure) | |
152 STORE_FAST 14 (project) | |
496 154 LOAD_DEREF 2 (shape) | |
156 LOAD_DEREF 1 (self) | |
158 LOAD_METHOD 3 (q) | |
160 LOAD_FAST 1 (hidden_states) | |
162 CALL_METHOD 1 | |
164 CALL_FUNCTION 1 | |
166 STORE_FAST 15 (query_states) | |
499 168 LOAD_FAST 14 (project) | |
500 170 LOAD_FAST 1 (hidden_states) | |
172 LOAD_DEREF 1 (self) | |
174 LOAD_ATTR 4 (k) | |
176 LOAD_FAST 3 (key_value_states) | |
178 LOAD_FAST 5 (past_key_value) | |
180 LOAD_CONST 1 (None) | |
182 COMPARE_OP 9 (is not) | |
184 POP_JUMP_IF_FALSE 194 | |
186 LOAD_FAST 5 (past_key_value) | |
188 LOAD_CONST 5 (0) | |
190 BINARY_SUBSCR | |
192 JUMP_FORWARD 2 (to 196) | |
>> 194 LOAD_CONST 1 (None) | |
499 >> 196 CALL_FUNCTION 4 | |
198 STORE_FAST 16 (key_states) | |
502 200 LOAD_FAST 14 (project) | |
503 202 LOAD_FAST 1 (hidden_states) | |
204 LOAD_DEREF 1 (self) | |
206 LOAD_ATTR 5 (v) | |
208 LOAD_FAST 3 (key_value_states) | |
210 LOAD_FAST 5 (past_key_value) | |
212 LOAD_CONST 1 (None) | |
214 COMPARE_OP 9 (is not) | |
216 POP_JUMP_IF_FALSE 226 | |
218 LOAD_FAST 5 (past_key_value) | |
220 LOAD_CONST 6 (1) | |
222 BINARY_SUBSCR | |
224 JUMP_FORWARD 2 (to 228) | |
>> 226 LOAD_CONST 1 (None) | |
502 >> 228 CALL_FUNCTION 4 | |
230 STORE_FAST 17 (value_states) | |
507 232 LOAD_GLOBAL 6 (torch) | |
234 LOAD_METHOD 7 (matmul) | |
508 236 LOAD_FAST 15 (query_states) | |
238 LOAD_FAST 16 (key_states) | |
240 LOAD_METHOD 8 (transpose) | |
242 LOAD_CONST 13 (3) | |
244 LOAD_CONST 2 (2) | |
246 CALL_METHOD 2 | |
507 248 CALL_METHOD 2 | |
250 STORE_FAST 18 (scores) | |
511 252 LOAD_FAST 4 (position_bias) | |
254 LOAD_CONST 1 (None) | |
256 COMPARE_OP 8 (is) | |
258 EXTENDED_ARG 1 | |
260 POP_JUMP_IF_FALSE 410 | |
512 262 LOAD_DEREF 1 (self) | |
264 LOAD_ATTR 9 (has_relative_attention_bias) | |
266 EXTENDED_ARG 1 | |
268 POP_JUMP_IF_TRUE 324 | |
513 270 LOAD_GLOBAL 6 (torch) | |
272 LOAD_ATTR 10 (zeros) | |
514 274 LOAD_CONST 6 (1) | |
276 LOAD_DEREF 1 (self) | |
278 LOAD_ATTR 11 (n_heads) | |
280 LOAD_FAST 11 (real_seq_length) | |
282 LOAD_FAST 12 (key_length) | |
284 BUILD_TUPLE 4 | |
286 LOAD_FAST 18 (scores) | |
288 LOAD_ATTR 12 (device) | |
290 LOAD_FAST 18 (scores) | |
292 LOAD_ATTR 13 (dtype) | |
513 294 LOAD_CONST 14 (('device', 'dtype')) | |
296 CALL_FUNCTION_KW 3 | |
298 STORE_FAST 4 (position_bias) | |
516 300 LOAD_DEREF 1 (self) | |
302 LOAD_ATTR 14 (gradient_checkpointing) | |
304 EXTENDED_ARG 1 | |
306 POP_JUMP_IF_FALSE 342 | |
308 LOAD_DEREF 1 (self) | |
310 LOAD_ATTR 15 (training) | |
312 EXTENDED_ARG 1 | |
314 POP_JUMP_IF_FALSE 342 | |
517 316 LOAD_CONST 15 (True) | |
318 LOAD_FAST 4 (position_bias) | |
320 STORE_ATTR 16 (requires_grad) | |
322 JUMP_FORWARD 18 (to 342) | |
519 >> 324 LOAD_DEREF 1 (self) | |
326 LOAD_ATTR 17 (compute_bias) | |
328 LOAD_FAST 11 (real_seq_length) | |
330 LOAD_FAST 12 (key_length) | |
332 LOAD_FAST 18 (scores) | |
334 LOAD_ATTR 12 (device) | |
336 LOAD_CONST 16 (('device',)) | |
338 CALL_FUNCTION_KW 3 | |
340 STORE_FAST 4 (position_bias) | |
523 >> 342 LOAD_FAST 5 (past_key_value) | |
344 LOAD_CONST 1 (None) | |
346 COMPARE_OP 9 (is not) | |
348 EXTENDED_ARG 1 | |
350 POP_JUMP_IF_FALSE 392 | |
524 352 LOAD_FAST 4 (position_bias) | |
354 LOAD_CONST 1 (None) | |
356 LOAD_CONST 1 (None) | |
358 BUILD_SLICE 2 | |
360 LOAD_CONST 1 (None) | |
362 LOAD_CONST 1 (None) | |
364 BUILD_SLICE 2 | |
366 LOAD_FAST 1 (hidden_states) | |
368 LOAD_METHOD 18 (size) | |
370 LOAD_CONST 6 (1) | |
372 CALL_METHOD 1 | |
374 UNARY_NEGATIVE | |
376 LOAD_CONST 1 (None) | |
378 BUILD_SLICE 2 | |
380 LOAD_CONST 1 (None) | |
382 LOAD_CONST 1 (None) | |
384 BUILD_SLICE 2 | |
386 BUILD_TUPLE 4 | |
388 BINARY_SUBSCR | |
390 STORE_FAST 4 (position_bias) | |
526 >> 392 LOAD_FAST 2 (mask) | |
394 LOAD_CONST 1 (None) | |
396 COMPARE_OP 9 (is not) | |
398 EXTENDED_ARG 1 | |
400 POP_JUMP_IF_FALSE 410 | |
527 402 LOAD_FAST 4 (position_bias) | |
404 LOAD_FAST 2 (mask) | |
406 BINARY_ADD | |
408 STORE_FAST 4 (position_bias) | |
529 >> 410 LOAD_FAST 18 (scores) | |
412 LOAD_FAST 4 (position_bias) | |
414 INPLACE_ADD | |
416 STORE_FAST 18 (scores) | |
530 418 LOAD_GLOBAL 19 (nn) | |
420 LOAD_ATTR 20 (functional) | |
422 LOAD_ATTR 21 (softmax) | |
424 LOAD_FAST 18 (scores) | |
426 LOAD_METHOD 22 (float) | |
428 CALL_METHOD 0 | |
430 LOAD_CONST 17 (-1) | |
432 LOAD_CONST 18 (('dim',)) | |
434 CALL_FUNCTION_KW 2 | |
436 LOAD_METHOD 23 (type_as) | |
531 438 LOAD_FAST 18 (scores) | |
530 440 CALL_METHOD 1 | |
442 STORE_FAST 19 (attn_weights) | |
533 444 LOAD_GLOBAL 19 (nn) | |
446 LOAD_ATTR 20 (functional) | |
448 LOAD_ATTR 24 (dropout) | |
534 450 LOAD_FAST 19 (attn_weights) | |
452 LOAD_DEREF 1 (self) | |
454 LOAD_ATTR 24 (dropout) | |
456 LOAD_DEREF 1 (self) | |
458 LOAD_ATTR 15 (training) | |
533 460 LOAD_CONST 19 (('p', 'training')) | |
462 CALL_FUNCTION_KW 3 | |
464 STORE_FAST 19 (attn_weights) | |
538 466 LOAD_FAST 6 (layer_head_mask) | |
468 LOAD_CONST 1 (None) | |
470 COMPARE_OP 9 (is not) | |
472 EXTENDED_ARG 1 | |
474 POP_JUMP_IF_FALSE 484 | |
539 476 LOAD_FAST 19 (attn_weights) | |
478 LOAD_FAST 6 (layer_head_mask) | |
480 BINARY_MULTIPLY | |
482 STORE_FAST 19 (attn_weights) | |
541 >> 484 LOAD_FAST 13 (unshape) | |
486 LOAD_GLOBAL 6 (torch) | |
488 LOAD_METHOD 7 (matmul) | |
490 LOAD_FAST 19 (attn_weights) | |
492 LOAD_FAST 17 (value_states) | |
494 CALL_METHOD 2 | |
496 CALL_FUNCTION 1 | |
498 STORE_FAST 20 (attn_output) | |
542 500 LOAD_DEREF 1 (self) | |
502 LOAD_METHOD 25 (o) | |
504 LOAD_FAST 20 (attn_output) | |
506 CALL_METHOD 1 | |
508 STORE_FAST 20 (attn_output) | |
544 510 LOAD_DEREF 1 (self) | |
512 LOAD_ATTR 26 (is_decoder) | |
514 EXTENDED_ARG 2 | |
516 POP_JUMP_IF_FALSE 532 | |
518 LOAD_FAST 8 (use_cache) | |
520 EXTENDED_ARG 2 | |
522 POP_JUMP_IF_FALSE 532 | |
524 LOAD_FAST 16 (key_states) | |
526 LOAD_FAST 17 (value_states) | |
528 BUILD_TUPLE 2 | |
530 JUMP_FORWARD 2 (to 534) | |
>> 532 LOAD_CONST 1 (None) | |
>> 534 STORE_FAST 21 (present_key_value_state) | |
545 536 LOAD_FAST 20 (attn_output) | |
538 BUILD_TUPLE 1 | |
540 LOAD_FAST 21 (present_key_value_state) | |
542 BUILD_TUPLE 1 | |
544 BINARY_ADD | |
546 LOAD_FAST 4 (position_bias) | |
548 BUILD_TUPLE 1 | |
550 BINARY_ADD | |
552 STORE_FAST 22 (outputs) | |
547 554 LOAD_FAST 9 (output_attentions) | |
556 EXTENDED_ARG 2 | |
558 POP_JUMP_IF_FALSE 570 | |
548 560 LOAD_FAST 22 (outputs) | |
562 LOAD_FAST 19 (attn_weights) | |
564 BUILD_TUPLE 1 | |
566 BINARY_ADD | |
568 STORE_FAST 22 (outputs) | |
549 >> 570 LOAD_FAST 22 (outputs) | |
572 RETURN_VALUE | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR shape [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [ShapeVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 2 [ShapeVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_SLICE 2 [ShapeVariable(), ConstantVariable(NoneType), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_SUBSCR None [ShapeVariable(), SliceVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE UNPACK_SEQUENCE 2 [ShapeVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_DEREF batch_size [ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST seq_length [ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST seq_length [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST real_seq_length [ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST past_key_value [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP is not [ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 92 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST key_value_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP is [ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 104 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST real_seq_length [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE JUMP_FORWARD 112 [ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST key_length [ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CLOSURE batch_size [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CLOSURE self [NewCellVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_TUPLE 2 [NewCellVariable(), NewCellVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST <code object shape at 0x7f3f100aebe0, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 466> [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST T5Attention.forward.<locals>.shape [TupleVariable(), ConstantVariable(code)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE MAKE_FUNCTION 8 [TupleVariable(), ConstantVariable(code), ConstantVariable(str)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_DEREF shape [NestedUserFunctionVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CLOSURE batch_size [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CLOSURE self [NewCellVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_TUPLE 2 [NewCellVariable(), NewCellVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST <code object unshape at 0x7f3f100aec90, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 470> [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST T5Attention.forward.<locals>.unshape [TupleVariable(), ConstantVariable(code)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE MAKE_FUNCTION 8 [TupleVariable(), ConstantVariable(code), ConstantVariable(str)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST unshape [NestedUserFunctionVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CLOSURE shape [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_TUPLE 1 [NewCellVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST <code object project at 0x7f3f100aed40, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 474> [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST T5Attention.forward.<locals>.project [TupleVariable(), ConstantVariable(code)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE MAKE_FUNCTION 8 [TupleVariable(), ConstantVariable(code), ConstantVariable(str)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST project [NestedUserFunctionVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF shape [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF self [NestedUserFunctionVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR q [NestedUserFunctionVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [NestedUserFunctionVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NestedUserFunctionVariable(), NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NestedUserFunctionVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] INLINING <code object shape at 0x7f3f100aebe0, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 466> | |
468 0 LOAD_FAST 0 (states) | |
2 LOAD_METHOD 0 (view) | |
4 LOAD_DEREF 0 (batch_size) | |
6 LOAD_CONST 1 (-1) | |
8 LOAD_DEREF 1 (self) | |
10 LOAD_ATTR 1 (n_heads) | |
12 LOAD_DEREF 1 (self) | |
14 LOAD_ATTR 2 (key_value_proj_dim) | |
16 CALL_METHOD 4 | |
18 LOAD_METHOD 3 (transpose) | |
20 LOAD_CONST 2 (1) | |
22 LOAD_CONST 3 (2) | |
24 CALL_METHOD 2 | |
26 RETURN_VALUE | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR view [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF batch_size [GetAttrVariable(TensorVariable(), view)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST -1 [GetAttrVariable(TensorVariable(), view), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF self [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR n_heads [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF self [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR key_value_proj_dim [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int), ConstantVariable(int), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 4 [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int), ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR transpose [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 1 [GetAttrVariable(TensorVariable(), transpose)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 2 [GetAttrVariable(TensorVariable(), transpose), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 2 [GetAttrVariable(TensorVariable(), transpose), ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE RETURN_VALUE None [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] DONE INLINING <code object shape at 0x7f3f100aebe0, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 466> | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST query_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST project [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [NestedUserFunctionVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF self [NestedUserFunctionVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR k [NestedUserFunctionVariable(), TensorVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST key_value_states [NestedUserFunctionVariable(), TensorVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST past_key_value [NestedUserFunctionVariable(), TensorVariable(), NNModuleVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [NestedUserFunctionVariable(), TensorVariable(), NNModuleVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP is not [NestedUserFunctionVariable(), TensorVariable(), NNModuleVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 194 [NestedUserFunctionVariable(), TensorVariable(), NNModuleVariable(), ConstantVariable(NoneType), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [NestedUserFunctionVariable(), TensorVariable(), NNModuleVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 4 [NestedUserFunctionVariable(), TensorVariable(), NNModuleVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] INLINING <code object project at 0x7f3f100aed40, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 474> | |
476 0 LOAD_FAST 2 (key_value_states) | |
2 LOAD_CONST 1 (None) | |
4 COMPARE_OP 8 (is) | |
6 POP_JUMP_IF_FALSE 22 | |
479 8 LOAD_DEREF 0 (shape) | |
10 LOAD_FAST 1 (proj_layer) | |
12 LOAD_FAST 0 (hidden_states) | |
14 CALL_FUNCTION 1 | |
16 CALL_FUNCTION 1 | |
18 STORE_FAST 0 (hidden_states) | |
20 JUMP_FORWARD 20 (to 42) | |
480 >> 22 LOAD_FAST 3 (past_key_value) | |
24 LOAD_CONST 1 (None) | |
26 COMPARE_OP 8 (is) | |
28 POP_JUMP_IF_FALSE 42 | |
483 30 LOAD_DEREF 0 (shape) | |
32 LOAD_FAST 1 (proj_layer) | |
34 LOAD_FAST 2 (key_value_states) | |
36 CALL_FUNCTION 1 | |
38 CALL_FUNCTION 1 | |
40 STORE_FAST 0 (hidden_states) | |
485 >> 42 LOAD_FAST 3 (past_key_value) | |
44 LOAD_CONST 1 (None) | |
46 COMPARE_OP 9 (is not) | |
48 POP_JUMP_IF_FALSE 82 | |
486 50 LOAD_FAST 2 (key_value_states) | |
52 LOAD_CONST 1 (None) | |
54 COMPARE_OP 8 (is) | |
56 POP_JUMP_IF_FALSE 78 | |
489 58 LOAD_GLOBAL 0 (torch) | |
60 LOAD_ATTR 1 (cat) | |
62 LOAD_FAST 3 (past_key_value) | |
64 LOAD_FAST 0 (hidden_states) | |
66 BUILD_LIST 2 | |
68 LOAD_CONST 2 (2) | |
70 LOAD_CONST 3 (('dim',)) | |
72 CALL_FUNCTION_KW 2 | |
74 STORE_FAST 0 (hidden_states) | |
76 JUMP_FORWARD 4 (to 82) | |
492 >> 78 LOAD_FAST 3 (past_key_value) | |
80 STORE_FAST 0 (hidden_states) | |
493 >> 82 LOAD_FAST 0 (hidden_states) | |
84 RETURN_VALUE | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST key_value_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP is [ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 22 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF shape [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST proj_layer [NestedUserFunctionVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [NestedUserFunctionVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NestedUserFunctionVariable(), NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NestedUserFunctionVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] INLINING <code object shape at 0x7f3f100aebe0, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 466> | |
468 0 LOAD_FAST 0 (states) | |
2 LOAD_METHOD 0 (view) | |
4 LOAD_DEREF 0 (batch_size) | |
6 LOAD_CONST 1 (-1) | |
8 LOAD_DEREF 1 (self) | |
10 LOAD_ATTR 1 (n_heads) | |
12 LOAD_DEREF 1 (self) | |
14 LOAD_ATTR 2 (key_value_proj_dim) | |
16 CALL_METHOD 4 | |
18 LOAD_METHOD 3 (transpose) | |
20 LOAD_CONST 2 (1) | |
22 LOAD_CONST 3 (2) | |
24 CALL_METHOD 2 | |
26 RETURN_VALUE | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR view [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF batch_size [GetAttrVariable(TensorVariable(), view)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST -1 [GetAttrVariable(TensorVariable(), view), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF self [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR n_heads [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF self [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR key_value_proj_dim [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int), ConstantVariable(int), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 4 [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int), ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR transpose [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 1 [GetAttrVariable(TensorVariable(), transpose)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 2 [GetAttrVariable(TensorVariable(), transpose), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 2 [GetAttrVariable(TensorVariable(), transpose), ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE RETURN_VALUE None [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] DONE INLINING <code object shape at 0x7f3f100aebe0, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 466> | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST hidden_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE JUMP_FORWARD 42 [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST past_key_value [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP is not [ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 82 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE RETURN_VALUE None [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] DONE INLINING <code object project at 0x7f3f100aed40, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 474> | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST key_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST project [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [NestedUserFunctionVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF self [NestedUserFunctionVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR v [NestedUserFunctionVariable(), TensorVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST key_value_states [NestedUserFunctionVariable(), TensorVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST past_key_value [NestedUserFunctionVariable(), TensorVariable(), NNModuleVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [NestedUserFunctionVariable(), TensorVariable(), NNModuleVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP is not [NestedUserFunctionVariable(), TensorVariable(), NNModuleVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 226 [NestedUserFunctionVariable(), TensorVariable(), NNModuleVariable(), ConstantVariable(NoneType), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [NestedUserFunctionVariable(), TensorVariable(), NNModuleVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 4 [NestedUserFunctionVariable(), TensorVariable(), NNModuleVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] INLINING <code object project at 0x7f3f100aed40, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 474> | |
476 0 LOAD_FAST 2 (key_value_states) | |
2 LOAD_CONST 1 (None) | |
4 COMPARE_OP 8 (is) | |
6 POP_JUMP_IF_FALSE 22 | |
479 8 LOAD_DEREF 0 (shape) | |
10 LOAD_FAST 1 (proj_layer) | |
12 LOAD_FAST 0 (hidden_states) | |
14 CALL_FUNCTION 1 | |
16 CALL_FUNCTION 1 | |
18 STORE_FAST 0 (hidden_states) | |
20 JUMP_FORWARD 20 (to 42) | |
480 >> 22 LOAD_FAST 3 (past_key_value) | |
24 LOAD_CONST 1 (None) | |
26 COMPARE_OP 8 (is) | |
28 POP_JUMP_IF_FALSE 42 | |
483 30 LOAD_DEREF 0 (shape) | |
32 LOAD_FAST 1 (proj_layer) | |
34 LOAD_FAST 2 (key_value_states) | |
36 CALL_FUNCTION 1 | |
38 CALL_FUNCTION 1 | |
40 STORE_FAST 0 (hidden_states) | |
485 >> 42 LOAD_FAST 3 (past_key_value) | |
44 LOAD_CONST 1 (None) | |
46 COMPARE_OP 9 (is not) | |
48 POP_JUMP_IF_FALSE 82 | |
486 50 LOAD_FAST 2 (key_value_states) | |
52 LOAD_CONST 1 (None) | |
54 COMPARE_OP 8 (is) | |
56 POP_JUMP_IF_FALSE 78 | |
489 58 LOAD_GLOBAL 0 (torch) | |
60 LOAD_ATTR 1 (cat) | |
62 LOAD_FAST 3 (past_key_value) | |
64 LOAD_FAST 0 (hidden_states) | |
66 BUILD_LIST 2 | |
68 LOAD_CONST 2 (2) | |
70 LOAD_CONST 3 (('dim',)) | |
72 CALL_FUNCTION_KW 2 | |
74 STORE_FAST 0 (hidden_states) | |
76 JUMP_FORWARD 4 (to 82) | |
492 >> 78 LOAD_FAST 3 (past_key_value) | |
80 STORE_FAST 0 (hidden_states) | |
493 >> 82 LOAD_FAST 0 (hidden_states) | |
84 RETURN_VALUE | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST key_value_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP is [ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 22 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF shape [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST proj_layer [NestedUserFunctionVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [NestedUserFunctionVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NestedUserFunctionVariable(), NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NestedUserFunctionVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] INLINING <code object shape at 0x7f3f100aebe0, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 466> | |
468 0 LOAD_FAST 0 (states) | |
2 LOAD_METHOD 0 (view) | |
4 LOAD_DEREF 0 (batch_size) | |
6 LOAD_CONST 1 (-1) | |
8 LOAD_DEREF 1 (self) | |
10 LOAD_ATTR 1 (n_heads) | |
12 LOAD_DEREF 1 (self) | |
14 LOAD_ATTR 2 (key_value_proj_dim) | |
16 CALL_METHOD 4 | |
18 LOAD_METHOD 3 (transpose) | |
20 LOAD_CONST 2 (1) | |
22 LOAD_CONST 3 (2) | |
24 CALL_METHOD 2 | |
26 RETURN_VALUE | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR view [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF batch_size [GetAttrVariable(TensorVariable(), view)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST -1 [GetAttrVariable(TensorVariable(), view), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF self [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR n_heads [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF self [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR key_value_proj_dim [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int), ConstantVariable(int), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 4 [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int), ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR transpose [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 1 [GetAttrVariable(TensorVariable(), transpose)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 2 [GetAttrVariable(TensorVariable(), transpose), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 2 [GetAttrVariable(TensorVariable(), transpose), ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE RETURN_VALUE None [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] DONE INLINING <code object shape at 0x7f3f100aebe0, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 466> | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST hidden_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE JUMP_FORWARD 42 [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST past_key_value [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP is not [ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 82 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE RETURN_VALUE None [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] DONE INLINING <code object project at 0x7f3f100aed40, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 474> | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST value_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL torch [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR matmul [TorchVariable(<module 'torch' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/__init__.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST query_states [TorchVariable(<built-in method matmul of type object at 0x7f40b0102e20>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST key_states [TorchVariable(<built-in method matmul of type object at 0x7f40b0102e20>), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR transpose [TorchVariable(<built-in method matmul of type object at 0x7f40b0102e20>), TensorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 3 [TorchVariable(<built-in method matmul of type object at 0x7f40b0102e20>), TensorVariable(), GetAttrVariable(TensorVariable(), transpose)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 2 [TorchVariable(<built-in method matmul of type object at 0x7f40b0102e20>), TensorVariable(), GetAttrVariable(TensorVariable(), transpose), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 2 [TorchVariable(<built-in method matmul of type object at 0x7f40b0102e20>), TensorVariable(), GetAttrVariable(TensorVariable(), transpose), ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 2 [TorchVariable(<built-in method matmul of type object at 0x7f40b0102e20>), TensorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST scores [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST position_bias [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP is [TensorVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 410 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST scores [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST position_bias [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE INPLACE_ADD None [TensorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST scores [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL nn [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR functional [TorchVariable(<module 'torch.nn' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/nn/__init__.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR softmax [TorchVariable(<module 'torch.nn.functional' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/nn/functional.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST scores [TorchVariable(<function softmax at 0x7f3f1bf53e50>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR float [TorchVariable(<function softmax at 0x7f3f1bf53e50>), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 0 [TorchVariable(<function softmax at 0x7f3f1bf53e50>), GetAttrVariable(TensorVariable(), float)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST -1 [TorchVariable(<function softmax at 0x7f3f1bf53e50>), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST ('dim',) [TorchVariable(<function softmax at 0x7f3f1bf53e50>), TensorVariable(), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION_KW 2 [TorchVariable(<function softmax at 0x7f3f1bf53e50>), TensorVariable(), ConstantVariable(int), ConstantVariable(tuple)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR type_as [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST scores [GetAttrVariable(TensorVariable(), type_as)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [GetAttrVariable(TensorVariable(), type_as), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST attn_weights [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL nn [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR functional [TorchVariable(<module 'torch.nn' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/nn/__init__.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR dropout [TorchVariable(<module 'torch.nn.functional' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/nn/functional.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST attn_weights [TorchVariable(<function dropout at 0x7f3f1bf53310>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF self [TorchVariable(<function dropout at 0x7f3f1bf53310>), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR dropout [TorchVariable(<function dropout at 0x7f3f1bf53310>), TensorVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF self [TorchVariable(<function dropout at 0x7f3f1bf53310>), TensorVariable(), ConstantVariable(float)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR training [TorchVariable(<function dropout at 0x7f3f1bf53310>), TensorVariable(), ConstantVariable(float), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST ('p', 'training') [TorchVariable(<function dropout at 0x7f3f1bf53310>), TensorVariable(), ConstantVariable(float), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION_KW 3 [TorchVariable(<function dropout at 0x7f3f1bf53310>), TensorVariable(), ConstantVariable(float), ConstantVariable(bool), ConstantVariable(tuple)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST attn_weights [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST layer_head_mask [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP is not [ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 484 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST unshape [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL torch [NestedUserFunctionVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR matmul [NestedUserFunctionVariable(), TorchVariable(<module 'torch' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/__init__.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST attn_weights [NestedUserFunctionVariable(), TorchVariable(<built-in method matmul of type object at 0x7f40b0102e20>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST value_states [NestedUserFunctionVariable(), TorchVariable(<built-in method matmul of type object at 0x7f40b0102e20>), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 2 [NestedUserFunctionVariable(), TorchVariable(<built-in method matmul of type object at 0x7f40b0102e20>), TensorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NestedUserFunctionVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] INLINING <code object unshape at 0x7f3f100aec90, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 470> | |
472 0 LOAD_FAST 0 (states) | |
2 LOAD_METHOD 0 (transpose) | |
4 LOAD_CONST 1 (1) | |
6 LOAD_CONST 2 (2) | |
8 CALL_METHOD 2 | |
10 LOAD_METHOD 1 (contiguous) | |
12 CALL_METHOD 0 | |
14 LOAD_METHOD 2 (view) | |
16 LOAD_DEREF 0 (batch_size) | |
18 LOAD_CONST 3 (-1) | |
20 LOAD_DEREF 1 (self) | |
22 LOAD_ATTR 3 (inner_dim) | |
24 CALL_METHOD 3 | |
26 RETURN_VALUE | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR transpose [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 1 [GetAttrVariable(TensorVariable(), transpose)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 2 [GetAttrVariable(TensorVariable(), transpose), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 2 [GetAttrVariable(TensorVariable(), transpose), ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR contiguous [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 0 [GetAttrVariable(TensorVariable(), contiguous)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR view [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF batch_size [GetAttrVariable(TensorVariable(), view)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST -1 [GetAttrVariable(TensorVariable(), view), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF self [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR inner_dim [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 3 [GetAttrVariable(TensorVariable(), view), ConstantVariable(int), ConstantVariable(int), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE RETURN_VALUE None [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] DONE INLINING <code object unshape at 0x7f3f100aec90, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 470> | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST attn_output [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR o [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST attn_output [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST attn_output [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR is_decoder [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 532 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST present_key_value_state [ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST attn_output [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_TUPLE 1 [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST present_key_value_state [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_TUPLE 1 [TupleVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_ADD None [TupleVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST position_bias [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_TUPLE 1 [TupleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_ADD None [TupleVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST outputs [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST output_attentions [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 570 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST outputs [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE RETURN_VALUE None [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] DONE INLINING <code object forward at 0x7f3f100aedf0, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 436> | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST attention_output [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR dropout [TensorVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST attention_output [TensorVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 0 [TensorVariable(), NNModuleVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_SUBSCR None [TensorVariable(), NNModuleVariable(), TupleVariable(), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [TensorVariable(), NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_ADD None [TensorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST hidden_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_TUPLE 1 [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST attention_output [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 1 [TupleVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [TupleVariable(), TupleVariable(), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_SLICE 2 [TupleVariable(), TupleVariable(), ConstantVariable(int), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_SUBSCR None [TupleVariable(), TupleVariable(), SliceVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_ADD None [TupleVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST outputs [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST outputs [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE RETURN_VALUE None [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] DONE INLINING <code object forward at 0x7f3f100b4030, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 559> | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST self_attention_outputs [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self_attention_outputs [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 2 [TupleVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_SLICE 2 [TupleVariable(), ConstantVariable(NoneType), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_SUBSCR None [TupleVariable(), SliceVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE UNPACK_SEQUENCE 2 [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST hidden_states [ConstantVariable(NoneType), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST present_key_value_state [ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self_attention_outputs [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 2 [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [TupleVariable(), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_SLICE 2 [TupleVariable(), ConstantVariable(int), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_SUBSCR None [TupleVariable(), SliceVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST attention_outputs [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR dtype [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL torch [TorchVariable(torch.float32)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR float16 [TorchVariable(torch.float32), TorchVariable(<module 'torch' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/__init__.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP == [TorchVariable(torch.float32), ConstantVariable(dtype)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 248 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR is_decoder [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE JUMP_IF_FALSE_OR_POP 262 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST do_cross_attention [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST do_cross_attention [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 444 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR layer [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST -1 [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_SUBSCR None [NNModuleVariable(), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] INLINING <code object forward at 0x7f3f100ae7c0, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 324> | |
325 0 LOAD_FAST 0 (self) | |
2 LOAD_METHOD 0 (layer_norm) | |
4 LOAD_FAST 1 (hidden_states) | |
6 CALL_METHOD 1 | |
8 STORE_FAST 2 (forwarded_states) | |
326 10 LOAD_FAST 0 (self) | |
12 LOAD_METHOD 1 (DenseReluDense) | |
14 LOAD_FAST 2 (forwarded_states) | |
16 CALL_METHOD 1 | |
18 STORE_FAST 2 (forwarded_states) | |
327 20 LOAD_FAST 1 (hidden_states) | |
22 LOAD_FAST 0 (self) | |
24 LOAD_METHOD 2 (dropout) | |
26 LOAD_FAST 2 (forwarded_states) | |
28 CALL_METHOD 1 | |
30 BINARY_ADD | |
32 STORE_FAST 1 (hidden_states) | |
328 34 LOAD_FAST 1 (hidden_states) | |
36 RETURN_VALUE | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR layer_norm [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] INLINING <code object forward at 0x7f3f100ae190, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 248> | |
255 0 LOAD_FAST 1 (hidden_states) | |
2 LOAD_METHOD 0 (to) | |
4 LOAD_GLOBAL 1 (torch) | |
6 LOAD_ATTR 2 (float32) | |
8 CALL_METHOD 1 | |
10 LOAD_METHOD 3 (pow) | |
12 LOAD_CONST 1 (2) | |
14 CALL_METHOD 1 | |
16 LOAD_ATTR 4 (mean) | |
18 LOAD_CONST 2 (-1) | |
20 LOAD_CONST 3 (True) | |
22 LOAD_CONST 4 (('keepdim',)) | |
24 CALL_FUNCTION_KW 2 | |
26 STORE_FAST 2 (variance) | |
256 28 LOAD_FAST 1 (hidden_states) | |
30 LOAD_GLOBAL 1 (torch) | |
32 LOAD_METHOD 5 (rsqrt) | |
34 LOAD_FAST 2 (variance) | |
36 LOAD_FAST 0 (self) | |
38 LOAD_ATTR 6 (variance_epsilon) | |
40 BINARY_ADD | |
42 CALL_METHOD 1 | |
44 BINARY_MULTIPLY | |
46 STORE_FAST 1 (hidden_states) | |
259 48 LOAD_FAST 0 (self) | |
50 LOAD_ATTR 7 (weight) | |
52 LOAD_ATTR 8 (dtype) | |
54 LOAD_GLOBAL 1 (torch) | |
56 LOAD_ATTR 9 (float16) | |
58 LOAD_GLOBAL 1 (torch) | |
60 LOAD_ATTR 10 (bfloat16) | |
62 BUILD_TUPLE 2 | |
64 COMPARE_OP 6 (in) | |
66 POP_JUMP_IF_FALSE 82 | |
260 68 LOAD_FAST 1 (hidden_states) | |
70 LOAD_METHOD 0 (to) | |
72 LOAD_FAST 0 (self) | |
74 LOAD_ATTR 7 (weight) | |
76 LOAD_ATTR 8 (dtype) | |
78 CALL_METHOD 1 | |
80 STORE_FAST 1 (hidden_states) | |
262 >> 82 LOAD_FAST 0 (self) | |
84 LOAD_ATTR 7 (weight) | |
86 LOAD_FAST 1 (hidden_states) | |
88 BINARY_MULTIPLY | |
90 RETURN_VALUE | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR to [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL torch [GetAttrVariable(TensorVariable(), to)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR float32 [GetAttrVariable(TensorVariable(), to), TorchVariable(<module 'torch' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/__init__.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [GetAttrVariable(TensorVariable(), to), ConstantVariable(dtype)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR pow [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 2 [GetAttrVariable(TensorVariable(), pow)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [GetAttrVariable(TensorVariable(), pow), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR mean [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST -1 [GetAttrVariable(TensorVariable(), mean)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST True [GetAttrVariable(TensorVariable(), mean), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST ('keepdim',) [GetAttrVariable(TensorVariable(), mean), ConstantVariable(int), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION_KW 2 [GetAttrVariable(TensorVariable(), mean), ConstantVariable(int), ConstantVariable(bool), ConstantVariable(tuple)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST variance [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL torch [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR rsqrt [TensorVariable(), TorchVariable(<module 'torch' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/__init__.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST variance [TensorVariable(), TorchVariable(<built-in method rsqrt of type object at 0x7f40b0102e20>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [TensorVariable(), TorchVariable(<built-in method rsqrt of type object at 0x7f40b0102e20>), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR variance_epsilon [TensorVariable(), TorchVariable(<built-in method rsqrt of type object at 0x7f40b0102e20>), TensorVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_ADD None [TensorVariable(), TorchVariable(<built-in method rsqrt of type object at 0x7f40b0102e20>), TensorVariable(), ConstantVariable(float)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [TensorVariable(), TorchVariable(<built-in method rsqrt of type object at 0x7f40b0102e20>), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_MULTIPLY None [TensorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST hidden_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR weight [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR dtype [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL torch [TorchVariable(torch.float32)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR float16 [TorchVariable(torch.float32), TorchVariable(<module 'torch' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/__init__.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL torch [TorchVariable(torch.float32), ConstantVariable(dtype)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR bfloat16 [TorchVariable(torch.float32), ConstantVariable(dtype), TorchVariable(<module 'torch' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/__init__.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_TUPLE 2 [TorchVariable(torch.float32), ConstantVariable(dtype), ConstantVariable(dtype)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP in [TorchVariable(torch.float32), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 82 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR weight [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_MULTIPLY None [TensorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE RETURN_VALUE None [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] DONE INLINING <code object forward at 0x7f3f100ae190, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 248> | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST forwarded_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR DenseReluDense [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST forwarded_states [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] INLINING <code object forward at 0x7f3f100ae3a0, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 287> | |
288 0 LOAD_FAST 0 (self) | |
2 LOAD_METHOD 0 (wi) | |
4 LOAD_FAST 1 (hidden_states) | |
6 CALL_METHOD 1 | |
8 STORE_FAST 1 (hidden_states) | |
289 10 LOAD_FAST 0 (self) | |
12 LOAD_METHOD 1 (act) | |
14 LOAD_FAST 1 (hidden_states) | |
16 CALL_METHOD 1 | |
18 STORE_FAST 1 (hidden_states) | |
290 20 LOAD_FAST 0 (self) | |
22 LOAD_METHOD 2 (dropout) | |
24 LOAD_FAST 1 (hidden_states) | |
26 CALL_METHOD 1 | |
28 STORE_FAST 1 (hidden_states) | |
291 30 LOAD_FAST 0 (self) | |
32 LOAD_METHOD 3 (wo) | |
34 LOAD_FAST 1 (hidden_states) | |
36 CALL_METHOD 1 | |
38 STORE_FAST 1 (hidden_states) | |
292 40 LOAD_FAST 1 (hidden_states) | |
42 RETURN_VALUE | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR wi [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST hidden_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR act [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST hidden_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR dropout [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST hidden_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR wo [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST hidden_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE RETURN_VALUE None [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] DONE INLINING <code object forward at 0x7f3f100ae3a0, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 287> | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST forwarded_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR dropout [TensorVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST forwarded_states [TensorVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [TensorVariable(), NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_ADD None [TensorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST hidden_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE RETURN_VALUE None [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] DONE INLINING <code object forward at 0x7f3f100ae7c0, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 324> | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST hidden_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR dtype [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL torch [TorchVariable(torch.float32)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR float16 [TorchVariable(torch.float32), TorchVariable(<module 'torch' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/__init__.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP == [TorchVariable(torch.float32), ConstantVariable(dtype)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 524 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_TUPLE 1 [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST outputs [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST use_cache [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 552 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST outputs [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST attention_outputs [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_ADD None [TupleVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST outputs [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST outputs [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE RETURN_VALUE None [TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] DONE INLINING <code object forward at 0x7f3f100b4450, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 631> | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST layer_outputs [ListIteratorVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF use_cache [ListIteratorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST False [ListIteratorVariable(), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP is [ListIteratorVariable(), ConstantVariable(bool), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 1096 [ListIteratorVariable(), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST layer_outputs [ListIteratorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [ListIteratorVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 1 [ListIteratorVariable(), TupleVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_SLICE 2 [ListIteratorVariable(), TupleVariable(), ConstantVariable(NoneType), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_SUBSCR None [ListIteratorVariable(), TupleVariable(), SliceVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST (None,) [ListIteratorVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_ADD None [ListIteratorVariable(), TupleVariable(), ConstantVariable(tuple)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST layer_outputs [ListIteratorVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 1 [ListIteratorVariable(), TupleVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [ListIteratorVariable(), TupleVariable(), TupleVariable(), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_SLICE 2 [ListIteratorVariable(), TupleVariable(), TupleVariable(), ConstantVariable(int), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_SUBSCR None [ListIteratorVariable(), TupleVariable(), TupleVariable(), SliceVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_ADD None [ListIteratorVariable(), TupleVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST layer_outputs [ListIteratorVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST layer_outputs [ListIteratorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [ListIteratorVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 2 [ListIteratorVariable(), TupleVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_SLICE 2 [ListIteratorVariable(), TupleVariable(), ConstantVariable(NoneType), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_SUBSCR None [ListIteratorVariable(), TupleVariable(), SliceVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE UNPACK_SEQUENCE 2 [ListIteratorVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST hidden_states [ListIteratorVariable(), ConstantVariable(NoneType), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST present_key_value_state [ListIteratorVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST layer_outputs [ListIteratorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 2 [ListIteratorVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_SUBSCR None [ListIteratorVariable(), TupleVariable(), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST position_bias [ListIteratorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [ListIteratorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR is_decoder [ListIteratorVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 1156 [ListIteratorVariable(), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF use_cache [ListIteratorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 1172 [ListIteratorVariable(), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_DEREF output_attentions [ListIteratorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 1214 [ListIteratorVariable(), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [ListIteratorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR model_parallel [ListIteratorVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 724 [ListIteratorVariable(), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE FOR_ITER 1304 [ListIteratorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR final_layer_norm [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] INLINING <code object forward at 0x7f3f100ae190, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 248> | |
255 0 LOAD_FAST 1 (hidden_states) | |
2 LOAD_METHOD 0 (to) | |
4 LOAD_GLOBAL 1 (torch) | |
6 LOAD_ATTR 2 (float32) | |
8 CALL_METHOD 1 | |
10 LOAD_METHOD 3 (pow) | |
12 LOAD_CONST 1 (2) | |
14 CALL_METHOD 1 | |
16 LOAD_ATTR 4 (mean) | |
18 LOAD_CONST 2 (-1) | |
20 LOAD_CONST 3 (True) | |
22 LOAD_CONST 4 (('keepdim',)) | |
24 CALL_FUNCTION_KW 2 | |
26 STORE_FAST 2 (variance) | |
256 28 LOAD_FAST 1 (hidden_states) | |
30 LOAD_GLOBAL 1 (torch) | |
32 LOAD_METHOD 5 (rsqrt) | |
34 LOAD_FAST 2 (variance) | |
36 LOAD_FAST 0 (self) | |
38 LOAD_ATTR 6 (variance_epsilon) | |
40 BINARY_ADD | |
42 CALL_METHOD 1 | |
44 BINARY_MULTIPLY | |
46 STORE_FAST 1 (hidden_states) | |
259 48 LOAD_FAST 0 (self) | |
50 LOAD_ATTR 7 (weight) | |
52 LOAD_ATTR 8 (dtype) | |
54 LOAD_GLOBAL 1 (torch) | |
56 LOAD_ATTR 9 (float16) | |
58 LOAD_GLOBAL 1 (torch) | |
60 LOAD_ATTR 10 (bfloat16) | |
62 BUILD_TUPLE 2 | |
64 COMPARE_OP 6 (in) | |
66 POP_JUMP_IF_FALSE 82 | |
260 68 LOAD_FAST 1 (hidden_states) | |
70 LOAD_METHOD 0 (to) | |
72 LOAD_FAST 0 (self) | |
74 LOAD_ATTR 7 (weight) | |
76 LOAD_ATTR 8 (dtype) | |
78 CALL_METHOD 1 | |
80 STORE_FAST 1 (hidden_states) | |
262 >> 82 LOAD_FAST 0 (self) | |
84 LOAD_ATTR 7 (weight) | |
86 LOAD_FAST 1 (hidden_states) | |
88 BINARY_MULTIPLY | |
90 RETURN_VALUE | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR to [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL torch [GetAttrVariable(TensorVariable(), to)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR float32 [GetAttrVariable(TensorVariable(), to), TorchVariable(<module 'torch' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/__init__.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [GetAttrVariable(TensorVariable(), to), ConstantVariable(dtype)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR pow [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 2 [GetAttrVariable(TensorVariable(), pow)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [GetAttrVariable(TensorVariable(), pow), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR mean [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST -1 [GetAttrVariable(TensorVariable(), mean)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST True [GetAttrVariable(TensorVariable(), mean), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST ('keepdim',) [GetAttrVariable(TensorVariable(), mean), ConstantVariable(int), ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION_KW 2 [GetAttrVariable(TensorVariable(), mean), ConstantVariable(int), ConstantVariable(bool), ConstantVariable(tuple)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST variance [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL torch [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR rsqrt [TensorVariable(), TorchVariable(<module 'torch' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/__init__.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST variance [TensorVariable(), TorchVariable(<built-in method rsqrt of type object at 0x7f40b0102e20>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [TensorVariable(), TorchVariable(<built-in method rsqrt of type object at 0x7f40b0102e20>), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR variance_epsilon [TensorVariable(), TorchVariable(<built-in method rsqrt of type object at 0x7f40b0102e20>), TensorVariable(), NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_ADD None [TensorVariable(), TorchVariable(<built-in method rsqrt of type object at 0x7f40b0102e20>), TensorVariable(), ConstantVariable(float)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [TensorVariable(), TorchVariable(<built-in method rsqrt of type object at 0x7f40b0102e20>), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_MULTIPLY None [TensorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST hidden_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR weight [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR dtype [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL torch [TorchVariable(torch.float32)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR float16 [TorchVariable(torch.float32), TorchVariable(<module 'torch' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/__init__.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL torch [TorchVariable(torch.float32), ConstantVariable(dtype)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR bfloat16 [TorchVariable(torch.float32), ConstantVariable(dtype), TorchVariable(<module 'torch' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/__init__.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_TUPLE 2 [TorchVariable(torch.float32), ConstantVariable(dtype), ConstantVariable(dtype)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP in [TorchVariable(torch.float32), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 82 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR weight [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_MULTIPLY None [TensorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE RETURN_VALUE None [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] DONE INLINING <code object forward at 0x7f3f100ae190, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 248> | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST hidden_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR dropout [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [NNModuleVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST hidden_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST output_hidden_states [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 1340 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST return_dict [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_TRUE 1374 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL BaseModelOutputWithPastAndCrossAttentions [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST hidden_states [UserDefinedClassVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST present_key_value_states [UserDefinedClassVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST all_hidden_states [UserDefinedClassVariable(), TensorVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST all_attentions [UserDefinedClassVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST all_cross_attentions [UserDefinedClassVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST ('last_hidden_state', 'past_key_values', 'hidden_states', 'attentions', 'cross_attentions') [UserDefinedClassVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION_KW 5 [UserDefinedClassVariable(), TensorVariable(), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(NoneType), ConstantVariable(tuple)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE RETURN_VALUE None [DataClassVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] DONE INLINING <code object forward at 0x7f3f100ba030, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 892> | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST encoder_outputs [DataClassVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE JUMP_FORWARD 194 [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST encoder_outputs [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 0 [DataClassVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_SUBSCR None [DataClassVariable(), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST hidden_states [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR model_parallel [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 224 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST labels [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP is not [TensorVariable(), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 264 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST decoder_input_ids [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP is [ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 264 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST decoder_inputs_embeds [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP is [ConstantVariable(NoneType), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 264 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR _shift_right [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST labels [UserMethodVariable(<function T5PreTrainedModel._shift_right at 0x7f3ef771cee0>, NNModuleVariable())] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [UserMethodVariable(<function T5PreTrainedModel._shift_right at 0x7f3ef771cee0>, NNModuleVariable()), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] INLINING <code object _shift_right at 0x7f3f100b47c0, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 805> | |
806 0 LOAD_FAST 0 (self) | |
2 LOAD_ATTR 0 (config) | |
4 LOAD_ATTR 1 (decoder_start_token_id) | |
6 STORE_FAST 2 (decoder_start_token_id) | |
807 8 LOAD_FAST 0 (self) | |
10 LOAD_ATTR 0 (config) | |
12 LOAD_ATTR 2 (pad_token_id) | |
14 STORE_FAST 3 (pad_token_id) | |
809 16 LOAD_FAST 2 (decoder_start_token_id) | |
18 LOAD_CONST 0 (None) | |
20 COMPARE_OP 9 (is not) | |
22 POP_JUMP_IF_TRUE 32 | |
24 LOAD_GLOBAL 3 (AssertionError) | |
810 26 LOAD_CONST 1 ('self.model.config.decoder_start_token_id has to be defined. In T5 it is usually set to the pad_token_id. See T5 docs for more information') | |
809 28 CALL_FUNCTION 1 | |
30 RAISE_VARARGS 1 | |
815 >> 32 LOAD_GLOBAL 4 (is_torch_fx_proxy) | |
34 LOAD_FAST 1 (input_ids) | |
36 CALL_FUNCTION 1 | |
38 POP_JUMP_IF_FALSE 98 | |
817 40 LOAD_GLOBAL 5 (torch) | |
42 LOAD_METHOD 6 (full) | |
44 LOAD_FAST 1 (input_ids) | |
46 LOAD_ATTR 7 (shape) | |
48 LOAD_CONST 0 (None) | |
50 LOAD_CONST 2 (-1) | |
52 BUILD_SLICE 2 | |
54 BINARY_SUBSCR | |
56 LOAD_CONST 3 ((1,)) | |
58 BINARY_ADD | |
60 LOAD_FAST 2 (decoder_start_token_id) | |
62 CALL_METHOD 2 | |
64 STORE_FAST 4 (shifted_input_ids) | |
818 66 LOAD_GLOBAL 5 (torch) | |
68 LOAD_ATTR 8 (cat) | |
70 LOAD_FAST 4 (shifted_input_ids) | |
72 LOAD_FAST 1 (input_ids) | |
74 LOAD_CONST 4 (Ellipsis) | |
76 LOAD_CONST 0 (None) | |
78 LOAD_CONST 2 (-1) | |
80 BUILD_SLICE 2 | |
82 BUILD_TUPLE 2 | |
84 BINARY_SUBSCR | |
86 BUILD_LIST 2 | |
88 LOAD_CONST 2 (-1) | |
90 LOAD_CONST 5 (('dim',)) | |
92 CALL_FUNCTION_KW 2 | |
94 STORE_FAST 4 (shifted_input_ids) | |
96 JUMP_FORWARD 52 (to 150) | |
820 >> 98 LOAD_FAST 1 (input_ids) | |
100 LOAD_METHOD 9 (new_zeros) | |
102 LOAD_FAST 1 (input_ids) | |
104 LOAD_ATTR 7 (shape) | |
106 CALL_METHOD 1 | |
108 STORE_FAST 4 (shifted_input_ids) | |
821 110 LOAD_FAST 1 (input_ids) | |
112 LOAD_CONST 4 (Ellipsis) | |
114 LOAD_CONST 0 (None) | |
116 LOAD_CONST 2 (-1) | |
118 BUILD_SLICE 2 | |
120 BUILD_TUPLE 2 | |
122 BINARY_SUBSCR | |
124 LOAD_METHOD 10 (clone) | |
126 CALL_METHOD 0 | |
128 LOAD_FAST 4 (shifted_input_ids) | |
130 LOAD_CONST 4 (Ellipsis) | |
132 LOAD_CONST 6 (1) | |
134 LOAD_CONST 0 (None) | |
136 BUILD_SLICE 2 | |
138 BUILD_TUPLE 2 | |
140 STORE_SUBSCR | |
822 142 LOAD_FAST 2 (decoder_start_token_id) | |
144 LOAD_FAST 4 (shifted_input_ids) | |
146 LOAD_CONST 7 ((Ellipsis, 0)) | |
148 STORE_SUBSCR | |
824 >> 150 LOAD_FAST 3 (pad_token_id) | |
152 LOAD_CONST 0 (None) | |
154 COMPARE_OP 9 (is not) | |
156 POP_JUMP_IF_TRUE 166 | |
158 LOAD_GLOBAL 3 (AssertionError) | |
160 LOAD_CONST 8 ('self.model.config.pad_token_id has to be defined.') | |
162 CALL_FUNCTION 1 | |
164 RAISE_VARARGS 1 | |
826 >> 166 LOAD_FAST 4 (shifted_input_ids) | |
168 LOAD_METHOD 11 (masked_fill_) | |
170 LOAD_FAST 4 (shifted_input_ids) | |
172 LOAD_CONST 9 (-100) | |
174 COMPARE_OP 2 (==) | |
176 LOAD_FAST 3 (pad_token_id) | |
178 CALL_METHOD 2 | |
180 POP_TOP | |
828 182 LOAD_GLOBAL 5 (torch) | |
184 LOAD_METHOD 12 (all) | |
186 LOAD_FAST 4 (shifted_input_ids) | |
188 LOAD_CONST 10 (0) | |
190 COMPARE_OP 5 (>=) | |
192 CALL_METHOD 1 | |
194 LOAD_METHOD 13 (item) | |
196 CALL_METHOD 0 | |
198 POP_JUMP_IF_TRUE 208 | |
200 LOAD_GLOBAL 3 (AssertionError) | |
202 LOAD_CONST 11 ('Verify that `shifted_input_ids` has only positive values') | |
204 CALL_FUNCTION 1 | |
206 RAISE_VARARGS 1 | |
830 >> 208 LOAD_FAST 4 (shifted_input_ids) | |
210 RETURN_VALUE | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR config [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR decoder_start_token_id [HFPretrainedConfigVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST decoder_start_token_id [ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST self [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR config [NNModuleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR pad_token_id [HFPretrainedConfigVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST pad_token_id [ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST decoder_start_token_id [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP is not [ConstantVariable(int), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_TRUE 32 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL is_torch_fx_proxy [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST input_ids [UserFunctionVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [UserFunctionVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] INLINING <code object is_torch_fx_proxy at 0x7f3f198c6870, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/utils/import_utils.py", line 848> | |
849 0 LOAD_GLOBAL 0 (is_torch_fx_available) | |
2 CALL_FUNCTION 0 | |
4 POP_JUMP_IF_FALSE 28 | |
850 6 LOAD_CONST 1 (0) | |
8 LOAD_CONST 0 (None) | |
10 IMPORT_NAME 1 (torch.fx) | |
12 STORE_FAST 1 (torch) | |
852 14 LOAD_GLOBAL 2 (isinstance) | |
16 LOAD_FAST 0 (x) | |
18 LOAD_FAST 1 (torch) | |
20 LOAD_ATTR 3 (fx) | |
22 LOAD_ATTR 4 (Proxy) | |
24 CALL_FUNCTION 2 | |
26 RETURN_VALUE | |
853 >> 28 LOAD_CONST 2 (False) | |
30 RETURN_VALUE | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL is_torch_fx_available [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 0 [UserFunctionVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] INLINING <code object is_torch_fx_available at 0x7f3f198c1190, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/utils/import_utils.py", line 359> | |
360 0 LOAD_GLOBAL 0 (_torch_fx_available) | |
2 RETURN_VALUE | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL _torch_fx_available [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE RETURN_VALUE None [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] DONE INLINING <code object is_torch_fx_available at 0x7f3f198c1190, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/utils/import_utils.py", line 359> | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 28 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 0 [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE IMPORT_NAME torch.fx [ConstantVariable(int), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST torch [TorchVariable(<module 'torch' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/__init__.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL isinstance [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST x [BuiltinVariable(isinstance)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST torch [BuiltinVariable(isinstance), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR fx [BuiltinVariable(isinstance), TensorVariable(), TorchVariable(<module 'torch' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/__init__.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR Proxy [BuiltinVariable(isinstance), TensorVariable(), TorchVariable(<module 'torch.fx' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/fx/__init__.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 2 [BuiltinVariable(isinstance), TensorVariable(), TorchVariable(<class 'torch.fx.proxy.Proxy'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE RETURN_VALUE None [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] DONE INLINING <code object is_torch_fx_proxy at 0x7f3f198c6870, file "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/utils/import_utils.py", line 848> | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_FALSE 98 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST input_ids [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR new_zeros [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST input_ids [GetAttrVariable(TensorVariable(), new_zeros)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR shape [GetAttrVariable(TensorVariable(), new_zeros), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [GetAttrVariable(TensorVariable(), new_zeros), ShapeVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_FAST shifted_input_ids [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST input_ids [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST Ellipsis [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [TensorVariable(), ConstantVariable(ellipsis)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST -1 [TensorVariable(), ConstantVariable(ellipsis), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_SLICE 2 [TensorVariable(), ConstantVariable(ellipsis), ConstantVariable(NoneType), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_TUPLE 2 [TensorVariable(), ConstantVariable(ellipsis), SliceVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BINARY_SUBSCR None [TensorVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR clone [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 0 [GetAttrVariable(TensorVariable(), clone)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST shifted_input_ids [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST Ellipsis [TensorVariable(), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 1 [TensorVariable(), TensorVariable(), ConstantVariable(ellipsis)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [TensorVariable(), TensorVariable(), ConstantVariable(ellipsis), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_SLICE 2 [TensorVariable(), TensorVariable(), ConstantVariable(ellipsis), ConstantVariable(int), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE BUILD_TUPLE 2 [TensorVariable(), TensorVariable(), ConstantVariable(ellipsis), SliceVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_SUBSCR None [TensorVariable(), TensorVariable(), TupleVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST decoder_start_token_id [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST shifted_input_ids [ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST (Ellipsis, 0) [ConstantVariable(int), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE STORE_SUBSCR None [ConstantVariable(int), TensorVariable(), ConstantVariable(tuple)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST pad_token_id [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST None [ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP is not [ConstantVariable(int), ConstantVariable(NoneType)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_JUMP_IF_TRUE 166 [ConstantVariable(bool)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST shifted_input_ids [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR masked_fill_ [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST shifted_input_ids [GetAttrVariable(TensorVariable(), masked_fill_)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST -100 [GetAttrVariable(TensorVariable(), masked_fill_), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP == [GetAttrVariable(TensorVariable(), masked_fill_), TensorVariable(), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST pad_token_id [GetAttrVariable(TensorVariable(), masked_fill_), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 2 [GetAttrVariable(TensorVariable(), masked_fill_), TensorVariable(), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE POP_TOP None [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_GLOBAL torch [] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR all [TorchVariable(<module 'torch' from '/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/__init__.py'>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_FAST shifted_input_ids [TorchVariable(<built-in method all of type object at 0x7f40b0102e20>)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_CONST 0 [TorchVariable(<built-in method all of type object at 0x7f40b0102e20>), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE COMPARE_OP >= [TorchVariable(<built-in method all of type object at 0x7f40b0102e20>), TensorVariable(), ConstantVariable(int)] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 1 [TorchVariable(<built-in method all of type object at 0x7f40b0102e20>), TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE LOAD_ATTR item [TensorVariable()] | |
torchdynamo.symbolic_convert: [DEBUG] TRACE CALL_FUNCTION 0 [GetAttrVariable(TensorVariable(), item)] | |
torchdynamo.symbolic_convert: [WARNING] Graph break: Tensor.item from user code at File "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 1624, in forward | |
decoder_input_ids = self._shift_right(labels) | |
File "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/transformers/models/t5/modeling_t5.py", line 828, in _shift_right | |
assert torch.all(shifted_input_ids >= 0).item(), "Verify that `shifted_input_ids` has only positive values" | |
torchdynamo.output_graph: [INFO] TRACED GRAPH | |
__compiled_fn_1 <eval_with_key>.3 opcode name target args kwargs | |
------------- --------------------------------------------------------------------- -------------------------------------------------------------------- ---------------------------------------------------------------------------------- -------------------------------------------------------------- | |
placeholder input_ids input_ids () {} | |
call_method view view (input_ids, -1, 1024) {} | |
call_module self_encoder_embed_tokens self_encoder_embed_tokens (view,) {} | |
call_function ones <built-in method ones of type object at 0x7f40b0102e20> (8, 1024) {} | |
call_method to to (ones, device(type='cuda', index=0)) {} | |
call_function getitem <built-in function getitem> (to, (slice(None, None, None), None, None, slice(None, None, None))) {} | |
call_method to_1 to (getitem,) {'dtype': torch.float32} | |
call_function sub <built-in function sub> (1.0, to_1) {} | |
call_function mul <built-in function mul> (sub, -10000.0) {} | |
call_module self_encoder_dropout self_encoder_dropout (self_encoder_embed_tokens,) {} | |
call_method to_2 to (self_encoder_dropout, torch.float32) {} | |
call_method pow_1 pow (to_2, 2) {} | |
call_method mean mean (pow_1, -1) {'keepdim': True} | |
call_function add <built-in function add> (mean, 1e-06) {} | |
call_function rsqrt <built-in method rsqrt of type object at 0x7f40b0102e20> (add,) {} | |
call_function mul_1 <built-in function mul> (self_encoder_dropout, rsqrt) {} | |
get_attr self_encoder_block_0__layer_0__layer_norm_weight_2 self_encoder_block_0__layer_0__layer_norm_weight () {} | |
call_function mul_2 <built-in function mul> (self_encoder_block_0__layer_0__layer_norm_weight_2, mul_1) {} | |
call_module self_encoder_block_0__layer_0__self_attention_q self_encoder_block_0__layer_0__SelfAttention_q (mul_2,) {} | |
call_method view_1 view (self_encoder_block_0__layer_0__self_attention_q, 8, -1, 8, 64) {} | |
call_method transpose transpose (view_1, 1, 2) {} | |
call_module self_encoder_block_0__layer_0__self_attention_k self_encoder_block_0__layer_0__SelfAttention_k (mul_2,) {} | |
call_method view_2 view (self_encoder_block_0__layer_0__self_attention_k, 8, -1, 8, 64) {} | |
call_method transpose_1 transpose (view_2, 1, 2) {} | |
call_module self_encoder_block_0__layer_0__self_attention_v self_encoder_block_0__layer_0__SelfAttention_v (mul_2,) {} | |
call_method view_3 view (self_encoder_block_0__layer_0__self_attention_v, 8, -1, 8, 64) {} | |
call_method transpose_2 transpose (view_3, 1, 2) {} | |
call_method transpose_3 transpose (transpose_1, 3, 2) {} | |
call_function matmul <built-in method matmul of type object at 0x7f40b0102e20> (transpose, transpose_3) {} | |
call_function arange <built-in method arange of type object at 0x7f40b0102e20> (1024,) {'dtype': torch.int64, 'device': device(type='cuda', index=0)} | |
call_function getitem_1 <built-in function getitem> (arange, (slice(None, None, None), None)) {} | |
call_function arange_1 <built-in method arange of type object at 0x7f40b0102e20> (1024,) {'dtype': torch.int64, 'device': device(type='cuda', index=0)} | |
call_function getitem_2 <built-in function getitem> (arange_1, (None, slice(None, None, None))) {} | |
call_function sub_1 <built-in function sub> (getitem_2, getitem_1) {} | |
call_function gt <built-in function gt> (sub_1, 0) {} | |
call_method to_3 to (gt, torch.int64) {} | |
call_function mul_3 <built-in function mul> (to_3, 16) {} | |
call_function add_1 <built-in function add> (mul_3, 0) {} | |
call_function abs_1 <built-in method abs of type object at 0x7f40b0102e20> (sub_1,) {} | |
call_function lt <built-in function lt> (abs_1, 8) {} | |
call_method float_1 float (abs_1,) {} | |
call_function truediv <built-in function truediv> (float_1, 8) {} | |
call_function log <built-in method log of type object at 0x7f40b0102e20> (truediv,) {} | |
call_function truediv_1 <built-in function truediv> (log, 2.772588722239781) {} | |
call_function mul_4 <built-in function mul> (truediv_1, 8) {} | |
call_method to_4 to (mul_4, torch.int64) {} | |
call_function add_2 <built-in function add> (8, to_4) {} | |
call_function full_like <built-in method full_like of type object at 0x7f40b0102e20> (add_2, 15) {} | |
call_function min_1 <built-in method min of type object at 0x7f40b0102e20> (add_2, full_like) {} | |
call_function where <built-in method where of type object at 0x7f40b0102e20> (lt, abs_1, min_1) {} | |
call_function iadd <built-in function iadd> (add_1, where) {} | |
call_module self_encoder_block_0__layer_0__self_attention_relative_attention_bias self_encoder_block_0__layer_0__SelfAttention_relative_attention_bias (iadd,) {} | |
call_method permute permute (self_encoder_block_0__layer_0__self_attention_relative_attention_bias, [2, 0, 1]) {} | |
call_method unsqueeze unsqueeze (permute, 0) {} | |
call_function add_3 <built-in function add> (unsqueeze, mul) {} | |
call_function iadd_1 <built-in function iadd> (matmul, add_3) {} | |
call_method float_2 float (iadd_1,) {} | |
call_function softmax <function softmax at 0x7f3f1bf53e50> (float_2,) {'dim': -1} | |
call_method type_as type_as (softmax, iadd_1) {} | |
call_function lowmem_dropout_5 <function lowmem_dropout at 0x7f3ef6b84d30> (type_as,) {'p': 0.1, 'training': True} | |
call_function matmul_1 <built-in method matmul of type object at 0x7f40b0102e20> (lowmem_dropout_5, transpose_2) {} | |
call_method transpose_4 transpose (matmul_1, 1, 2) {} | |
call_method contiguous contiguous (transpose_4,) {} | |
call_method view_4 view (contiguous, 8, -1, 512) {} | |
call_module self_encoder_block_0__layer_0__self_attention_o self_encoder_block_0__layer_0__SelfAttention_o (view_4,) {} | |
call_module self_encoder_block_0__layer_0__dropout self_encoder_block_0__layer_0__dropout (self_encoder_block_0__layer_0__self_attention_o,) {} | |
call_function add_4 <built-in function add> (self_encoder_dropout, self_encoder_block_0__layer_0__dropout) {} | |
call_method to_5 to (add_4, torch.float32) {} | |
call_method pow_2 pow (to_5, 2) {} | |
call_method mean_1 mean (pow_2, -1) {'keepdim': True} | |
call_function add_5 <built-in function add> (mean_1, 1e-06) {} | |
call_function rsqrt_1 <built-in method rsqrt of type object at 0x7f40b0102e20> (add_5,) {} | |
call_function mul_5 <built-in function mul> (add_4, rsqrt_1) {} | |
get_attr self_encoder_block_0__layer_1__layer_norm_weight_2 self_encoder_block_0__layer_1__layer_norm_weight () {} | |
call_function mul_6 <built-in function mul> (self_encoder_block_0__layer_1__layer_norm_weight_2, mul_5) {} | |
call_module self_encoder_block_0__layer__1__dense_relu_dense_wi self_encoder_block_0__layer__1__DenseReluDense_wi (mul_6,) {} | |
call_module self_encoder_block_0__layer__1__dense_relu_dense_act self_encoder_block_0__layer__1__DenseReluDense_act (self_encoder_block_0__layer__1__dense_relu_dense_wi,) {} | |
call_module self_encoder_block_0__layer__1__dense_relu_dense_dropout self_encoder_block_0__layer__1__DenseReluDense_dropout (self_encoder_block_0__layer__1__dense_relu_dense_act,) {} | |
call_module self_encoder_block_0__layer__1__dense_relu_dense_wo self_encoder_block_0__layer__1__DenseReluDense_wo (self_encoder_block_0__layer__1__dense_relu_dense_dropout,) {} | |
call_module self_encoder_block_0__layer__1__dropout self_encoder_block_0__layer__1__dropout (self_encoder_block_0__layer__1__dense_relu_dense_wo,) {} | |
call_function add_6 <built-in function add> (add_4, self_encoder_block_0__layer__1__dropout) {} | |
call_method to_6 to (add_6, torch.float32) {} | |
call_method pow_3 pow (to_6, 2) {} | |
call_method mean_2 mean (pow_3, -1) {'keepdim': True} | |
call_function add_7 <built-in function add> (mean_2, 1e-06) {} | |
call_function rsqrt_2 <built-in method rsqrt of type object at 0x7f40b0102e20> (add_7,) {} | |
call_function mul_7 <built-in function mul> (add_6, rsqrt_2) {} | |
get_attr self_encoder_block_1__layer_0__layer_norm_weight_2 self_encoder_block_1__layer_0__layer_norm_weight () {} | |
call_function mul_8 <built-in function mul> (self_encoder_block_1__layer_0__layer_norm_weight_2, mul_7) {} | |
call_module self_encoder_block_1__layer_0__self_attention_q self_encoder_block_1__layer_0__SelfAttention_q (mul_8,) {} | |
call_method view_5 view (self_encoder_block_1__layer_0__self_attention_q, 8, -1, 8, 64) {} | |
call_method transpose_5 transpose (view_5, 1, 2) {} | |
call_module self_encoder_block_1__layer_0__self_attention_k self_encoder_block_1__layer_0__SelfAttention_k (mul_8,) {} | |
call_method view_6 view (self_encoder_block_1__layer_0__self_attention_k, 8, -1, 8, 64) {} | |
call_method transpose_6 transpose (view_6, 1, 2) {} | |
call_module self_encoder_block_1__layer_0__self_attention_v self_encoder_block_1__layer_0__SelfAttention_v (mul_8,) {} | |
call_method view_7 view (self_encoder_block_1__layer_0__self_attention_v, 8, -1, 8, 64) {} | |
call_method transpose_7 transpose (view_7, 1, 2) {} | |
call_method transpose_8 transpose (transpose_6, 3, 2) {} | |
call_function matmul_2 <built-in method matmul of type object at 0x7f40b0102e20> (transpose_5, transpose_8) {} | |
call_function iadd_2 <built-in function iadd> (matmul_2, add_3) {} | |
call_method float_3 float (iadd_2,) {} | |
call_function softmax_1 <function softmax at 0x7f3f1bf53e50> (float_3,) {'dim': -1} | |
call_method type_as_1 type_as (softmax_1, iadd_2) {} | |
call_function lowmem_dropout_4 <function lowmem_dropout at 0x7f3ef6b84d30> (type_as_1,) {'p': 0.1, 'training': True} | |
call_function matmul_3 <built-in method matmul of type object at 0x7f40b0102e20> (lowmem_dropout_4, transpose_7) {} | |
call_method transpose_9 transpose (matmul_3, 1, 2) {} | |
call_method contiguous_1 contiguous (transpose_9,) {} | |
call_method view_8 view (contiguous_1, 8, -1, 512) {} | |
call_module self_encoder_block_1__layer_0__self_attention_o self_encoder_block_1__layer_0__SelfAttention_o (view_8,) {} | |
call_module self_encoder_block_1__layer_0__dropout self_encoder_block_1__layer_0__dropout (self_encoder_block_1__layer_0__self_attention_o,) {} | |
call_function add_8 <built-in function add> (add_6, self_encoder_block_1__layer_0__dropout) {} | |
call_method to_7 to (add_8, torch.float32) {} | |
call_method pow_4 pow (to_7, 2) {} | |
call_method mean_3 mean (pow_4, -1) {'keepdim': True} | |
call_function add_9 <built-in function add> (mean_3, 1e-06) {} | |
call_function rsqrt_3 <built-in method rsqrt of type object at 0x7f40b0102e20> (add_9,) {} | |
call_function mul_9 <built-in function mul> (add_8, rsqrt_3) {} | |
get_attr self_encoder_block_1__layer_1__layer_norm_weight_2 self_encoder_block_1__layer_1__layer_norm_weight () {} | |
call_function mul_10 <built-in function mul> (self_encoder_block_1__layer_1__layer_norm_weight_2, mul_9) {} | |
call_module self_encoder_block_1__layer__1__dense_relu_dense_wi self_encoder_block_1__layer__1__DenseReluDense_wi (mul_10,) {} | |
call_module self_encoder_block_0__layer__1__dense_relu_dense_act_1 self_encoder_block_0__layer__1__DenseReluDense_act (self_encoder_block_1__layer__1__dense_relu_dense_wi,) {} | |
call_module self_encoder_block_1__layer__1__dense_relu_dense_dropout self_encoder_block_1__layer__1__DenseReluDense_dropout (self_encoder_block_0__layer__1__dense_relu_dense_act_1,) {} | |
call_module self_encoder_block_1__layer__1__dense_relu_dense_wo self_encoder_block_1__layer__1__DenseReluDense_wo (self_encoder_block_1__layer__1__dense_relu_dense_dropout,) {} | |
call_module self_encoder_block_1__layer__1__dropout self_encoder_block_1__layer__1__dropout (self_encoder_block_1__layer__1__dense_relu_dense_wo,) {} | |
call_function add_10 <built-in function add> (add_8, self_encoder_block_1__layer__1__dropout) {} | |
call_method to_8 to (add_10, torch.float32) {} | |
call_method pow_5 pow (to_8, 2) {} | |
call_method mean_4 mean (pow_5, -1) {'keepdim': True} | |
call_function add_11 <built-in function add> (mean_4, 1e-06) {} | |
call_function rsqrt_4 <built-in method rsqrt of type object at 0x7f40b0102e20> (add_11,) {} | |
call_function mul_11 <built-in function mul> (add_10, rsqrt_4) {} | |
get_attr self_encoder_block_2__layer_0__layer_norm_weight_2 self_encoder_block_2__layer_0__layer_norm_weight () {} | |
call_function mul_12 <built-in function mul> (self_encoder_block_2__layer_0__layer_norm_weight_2, mul_11) {} | |
call_module self_encoder_block_2__layer_0__self_attention_q self_encoder_block_2__layer_0__SelfAttention_q (mul_12,) {} | |
call_method view_9 view (self_encoder_block_2__layer_0__self_attention_q, 8, -1, 8, 64) {} | |
call_method transpose_10 transpose (view_9, 1, 2) {} | |
call_module self_encoder_block_2__layer_0__self_attention_k self_encoder_block_2__layer_0__SelfAttention_k (mul_12,) {} | |
call_method view_10 view (self_encoder_block_2__layer_0__self_attention_k, 8, -1, 8, 64) {} | |
call_method transpose_11 transpose (view_10, 1, 2) {} | |
call_module self_encoder_block_2__layer_0__self_attention_v self_encoder_block_2__layer_0__SelfAttention_v (mul_12,) {} | |
call_method view_11 view (self_encoder_block_2__layer_0__self_attention_v, 8, -1, 8, 64) {} | |
call_method transpose_12 transpose (view_11, 1, 2) {} | |
call_method transpose_13 transpose (transpose_11, 3, 2) {} | |
call_function matmul_4 <built-in method matmul of type object at 0x7f40b0102e20> (transpose_10, transpose_13) {} | |
call_function iadd_3 <built-in function iadd> (matmul_4, add_3) {} | |
call_method float_4 float (iadd_3,) {} | |
call_function softmax_2 <function softmax at 0x7f3f1bf53e50> (float_4,) {'dim': -1} | |
call_method type_as_2 type_as (softmax_2, iadd_3) {} | |
call_function lowmem_dropout_3 <function lowmem_dropout at 0x7f3ef6b84d30> (type_as_2,) {'p': 0.1, 'training': True} | |
call_function matmul_5 <built-in method matmul of type object at 0x7f40b0102e20> (lowmem_dropout_3, transpose_12) {} | |
call_method transpose_14 transpose (matmul_5, 1, 2) {} | |
call_method contiguous_2 contiguous (transpose_14,) {} | |
call_method view_12 view (contiguous_2, 8, -1, 512) {} | |
call_module self_encoder_block_2__layer_0__self_attention_o self_encoder_block_2__layer_0__SelfAttention_o (view_12,) {} | |
call_module self_encoder_block_2__layer_0__dropout self_encoder_block_2__layer_0__dropout (self_encoder_block_2__layer_0__self_attention_o,) {} | |
call_function add_12 <built-in function add> (add_10, self_encoder_block_2__layer_0__dropout) {} | |
call_method to_9 to (add_12, torch.float32) {} | |
call_method pow_6 pow (to_9, 2) {} | |
call_method mean_5 mean (pow_6, -1) {'keepdim': True} | |
call_function add_13 <built-in function add> (mean_5, 1e-06) {} | |
call_function rsqrt_5 <built-in method rsqrt of type object at 0x7f40b0102e20> (add_13,) {} | |
call_function mul_13 <built-in function mul> (add_12, rsqrt_5) {} | |
get_attr self_encoder_block_2__layer_1__layer_norm_weight_2 self_encoder_block_2__layer_1__layer_norm_weight () {} | |
call_function mul_14 <built-in function mul> (self_encoder_block_2__layer_1__layer_norm_weight_2, mul_13) {} | |
call_module self_encoder_block_2__layer__1__dense_relu_dense_wi self_encoder_block_2__layer__1__DenseReluDense_wi (mul_14,) {} | |
call_module self_encoder_block_0__layer__1__dense_relu_dense_act_2 self_encoder_block_0__layer__1__DenseReluDense_act (self_encoder_block_2__layer__1__dense_relu_dense_wi,) {} | |
call_module self_encoder_block_2__layer__1__dense_relu_dense_dropout self_encoder_block_2__layer__1__DenseReluDense_dropout (self_encoder_block_0__layer__1__dense_relu_dense_act_2,) {} | |
call_module self_encoder_block_2__layer__1__dense_relu_dense_wo self_encoder_block_2__layer__1__DenseReluDense_wo (self_encoder_block_2__layer__1__dense_relu_dense_dropout,) {} | |
call_module self_encoder_block_2__layer__1__dropout self_encoder_block_2__layer__1__dropout (self_encoder_block_2__layer__1__dense_relu_dense_wo,) {} | |
call_function add_14 <built-in function add> (add_12, self_encoder_block_2__layer__1__dropout) {} | |
call_method to_10 to (add_14, torch.float32) {} | |
call_method pow_7 pow (to_10, 2) {} | |
call_method mean_6 mean (pow_7, -1) {'keepdim': True} | |
call_function add_15 <built-in function add> (mean_6, 1e-06) {} | |
call_function rsqrt_6 <built-in method rsqrt of type object at 0x7f40b0102e20> (add_15,) {} | |
call_function mul_15 <built-in function mul> (add_14, rsqrt_6) {} | |
get_attr self_encoder_block_3__layer_0__layer_norm_weight_2 self_encoder_block_3__layer_0__layer_norm_weight () {} | |
call_function mul_16 <built-in function mul> (self_encoder_block_3__layer_0__layer_norm_weight_2, mul_15) {} | |
call_module self_encoder_block_3__layer_0__self_attention_q self_encoder_block_3__layer_0__SelfAttention_q (mul_16,) {} | |
call_method view_13 view (self_encoder_block_3__layer_0__self_attention_q, 8, -1, 8, 64) {} | |
call_method transpose_15 transpose (view_13, 1, 2) {} | |
call_module self_encoder_block_3__layer_0__self_attention_k self_encoder_block_3__layer_0__SelfAttention_k (mul_16,) {} | |
call_method view_14 view (self_encoder_block_3__layer_0__self_attention_k, 8, -1, 8, 64) {} | |
call_method transpose_16 transpose (view_14, 1, 2) {} | |
call_module self_encoder_block_3__layer_0__self_attention_v self_encoder_block_3__layer_0__SelfAttention_v (mul_16,) {} | |
call_method view_15 view (self_encoder_block_3__layer_0__self_attention_v, 8, -1, 8, 64) {} | |
call_method transpose_17 transpose (view_15, 1, 2) {} | |
call_method transpose_18 transpose (transpose_16, 3, 2) {} | |
call_function matmul_6 <built-in method matmul of type object at 0x7f40b0102e20> (transpose_15, transpose_18) {} | |
call_function iadd_4 <built-in function iadd> (matmul_6, add_3) {} | |
call_method float_5 float (iadd_4,) {} | |
call_function softmax_3 <function softmax at 0x7f3f1bf53e50> (float_5,) {'dim': -1} | |
call_method type_as_3 type_as (softmax_3, iadd_4) {} | |
call_function lowmem_dropout_2 <function lowmem_dropout at 0x7f3ef6b84d30> (type_as_3,) {'p': 0.1, 'training': True} | |
call_function matmul_7 <built-in method matmul of type object at 0x7f40b0102e20> (lowmem_dropout_2, transpose_17) {} | |
call_method transpose_19 transpose (matmul_7, 1, 2) {} | |
call_method contiguous_3 contiguous (transpose_19,) {} | |
call_method view_16 view (contiguous_3, 8, -1, 512) {} | |
call_module self_encoder_block_3__layer_0__self_attention_o self_encoder_block_3__layer_0__SelfAttention_o (view_16,) {} | |
call_module self_encoder_block_3__layer_0__dropout self_encoder_block_3__layer_0__dropout (self_encoder_block_3__layer_0__self_attention_o,) {} | |
call_func |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment