Last active
October 27, 2022 02:53
-
-
Save davidberard98/b09c4b3bfc4b9b7a41785c65c612fec8 to your computer and use it in GitHub Desktop.
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torchvision/models/_utils.py:208: UserWarning: The parameter 'pretrained' is deprecated since 0.13 and may be removed in the future, please use 'weights' instead. | |
warnings.warn( | |
/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torchvision/models/_utils.py:223: UserWarning: Arguments other than a weight enum or `None` for 'weights' are deprecated since 0.13 and may be removed in the future. The current behavior is equivalent to passing `weights=ResNet50_Weights.IMAGENET1K_V1`. You can also use `weights=ResNet50_Weights.DEFAULT` to get the most up-to-date weights. | |
warnings.warn(msg) | |
/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torchvision/models/_utils.py:208: UserWarning: The parameter 'pretrained' is deprecated since 0.13 and may be removed in the future, please use 'weights' instead. | |
warnings.warn( | |
/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torchvision/models/_utils.py:223: UserWarning: Arguments other than a weight enum or `None` for 'weights' are deprecated since 0.13 and may be removed in the future. The current behavior is equivalent to passing `weights=ResNet50_Weights.IMAGENET1K_V1`. You can also use `weights=ResNet50_Weights.DEFAULT` to get the most up-to-date weights. | |
warnings.warn(msg) | |
[2022-10-26 04:14:27,885] torch._dynamo.variables.torch: [WARNING] Profiler will be ignored | |
[2022-10-26 04:14:29,204] torch._dynamo.variables.torch: [WARNING] Profiler will be ignored | |
[2022-10-26 04:21:22,358] torch._dynamo.optimizations.training: [WARNING] Unable to use Aot Autograd because of presence of mutation | |
[2022-10-26 04:21:22,361] torch._inductor.compile_fx: [WARNING] Aot Autograd is not safe to run, so falling back to eager | |
[2022-10-26 04:21:22,540] torch._dynamo.variables.torch: [WARNING] Profiler will be ignored | |
[2022-10-26 04:21:22,761] torch._dynamo.variables.torch: [WARNING] Profiler will be ignored | |
[2022-10-26 04:21:22,979] torch._dynamo.variables.torch: [WARNING] Profiler will be ignored | |
[2022-10-26 04:22:40,674] torch._dynamo.optimizations.training: [WARNING] Unable to use Aot Autograd because of presence of mutation | |
[2022-10-26 04:22:40,675] torch._inductor.compile_fx: [WARNING] Aot Autograd is not safe to run, so falling back to eager | |
/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torchvision/models/_utils.py:208: UserWarning: The parameter 'pretrained' is deprecated since 0.13 and may be removed in the future, please use 'weights' instead. | |
warnings.warn( | |
/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torchvision/models/_utils.py:223: UserWarning: Arguments other than a weight enum or `None` for 'weights' are deprecated since 0.13 and may be removed in the future. The current behavior is equivalent to passing `weights=ResNet50_Weights.IMAGENET1K_V1`. You can also use `weights=ResNet50_Weights.DEFAULT` to get the most up-to-date weights. | |
warnings.warn(msg) | |
[2022-10-26 04:23:03,878] torch._dynamo.variables.torch: [WARNING] Profiler will be ignored | |
[2022-10-26 04:23:05,196] torch._dynamo.variables.torch: [WARNING] Profiler will be ignored | |
[2022-10-26 04:23:17,615] torch._dynamo.optimizations.training: [WARNING] Unable to use Aot Autograd because of presence of mutation | |
[2022-10-26 04:23:17,615] torch._inductor.compile_fx: [WARNING] Aot Autograd is not safe to run, so falling back to eager | |
Traceback (most recent call last): | |
File "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/fx/graph_module.py", line 269, in __call__ | |
return super(self.cls, obj).__call__(*args, **kwargs) # type: ignore[misc] | |
File "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/nn/modules/module.py", line 1363, in _call_impl | |
return forward_call(*input, **kwargs) | |
File "<eval_with_key>.19", line 7, in forward | |
self_layer3_0_bn3 += self_layer3_0_downsample_1; iadd = self_layer3_0_bn3; self_layer3_0_bn3 = self_layer3_0_downsample_1 = None | |
RuntimeError: Output 1 of CompiledFunctionBackward is a view and is being modified inplace. This view is the output of a function that returns multiple views. Such functions do not allow the output views to be modified inplace. You should replace the inplace operation by an out-of-place one. | |
Exception raised from handle_view_on_rebase at /scratch/dberard/bench-fast/pytorch/torch/csrc/autograd/variable.cpp:819 (most recent call first): | |
frame #0: c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> >) + 0x6b (0x7fa6759cc2fb in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libc10.so) | |
frame #1: c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > const&) + 0xce (0x7fa6759c7afe in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libc10.so) | |
frame #2: torch::autograd::handle_view_on_rebase(torch::autograd::DifferentiableViewMeta*, bool) + 0x37b (0x7fa68ae8f97b in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_cpu.so) | |
frame #3: <unknown function> + 0x5ec24aa (0x7fa68a3864aa in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_cpu.so) | |
frame #4: <unknown function> + 0x60ef8ea (0x7fa68a5b38ea in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_cpu.so) | |
frame #5: at::_ops::add__Tensor::call(at::Tensor&, at::Tensor const&, c10::Scalar const&) + 0x15b (0x7fa688c25ccb in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_cpu.so) | |
frame #6: <unknown function> + 0x5e58f8 (0x7fa6924eb8f8 in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_python.so) | |
frame #7: <unknown function> + 0x5e5d36 (0x7fa6924ebd36 in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_python.so) | |
frame #8: <unknown function> + 0x1b9c03 (0x563bbf75ec03 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #9: <unknown function> + 0x194a64 (0x563bbf739a64 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #10: <unknown function> + 0x230eac (0x563bbf7d5eac in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #11: PyNumber_InPlaceAdd + 0x279 (0x563bbf74a0f9 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #12: _PyEval_EvalFrameDefault + 0x16b1 (0x563bbf77d051 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #13: <unknown function> + 0x8932fb (0x7fa6927992fb in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_python.so) | |
frame #14: _PyFunction_Vectorcall + 0x1b7 (0x563bbf7727e7 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #15: <unknown function> + 0x166bf8 (0x563bbf70bbf8 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #16: PyObject_Call + 0x7d (0x563bbf6dc20d in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #17: _PyEval_EvalFrameDefault + 0x1f07 (0x563bbf77d8a7 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #18: <unknown function> + 0x8932fb (0x7fa6927992fb in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_python.so) | |
frame #19: _PyEval_EvalCodeWithName + 0x260 (0x563bbf771600 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #20: _PyFunction_Vectorcall + 0x594 (0x563bbf772bc4 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #21: <unknown function> + 0x166bf8 (0x563bbf70bbf8 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #22: PyObject_Call + 0x7d (0x563bbf6dc20d in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #23: _PyEval_EvalFrameDefault + 0x1f07 (0x563bbf77d8a7 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #24: <unknown function> + 0x8932fb (0x7fa6927992fb in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_python.so) | |
frame #25: _PyEval_EvalCodeWithName + 0x888 (0x563bbf771c28 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #26: _PyFunction_Vectorcall + 0x594 (0x563bbf772bc4 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #27: _PyObject_FastCallDict + 0xe7 (0x563bbf7036b7 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #28: <unknown function> + 0x194d2b (0x563bbf739d2b in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #29: PyObject_Call + 0x45d (0x563bbf6dc5ed in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #30: _PyEval_EvalFrameDefault + 0x1f07 (0x563bbf77d8a7 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #31: <unknown function> + 0x8932fb (0x7fa6927992fb in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_python.so) | |
frame #32: _PyEval_EvalCodeWithName + 0x260 (0x563bbf771600 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #33: _PyFunction_Vectorcall + 0x594 (0x563bbf772bc4 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #34: _PyObject_FastCallDict + 0x5f (0x563bbf70362f in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #35: <unknown function> + 0x194d2b (0x563bbf739d2b in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #36: PyObject_Call + 0x45d (0x563bbf6dc5ed in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #37: _PyEval_EvalFrameDefault + 0x1f07 (0x563bbf77d8a7 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #38: <unknown function> + 0x8932fb (0x7fa6927992fb in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_python.so) | |
frame #39: _PyEval_EvalCodeWithName + 0x260 (0x563bbf771600 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #40: _PyFunction_Vectorcall + 0x594 (0x563bbf772bc4 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #41: <unknown function> + 0x166bf8 (0x563bbf70bbf8 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #42: PyObject_Call + 0x7d (0x563bbf6dc20d in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #43: _PyEval_EvalFrameDefault + 0x1f07 (0x563bbf77d8a7 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #44: <unknown function> + 0x8932fb (0x7fa6927992fb in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_python.so) | |
frame #45: _PyEval_EvalCodeWithName + 0x260 (0x563bbf771600 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #46: _PyFunction_Vectorcall + 0x594 (0x563bbf772bc4 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #47: _PyObject_FastCallDict + 0xe7 (0x563bbf7036b7 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #48: <unknown function> + 0x194d2b (0x563bbf739d2b in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #49: PyObject_Call + 0x45d (0x563bbf6dc5ed in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #50: _PyEval_EvalFrameDefault + 0x1f07 (0x563bbf77d8a7 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #51: <unknown function> + 0x8932fb (0x7fa6927992fb in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_python.so) | |
frame #52: _PyFunction_Vectorcall + 0x1b7 (0x563bbf7727e7 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #53: <unknown function> + 0x166b2e (0x563bbf70bb2e in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #54: _PyEval_EvalFrameDefault + 0x71b (0x563bbf77c0bb in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #55: <unknown function> + 0x8932fb (0x7fa6927992fb in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_python.so) | |
frame #56: _PyEval_EvalCodeWithName + 0x260 (0x563bbf771600 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #57: _PyFunction_Vectorcall + 0x594 (0x563bbf772bc4 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #58: _PyEval_EvalFrameDefault + 0x4c0 (0x563bbf77be60 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #59: <unknown function> + 0x8932fb (0x7fa6927992fb in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_python.so) | |
frame #60: _PyEval_EvalCodeWithName + 0xd5f (0x563bbf7720ff in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #61: _PyFunction_Vectorcall + 0x594 (0x563bbf772bc4 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #62: <unknown function> + 0x166bf8 (0x563bbf70bbf8 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #63: PyObject_Call + 0x7d (0x563bbf6dc20d in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
Call using an FX-traced Module, line 7 of the traced Module's generated forward function: | |
self_layer3_0_downsample_1 = self.self_layer3_0_downsample_1(self_layer3_0_downsample_0); self_layer3_0_downsample_0 = None | |
self_layer3_0_bn3 += self_layer3_0_downsample_1; iadd = self_layer3_0_bn3; self_layer3_0_bn3 = self_layer3_0_downsample_1 = None | |
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ <--- HERE | |
self_layer3_0_relu = self.self_layer3_0_relu(iadd); iadd = None | |
self_layer3_1_conv1 = self.self_layer3_1_conv1(self_layer3_0_relu) | |
[2022-10-26 04:23:17,639] torch._dynamo.output_graph: [WARNING] ---------------------------------------- | |
[2022-10-26 04:23:17,639] torch._dynamo.output_graph: [WARNING] TORCHDYNAMO: backend compiler failed | |
[2022-10-26 04:23:17,639] torch._dynamo.output_graph: [WARNING] Output 1 of CompiledFunctionBackward is a view and is being modified inplace. This view is the output of a function that returns multiple views. Such functions do not allow the output views to be modified inplace. You should replace the inplace operation by an out-of-place one. | |
Exception raised from handle_view_on_rebase at /scratch/dberard/bench-fast/pytorch/torch/csrc/autograd/variable.cpp:819 (most recent call first): | |
frame #0: c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> >) + 0x6b (0x7fa6759cc2fb in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libc10.so) | |
frame #1: c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > const&) + 0xce (0x7fa6759c7afe in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libc10.so) | |
frame #2: torch::autograd::handle_view_on_rebase(torch::autograd::DifferentiableViewMeta*, bool) + 0x37b (0x7fa68ae8f97b in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_cpu.so) | |
frame #3: <unknown function> + 0x5ec24aa (0x7fa68a3864aa in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_cpu.so) | |
frame #4: <unknown function> + 0x60ef8ea (0x7fa68a5b38ea in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_cpu.so) | |
frame #5: at::_ops::add__Tensor::call(at::Tensor&, at::Tensor const&, c10::Scalar const&) + 0x15b (0x7fa688c25ccb in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_cpu.so) | |
frame #6: <unknown function> + 0x5e58f8 (0x7fa6924eb8f8 in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_python.so) | |
frame #7: <unknown function> + 0x5e5d36 (0x7fa6924ebd36 in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_python.so) | |
frame #8: <unknown function> + 0x1b9c03 (0x563bbf75ec03 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #9: <unknown function> + 0x194a64 (0x563bbf739a64 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #10: <unknown function> + 0x230eac (0x563bbf7d5eac in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #11: PyNumber_InPlaceAdd + 0x279 (0x563bbf74a0f9 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #12: _PyEval_EvalFrameDefault + 0x16b1 (0x563bbf77d051 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #13: <unknown function> + 0x8932fb (0x7fa6927992fb in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_python.so) | |
frame #14: _PyFunction_Vectorcall + 0x1b7 (0x563bbf7727e7 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #15: <unknown function> + 0x166bf8 (0x563bbf70bbf8 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #16: PyObject_Call + 0x7d (0x563bbf6dc20d in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #17: _PyEval_EvalFrameDefault + 0x1f07 (0x563bbf77d8a7 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #18: <unknown function> + 0x8932fb (0x7fa6927992fb in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_python.so) | |
frame #19: _PyEval_EvalCodeWithName + 0x260 (0x563bbf771600 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #20: _PyFunction_Vectorcall + 0x594 (0x563bbf772bc4 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #21: <unknown function> + 0x166bf8 (0x563bbf70bbf8 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #22: PyObject_Call + 0x7d (0x563bbf6dc20d in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #23: _PyEval_EvalFrameDefault + 0x1f07 (0x563bbf77d8a7 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #24: <unknown function> + 0x8932fb (0x7fa6927992fb in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_python.so) | |
frame #25: _PyEval_EvalCodeWithName + 0x888 (0x563bbf771c28 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #26: _PyFunction_Vectorcall + 0x594 (0x563bbf772bc4 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #27: _PyObject_FastCallDict + 0xe7 (0x563bbf7036b7 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #28: <unknown function> + 0x194d2b (0x563bbf739d2b in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #29: PyObject_Call + 0x45d (0x563bbf6dc5ed in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #30: _PyEval_EvalFrameDefault + 0x1f07 (0x563bbf77d8a7 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #31: <unknown function> + 0x8932fb (0x7fa6927992fb in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_python.so) | |
frame #32: _PyEval_EvalCodeWithName + 0x260 (0x563bbf771600 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #33: _PyFunction_Vectorcall + 0x594 (0x563bbf772bc4 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #34: _PyObject_FastCallDict + 0x5f (0x563bbf70362f in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #35: <unknown function> + 0x194d2b (0x563bbf739d2b in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #36: PyObject_Call + 0x45d (0x563bbf6dc5ed in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #37: _PyEval_EvalFrameDefault + 0x1f07 (0x563bbf77d8a7 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #38: <unknown function> + 0x8932fb (0x7fa6927992fb in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_python.so) | |
frame #39: _PyEval_EvalCodeWithName + 0x260 (0x563bbf771600 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #40: _PyFunction_Vectorcall + 0x594 (0x563bbf772bc4 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #41: <unknown function> + 0x166bf8 (0x563bbf70bbf8 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #42: PyObject_Call + 0x7d (0x563bbf6dc20d in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #43: _PyEval_EvalFrameDefault + 0x1f07 (0x563bbf77d8a7 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #44: <unknown function> + 0x8932fb (0x7fa6927992fb in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_python.so) | |
frame #45: _PyEval_EvalCodeWithName + 0x260 (0x563bbf771600 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #46: _PyFunction_Vectorcall + 0x594 (0x563bbf772bc4 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #47: _PyObject_FastCallDict + 0xe7 (0x563bbf7036b7 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #48: <unknown function> + 0x194d2b (0x563bbf739d2b in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #49: PyObject_Call + 0x45d (0x563bbf6dc5ed in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #50: _PyEval_EvalFrameDefault + 0x1f07 (0x563bbf77d8a7 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #51: <unknown function> + 0x8932fb (0x7fa6927992fb in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_python.so) | |
frame #52: _PyFunction_Vectorcall + 0x1b7 (0x563bbf7727e7 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #53: <unknown function> + 0x166b2e (0x563bbf70bb2e in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #54: _PyEval_EvalFrameDefault + 0x71b (0x563bbf77c0bb in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #55: <unknown function> + 0x8932fb (0x7fa6927992fb in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_python.so) | |
frame #56: _PyEval_EvalCodeWithName + 0x260 (0x563bbf771600 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #57: _PyFunction_Vectorcall + 0x594 (0x563bbf772bc4 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #58: _PyEval_EvalFrameDefault + 0x4c0 (0x563bbf77be60 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #59: <unknown function> + 0x8932fb (0x7fa6927992fb in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_python.so) | |
frame #60: _PyEval_EvalCodeWithName + 0xd5f (0x563bbf7720ff in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #61: _PyFunction_Vectorcall + 0x594 (0x563bbf772bc4 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #62: <unknown function> + 0x166bf8 (0x563bbf70bbf8 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #63: PyObject_Call + 0x7d (0x563bbf6dc20d in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
While executing %submod_1 : [#users=2] = call_module[target=compiled_submod_1](args = (%getitem, %getitem_1), kwargs = {}) | |
Original traceback: | |
None | |
Traceback (most recent call last): | |
File "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/_dynamo/output_graph.py", line 428, in call_user_compiler | |
compiled_fn = self.compiler_fn(gm, self.example_inputs()) | |
File "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/_dynamo/optimizations/distributed.py", line 207, in compile_fn | |
submod_compiler.run(*example_inputs) | |
File "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/fx/interpreter.py", line 130, in run | |
self.env[node] = self.run_node(node) | |
File "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/_dynamo/optimizations/distributed.py", line 204, in run_node | |
return getattr(self, n.op)(n.target, args, kwargs) | |
File "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/fx/interpreter.py", line 288, in call_module | |
return submod(*args, **kwargs) | |
File "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/nn/modules/module.py", line 1363, in _call_impl | |
return forward_call(*input, **kwargs) | |
File "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/_dynamo/optimizations/distributed.py", line 160, in forward | |
x = self.compiled_submod(*args) | |
File "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/fx/graph_module.py", line 660, in call_wrapped | |
return self._wrapped_call(self, *args, **kwargs) | |
File "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/fx/graph_module.py", line 277, in __call__ | |
raise e.with_traceback(None) | |
RuntimeError: Output 1 of CompiledFunctionBackward is a view and is being modified inplace. This view is the output of a function that returns multiple views. Such functions do not allow the output views to be modified inplace. You should replace the inplace operation by an out-of-place one. | |
Exception raised from handle_view_on_rebase at /scratch/dberard/bench-fast/pytorch/torch/csrc/autograd/variable.cpp:819 (most recent call first): | |
frame #0: c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> >) + 0x6b (0x7fa6759cc2fb in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libc10.so) | |
frame #1: c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > const&) + 0xce (0x7fa6759c7afe in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libc10.so) | |
frame #2: torch::autograd::handle_view_on_rebase(torch::autograd::DifferentiableViewMeta*, bool) + 0x37b (0x7fa68ae8f97b in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_cpu.so) | |
frame #3: <unknown function> + 0x5ec24aa (0x7fa68a3864aa in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_cpu.so) | |
frame #4: <unknown function> + 0x60ef8ea (0x7fa68a5b38ea in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_cpu.so) | |
frame #5: at::_ops::add__Tensor::call(at::Tensor&, at::Tensor const&, c10::Scalar const&) + 0x15b (0x7fa688c25ccb in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_cpu.so) | |
frame #6: <unknown function> + 0x5e58f8 (0x7fa6924eb8f8 in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_python.so) | |
frame #7: <unknown function> + 0x5e5d36 (0x7fa6924ebd36 in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_python.so) | |
frame #8: <unknown function> + 0x1b9c03 (0x563bbf75ec03 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #9: <unknown function> + 0x194a64 (0x563bbf739a64 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #10: <unknown function> + 0x230eac (0x563bbf7d5eac in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #11: PyNumber_InPlaceAdd + 0x279 (0x563bbf74a0f9 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #12: _PyEval_EvalFrameDefault + 0x16b1 (0x563bbf77d051 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #13: <unknown function> + 0x8932fb (0x7fa6927992fb in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_python.so) | |
frame #14: _PyFunction_Vectorcall + 0x1b7 (0x563bbf7727e7 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #15: <unknown function> + 0x166bf8 (0x563bbf70bbf8 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #16: PyObject_Call + 0x7d (0x563bbf6dc20d in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #17: _PyEval_EvalFrameDefault + 0x1f07 (0x563bbf77d8a7 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #18: <unknown function> + 0x8932fb (0x7fa6927992fb in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_python.so) | |
frame #19: _PyEval_EvalCodeWithName + 0x260 (0x563bbf771600 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #20: _PyFunction_Vectorcall + 0x594 (0x563bbf772bc4 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #21: <unknown function> + 0x166bf8 (0x563bbf70bbf8 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #22: PyObject_Call + 0x7d (0x563bbf6dc20d in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #23: _PyEval_EvalFrameDefault + 0x1f07 (0x563bbf77d8a7 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #24: <unknown function> + 0x8932fb (0x7fa6927992fb in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_python.so) | |
frame #25: _PyEval_EvalCodeWithName + 0x888 (0x563bbf771c28 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #26: _PyFunction_Vectorcall + 0x594 (0x563bbf772bc4 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #27: _PyObject_FastCallDict + 0xe7 (0x563bbf7036b7 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #28: <unknown function> + 0x194d2b (0x563bbf739d2b in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #29: PyObject_Call + 0x45d (0x563bbf6dc5ed in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #30: _PyEval_EvalFrameDefault + 0x1f07 (0x563bbf77d8a7 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #31: <unknown function> + 0x8932fb (0x7fa6927992fb in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_python.so) | |
frame #32: _PyEval_EvalCodeWithName + 0x260 (0x563bbf771600 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #33: _PyFunction_Vectorcall + 0x594 (0x563bbf772bc4 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #34: _PyObject_FastCallDict + 0x5f (0x563bbf70362f in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #35: <unknown function> + 0x194d2b (0x563bbf739d2b in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #36: PyObject_Call + 0x45d (0x563bbf6dc5ed in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #37: _PyEval_EvalFrameDefault + 0x1f07 (0x563bbf77d8a7 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #38: <unknown function> + 0x8932fb (0x7fa6927992fb in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_python.so) | |
frame #39: _PyEval_EvalCodeWithName + 0x260 (0x563bbf771600 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #40: _PyFunction_Vectorcall + 0x594 (0x563bbf772bc4 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #41: <unknown function> + 0x166bf8 (0x563bbf70bbf8 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #42: PyObject_Call + 0x7d (0x563bbf6dc20d in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #43: _PyEval_EvalFrameDefault + 0x1f07 (0x563bbf77d8a7 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #44: <unknown function> + 0x8932fb (0x7fa6927992fb in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_python.so) | |
frame #45: _PyEval_EvalCodeWithName + 0x260 (0x563bbf771600 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #46: _PyFunction_Vectorcall + 0x594 (0x563bbf772bc4 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #47: _PyObject_FastCallDict + 0xe7 (0x563bbf7036b7 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #48: <unknown function> + 0x194d2b (0x563bbf739d2b in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #49: PyObject_Call + 0x45d (0x563bbf6dc5ed in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #50: _PyEval_EvalFrameDefault + 0x1f07 (0x563bbf77d8a7 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #51: <unknown function> + 0x8932fb (0x7fa6927992fb in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_python.so) | |
frame #52: _PyFunction_Vectorcall + 0x1b7 (0x563bbf7727e7 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #53: <unknown function> + 0x166b2e (0x563bbf70bb2e in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #54: _PyEval_EvalFrameDefault + 0x71b (0x563bbf77c0bb in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #55: <unknown function> + 0x8932fb (0x7fa6927992fb in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_python.so) | |
frame #56: _PyEval_EvalCodeWithName + 0x260 (0x563bbf771600 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #57: _PyFunction_Vectorcall + 0x594 (0x563bbf772bc4 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #58: _PyEval_EvalFrameDefault + 0x4c0 (0x563bbf77be60 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #59: <unknown function> + 0x8932fb (0x7fa6927992fb in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_python.so) | |
frame #60: _PyEval_EvalCodeWithName + 0xd5f (0x563bbf7720ff in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #61: _PyFunction_Vectorcall + 0x594 (0x563bbf772bc4 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #62: <unknown function> + 0x166bf8 (0x563bbf70bbf8 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #63: PyObject_Call + 0x7d (0x563bbf6dc20d in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
While executing %submod_1 : [#users=2] = call_module[target=compiled_submod_1](args = (%getitem, %getitem_1), kwargs = {}) | |
Original traceback: | |
None | |
[2022-10-26 04:23:17,647] torch._dynamo.output_graph: [WARNING] ---------------------------------------- | |
[2022-10-26 04:23:17,648] torch._dynamo.convert_frame: [ERROR] WON'T CONVERT forward /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torchvision/models/resnet.py line 284 | |
due to: | |
Traceback (most recent call last): | |
File "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/fx/graph_module.py", line 277, in __call__ | |
raise e.with_traceback(None) | |
RuntimeError: Output 1 of CompiledFunctionBackward is a view and is being modified inplace. This view is the output of a function that returns multiple views. Such functions do not allow the output views to be modified inplace. You should replace the inplace operation by an out-of-place one. | |
Exception raised from handle_view_on_rebase at /scratch/dberard/bench-fast/pytorch/torch/csrc/autograd/variable.cpp:819 (most recent call first): | |
frame #0: c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> >) + 0x6b (0x7fa6759cc2fb in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libc10.so) | |
frame #1: c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > const&) + 0xce (0x7fa6759c7afe in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libc10.so) | |
frame #2: torch::autograd::handle_view_on_rebase(torch::autograd::DifferentiableViewMeta*, bool) + 0x37b (0x7fa68ae8f97b in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_cpu.so) | |
frame #3: <unknown function> + 0x5ec24aa (0x7fa68a3864aa in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_cpu.so) | |
frame #4: <unknown function> + 0x60ef8ea (0x7fa68a5b38ea in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_cpu.so) | |
frame #5: at::_ops::add__Tensor::call(at::Tensor&, at::Tensor const&, c10::Scalar const&) + 0x15b (0x7fa688c25ccb in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_cpu.so) | |
frame #6: <unknown function> + 0x5e58f8 (0x7fa6924eb8f8 in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_python.so) | |
frame #7: <unknown function> + 0x5e5d36 (0x7fa6924ebd36 in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_python.so) | |
frame #8: <unknown function> + 0x1b9c03 (0x563bbf75ec03 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #9: <unknown function> + 0x194a64 (0x563bbf739a64 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #10: <unknown function> + 0x230eac (0x563bbf7d5eac in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #11: PyNumber_InPlaceAdd + 0x279 (0x563bbf74a0f9 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #12: _PyEval_EvalFrameDefault + 0x16b1 (0x563bbf77d051 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #13: <unknown function> + 0x8932fb (0x7fa6927992fb in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_python.so) | |
frame #14: _PyFunction_Vectorcall + 0x1b7 (0x563bbf7727e7 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #15: <unknown function> + 0x166bf8 (0x563bbf70bbf8 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #16: PyObject_Call + 0x7d (0x563bbf6dc20d in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #17: _PyEval_EvalFrameDefault + 0x1f07 (0x563bbf77d8a7 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #18: <unknown function> + 0x8932fb (0x7fa6927992fb in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_python.so) | |
frame #19: _PyEval_EvalCodeWithName + 0x260 (0x563bbf771600 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #20: _PyFunction_Vectorcall + 0x594 (0x563bbf772bc4 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #21: <unknown function> + 0x166bf8 (0x563bbf70bbf8 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #22: PyObject_Call + 0x7d (0x563bbf6dc20d in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #23: _PyEval_EvalFrameDefault + 0x1f07 (0x563bbf77d8a7 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #24: <unknown function> + 0x8932fb (0x7fa6927992fb in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_python.so) | |
frame #25: _PyEval_EvalCodeWithName + 0x888 (0x563bbf771c28 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #26: _PyFunction_Vectorcall + 0x594 (0x563bbf772bc4 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #27: _PyObject_FastCallDict + 0xe7 (0x563bbf7036b7 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #28: <unknown function> + 0x194d2b (0x563bbf739d2b in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #29: PyObject_Call + 0x45d (0x563bbf6dc5ed in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #30: _PyEval_EvalFrameDefault + 0x1f07 (0x563bbf77d8a7 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #31: <unknown function> + 0x8932fb (0x7fa6927992fb in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_python.so) | |
frame #32: _PyEval_EvalCodeWithName + 0x260 (0x563bbf771600 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #33: _PyFunction_Vectorcall + 0x594 (0x563bbf772bc4 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #34: _PyObject_FastCallDict + 0x5f (0x563bbf70362f in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #35: <unknown function> + 0x194d2b (0x563bbf739d2b in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #36: PyObject_Call + 0x45d (0x563bbf6dc5ed in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #37: _PyEval_EvalFrameDefault + 0x1f07 (0x563bbf77d8a7 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #38: <unknown function> + 0x8932fb (0x7fa6927992fb in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_python.so) | |
frame #39: _PyEval_EvalCodeWithName + 0x260 (0x563bbf771600 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #40: _PyFunction_Vectorcall + 0x594 (0x563bbf772bc4 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #41: <unknown function> + 0x166bf8 (0x563bbf70bbf8 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #42: PyObject_Call + 0x7d (0x563bbf6dc20d in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #43: _PyEval_EvalFrameDefault + 0x1f07 (0x563bbf77d8a7 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #44: <unknown function> + 0x8932fb (0x7fa6927992fb in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_python.so) | |
frame #45: _PyEval_EvalCodeWithName + 0x260 (0x563bbf771600 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #46: _PyFunction_Vectorcall + 0x594 (0x563bbf772bc4 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #47: _PyObject_FastCallDict + 0xe7 (0x563bbf7036b7 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #48: <unknown function> + 0x194d2b (0x563bbf739d2b in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #49: PyObject_Call + 0x45d (0x563bbf6dc5ed in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #50: _PyEval_EvalFrameDefault + 0x1f07 (0x563bbf77d8a7 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #51: <unknown function> + 0x8932fb (0x7fa6927992fb in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_python.so) | |
frame #52: _PyFunction_Vectorcall + 0x1b7 (0x563bbf7727e7 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #53: <unknown function> + 0x166b2e (0x563bbf70bb2e in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #54: _PyEval_EvalFrameDefault + 0x71b (0x563bbf77c0bb in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #55: <unknown function> + 0x8932fb (0x7fa6927992fb in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_python.so) | |
frame #56: _PyEval_EvalCodeWithName + 0x260 (0x563bbf771600 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #57: _PyFunction_Vectorcall + 0x594 (0x563bbf772bc4 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #58: _PyEval_EvalFrameDefault + 0x4c0 (0x563bbf77be60 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #59: <unknown function> + 0x8932fb (0x7fa6927992fb in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_python.so) | |
frame #60: _PyEval_EvalCodeWithName + 0xd5f (0x563bbf7720ff in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #61: _PyFunction_Vectorcall + 0x594 (0x563bbf772bc4 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #62: <unknown function> + 0x166bf8 (0x563bbf70bbf8 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #63: PyObject_Call + 0x7d (0x563bbf6dc20d in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
While executing %submod_1 : [#users=2] = call_module[target=compiled_submod_1](args = (%getitem, %getitem_1), kwargs = {}) | |
Original traceback: | |
None | |
The above exception was the direct cause of the following exception: | |
Traceback (most recent call last): | |
File "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/_dynamo/output_graph.py", line 437, in call_user_compiler | |
raise BackendCompilerFailed(self.compiler_fn, e) from e | |
torch._dynamo.exc.BackendCompilerFailed: compile_fn raised RuntimeError: Output 1 of CompiledFunctionBackward is a view and is being modified inplace. This view is the output of a function that returns multiple views. Such functions do not allow the output views to be modified inplace. You should replace the inplace operation by an out-of-place one. | |
Exception raised from handle_view_on_rebase at /scratch/dberard/bench-fast/pytorch/torch/csrc/autograd/variable.cpp:819 (most recent call first): | |
frame #0: c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> >) + 0x6b (0x7fa6759cc2fb in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libc10.so) | |
frame #1: c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > const&) + 0xce (0x7fa6759c7afe in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libc10.so) | |
frame #2: torch::autograd::handle_view_on_rebase(torch::autograd::DifferentiableViewMeta*, bool) + 0x37b (0x7fa68ae8f97b in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_cpu.so) | |
frame #3: <unknown function> + 0x5ec24aa (0x7fa68a3864aa in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_cpu.so) | |
frame #4: <unknown function> + 0x60ef8ea (0x7fa68a5b38ea in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_cpu.so) | |
frame #5: at::_ops::add__Tensor::call(at::Tensor&, at::Tensor const&, c10::Scalar const&) + 0x15b (0x7fa688c25ccb in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_cpu.so) | |
frame #6: <unknown function> + 0x5e58f8 (0x7fa6924eb8f8 in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_python.so) | |
frame #7: <unknown function> + 0x5e5d36 (0x7fa6924ebd36 in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_python.so) | |
frame #8: <unknown function> + 0x1b9c03 (0x563bbf75ec03 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #9: <unknown function> + 0x194a64 (0x563bbf739a64 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #10: <unknown function> + 0x230eac (0x563bbf7d5eac in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #11: PyNumber_InPlaceAdd + 0x279 (0x563bbf74a0f9 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #12: _PyEval_EvalFrameDefault + 0x16b1 (0x563bbf77d051 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #13: <unknown function> + 0x8932fb (0x7fa6927992fb in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_python.so) | |
frame #14: _PyFunction_Vectorcall + 0x1b7 (0x563bbf7727e7 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #15: <unknown function> + 0x166bf8 (0x563bbf70bbf8 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #16: PyObject_Call + 0x7d (0x563bbf6dc20d in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #17: _PyEval_EvalFrameDefault + 0x1f07 (0x563bbf77d8a7 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #18: <unknown function> + 0x8932fb (0x7fa6927992fb in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_python.so) | |
frame #19: _PyEval_EvalCodeWithName + 0x260 (0x563bbf771600 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #20: _PyFunction_Vectorcall + 0x594 (0x563bbf772bc4 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #21: <unknown function> + 0x166bf8 (0x563bbf70bbf8 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #22: PyObject_Call + 0x7d (0x563bbf6dc20d in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #23: _PyEval_EvalFrameDefault + 0x1f07 (0x563bbf77d8a7 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #24: <unknown function> + 0x8932fb (0x7fa6927992fb in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_python.so) | |
frame #25: _PyEval_EvalCodeWithName + 0x888 (0x563bbf771c28 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #26: _PyFunction_Vectorcall + 0x594 (0x563bbf772bc4 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #27: _PyObject_FastCallDict + 0xe7 (0x563bbf7036b7 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #28: <unknown function> + 0x194d2b (0x563bbf739d2b in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #29: PyObject_Call + 0x45d (0x563bbf6dc5ed in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #30: _PyEval_EvalFrameDefault + 0x1f07 (0x563bbf77d8a7 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #31: <unknown function> + 0x8932fb (0x7fa6927992fb in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_python.so) | |
frame #32: _PyEval_EvalCodeWithName + 0x260 (0x563bbf771600 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #33: _PyFunction_Vectorcall + 0x594 (0x563bbf772bc4 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #34: _PyObject_FastCallDict + 0x5f (0x563bbf70362f in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #35: <unknown function> + 0x194d2b (0x563bbf739d2b in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #36: PyObject_Call + 0x45d (0x563bbf6dc5ed in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #37: _PyEval_EvalFrameDefault + 0x1f07 (0x563bbf77d8a7 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #38: <unknown function> + 0x8932fb (0x7fa6927992fb in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_python.so) | |
frame #39: _PyEval_EvalCodeWithName + 0x260 (0x563bbf771600 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #40: _PyFunction_Vectorcall + 0x594 (0x563bbf772bc4 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #41: <unknown function> + 0x166bf8 (0x563bbf70bbf8 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #42: PyObject_Call + 0x7d (0x563bbf6dc20d in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #43: _PyEval_EvalFrameDefault + 0x1f07 (0x563bbf77d8a7 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #44: <unknown function> + 0x8932fb (0x7fa6927992fb in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_python.so) | |
frame #45: _PyEval_EvalCodeWithName + 0x260 (0x563bbf771600 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #46: _PyFunction_Vectorcall + 0x594 (0x563bbf772bc4 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #47: _PyObject_FastCallDict + 0xe7 (0x563bbf7036b7 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #48: <unknown function> + 0x194d2b (0x563bbf739d2b in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #49: PyObject_Call + 0x45d (0x563bbf6dc5ed in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #50: _PyEval_EvalFrameDefault + 0x1f07 (0x563bbf77d8a7 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #51: <unknown function> + 0x8932fb (0x7fa6927992fb in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_python.so) | |
frame #52: _PyFunction_Vectorcall + 0x1b7 (0x563bbf7727e7 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #53: <unknown function> + 0x166b2e (0x563bbf70bb2e in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #54: _PyEval_EvalFrameDefault + 0x71b (0x563bbf77c0bb in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #55: <unknown function> + 0x8932fb (0x7fa6927992fb in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_python.so) | |
frame #56: _PyEval_EvalCodeWithName + 0x260 (0x563bbf771600 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #57: _PyFunction_Vectorcall + 0x594 (0x563bbf772bc4 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #58: _PyEval_EvalFrameDefault + 0x4c0 (0x563bbf77be60 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #59: <unknown function> + 0x8932fb (0x7fa6927992fb in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_python.so) | |
frame #60: _PyEval_EvalCodeWithName + 0xd5f (0x563bbf7720ff in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #61: _PyFunction_Vectorcall + 0x594 (0x563bbf772bc4 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #62: <unknown function> + 0x166bf8 (0x563bbf70bbf8 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #63: PyObject_Call + 0x7d (0x563bbf6dc20d in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
While executing %submod_1 : [#users=2] = call_module[target=compiled_submod_1](args = (%getitem, %getitem_1), kwargs = {}) | |
Original traceback: | |
None | |
You can suppress this exception and fall back to eager by setting: | |
torchdynamo.config.suppress_errors = True | |
from user code: | |
File "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torchvision/models/resnet.py", line 285, in forward | |
return self._forward_impl(x) | |
Set torch._dynamo.config.verbose=True for more information | |
========== | |
[2022-10-26 04:23:17,649] torch._dynamo.eval_frame: [ERROR] Error while processing frame | |
Traceback (most recent call last): | |
File "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/_dynamo/output_graph.py", line 428, in call_user_compiler | |
compiled_fn = self.compiler_fn(gm, self.example_inputs()) | |
File "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/_dynamo/optimizations/distributed.py", line 207, in compile_fn | |
submod_compiler.run(*example_inputs) | |
File "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/fx/interpreter.py", line 130, in run | |
self.env[node] = self.run_node(node) | |
File "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/_dynamo/optimizations/distributed.py", line 204, in run_node | |
return getattr(self, n.op)(n.target, args, kwargs) | |
File "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/fx/interpreter.py", line 288, in call_module | |
return submod(*args, **kwargs) | |
File "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/nn/modules/module.py", line 1363, in _call_impl | |
return forward_call(*input, **kwargs) | |
File "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/_dynamo/optimizations/distributed.py", line 160, in forward | |
x = self.compiled_submod(*args) | |
File "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/fx/graph_module.py", line 660, in call_wrapped | |
return self._wrapped_call(self, *args, **kwargs) | |
File "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/fx/graph_module.py", line 277, in __call__ | |
raise e.with_traceback(None) | |
RuntimeError: Output 1 of CompiledFunctionBackward is a view and is being modified inplace. This view is the output of a function that returns multiple views. Such functions do not allow the output views to be modified inplace. You should replace the inplace operation by an out-of-place one. | |
Exception raised from handle_view_on_rebase at /scratch/dberard/bench-fast/pytorch/torch/csrc/autograd/variable.cpp:819 (most recent call first): | |
frame #0: c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> >) + 0x6b (0x7fa6759cc2fb in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libc10.so) | |
frame #1: c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > const&) + 0xce (0x7fa6759c7afe in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libc10.so) | |
frame #2: torch::autograd::handle_view_on_rebase(torch::autograd::DifferentiableViewMeta*, bool) + 0x37b (0x7fa68ae8f97b in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_cpu.so) | |
frame #3: <unknown function> + 0x5ec24aa (0x7fa68a3864aa in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_cpu.so) | |
frame #4: <unknown function> + 0x60ef8ea (0x7fa68a5b38ea in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_cpu.so) | |
frame #5: at::_ops::add__Tensor::call(at::Tensor&, at::Tensor const&, c10::Scalar const&) + 0x15b (0x7fa688c25ccb in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_cpu.so) | |
frame #6: <unknown function> + 0x5e58f8 (0x7fa6924eb8f8 in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_python.so) | |
frame #7: <unknown function> + 0x5e5d36 (0x7fa6924ebd36 in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_python.so) | |
frame #8: <unknown function> + 0x1b9c03 (0x563bbf75ec03 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #9: <unknown function> + 0x194a64 (0x563bbf739a64 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #10: <unknown function> + 0x230eac (0x563bbf7d5eac in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #11: PyNumber_InPlaceAdd + 0x279 (0x563bbf74a0f9 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #12: _PyEval_EvalFrameDefault + 0x16b1 (0x563bbf77d051 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #13: <unknown function> + 0x8932fb (0x7fa6927992fb in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_python.so) | |
frame #14: _PyFunction_Vectorcall + 0x1b7 (0x563bbf7727e7 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #15: <unknown function> + 0x166bf8 (0x563bbf70bbf8 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #16: PyObject_Call + 0x7d (0x563bbf6dc20d in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #17: _PyEval_EvalFrameDefault + 0x1f07 (0x563bbf77d8a7 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #18: <unknown function> + 0x8932fb (0x7fa6927992fb in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_python.so) | |
frame #19: _PyEval_EvalCodeWithName + 0x260 (0x563bbf771600 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #20: _PyFunction_Vectorcall + 0x594 (0x563bbf772bc4 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #21: <unknown function> + 0x166bf8 (0x563bbf70bbf8 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #22: PyObject_Call + 0x7d (0x563bbf6dc20d in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #23: _PyEval_EvalFrameDefault + 0x1f07 (0x563bbf77d8a7 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #24: <unknown function> + 0x8932fb (0x7fa6927992fb in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_python.so) | |
frame #25: _PyEval_EvalCodeWithName + 0x888 (0x563bbf771c28 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #26: _PyFunction_Vectorcall + 0x594 (0x563bbf772bc4 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #27: _PyObject_FastCallDict + 0xe7 (0x563bbf7036b7 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #28: <unknown function> + 0x194d2b (0x563bbf739d2b in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #29: PyObject_Call + 0x45d (0x563bbf6dc5ed in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #30: _PyEval_EvalFrameDefault + 0x1f07 (0x563bbf77d8a7 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #31: <unknown function> + 0x8932fb (0x7fa6927992fb in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_python.so) | |
frame #32: _PyEval_EvalCodeWithName + 0x260 (0x563bbf771600 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #33: _PyFunction_Vectorcall + 0x594 (0x563bbf772bc4 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #34: _PyObject_FastCallDict + 0x5f (0x563bbf70362f in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #35: <unknown function> + 0x194d2b (0x563bbf739d2b in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #36: PyObject_Call + 0x45d (0x563bbf6dc5ed in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #37: _PyEval_EvalFrameDefault + 0x1f07 (0x563bbf77d8a7 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #38: <unknown function> + 0x8932fb (0x7fa6927992fb in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_python.so) | |
frame #39: _PyEval_EvalCodeWithName + 0x260 (0x563bbf771600 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #40: _PyFunction_Vectorcall + 0x594 (0x563bbf772bc4 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #41: <unknown function> + 0x166bf8 (0x563bbf70bbf8 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #42: PyObject_Call + 0x7d (0x563bbf6dc20d in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #43: _PyEval_EvalFrameDefault + 0x1f07 (0x563bbf77d8a7 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #44: <unknown function> + 0x8932fb (0x7fa6927992fb in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_python.so) | |
frame #45: _PyEval_EvalCodeWithName + 0x260 (0x563bbf771600 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #46: _PyFunction_Vectorcall + 0x594 (0x563bbf772bc4 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #47: _PyObject_FastCallDict + 0xe7 (0x563bbf7036b7 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #48: <unknown function> + 0x194d2b (0x563bbf739d2b in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #49: PyObject_Call + 0x45d (0x563bbf6dc5ed in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #50: _PyEval_EvalFrameDefault + 0x1f07 (0x563bbf77d8a7 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #51: <unknown function> + 0x8932fb (0x7fa6927992fb in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_python.so) | |
frame #52: _PyFunction_Vectorcall + 0x1b7 (0x563bbf7727e7 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #53: <unknown function> + 0x166b2e (0x563bbf70bb2e in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #54: _PyEval_EvalFrameDefault + 0x71b (0x563bbf77c0bb in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #55: <unknown function> + 0x8932fb (0x7fa6927992fb in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_python.so) | |
frame #56: _PyEval_EvalCodeWithName + 0x260 (0x563bbf771600 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #57: _PyFunction_Vectorcall + 0x594 (0x563bbf772bc4 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #58: _PyEval_EvalFrameDefault + 0x4c0 (0x563bbf77be60 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #59: <unknown function> + 0x8932fb (0x7fa6927992fb in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_python.so) | |
frame #60: _PyEval_EvalCodeWithName + 0xd5f (0x563bbf7720ff in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #61: _PyFunction_Vectorcall + 0x594 (0x563bbf772bc4 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #62: <unknown function> + 0x166bf8 (0x563bbf70bbf8 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #63: PyObject_Call + 0x7d (0x563bbf6dc20d in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
While executing %submod_1 : [#users=2] = call_module[target=compiled_submod_1](args = (%getitem, %getitem_1), kwargs = {}) | |
Original traceback: | |
None | |
The above exception was the direct cause of the following exception: | |
Traceback (most recent call last): | |
File "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/_dynamo/eval_frame.py", line 241, in catch_errors | |
return hijacked_callback(frame, cache_size) | |
File "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/_dynamo/convert_frame.py", line 452, in _convert_frame | |
result = inner_convert(frame, cache_size) | |
File "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/_dynamo/convert_frame.py", line 118, in _fn | |
return fn(*args, **kwargs) | |
File "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/_dynamo/utils.py", line 86, in time_wrapper | |
r = func(*args, **kwargs) | |
File "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/_dynamo/convert_frame.py", line 325, in _convert_frame_assert | |
return _compile( | |
File "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/_dynamo/convert_frame.py", line 380, in _compile | |
out_code = transform_code_object(code, transform) | |
File "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/_dynamo/bytecode_transformation.py", line 341, in transform_code_object | |
transformations(instructions, code_options) | |
File "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/_dynamo/convert_frame.py", line 368, in transform | |
tracer.run() | |
File "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/_dynamo/symbolic_convert.py", line 1447, in run | |
super().run() | |
File "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/_dynamo/symbolic_convert.py", line 349, in run | |
and self.step() | |
File "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/_dynamo/symbolic_convert.py", line 322, in step | |
getattr(self, inst.opname)(inst) | |
File "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/_dynamo/symbolic_convert.py", line 1509, in RETURN_VALUE | |
self.output.compile_subgraph(self) | |
File "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/_dynamo/output_graph.py", line 328, in compile_subgraph | |
self.compile_and_call_fx_graph(tx, list(reversed(stack_values)), root) | |
File "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/_dynamo/output_graph.py", line 397, in compile_and_call_fx_graph | |
compiled_fn = self.call_user_compiler(gm) | |
File "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/_dynamo/output_graph.py", line 437, in call_user_compiler | |
raise BackendCompilerFailed(self.compiler_fn, e) from e | |
torch._dynamo.exc.BackendCompilerFailed: compile_fn raised RuntimeError: Output 1 of CompiledFunctionBackward is a view and is being modified inplace. This view is the output of a function that returns multiple views. Such functions do not allow the output views to be modified inplace. You should replace the inplace operation by an out-of-place one. | |
Exception raised from handle_view_on_rebase at /scratch/dberard/bench-fast/pytorch/torch/csrc/autograd/variable.cpp:819 (most recent call first): | |
frame #0: c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> >) + 0x6b (0x7fa6759cc2fb in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libc10.so) | |
frame #1: c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > const&) + 0xce (0x7fa6759c7afe in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libc10.so) | |
frame #2: torch::autograd::handle_view_on_rebase(torch::autograd::DifferentiableViewMeta*, bool) + 0x37b (0x7fa68ae8f97b in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_cpu.so) | |
frame #3: <unknown function> + 0x5ec24aa (0x7fa68a3864aa in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_cpu.so) | |
frame #4: <unknown function> + 0x60ef8ea (0x7fa68a5b38ea in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_cpu.so) | |
frame #5: at::_ops::add__Tensor::call(at::Tensor&, at::Tensor const&, c10::Scalar const&) + 0x15b (0x7fa688c25ccb in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_cpu.so) | |
frame #6: <unknown function> + 0x5e58f8 (0x7fa6924eb8f8 in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_python.so) | |
frame #7: <unknown function> + 0x5e5d36 (0x7fa6924ebd36 in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_python.so) | |
frame #8: <unknown function> + 0x1b9c03 (0x563bbf75ec03 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #9: <unknown function> + 0x194a64 (0x563bbf739a64 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #10: <unknown function> + 0x230eac (0x563bbf7d5eac in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #11: PyNumber_InPlaceAdd + 0x279 (0x563bbf74a0f9 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #12: _PyEval_EvalFrameDefault + 0x16b1 (0x563bbf77d051 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #13: <unknown function> + 0x8932fb (0x7fa6927992fb in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_python.so) | |
frame #14: _PyFunction_Vectorcall + 0x1b7 (0x563bbf7727e7 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #15: <unknown function> + 0x166bf8 (0x563bbf70bbf8 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #16: PyObject_Call + 0x7d (0x563bbf6dc20d in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #17: _PyEval_EvalFrameDefault + 0x1f07 (0x563bbf77d8a7 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #18: <unknown function> + 0x8932fb (0x7fa6927992fb in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_python.so) | |
frame #19: _PyEval_EvalCodeWithName + 0x260 (0x563bbf771600 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #20: _PyFunction_Vectorcall + 0x594 (0x563bbf772bc4 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #21: <unknown function> + 0x166bf8 (0x563bbf70bbf8 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #22: PyObject_Call + 0x7d (0x563bbf6dc20d in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #23: _PyEval_EvalFrameDefault + 0x1f07 (0x563bbf77d8a7 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #24: <unknown function> + 0x8932fb (0x7fa6927992fb in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_python.so) | |
frame #25: _PyEval_EvalCodeWithName + 0x888 (0x563bbf771c28 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #26: _PyFunction_Vectorcall + 0x594 (0x563bbf772bc4 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #27: _PyObject_FastCallDict + 0xe7 (0x563bbf7036b7 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #28: <unknown function> + 0x194d2b (0x563bbf739d2b in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #29: PyObject_Call + 0x45d (0x563bbf6dc5ed in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #30: _PyEval_EvalFrameDefault + 0x1f07 (0x563bbf77d8a7 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #31: <unknown function> + 0x8932fb (0x7fa6927992fb in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_python.so) | |
frame #32: _PyEval_EvalCodeWithName + 0x260 (0x563bbf771600 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #33: _PyFunction_Vectorcall + 0x594 (0x563bbf772bc4 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #34: _PyObject_FastCallDict + 0x5f (0x563bbf70362f in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #35: <unknown function> + 0x194d2b (0x563bbf739d2b in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #36: PyObject_Call + 0x45d (0x563bbf6dc5ed in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #37: _PyEval_EvalFrameDefault + 0x1f07 (0x563bbf77d8a7 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #38: <unknown function> + 0x8932fb (0x7fa6927992fb in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_python.so) | |
frame #39: _PyEval_EvalCodeWithName + 0x260 (0x563bbf771600 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #40: _PyFunction_Vectorcall + 0x594 (0x563bbf772bc4 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #41: <unknown function> + 0x166bf8 (0x563bbf70bbf8 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #42: PyObject_Call + 0x7d (0x563bbf6dc20d in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #43: _PyEval_EvalFrameDefault + 0x1f07 (0x563bbf77d8a7 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #44: <unknown function> + 0x8932fb (0x7fa6927992fb in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_python.so) | |
frame #45: _PyEval_EvalCodeWithName + 0x260 (0x563bbf771600 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #46: _PyFunction_Vectorcall + 0x594 (0x563bbf772bc4 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #47: _PyObject_FastCallDict + 0xe7 (0x563bbf7036b7 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #48: <unknown function> + 0x194d2b (0x563bbf739d2b in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #49: PyObject_Call + 0x45d (0x563bbf6dc5ed in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #50: _PyEval_EvalFrameDefault + 0x1f07 (0x563bbf77d8a7 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #51: <unknown function> + 0x8932fb (0x7fa6927992fb in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_python.so) | |
frame #52: _PyFunction_Vectorcall + 0x1b7 (0x563bbf7727e7 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #53: <unknown function> + 0x166b2e (0x563bbf70bb2e in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #54: _PyEval_EvalFrameDefault + 0x71b (0x563bbf77c0bb in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #55: <unknown function> + 0x8932fb (0x7fa6927992fb in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_python.so) | |
frame #56: _PyEval_EvalCodeWithName + 0x260 (0x563bbf771600 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #57: _PyFunction_Vectorcall + 0x594 (0x563bbf772bc4 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #58: _PyEval_EvalFrameDefault + 0x4c0 (0x563bbf77be60 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #59: <unknown function> + 0x8932fb (0x7fa6927992fb in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_python.so) | |
frame #60: _PyEval_EvalCodeWithName + 0xd5f (0x563bbf7720ff in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #61: _PyFunction_Vectorcall + 0x594 (0x563bbf772bc4 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #62: <unknown function> + 0x166bf8 (0x563bbf70bbf8 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #63: PyObject_Call + 0x7d (0x563bbf6dc20d in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
While executing %submod_1 : [#users=2] = call_module[target=compiled_submod_1](args = (%getitem, %getitem_1), kwargs = {}) | |
Original traceback: | |
None | |
You can suppress this exception and fall back to eager by setting: | |
torchdynamo.config.suppress_errors = True | |
Process Process-3: | |
Traceback (most recent call last): | |
File "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/_dynamo/output_graph.py", line 428, in call_user_compiler | |
compiled_fn = self.compiler_fn(gm, self.example_inputs()) | |
File "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/_dynamo/optimizations/distributed.py", line 207, in compile_fn | |
submod_compiler.run(*example_inputs) | |
File "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/fx/interpreter.py", line 130, in run | |
self.env[node] = self.run_node(node) | |
File "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/_dynamo/optimizations/distributed.py", line 204, in run_node | |
return getattr(self, n.op)(n.target, args, kwargs) | |
File "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/fx/interpreter.py", line 288, in call_module | |
return submod(*args, **kwargs) | |
File "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/nn/modules/module.py", line 1363, in _call_impl | |
return forward_call(*input, **kwargs) | |
File "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/_dynamo/optimizations/distributed.py", line 160, in forward | |
x = self.compiled_submod(*args) | |
File "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/fx/graph_module.py", line 660, in call_wrapped | |
return self._wrapped_call(self, *args, **kwargs) | |
File "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/fx/graph_module.py", line 277, in __call__ | |
raise e.with_traceback(None) | |
RuntimeError: Output 1 of CompiledFunctionBackward is a view and is being modified inplace. This view is the output of a function that returns multiple views. Such functions do not allow the output views to be modified inplace. You should replace the inplace operation by an out-of-place one. | |
Exception raised from handle_view_on_rebase at /scratch/dberard/bench-fast/pytorch/torch/csrc/autograd/variable.cpp:819 (most recent call first): | |
frame #0: c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> >) + 0x6b (0x7fa6759cc2fb in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libc10.so) | |
frame #1: c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > const&) + 0xce (0x7fa6759c7afe in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libc10.so) | |
frame #2: torch::autograd::handle_view_on_rebase(torch::autograd::DifferentiableViewMeta*, bool) + 0x37b (0x7fa68ae8f97b in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_cpu.so) | |
frame #3: <unknown function> + 0x5ec24aa (0x7fa68a3864aa in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_cpu.so) | |
frame #4: <unknown function> + 0x60ef8ea (0x7fa68a5b38ea in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_cpu.so) | |
frame #5: at::_ops::add__Tensor::call(at::Tensor&, at::Tensor const&, c10::Scalar const&) + 0x15b (0x7fa688c25ccb in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_cpu.so) | |
frame #6: <unknown function> + 0x5e58f8 (0x7fa6924eb8f8 in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_python.so) | |
frame #7: <unknown function> + 0x5e5d36 (0x7fa6924ebd36 in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_python.so) | |
frame #8: <unknown function> + 0x1b9c03 (0x563bbf75ec03 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #9: <unknown function> + 0x194a64 (0x563bbf739a64 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #10: <unknown function> + 0x230eac (0x563bbf7d5eac in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #11: PyNumber_InPlaceAdd + 0x279 (0x563bbf74a0f9 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #12: _PyEval_EvalFrameDefault + 0x16b1 (0x563bbf77d051 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #13: <unknown function> + 0x8932fb (0x7fa6927992fb in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_python.so) | |
frame #14: _PyFunction_Vectorcall + 0x1b7 (0x563bbf7727e7 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #15: <unknown function> + 0x166bf8 (0x563bbf70bbf8 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #16: PyObject_Call + 0x7d (0x563bbf6dc20d in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #17: _PyEval_EvalFrameDefault + 0x1f07 (0x563bbf77d8a7 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #18: <unknown function> + 0x8932fb (0x7fa6927992fb in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_python.so) | |
frame #19: _PyEval_EvalCodeWithName + 0x260 (0x563bbf771600 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #20: _PyFunction_Vectorcall + 0x594 (0x563bbf772bc4 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #21: <unknown function> + 0x166bf8 (0x563bbf70bbf8 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #22: PyObject_Call + 0x7d (0x563bbf6dc20d in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #23: _PyEval_EvalFrameDefault + 0x1f07 (0x563bbf77d8a7 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #24: <unknown function> + 0x8932fb (0x7fa6927992fb in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_python.so) | |
frame #25: _PyEval_EvalCodeWithName + 0x888 (0x563bbf771c28 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #26: _PyFunction_Vectorcall + 0x594 (0x563bbf772bc4 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #27: _PyObject_FastCallDict + 0xe7 (0x563bbf7036b7 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #28: <unknown function> + 0x194d2b (0x563bbf739d2b in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #29: PyObject_Call + 0x45d (0x563bbf6dc5ed in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #30: _PyEval_EvalFrameDefault + 0x1f07 (0x563bbf77d8a7 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #31: <unknown function> + 0x8932fb (0x7fa6927992fb in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_python.so) | |
frame #32: _PyEval_EvalCodeWithName + 0x260 (0x563bbf771600 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #33: _PyFunction_Vectorcall + 0x594 (0x563bbf772bc4 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #34: _PyObject_FastCallDict + 0x5f (0x563bbf70362f in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #35: <unknown function> + 0x194d2b (0x563bbf739d2b in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #36: PyObject_Call + 0x45d (0x563bbf6dc5ed in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #37: _PyEval_EvalFrameDefault + 0x1f07 (0x563bbf77d8a7 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #38: <unknown function> + 0x8932fb (0x7fa6927992fb in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_python.so) | |
frame #39: _PyEval_EvalCodeWithName + 0x260 (0x563bbf771600 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #40: _PyFunction_Vectorcall + 0x594 (0x563bbf772bc4 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #41: <unknown function> + 0x166bf8 (0x563bbf70bbf8 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #42: PyObject_Call + 0x7d (0x563bbf6dc20d in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #43: _PyEval_EvalFrameDefault + 0x1f07 (0x563bbf77d8a7 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #44: <unknown function> + 0x8932fb (0x7fa6927992fb in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_python.so) | |
frame #45: _PyEval_EvalCodeWithName + 0x260 (0x563bbf771600 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #46: _PyFunction_Vectorcall + 0x594 (0x563bbf772bc4 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #47: _PyObject_FastCallDict + 0xe7 (0x563bbf7036b7 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #48: <unknown function> + 0x194d2b (0x563bbf739d2b in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #49: PyObject_Call + 0x45d (0x563bbf6dc5ed in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #50: _PyEval_EvalFrameDefault + 0x1f07 (0x563bbf77d8a7 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #51: <unknown function> + 0x8932fb (0x7fa6927992fb in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_python.so) | |
frame #52: _PyFunction_Vectorcall + 0x1b7 (0x563bbf7727e7 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #53: <unknown function> + 0x166b2e (0x563bbf70bb2e in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #54: _PyEval_EvalFrameDefault + 0x71b (0x563bbf77c0bb in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #55: <unknown function> + 0x8932fb (0x7fa6927992fb in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_python.so) | |
frame #56: _PyEval_EvalCodeWithName + 0x260 (0x563bbf771600 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #57: _PyFunction_Vectorcall + 0x594 (0x563bbf772bc4 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #58: _PyEval_EvalFrameDefault + 0x4c0 (0x563bbf77be60 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #59: <unknown function> + 0x8932fb (0x7fa6927992fb in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_python.so) | |
frame #60: _PyEval_EvalCodeWithName + 0xd5f (0x563bbf7720ff in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #61: _PyFunction_Vectorcall + 0x594 (0x563bbf772bc4 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #62: <unknown function> + 0x166bf8 (0x563bbf70bbf8 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #63: PyObject_Call + 0x7d (0x563bbf6dc20d in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
While executing %submod_1 : [#users=2] = call_module[target=compiled_submod_1](args = (%getitem, %getitem_1), kwargs = {}) | |
Original traceback: | |
None | |
The above exception was the direct cause of the following exception: | |
Traceback (most recent call last): | |
File "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/multiprocessing/process.py", line 315, in _bootstrap | |
self.run() | |
File "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/multiprocessing/process.py", line 108, in run | |
self._target(*self._args, **self._kwargs) | |
File "userbenchmark/ddp_experiments/__init__.py", line 203, in run_once | |
result = trainer.measure() | |
File "/fsx/users/dberard/scratch-local/bench-fast/benchmark/torchbenchmark/util/distributed/core_model/trainer.py", line 89, in measure | |
self.benchmark.invoke() | |
File "/fsx/users/dberard/scratch-local/bench-fast/benchmark/torchbenchmark/util/model.py", line 243, in invoke | |
self.train() | |
File "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/_dynamo/eval_frame.py", line 160, in _fn | |
return fn(*args, **kwargs) | |
File "/fsx/users/dberard/scratch-local/bench-fast/benchmark/torchbenchmark/util/framework/vision/model_factory.py", line 69, in train | |
pred = self.model(data) | |
File "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/nn/modules/module.py", line 1363, in _call_impl | |
return forward_call(*input, **kwargs) | |
File "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/nn/parallel/distributed.py", line 1040, in forward | |
output = self._run_ddp_forward(*inputs, **kwargs) | |
File "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/nn/parallel/distributed.py", line 1000, in _run_ddp_forward | |
return module_to_run(*inputs[0], **kwargs[0]) | |
File "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/nn/modules/module.py", line 1363, in _call_impl | |
return forward_call(*input, **kwargs) | |
File "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/_dynamo/eval_frame.py", line 241, in catch_errors | |
return hijacked_callback(frame, cache_size) | |
File "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/_dynamo/convert_frame.py", line 452, in _convert_frame | |
result = inner_convert(frame, cache_size) | |
File "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/_dynamo/convert_frame.py", line 118, in _fn | |
return fn(*args, **kwargs) | |
File "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/_dynamo/utils.py", line 86, in time_wrapper | |
r = func(*args, **kwargs) | |
File "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/_dynamo/convert_frame.py", line 325, in _convert_frame_assert | |
return _compile( | |
File "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/_dynamo/convert_frame.py", line 380, in _compile | |
out_code = transform_code_object(code, transform) | |
File "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/_dynamo/bytecode_transformation.py", line 341, in transform_code_object | |
transformations(instructions, code_options) | |
File "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/_dynamo/convert_frame.py", line 368, in transform | |
tracer.run() | |
File "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/_dynamo/symbolic_convert.py", line 1447, in run | |
super().run() | |
File "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/_dynamo/symbolic_convert.py", line 349, in run | |
and self.step() | |
File "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/_dynamo/symbolic_convert.py", line 322, in step | |
getattr(self, inst.opname)(inst) | |
File "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/_dynamo/symbolic_convert.py", line 1509, in RETURN_VALUE | |
self.output.compile_subgraph(self) | |
File "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/_dynamo/output_graph.py", line 328, in compile_subgraph | |
self.compile_and_call_fx_graph(tx, list(reversed(stack_values)), root) | |
File "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/_dynamo/output_graph.py", line 397, in compile_and_call_fx_graph | |
compiled_fn = self.call_user_compiler(gm) | |
File "/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/_dynamo/output_graph.py", line 437, in call_user_compiler | |
raise BackendCompilerFailed(self.compiler_fn, e) from e | |
torch._dynamo.exc.BackendCompilerFailed: compile_fn raised RuntimeError: Output 1 of CompiledFunctionBackward is a view and is being modified inplace. This view is the output of a function that returns multiple views. Such functions do not allow the output views to be modified inplace. You should replace the inplace operation by an out-of-place one. | |
Exception raised from handle_view_on_rebase at /scratch/dberard/bench-fast/pytorch/torch/csrc/autograd/variable.cpp:819 (most recent call first): | |
frame #0: c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> >) + 0x6b (0x7fa6759cc2fb in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libc10.so) | |
frame #1: c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > const&) + 0xce (0x7fa6759c7afe in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libc10.so) | |
frame #2: torch::autograd::handle_view_on_rebase(torch::autograd::DifferentiableViewMeta*, bool) + 0x37b (0x7fa68ae8f97b in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_cpu.so) | |
frame #3: <unknown function> + 0x5ec24aa (0x7fa68a3864aa in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_cpu.so) | |
frame #4: <unknown function> + 0x60ef8ea (0x7fa68a5b38ea in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_cpu.so) | |
frame #5: at::_ops::add__Tensor::call(at::Tensor&, at::Tensor const&, c10::Scalar const&) + 0x15b (0x7fa688c25ccb in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_cpu.so) | |
frame #6: <unknown function> + 0x5e58f8 (0x7fa6924eb8f8 in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_python.so) | |
frame #7: <unknown function> + 0x5e5d36 (0x7fa6924ebd36 in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_python.so) | |
frame #8: <unknown function> + 0x1b9c03 (0x563bbf75ec03 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #9: <unknown function> + 0x194a64 (0x563bbf739a64 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #10: <unknown function> + 0x230eac (0x563bbf7d5eac in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #11: PyNumber_InPlaceAdd + 0x279 (0x563bbf74a0f9 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #12: _PyEval_EvalFrameDefault + 0x16b1 (0x563bbf77d051 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #13: <unknown function> + 0x8932fb (0x7fa6927992fb in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_python.so) | |
frame #14: _PyFunction_Vectorcall + 0x1b7 (0x563bbf7727e7 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #15: <unknown function> + 0x166bf8 (0x563bbf70bbf8 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #16: PyObject_Call + 0x7d (0x563bbf6dc20d in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #17: _PyEval_EvalFrameDefault + 0x1f07 (0x563bbf77d8a7 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #18: <unknown function> + 0x8932fb (0x7fa6927992fb in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_python.so) | |
frame #19: _PyEval_EvalCodeWithName + 0x260 (0x563bbf771600 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #20: _PyFunction_Vectorcall + 0x594 (0x563bbf772bc4 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #21: <unknown function> + 0x166bf8 (0x563bbf70bbf8 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #22: PyObject_Call + 0x7d (0x563bbf6dc20d in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #23: _PyEval_EvalFrameDefault + 0x1f07 (0x563bbf77d8a7 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #24: <unknown function> + 0x8932fb (0x7fa6927992fb in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_python.so) | |
frame #25: _PyEval_EvalCodeWithName + 0x888 (0x563bbf771c28 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #26: _PyFunction_Vectorcall + 0x594 (0x563bbf772bc4 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #27: _PyObject_FastCallDict + 0xe7 (0x563bbf7036b7 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #28: <unknown function> + 0x194d2b (0x563bbf739d2b in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #29: PyObject_Call + 0x45d (0x563bbf6dc5ed in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #30: _PyEval_EvalFrameDefault + 0x1f07 (0x563bbf77d8a7 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #31: <unknown function> + 0x8932fb (0x7fa6927992fb in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_python.so) | |
frame #32: _PyEval_EvalCodeWithName + 0x260 (0x563bbf771600 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #33: _PyFunction_Vectorcall + 0x594 (0x563bbf772bc4 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #34: _PyObject_FastCallDict + 0x5f (0x563bbf70362f in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #35: <unknown function> + 0x194d2b (0x563bbf739d2b in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #36: PyObject_Call + 0x45d (0x563bbf6dc5ed in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #37: _PyEval_EvalFrameDefault + 0x1f07 (0x563bbf77d8a7 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #38: <unknown function> + 0x8932fb (0x7fa6927992fb in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_python.so) | |
frame #39: _PyEval_EvalCodeWithName + 0x260 (0x563bbf771600 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #40: _PyFunction_Vectorcall + 0x594 (0x563bbf772bc4 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #41: <unknown function> + 0x166bf8 (0x563bbf70bbf8 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #42: PyObject_Call + 0x7d (0x563bbf6dc20d in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #43: _PyEval_EvalFrameDefault + 0x1f07 (0x563bbf77d8a7 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #44: <unknown function> + 0x8932fb (0x7fa6927992fb in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_python.so) | |
frame #45: _PyEval_EvalCodeWithName + 0x260 (0x563bbf771600 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #46: _PyFunction_Vectorcall + 0x594 (0x563bbf772bc4 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #47: _PyObject_FastCallDict + 0xe7 (0x563bbf7036b7 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #48: <unknown function> + 0x194d2b (0x563bbf739d2b in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #49: PyObject_Call + 0x45d (0x563bbf6dc5ed in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #50: _PyEval_EvalFrameDefault + 0x1f07 (0x563bbf77d8a7 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #51: <unknown function> + 0x8932fb (0x7fa6927992fb in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_python.so) | |
frame #52: _PyFunction_Vectorcall + 0x1b7 (0x563bbf7727e7 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #53: <unknown function> + 0x166b2e (0x563bbf70bb2e in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #54: _PyEval_EvalFrameDefault + 0x71b (0x563bbf77c0bb in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #55: <unknown function> + 0x8932fb (0x7fa6927992fb in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_python.so) | |
frame #56: _PyEval_EvalCodeWithName + 0x260 (0x563bbf771600 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #57: _PyFunction_Vectorcall + 0x594 (0x563bbf772bc4 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #58: _PyEval_EvalFrameDefault + 0x4c0 (0x563bbf77be60 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #59: <unknown function> + 0x8932fb (0x7fa6927992fb in /data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torch/lib/libtorch_python.so) | |
frame #60: _PyEval_EvalCodeWithName + 0xd5f (0x563bbf7720ff in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #61: _PyFunction_Vectorcall + 0x594 (0x563bbf772bc4 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #62: <unknown function> + 0x166bf8 (0x563bbf70bbf8 in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
frame #63: PyObject_Call + 0x7d (0x563bbf6dc20d in /data/home/dberard/miniconda/envs/bench-fast/bin/python) | |
While executing %submod_1 : [#users=2] = call_module[target=compiled_submod_1](args = (%getitem, %getitem_1), kwargs = {}) | |
Original traceback: | |
None | |
You can suppress this exception and fall back to eager by setting: | |
torchdynamo.config.suppress_errors = True |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
submitit INFO (2022-10-26 04:13:31,105) - Starting with JobEnvironment(job_id=74200, hostname=a100-st-p4d24xlarge-47, local_rank=0(8), node=0(1), global_rank=0(8)) | |
submitit INFO (2022-10-26 04:13:31,106) - Loading pickle: /fsx/users/dberard/scratch-local/bench-fast/benchmark/logs_oct25/74200_submitted.pkl | |
This is node 0 | |
run_once | |
Process group: 8 tasks, rank: 0 | |
MY HOSTNAME: a100-st-p4d24xlarge-47 | |
FI_PROVIDER : efa | |
LD_LIBRARY_PATH : /fsx/users/dberard/scratch-local/bench-fast/aws-ofi-nccl/lib:/opt/amazon/efa/lib:/usr/local/cuda-11.6/lib:/usr/local/cuda-11.6/lib64:/opt/amazon/efa/lib:/opt/amazon/openmpi/lib:/usr/local/cuda/efa/lib:/usr/local/cuda/lib:/usr/local/cuda:/usr/local/cuda/lib64:/usr/local/cuda/extras/CUPTI/lib64:/usr/local/cuda/targets/x86_64-linux/lib:/usr/local/lib:/usr/lib: | |
NCCL_DEBUG : INFO | |
FI_EFA_USE_DEVICE_RDMA : 1 | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
a100-st-p4d24xlarge-47:40610:40610 [0] NCCL INFO NCCL_SOCKET_IFNAME set by environment to ens | |
a100-st-p4d24xlarge-47:40610:40610 [0] NCCL INFO NCCL_SOCKET_IFNAME set to ens | |
a100-st-p4d24xlarge-47:40610:40610 [0] NCCL INFO Bootstrap : Using ens32:10.200.88.223<0> | |
a100-st-p4d24xlarge-47:40610:40610 [0] NCCL INFO NET/Plugin: Failed to find ncclCollNetPlugin_v6 symbol. | |
a100-st-p4d24xlarge-47:40610:40610 [0] NCCL INFO NET/Plugin: Failed to find ncclCollNetPlugin symbol (v4 or v5). | |
a100-st-p4d24xlarge-47:40610:40610 [0] NCCL INFO cudaDriverVersion 11060 | |
NCCL version 2.14.3+cuda11.6 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO NET/OFI Using aws-ofi-nccl 1.4.0aws | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO NET/OFI Running on p4d.24xlarge platform, Setting NCCL_TOPO_FILE environment variable to /fsx/users/dberard/scratch-local/bench-fast/aws-ofi-nccl/share/aws-ofi-nccl/xml/p4d-24xl-topo.xml | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO NET/OFI Setting FI_EFA_FORK_SAFE environment variable to 1 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO NET/OFI Selected Provider is efa | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Using network AWS Libfabric | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO NCCL_TOPO_FILE set by environment to /fsx/users/dberard/scratch-local/bench-fast/aws-ofi-nccl/share/aws-ofi-nccl/xml/p4d-24xl-topo.xml | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO NET/AWS Libfabric : GPU Direct RDMA Enabled for HCA 0 'rdmap16s27' | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO NET/AWS Libfabric : GPU Direct RDMA Enabled for HCA 1 'rdmap32s27' | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO NET/AWS Libfabric : GPU Direct RDMA Enabled for HCA 2 'rdmap144s27' | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO NET/AWS Libfabric : GPU Direct RDMA Enabled for HCA 3 'rdmap160s27' | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101d0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201c0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201d0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901c0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901d0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01c0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01d0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101d0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201c0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201d0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901c0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901d0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01c0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01d0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101d0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201c0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201d0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901c0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901d0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01c0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01d0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101d0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201c0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201d0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901c0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901d0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01c0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01d0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Setting affinity for GPU 0 to 1f0000,0000001f | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Channel 00/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Channel 01/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Channel 02/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Channel 03/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Channel 04/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Channel 05/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Channel 06/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Channel 07/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Channel 08/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Channel 09/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Channel 10/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Channel 11/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Channel 12/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Channel 13/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Channel 14/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Channel 15/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Channel 16/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Channel 17/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Channel 18/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Channel 19/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Channel 20/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Channel 21/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Channel 22/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Channel 23/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Trees [0] 1/-1/-1->0->-1 [1] 1/-1/-1->0->-1 [2] 1/-1/-1->0->-1 [3] 1/-1/-1->0->-1 [4] 1/-1/-1->0->-1 [5] 1/-1/-1->0->-1 [6] 1/-1/-1->0->-1 [7] 1/-1/-1->0->-1 [8] 1/-1/-1->0->-1 [9] 1/-1/-1->0->-1 [10] 1/-1/-1->0->-1 [11] 1/-1/-1->0->-1 [12] 1/-1/-1->0->-1 [13] 1/-1/-1->0->-1 [14] 1/-1/-1->0->-1 [15] 1/-1/-1->0->-1 [16] 1/-1/-1->0->-1 [17] 1/-1/-1->0->-1 [18] 1/-1/-1->0->-1 [19] 1/-1/-1->0->-1 [20] 1/-1/-1->0->-1 [21] 1/-1/-1->0->-1 [22] 1/-1/-1->0->-1 [23] 1/-1/-1->0->-1 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8002dd0 | |
a100-st-p4d24xlarge-47:40610:41202 [0] NCCL INFO New proxy recv connection 0 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8002e10 | |
a100-st-p4d24xlarge-47:40610:41202 [0] NCCL INFO New proxy recv connection 1 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:40610:41202 [0] NCCL INFO New proxy recv connection 2 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8002e50 | |
a100-st-p4d24xlarge-47:40610:41202 [0] NCCL INFO New proxy recv connection 3 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8002e90 | |
a100-st-p4d24xlarge-47:40610:41202 [0] NCCL INFO New proxy recv connection 4 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8002ed0 | |
a100-st-p4d24xlarge-47:40610:41202 [0] NCCL INFO New proxy recv connection 5 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8002f10 | |
a100-st-p4d24xlarge-47:40610:41202 [0] NCCL INFO New proxy recv connection 6 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8002f50 | |
a100-st-p4d24xlarge-47:40610:41202 [0] NCCL INFO New proxy recv connection 7 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8002f90 | |
a100-st-p4d24xlarge-47:40610:41202 [0] NCCL INFO New proxy recv connection 8 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8002fd0 | |
a100-st-p4d24xlarge-47:40610:41202 [0] NCCL INFO New proxy recv connection 9 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003010 | |
a100-st-p4d24xlarge-47:40610:41202 [0] NCCL INFO New proxy recv connection 10 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003050 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003090 | |
a100-st-p4d24xlarge-47:40610:41202 [0] NCCL INFO New proxy recv connection 11 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c80030d0 | |
a100-st-p4d24xlarge-47:40610:41202 [0] NCCL INFO New proxy recv connection 12 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003110 | |
a100-st-p4d24xlarge-47:40610:41202 [0] NCCL INFO New proxy recv connection 13 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003150 | |
a100-st-p4d24xlarge-47:40610:41202 [0] NCCL INFO New proxy recv connection 14 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003190 | |
a100-st-p4d24xlarge-47:40610:41202 [0] NCCL INFO New proxy recv connection 15 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c80031d0 | |
a100-st-p4d24xlarge-47:40610:41202 [0] NCCL INFO New proxy recv connection 16 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003210 | |
a100-st-p4d24xlarge-47:40610:41202 [0] NCCL INFO New proxy recv connection 17 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003250 | |
a100-st-p4d24xlarge-47:40610:41202 [0] NCCL INFO New proxy recv connection 18 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003290 | |
a100-st-p4d24xlarge-47:40610:41202 [0] NCCL INFO New proxy recv connection 19 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c80032d0 | |
a100-st-p4d24xlarge-47:40610:41202 [0] NCCL INFO New proxy recv connection 20 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003310 | |
a100-st-p4d24xlarge-47:40610:41202 [0] NCCL INFO New proxy recv connection 21 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003350 | |
a100-st-p4d24xlarge-47:40610:41202 [0] NCCL INFO New proxy recv connection 22 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003390 | |
a100-st-p4d24xlarge-47:40610:41202 [0] NCCL INFO New proxy recv connection 23 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Channel 00/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c80033d0 | |
a100-st-p4d24xlarge-47:40610:41202 [0] NCCL INFO New proxy send connection 24 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Channel 01/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003410 | |
a100-st-p4d24xlarge-47:40610:41202 [0] NCCL INFO New proxy send connection 25 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Channel 02/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003450 | |
a100-st-p4d24xlarge-47:40610:41202 [0] NCCL INFO New proxy send connection 26 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Channel 03/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003490 | |
a100-st-p4d24xlarge-47:40610:41202 [0] NCCL INFO New proxy send connection 27 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Channel 04/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c80034d0 | |
a100-st-p4d24xlarge-47:40610:41202 [0] NCCL INFO New proxy send connection 28 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Channel 05/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003510 | |
a100-st-p4d24xlarge-47:40610:41202 [0] NCCL INFO New proxy send connection 29 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Channel 06/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003550 | |
a100-st-p4d24xlarge-47:40610:41202 [0] NCCL INFO New proxy send connection 30 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Channel 07/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003590 | |
a100-st-p4d24xlarge-47:40610:41202 [0] NCCL INFO New proxy send connection 31 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Channel 08/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c80035d0 | |
a100-st-p4d24xlarge-47:40610:41202 [0] NCCL INFO New proxy send connection 32 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Channel 09/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003610 | |
a100-st-p4d24xlarge-47:40610:41202 [0] NCCL INFO New proxy send connection 33 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Channel 10/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003650 | |
a100-st-p4d24xlarge-47:40610:41202 [0] NCCL INFO New proxy send connection 34 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Channel 11/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003690 | |
a100-st-p4d24xlarge-47:40610:41202 [0] NCCL INFO New proxy send connection 35 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Channel 12/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c80036d0 | |
a100-st-p4d24xlarge-47:40610:41202 [0] NCCL INFO New proxy send connection 36 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Channel 13/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003710 | |
a100-st-p4d24xlarge-47:40610:41202 [0] NCCL INFO New proxy send connection 37 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Channel 14/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003750 | |
a100-st-p4d24xlarge-47:40610:41202 [0] NCCL INFO New proxy send connection 38 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Channel 15/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003790 | |
a100-st-p4d24xlarge-47:40610:41202 [0] NCCL INFO New proxy send connection 39 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Channel 16/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c80037d0 | |
a100-st-p4d24xlarge-47:40610:41202 [0] NCCL INFO New proxy send connection 40 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Channel 17/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003810 | |
a100-st-p4d24xlarge-47:40610:41202 [0] NCCL INFO New proxy send connection 41 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Channel 18/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003850 | |
a100-st-p4d24xlarge-47:40610:41202 [0] NCCL INFO New proxy send connection 42 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Channel 19/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003890 | |
a100-st-p4d24xlarge-47:40610:41202 [0] NCCL INFO New proxy send connection 43 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Channel 20/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c80038d0 | |
a100-st-p4d24xlarge-47:40610:41202 [0] NCCL INFO New proxy send connection 44 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Channel 21/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003910 | |
a100-st-p4d24xlarge-47:40610:41202 [0] NCCL INFO New proxy send connection 45 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Channel 22/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003950 | |
a100-st-p4d24xlarge-47:40610:41202 [0] NCCL INFO New proxy send connection 46 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Channel 23/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003990 | |
a100-st-p4d24xlarge-47:40610:41202 [0] NCCL INFO New proxy send connection 47 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Connected all rings | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c80039d0 | |
a100-st-p4d24xlarge-47:40610:41202 [0] NCCL INFO New proxy recv connection 48 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003a10 | |
a100-st-p4d24xlarge-47:40610:41202 [0] NCCL INFO New proxy recv connection 49 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:40610:41202 [0] NCCL INFO New proxy recv connection 50 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003a50 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003a90 | |
a100-st-p4d24xlarge-47:40610:41202 [0] NCCL INFO New proxy recv connection 51 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003ad0 | |
a100-st-p4d24xlarge-47:40610:41202 [0] NCCL INFO New proxy recv connection 52 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003b10 | |
a100-st-p4d24xlarge-47:40610:41202 [0] NCCL INFO New proxy recv connection 53 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003b50 | |
a100-st-p4d24xlarge-47:40610:41202 [0] NCCL INFO New proxy recv connection 54 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003b90 | |
a100-st-p4d24xlarge-47:40610:41202 [0] NCCL INFO New proxy recv connection 55 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003bd0 | |
a100-st-p4d24xlarge-47:40610:41202 [0] NCCL INFO New proxy recv connection 56 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003c10 | |
a100-st-p4d24xlarge-47:40610:41202 [0] NCCL INFO New proxy recv connection 57 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003c50 | |
a100-st-p4d24xlarge-47:40610:41202 [0] NCCL INFO New proxy recv connection 58 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003c90 | |
a100-st-p4d24xlarge-47:40610:41202 [0] NCCL INFO New proxy recv connection 59 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003cd0 | |
a100-st-p4d24xlarge-47:40610:41202 [0] NCCL INFO New proxy recv connection 60 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003d10 | |
a100-st-p4d24xlarge-47:40610:41202 [0] NCCL INFO New proxy recv connection 61 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003d50 | |
a100-st-p4d24xlarge-47:40610:41202 [0] NCCL INFO New proxy recv connection 62 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003d90 | |
a100-st-p4d24xlarge-47:40610:41202 [0] NCCL INFO New proxy recv connection 63 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003dd0 | |
a100-st-p4d24xlarge-47:40610:41202 [0] NCCL INFO New proxy recv connection 64 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003e10 | |
a100-st-p4d24xlarge-47:40610:41202 [0] NCCL INFO New proxy recv connection 65 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003e50 | |
a100-st-p4d24xlarge-47:40610:41202 [0] NCCL INFO New proxy recv connection 66 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003e90 | |
a100-st-p4d24xlarge-47:40610:41202 [0] NCCL INFO New proxy recv connection 67 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003ed0 | |
a100-st-p4d24xlarge-47:40610:41202 [0] NCCL INFO New proxy recv connection 68 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003f10 | |
a100-st-p4d24xlarge-47:40610:41202 [0] NCCL INFO New proxy recv connection 69 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003f50 | |
a100-st-p4d24xlarge-47:40610:41202 [0] NCCL INFO New proxy recv connection 70 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003f90 | |
a100-st-p4d24xlarge-47:40610:41202 [0] NCCL INFO New proxy recv connection 71 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Connected all trees | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO NCCL_ALGO set by environment to ring | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO threadThresholds 8/8/64 | 64/8/64 | 512 | 512 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO 24 coll channels, 32 p2p channels, 32 p2p channels per peer | |
a100-st-p4d24xlarge-47:40610:41202 [0] NCCL INFO New proxy send connection 72 from local rank 0, transport 2 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003fd0 | |
a100-st-p4d24xlarge-47:40610:41147 [0] NCCL INFO comm 0x563c6a5c7760 rank 0 nranks 8 cudaDev 0 busId 101c0 - Init COMPLETE | |
didn't get the result, queue empty | |
result {'latency_median': 62.83059310913086, 'latency_stdev': 0.47123895276864264} | |
has been placed into the queue | |
got exit 0 | |
got the result! | |
exit code: 0 and result: {'nodes': 1, 'model_name': 'torchbenchmark.models.resnet50.Model', 'backend': 'eager', 'has_breaks': False, 'static_graph': False, 'result': {'latency_median': 62.83059310913086, 'latency_stdev': 0.47123895276864264}} | |
<RESULT>{"nodes": 1, "model_name": "torchbenchmark.models.resnet50.Model", "backend": "eager", "has_breaks": false, "static_graph": false, "result": {"latency_median": 62.83059310913086, "latency_stdev": 0.47123895276864264}}</RESULT> | |
run_once | |
Process group: 8 tasks, rank: 0 | |
MY HOSTNAME: a100-st-p4d24xlarge-47 | |
FI_PROVIDER : efa | |
LD_LIBRARY_PATH : /fsx/users/dberard/scratch-local/bench-fast/aws-ofi-nccl/lib:/opt/amazon/efa/lib:/usr/local/cuda-11.6/lib:/usr/local/cuda-11.6/lib64:/opt/amazon/efa/lib:/opt/amazon/openmpi/lib:/usr/local/cuda/efa/lib:/usr/local/cuda/lib:/usr/local/cuda:/usr/local/cuda/lib64:/usr/local/cuda/extras/CUPTI/lib64:/usr/local/cuda/targets/x86_64-linux/lib:/usr/local/lib:/usr/lib: | |
NCCL_DEBUG : INFO | |
FI_EFA_USE_DEVICE_RDMA : 1 | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
a100-st-p4d24xlarge-47:41346:41346 [0] NCCL INFO NCCL_SOCKET_IFNAME set by environment to ens | |
a100-st-p4d24xlarge-47:41346:41346 [0] NCCL INFO NCCL_SOCKET_IFNAME set to ens | |
a100-st-p4d24xlarge-47:41346:41346 [0] NCCL INFO Bootstrap : Using ens32:10.200.88.223<0> | |
a100-st-p4d24xlarge-47:41346:41346 [0] NCCL INFO NET/Plugin: Failed to find ncclCollNetPlugin_v6 symbol. | |
a100-st-p4d24xlarge-47:41346:41346 [0] NCCL INFO NET/Plugin: Failed to find ncclCollNetPlugin symbol (v4 or v5). | |
a100-st-p4d24xlarge-47:41346:41346 [0] NCCL INFO cudaDriverVersion 11060 | |
NCCL version 2.14.3+cuda11.6 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO NET/OFI Using aws-ofi-nccl 1.4.0aws | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO NET/OFI Running on p4d.24xlarge platform, Setting NCCL_TOPO_FILE environment variable to /fsx/users/dberard/scratch-local/bench-fast/aws-ofi-nccl/share/aws-ofi-nccl/xml/p4d-24xl-topo.xml | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO NET/OFI Setting FI_EFA_FORK_SAFE environment variable to 1 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO NET/OFI Selected Provider is efa | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Using network AWS Libfabric | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO NCCL_TOPO_FILE set by environment to /fsx/users/dberard/scratch-local/bench-fast/aws-ofi-nccl/share/aws-ofi-nccl/xml/p4d-24xl-topo.xml | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO NET/AWS Libfabric : GPU Direct RDMA Enabled for HCA 0 'rdmap16s27' | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO NET/AWS Libfabric : GPU Direct RDMA Enabled for HCA 1 'rdmap32s27' | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO NET/AWS Libfabric : GPU Direct RDMA Enabled for HCA 2 'rdmap144s27' | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO NET/AWS Libfabric : GPU Direct RDMA Enabled for HCA 3 'rdmap160s27' | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101d0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201c0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201d0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901c0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901d0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01c0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01d0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101d0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201c0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201d0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901c0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901d0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01c0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01d0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101d0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201c0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201d0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901c0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901d0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01c0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01d0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101d0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201c0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201d0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901c0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901d0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01c0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01d0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Setting affinity for GPU 0 to 1f0000,0000001f | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Channel 00/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Channel 01/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Channel 02/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Channel 03/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Channel 04/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Channel 05/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Channel 06/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Channel 07/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Channel 08/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Channel 09/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Channel 10/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Channel 11/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Channel 12/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Channel 13/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Channel 14/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Channel 15/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Channel 16/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Channel 17/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Channel 18/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Channel 19/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Channel 20/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Channel 21/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Channel 22/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Channel 23/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Trees [0] 1/-1/-1->0->-1 [1] 1/-1/-1->0->-1 [2] 1/-1/-1->0->-1 [3] 1/-1/-1->0->-1 [4] 1/-1/-1->0->-1 [5] 1/-1/-1->0->-1 [6] 1/-1/-1->0->-1 [7] 1/-1/-1->0->-1 [8] 1/-1/-1->0->-1 [9] 1/-1/-1->0->-1 [10] 1/-1/-1->0->-1 [11] 1/-1/-1->0->-1 [12] 1/-1/-1->0->-1 [13] 1/-1/-1->0->-1 [14] 1/-1/-1->0->-1 [15] 1/-1/-1->0->-1 [16] 1/-1/-1->0->-1 [17] 1/-1/-1->0->-1 [18] 1/-1/-1->0->-1 [19] 1/-1/-1->0->-1 [20] 1/-1/-1->0->-1 [21] 1/-1/-1->0->-1 [22] 1/-1/-1->0->-1 [23] 1/-1/-1->0->-1 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8002dd0 | |
a100-st-p4d24xlarge-47:41346:42521 [0] NCCL INFO New proxy recv connection 0 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8002e10 | |
a100-st-p4d24xlarge-47:41346:42521 [0] NCCL INFO New proxy recv connection 1 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:41346:42521 [0] NCCL INFO New proxy recv connection 2 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8002e50 | |
a100-st-p4d24xlarge-47:41346:42521 [0] NCCL INFO New proxy recv connection 3 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8002e90 | |
a100-st-p4d24xlarge-47:41346:42521 [0] NCCL INFO New proxy recv connection 4 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8002ed0 | |
a100-st-p4d24xlarge-47:41346:42521 [0] NCCL INFO New proxy recv connection 5 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8002f10 | |
a100-st-p4d24xlarge-47:41346:42521 [0] NCCL INFO New proxy recv connection 6 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8002f50 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8002f90 | |
a100-st-p4d24xlarge-47:41346:42521 [0] NCCL INFO New proxy recv connection 7 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8002fd0 | |
a100-st-p4d24xlarge-47:41346:42521 [0] NCCL INFO New proxy recv connection 8 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003010 | |
a100-st-p4d24xlarge-47:41346:42521 [0] NCCL INFO New proxy recv connection 9 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003050 | |
a100-st-p4d24xlarge-47:41346:42521 [0] NCCL INFO New proxy recv connection 10 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003090 | |
a100-st-p4d24xlarge-47:41346:42521 [0] NCCL INFO New proxy recv connection 11 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c80030d0 | |
a100-st-p4d24xlarge-47:41346:42521 [0] NCCL INFO New proxy recv connection 12 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003110 | |
a100-st-p4d24xlarge-47:41346:42521 [0] NCCL INFO New proxy recv connection 13 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003150 | |
a100-st-p4d24xlarge-47:41346:42521 [0] NCCL INFO New proxy recv connection 14 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003190 | |
a100-st-p4d24xlarge-47:41346:42521 [0] NCCL INFO New proxy recv connection 15 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c80031d0 | |
a100-st-p4d24xlarge-47:41346:42521 [0] NCCL INFO New proxy recv connection 16 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003210 | |
a100-st-p4d24xlarge-47:41346:42521 [0] NCCL INFO New proxy recv connection 17 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003250 | |
a100-st-p4d24xlarge-47:41346:42521 [0] NCCL INFO New proxy recv connection 18 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003290 | |
a100-st-p4d24xlarge-47:41346:42521 [0] NCCL INFO New proxy recv connection 19 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c80032d0 | |
a100-st-p4d24xlarge-47:41346:42521 [0] NCCL INFO New proxy recv connection 20 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003310 | |
a100-st-p4d24xlarge-47:41346:42521 [0] NCCL INFO New proxy recv connection 21 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003350 | |
a100-st-p4d24xlarge-47:41346:42521 [0] NCCL INFO New proxy recv connection 22 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003390 | |
a100-st-p4d24xlarge-47:41346:42521 [0] NCCL INFO New proxy recv connection 23 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Channel 00/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c80033d0 | |
a100-st-p4d24xlarge-47:41346:42521 [0] NCCL INFO New proxy send connection 24 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Channel 01/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003410 | |
a100-st-p4d24xlarge-47:41346:42521 [0] NCCL INFO New proxy send connection 25 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Channel 02/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003450 | |
a100-st-p4d24xlarge-47:41346:42521 [0] NCCL INFO New proxy send connection 26 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Channel 03/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003490 | |
a100-st-p4d24xlarge-47:41346:42521 [0] NCCL INFO New proxy send connection 27 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Channel 04/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c80034d0 | |
a100-st-p4d24xlarge-47:41346:42521 [0] NCCL INFO New proxy send connection 28 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Channel 05/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003510 | |
a100-st-p4d24xlarge-47:41346:42521 [0] NCCL INFO New proxy send connection 29 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Channel 06/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003550 | |
a100-st-p4d24xlarge-47:41346:42521 [0] NCCL INFO New proxy send connection 30 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Channel 07/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003590 | |
a100-st-p4d24xlarge-47:41346:42521 [0] NCCL INFO New proxy send connection 31 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Channel 08/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c80035d0 | |
a100-st-p4d24xlarge-47:41346:42521 [0] NCCL INFO New proxy send connection 32 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Channel 09/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003610 | |
a100-st-p4d24xlarge-47:41346:42521 [0] NCCL INFO New proxy send connection 33 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Channel 10/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003650 | |
a100-st-p4d24xlarge-47:41346:42521 [0] NCCL INFO New proxy send connection 34 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Channel 11/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003690 | |
a100-st-p4d24xlarge-47:41346:42521 [0] NCCL INFO New proxy send connection 35 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Channel 12/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c80036d0 | |
a100-st-p4d24xlarge-47:41346:42521 [0] NCCL INFO New proxy send connection 36 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Channel 13/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003710 | |
a100-st-p4d24xlarge-47:41346:42521 [0] NCCL INFO New proxy send connection 37 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Channel 14/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003750 | |
a100-st-p4d24xlarge-47:41346:42521 [0] NCCL INFO New proxy send connection 38 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Channel 15/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003790 | |
a100-st-p4d24xlarge-47:41346:42521 [0] NCCL INFO New proxy send connection 39 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Channel 16/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c80037d0 | |
a100-st-p4d24xlarge-47:41346:42521 [0] NCCL INFO New proxy send connection 40 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Channel 17/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003810 | |
a100-st-p4d24xlarge-47:41346:42521 [0] NCCL INFO New proxy send connection 41 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Channel 18/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003850 | |
a100-st-p4d24xlarge-47:41346:42521 [0] NCCL INFO New proxy send connection 42 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Channel 19/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003890 | |
a100-st-p4d24xlarge-47:41346:42521 [0] NCCL INFO New proxy send connection 43 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Channel 20/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c80038d0 | |
a100-st-p4d24xlarge-47:41346:42521 [0] NCCL INFO New proxy send connection 44 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Channel 21/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003910 | |
a100-st-p4d24xlarge-47:41346:42521 [0] NCCL INFO New proxy send connection 45 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Channel 22/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003950 | |
a100-st-p4d24xlarge-47:41346:42521 [0] NCCL INFO New proxy send connection 46 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Channel 23/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003990 | |
a100-st-p4d24xlarge-47:41346:42521 [0] NCCL INFO New proxy send connection 47 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Connected all rings | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c80039d0 | |
a100-st-p4d24xlarge-47:41346:42521 [0] NCCL INFO New proxy recv connection 48 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:41346:42521 [0] NCCL INFO New proxy recv connection 49 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003a10 | |
a100-st-p4d24xlarge-47:41346:42521 [0] NCCL INFO New proxy recv connection 50 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003a50 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003a90 | |
a100-st-p4d24xlarge-47:41346:42521 [0] NCCL INFO New proxy recv connection 51 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003ad0 | |
a100-st-p4d24xlarge-47:41346:42521 [0] NCCL INFO New proxy recv connection 52 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003b10 | |
a100-st-p4d24xlarge-47:41346:42521 [0] NCCL INFO New proxy recv connection 53 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003b50 | |
a100-st-p4d24xlarge-47:41346:42521 [0] NCCL INFO New proxy recv connection 54 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003b90 | |
a100-st-p4d24xlarge-47:41346:42521 [0] NCCL INFO New proxy recv connection 55 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003bd0 | |
a100-st-p4d24xlarge-47:41346:42521 [0] NCCL INFO New proxy recv connection 56 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003c10 | |
a100-st-p4d24xlarge-47:41346:42521 [0] NCCL INFO New proxy recv connection 57 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003c50 | |
a100-st-p4d24xlarge-47:41346:42521 [0] NCCL INFO New proxy recv connection 58 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003c90 | |
a100-st-p4d24xlarge-47:41346:42521 [0] NCCL INFO New proxy recv connection 59 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003cd0 | |
a100-st-p4d24xlarge-47:41346:42521 [0] NCCL INFO New proxy recv connection 60 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003d10 | |
a100-st-p4d24xlarge-47:41346:42521 [0] NCCL INFO New proxy recv connection 61 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003d50 | |
a100-st-p4d24xlarge-47:41346:42521 [0] NCCL INFO New proxy recv connection 62 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003d90 | |
a100-st-p4d24xlarge-47:41346:42521 [0] NCCL INFO New proxy recv connection 63 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003dd0 | |
a100-st-p4d24xlarge-47:41346:42521 [0] NCCL INFO New proxy recv connection 64 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003e10 | |
a100-st-p4d24xlarge-47:41346:42521 [0] NCCL INFO New proxy recv connection 65 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003e50 | |
a100-st-p4d24xlarge-47:41346:42521 [0] NCCL INFO New proxy recv connection 66 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003e90 | |
a100-st-p4d24xlarge-47:41346:42521 [0] NCCL INFO New proxy recv connection 67 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003ed0 | |
a100-st-p4d24xlarge-47:41346:42521 [0] NCCL INFO New proxy recv connection 68 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003f10 | |
a100-st-p4d24xlarge-47:41346:42521 [0] NCCL INFO New proxy recv connection 69 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003f50 | |
a100-st-p4d24xlarge-47:41346:42521 [0] NCCL INFO New proxy recv connection 70 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003f90 | |
a100-st-p4d24xlarge-47:41346:42521 [0] NCCL INFO New proxy recv connection 71 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Connected all trees | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO NCCL_ALGO set by environment to ring | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO threadThresholds 8/8/64 | 64/8/64 | 512 | 512 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO 24 coll channels, 32 p2p channels, 32 p2p channels per peer | |
a100-st-p4d24xlarge-47:41346:42521 [0] NCCL INFO New proxy send connection 72 from local rank 0, transport 2 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2c8003fd0 | |
a100-st-p4d24xlarge-47:41346:42475 [0] NCCL INFO comm 0x563c6c315c50 rank 0 nranks 8 cudaDev 0 busId 101c0 - Init COMPLETE | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
result {'latency_median': 80.85042953491211, 'latency_stdev': 0.2639016516493464} | |
has been placed into the queue | |
got exit 0 | |
got the result! | |
exit code: 0 and result: {'nodes': 1, 'model_name': 'torchbenchmark.models.resnet50.Model', 'backend': 'torchdynamo_inductor', 'has_breaks': False, 'static_graph': False, 'result': {'latency_median': 80.85042953491211, 'latency_stdev': 0.2639016516493464}} | |
<RESULT>{"nodes": 1, "model_name": "torchbenchmark.models.resnet50.Model", "backend": "torchdynamo_inductor", "has_breaks": false, "static_graph": false, "result": {"latency_median": 80.85042953491211, "latency_stdev": 0.2639016516493464}}</RESULT> | |
run_once | |
Process group: 8 tasks, rank: 0 | |
MY HOSTNAME: a100-st-p4d24xlarge-47 | |
FI_PROVIDER : efa | |
LD_LIBRARY_PATH : /fsx/users/dberard/scratch-local/bench-fast/aws-ofi-nccl/lib:/opt/amazon/efa/lib:/usr/local/cuda-11.6/lib:/usr/local/cuda-11.6/lib64:/opt/amazon/efa/lib:/opt/amazon/openmpi/lib:/usr/local/cuda/efa/lib:/usr/local/cuda/lib:/usr/local/cuda:/usr/local/cuda/lib64:/usr/local/cuda/extras/CUPTI/lib64:/usr/local/cuda/targets/x86_64-linux/lib:/usr/local/lib:/usr/lib: | |
NCCL_DEBUG : INFO | |
FI_EFA_USE_DEVICE_RDMA : 1 | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
a100-st-p4d24xlarge-47:87091:87091 [0] NCCL INFO NCCL_SOCKET_IFNAME set by environment to ens | |
a100-st-p4d24xlarge-47:87091:87091 [0] NCCL INFO NCCL_SOCKET_IFNAME set to ens | |
a100-st-p4d24xlarge-47:87091:87091 [0] NCCL INFO Bootstrap : Using ens32:10.200.88.223<0> | |
a100-st-p4d24xlarge-47:87091:87091 [0] NCCL INFO NET/Plugin: Failed to find ncclCollNetPlugin_v6 symbol. | |
a100-st-p4d24xlarge-47:87091:87091 [0] NCCL INFO NET/Plugin: Failed to find ncclCollNetPlugin symbol (v4 or v5). | |
a100-st-p4d24xlarge-47:87091:87091 [0] NCCL INFO cudaDriverVersion 11060 | |
NCCL version 2.14.3+cuda11.6 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO NET/OFI Using aws-ofi-nccl 1.4.0aws | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO NET/OFI Running on p4d.24xlarge platform, Setting NCCL_TOPO_FILE environment variable to /fsx/users/dberard/scratch-local/bench-fast/aws-ofi-nccl/share/aws-ofi-nccl/xml/p4d-24xl-topo.xml | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO NET/OFI Setting FI_EFA_FORK_SAFE environment variable to 1 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO NET/OFI Selected Provider is efa | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Using network AWS Libfabric | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO NCCL_TOPO_FILE set by environment to /fsx/users/dberard/scratch-local/bench-fast/aws-ofi-nccl/share/aws-ofi-nccl/xml/p4d-24xl-topo.xml | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO NET/AWS Libfabric : GPU Direct RDMA Enabled for HCA 0 'rdmap16s27' | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO NET/AWS Libfabric : GPU Direct RDMA Enabled for HCA 1 'rdmap32s27' | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO NET/AWS Libfabric : GPU Direct RDMA Enabled for HCA 2 'rdmap144s27' | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO NET/AWS Libfabric : GPU Direct RDMA Enabled for HCA 3 'rdmap160s27' | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101d0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201c0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201d0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901c0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901d0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01c0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01d0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101d0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201c0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201d0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901c0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901d0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01c0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01d0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101d0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201c0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201d0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901c0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901d0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01c0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01d0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101d0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201c0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201d0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901c0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901d0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01c0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01d0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Setting affinity for GPU 0 to 1f0000,0000001f | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Channel 00/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Channel 01/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Channel 02/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Channel 03/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Channel 04/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Channel 05/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Channel 06/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Channel 07/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Channel 08/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Channel 09/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Channel 10/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Channel 11/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Channel 12/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Channel 13/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Channel 14/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Channel 15/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Channel 16/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Channel 17/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Channel 18/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Channel 19/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Channel 20/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Channel 21/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Channel 22/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Channel 23/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Trees [0] 1/-1/-1->0->-1 [1] 1/-1/-1->0->-1 [2] 1/-1/-1->0->-1 [3] 1/-1/-1->0->-1 [4] 1/-1/-1->0->-1 [5] 1/-1/-1->0->-1 [6] 1/-1/-1->0->-1 [7] 1/-1/-1->0->-1 [8] 1/-1/-1->0->-1 [9] 1/-1/-1->0->-1 [10] 1/-1/-1->0->-1 [11] 1/-1/-1->0->-1 [12] 1/-1/-1->0->-1 [13] 1/-1/-1->0->-1 [14] 1/-1/-1->0->-1 [15] 1/-1/-1->0->-1 [16] 1/-1/-1->0->-1 [17] 1/-1/-1->0->-1 [18] 1/-1/-1->0->-1 [19] 1/-1/-1->0->-1 [20] 1/-1/-1->0->-1 [21] 1/-1/-1->0->-1 [22] 1/-1/-1->0->-1 [23] 1/-1/-1->0->-1 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2bc002dd0 | |
a100-st-p4d24xlarge-47:87091:88232 [0] NCCL INFO New proxy recv connection 0 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2bc002e10 | |
a100-st-p4d24xlarge-47:87091:88232 [0] NCCL INFO New proxy recv connection 1 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:87091:88232 [0] NCCL INFO New proxy recv connection 2 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2bc002e50 | |
a100-st-p4d24xlarge-47:87091:88232 [0] NCCL INFO New proxy recv connection 3 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2bc002e90 | |
a100-st-p4d24xlarge-47:87091:88232 [0] NCCL INFO New proxy recv connection 4 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2bc002ed0 | |
a100-st-p4d24xlarge-47:87091:88232 [0] NCCL INFO New proxy recv connection 5 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2bc002f10 | |
a100-st-p4d24xlarge-47:87091:88232 [0] NCCL INFO New proxy recv connection 6 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2bc002f50 | |
a100-st-p4d24xlarge-47:87091:88232 [0] NCCL INFO New proxy recv connection 7 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2bc002f90 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2bc002fd0 | |
a100-st-p4d24xlarge-47:87091:88232 [0] NCCL INFO New proxy recv connection 8 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2bc003010 | |
a100-st-p4d24xlarge-47:87091:88232 [0] NCCL INFO New proxy recv connection 9 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2bc003050 | |
a100-st-p4d24xlarge-47:87091:88232 [0] NCCL INFO New proxy recv connection 10 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2bc003090 | |
a100-st-p4d24xlarge-47:87091:88232 [0] NCCL INFO New proxy recv connection 11 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2bc0030d0 | |
a100-st-p4d24xlarge-47:87091:88232 [0] NCCL INFO New proxy recv connection 12 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2bc003110 | |
a100-st-p4d24xlarge-47:87091:88232 [0] NCCL INFO New proxy recv connection 13 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2bc003150 | |
a100-st-p4d24xlarge-47:87091:88232 [0] NCCL INFO New proxy recv connection 14 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2bc003190 | |
a100-st-p4d24xlarge-47:87091:88232 [0] NCCL INFO New proxy recv connection 15 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2bc0031d0 | |
a100-st-p4d24xlarge-47:87091:88232 [0] NCCL INFO New proxy recv connection 16 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2bc003210 | |
a100-st-p4d24xlarge-47:87091:88232 [0] NCCL INFO New proxy recv connection 17 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2bc003250 | |
a100-st-p4d24xlarge-47:87091:88232 [0] NCCL INFO New proxy recv connection 18 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2bc003290 | |
a100-st-p4d24xlarge-47:87091:88232 [0] NCCL INFO New proxy recv connection 19 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2bc0032d0 | |
a100-st-p4d24xlarge-47:87091:88232 [0] NCCL INFO New proxy recv connection 20 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2bc003310 | |
a100-st-p4d24xlarge-47:87091:88232 [0] NCCL INFO New proxy recv connection 21 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2bc003350 | |
a100-st-p4d24xlarge-47:87091:88232 [0] NCCL INFO New proxy recv connection 22 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2bc003390 | |
a100-st-p4d24xlarge-47:87091:88232 [0] NCCL INFO New proxy recv connection 23 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Channel 00/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2bc0033d0 | |
a100-st-p4d24xlarge-47:87091:88232 [0] NCCL INFO New proxy send connection 24 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Channel 01/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2bc003410 | |
a100-st-p4d24xlarge-47:87091:88232 [0] NCCL INFO New proxy send connection 25 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Channel 02/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2bc003450 | |
a100-st-p4d24xlarge-47:87091:88232 [0] NCCL INFO New proxy send connection 26 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Channel 03/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2bc003490 | |
a100-st-p4d24xlarge-47:87091:88232 [0] NCCL INFO New proxy send connection 27 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Channel 04/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2bc0034d0 | |
a100-st-p4d24xlarge-47:87091:88232 [0] NCCL INFO New proxy send connection 28 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Channel 05/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2bc003510 | |
a100-st-p4d24xlarge-47:87091:88232 [0] NCCL INFO New proxy send connection 29 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Channel 06/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2bc003550 | |
a100-st-p4d24xlarge-47:87091:88232 [0] NCCL INFO New proxy send connection 30 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Channel 07/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2bc003590 | |
a100-st-p4d24xlarge-47:87091:88232 [0] NCCL INFO New proxy send connection 31 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Channel 08/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2bc0035d0 | |
a100-st-p4d24xlarge-47:87091:88232 [0] NCCL INFO New proxy send connection 32 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Channel 09/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2bc003610 | |
a100-st-p4d24xlarge-47:87091:88232 [0] NCCL INFO New proxy send connection 33 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Channel 10/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2bc003650 | |
a100-st-p4d24xlarge-47:87091:88232 [0] NCCL INFO New proxy send connection 34 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Channel 11/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2bc003690 | |
a100-st-p4d24xlarge-47:87091:88232 [0] NCCL INFO New proxy send connection 35 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Channel 12/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2bc0036d0 | |
a100-st-p4d24xlarge-47:87091:88232 [0] NCCL INFO New proxy send connection 36 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Channel 13/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2bc003710 | |
a100-st-p4d24xlarge-47:87091:88232 [0] NCCL INFO New proxy send connection 37 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Channel 14/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2bc003750 | |
a100-st-p4d24xlarge-47:87091:88232 [0] NCCL INFO New proxy send connection 38 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Channel 15/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2bc003790 | |
a100-st-p4d24xlarge-47:87091:88232 [0] NCCL INFO New proxy send connection 39 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Channel 16/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2bc0037d0 | |
a100-st-p4d24xlarge-47:87091:88232 [0] NCCL INFO New proxy send connection 40 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Channel 17/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2bc003810 | |
a100-st-p4d24xlarge-47:87091:88232 [0] NCCL INFO New proxy send connection 41 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Channel 18/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2bc003850 | |
a100-st-p4d24xlarge-47:87091:88232 [0] NCCL INFO New proxy send connection 42 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Channel 19/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2bc003890 | |
a100-st-p4d24xlarge-47:87091:88232 [0] NCCL INFO New proxy send connection 43 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Channel 20/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2bc0038d0 | |
a100-st-p4d24xlarge-47:87091:88232 [0] NCCL INFO New proxy send connection 44 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Channel 21/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2bc003910 | |
a100-st-p4d24xlarge-47:87091:88232 [0] NCCL INFO New proxy send connection 45 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Channel 22/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2bc003950 | |
a100-st-p4d24xlarge-47:87091:88232 [0] NCCL INFO New proxy send connection 46 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Channel 23/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2bc003990 | |
a100-st-p4d24xlarge-47:87091:88232 [0] NCCL INFO New proxy send connection 47 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Connected all rings | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2bc0039d0 | |
a100-st-p4d24xlarge-47:87091:88232 [0] NCCL INFO New proxy recv connection 48 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:87091:88232 [0] NCCL INFO New proxy recv connection 49 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2bc003a10 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2bc003a50 | |
a100-st-p4d24xlarge-47:87091:88232 [0] NCCL INFO New proxy recv connection 50 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2bc003a90 | |
a100-st-p4d24xlarge-47:87091:88232 [0] NCCL INFO New proxy recv connection 51 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2bc003ad0 | |
a100-st-p4d24xlarge-47:87091:88232 [0] NCCL INFO New proxy recv connection 52 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2bc003b10 | |
a100-st-p4d24xlarge-47:87091:88232 [0] NCCL INFO New proxy recv connection 53 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2bc003b50 | |
a100-st-p4d24xlarge-47:87091:88232 [0] NCCL INFO New proxy recv connection 54 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2bc003b90 | |
a100-st-p4d24xlarge-47:87091:88232 [0] NCCL INFO New proxy recv connection 55 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2bc003bd0 | |
a100-st-p4d24xlarge-47:87091:88232 [0] NCCL INFO New proxy recv connection 56 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2bc003c10 | |
a100-st-p4d24xlarge-47:87091:88232 [0] NCCL INFO New proxy recv connection 57 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2bc003c50 | |
a100-st-p4d24xlarge-47:87091:88232 [0] NCCL INFO New proxy recv connection 58 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2bc003c90 | |
a100-st-p4d24xlarge-47:87091:88232 [0] NCCL INFO New proxy recv connection 59 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2bc003cd0 | |
a100-st-p4d24xlarge-47:87091:88232 [0] NCCL INFO New proxy recv connection 60 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2bc003d10 | |
a100-st-p4d24xlarge-47:87091:88232 [0] NCCL INFO New proxy recv connection 61 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2bc003d50 | |
a100-st-p4d24xlarge-47:87091:88232 [0] NCCL INFO New proxy recv connection 62 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2bc003d90 | |
a100-st-p4d24xlarge-47:87091:88232 [0] NCCL INFO New proxy recv connection 63 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2bc003dd0 | |
a100-st-p4d24xlarge-47:87091:88232 [0] NCCL INFO New proxy recv connection 64 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2bc003e10 | |
a100-st-p4d24xlarge-47:87091:88232 [0] NCCL INFO New proxy recv connection 65 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2bc003e50 | |
a100-st-p4d24xlarge-47:87091:88232 [0] NCCL INFO New proxy recv connection 66 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2bc003e90 | |
a100-st-p4d24xlarge-47:87091:88232 [0] NCCL INFO New proxy recv connection 67 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2bc003ed0 | |
a100-st-p4d24xlarge-47:87091:88232 [0] NCCL INFO New proxy recv connection 68 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2bc003f10 | |
a100-st-p4d24xlarge-47:87091:88232 [0] NCCL INFO New proxy recv connection 69 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2bc003f50 | |
a100-st-p4d24xlarge-47:87091:88232 [0] NCCL INFO New proxy recv connection 70 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2bc003f90 | |
a100-st-p4d24xlarge-47:87091:88232 [0] NCCL INFO New proxy recv connection 71 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Connected all trees | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO NCCL_ALGO set by environment to ring | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO threadThresholds 8/8/64 | 64/8/64 | 512 | 512 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO 24 coll channels, 32 p2p channels, 32 p2p channels per peer | |
a100-st-p4d24xlarge-47:87091:88232 [0] NCCL INFO New proxy send connection 72 from local rank 0, transport 2 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7fa2bc003fd0 | |
a100-st-p4d24xlarge-47:87091:88189 [0] NCCL INFO comm 0x563c6b00afa0 rank 0 nranks 8 cudaDev 0 busId 101c0 - Init COMPLETE | |
didn't get the result, queue empty | |
DDPOptimizer used bucket cap 26214400 and produced the following buckets: | |
Index Size (b) Param Names | |
------- ---------- --------------------------------- | |
0 8196000 self_fc_weight | |
self_fc_bias | |
1 31502336 self_layer4_2_bn3_weight | |
self_layer4_2_bn3_bias | |
self_layer4_2_conv3_weight | |
self_layer4_2_bn2_weight | |
self_layer4_2_bn2_bias | |
self_layer4_2_conv2_weight | |
self_layer4_2_bn1_weight | |
self_layer4_2_bn1_bias | |
self_layer4_2_conv1_weight | |
self_layer4_1_bn3_weight | |
self_layer4_1_bn3_bias | |
self_layer4_1_conv3_weight | |
self_layer4_1_bn2_weight | |
self_layer4_1_bn2_bias | |
self_layer4_1_conv2_weight | |
2 26255360 self_layer4_1_bn1_weight | |
self_layer4_1_bn1_bias | |
self_layer4_1_conv1_weight | |
self_layer4_0_downsample_1_weight | |
self_layer4_0_downsample_1_bias | |
self_layer4_0_downsample_0_weight | |
self_layer4_0_bn3_weight | |
self_layer4_0_bn3_bias | |
self_layer4_0_conv3_weight | |
self_layer4_0_bn2_weight | |
self_layer4_0_bn2_bias | |
self_layer4_0_conv2_weight | |
3 26550272 self_layer4_0_bn1_weight | |
self_layer4_0_bn1_bias | |
self_layer4_0_conv1_weight | |
self_layer3_5_bn3_weight | |
self_layer3_5_bn3_bias | |
self_layer3_5_conv3_weight | |
self_layer3_5_bn2_weight | |
self_layer3_5_bn2_bias | |
self_layer3_5_conv2_weight | |
self_layer3_5_bn1_weight | |
self_layer3_5_bn1_bias | |
self_layer3_5_conv1_weight | |
self_layer3_4_bn3_weight | |
self_layer3_4_bn3_bias | |
self_layer3_4_conv3_weight | |
self_layer3_4_bn2_weight | |
self_layer3_4_bn2_bias | |
self_layer3_4_conv2_weight | |
self_layer3_4_bn1_weight | |
self_layer3_4_bn1_bias | |
self_layer3_4_conv1_weight | |
self_layer3_3_bn3_weight | |
self_layer3_3_bn3_bias | |
self_layer3_3_conv3_weight | |
self_layer3_3_bn2_weight | |
self_layer3_3_bn2_bias | |
self_layer3_3_conv2_weight | |
self_layer3_3_bn1_weight | |
self_layer3_3_bn1_bias | |
self_layer3_3_conv1_weight | |
self_layer3_2_bn3_weight | |
self_layer3_2_bn3_bias | |
self_layer3_2_conv3_weight | |
self_layer3_2_bn2_weight | |
self_layer3_2_bn2_bias | |
self_layer3_2_conv2_weight | |
self_layer3_2_bn1_weight | |
self_layer3_2_bn1_bias | |
self_layer3_2_conv1_weight | |
self_layer3_1_bn3_weight | |
self_layer3_1_bn3_bias | |
self_layer3_1_conv3_weight | |
self_layer3_1_bn2_weight | |
self_layer3_1_bn2_bias | |
self_layer3_1_conv2_weight | |
self_layer3_1_bn1_weight | |
self_layer3_1_bn1_bias | |
self_layer3_1_conv1_weight | |
self_layer3_0_downsample_1_weight | |
self_layer3_0_downsample_1_bias | |
self_layer3_0_downsample_0_weight | |
4 9724160 self_layer3_0_bn3_weight | |
self_layer3_0_bn3_bias | |
self_layer3_0_conv3_weight | |
self_layer3_0_bn2_weight | |
self_layer3_0_bn2_bias | |
self_layer3_0_conv2_weight | |
self_layer3_0_bn1_weight | |
self_layer3_0_bn1_bias | |
self_layer3_0_conv1_weight | |
self_layer2_3_bn3_weight | |
self_layer2_3_bn3_bias | |
self_layer2_3_conv3_weight | |
self_layer2_3_bn2_weight | |
self_layer2_3_bn2_bias | |
self_layer2_3_conv2_weight | |
self_layer2_3_bn1_weight | |
self_layer2_3_bn1_bias | |
self_layer2_3_conv1_weight | |
self_layer2_2_bn3_weight | |
self_layer2_2_bn3_bias | |
self_layer2_2_conv3_weight | |
self_layer2_2_bn2_weight | |
self_layer2_2_bn2_bias | |
self_layer2_2_conv2_weight | |
self_layer2_2_bn1_weight | |
self_layer2_2_bn1_bias | |
self_layer2_2_conv1_weight | |
self_layer2_1_bn3_weight | |
self_layer2_1_bn3_bias | |
self_layer2_1_conv3_weight | |
self_layer2_1_bn2_weight | |
self_layer2_1_bn2_bias | |
self_layer2_1_conv2_weight | |
self_layer2_1_bn1_weight | |
self_layer2_1_bn1_bias | |
self_layer2_1_conv1_weight | |
self_layer2_0_downsample_1_weight | |
self_layer2_0_downsample_1_bias | |
self_layer2_0_downsample_0_weight | |
self_layer2_0_bn3_weight | |
self_layer2_0_bn3_bias | |
self_layer2_0_conv3_weight | |
self_layer2_0_bn2_weight | |
self_layer2_0_bn2_bias | |
self_layer2_0_conv2_weight | |
self_layer2_0_bn1_weight | |
self_layer2_0_bn1_bias | |
self_layer2_0_conv1_weight | |
self_layer1_2_bn3_weight | |
self_layer1_2_bn3_bias | |
self_layer1_2_conv3_weight | |
self_layer1_2_bn2_weight | |
self_layer1_2_bn2_bias | |
self_layer1_2_conv2_weight | |
self_layer1_2_bn1_weight | |
self_layer1_2_bn1_bias | |
self_layer1_2_conv1_weight | |
self_layer1_1_bn3_weight | |
self_layer1_1_bn3_bias | |
self_layer1_1_conv3_weight | |
self_layer1_1_bn2_weight | |
self_layer1_1_bn2_bias | |
self_layer1_1_conv2_weight | |
self_layer1_1_bn1_weight | |
self_layer1_1_bn1_bias | |
self_layer1_1_conv1_weight | |
self_layer1_0_downsample_1_weight | |
self_layer1_0_downsample_1_bias | |
self_layer1_0_downsample_0_weight | |
self_layer1_0_bn3_weight | |
self_layer1_0_bn3_bias | |
self_layer1_0_conv3_weight | |
self_layer1_0_bn2_weight | |
self_layer1_0_bn2_bias | |
self_layer1_0_conv2_weight | |
self_layer1_0_bn1_weight | |
self_layer1_0_bn1_bias | |
self_layer1_0_conv1_weight | |
self_bn1_weight | |
self_bn1_bias | |
self_conv1_weight | |
---orig graph--- | |
graph(): | |
%x : torch.Tensor [#users=1] = placeholder[target=x] | |
%self_conv1 : [#users=1] = call_module[target=self_conv1](args = (%x,), kwargs = {}) | |
%self_bn1 : [#users=1] = call_module[target=self_bn1](args = (%self_conv1,), kwargs = {}) | |
%self_relu : [#users=1] = call_module[target=self_relu](args = (%self_bn1,), kwargs = {}) | |
%self_maxpool : [#users=2] = call_module[target=self_maxpool](args = (%self_relu,), kwargs = {}) | |
%self_layer1_0_conv1 : [#users=1] = call_module[target=self_layer1_0_conv1](args = (%self_maxpool,), kwargs = {}) | |
%self_layer1_0_bn1 : [#users=1] = call_module[target=self_layer1_0_bn1](args = (%self_layer1_0_conv1,), kwargs = {}) | |
%self_layer1_0_relu : [#users=1] = call_module[target=self_layer1_0_relu](args = (%self_layer1_0_bn1,), kwargs = {}) | |
%self_layer1_0_conv2 : [#users=1] = call_module[target=self_layer1_0_conv2](args = (%self_layer1_0_relu,), kwargs = {}) | |
%self_layer1_0_bn2 : [#users=1] = call_module[target=self_layer1_0_bn2](args = (%self_layer1_0_conv2,), kwargs = {}) | |
%self_layer1_0_relu_1 : [#users=1] = call_module[target=self_layer1_0_relu](args = (%self_layer1_0_bn2,), kwargs = {}) | |
%self_layer1_0_conv3 : [#users=1] = call_module[target=self_layer1_0_conv3](args = (%self_layer1_0_relu_1,), kwargs = {}) | |
%self_layer1_0_bn3 : [#users=1] = call_module[target=self_layer1_0_bn3](args = (%self_layer1_0_conv3,), kwargs = {}) | |
%self_layer1_0_downsample_0 : [#users=1] = call_module[target=self_layer1_0_downsample_0](args = (%self_maxpool,), kwargs = {}) | |
%self_layer1_0_downsample_1 : [#users=1] = call_module[target=self_layer1_0_downsample_1](args = (%self_layer1_0_downsample_0,), kwargs = {}) | |
%iadd : [#users=1] = call_function[target=operator.iadd](args = (%self_layer1_0_bn3, %self_layer1_0_downsample_1), kwargs = {}) | |
%self_layer1_0_relu_2 : [#users=2] = call_module[target=self_layer1_0_relu](args = (%iadd,), kwargs = {}) | |
%self_layer1_1_conv1 : [#users=1] = call_module[target=self_layer1_1_conv1](args = (%self_layer1_0_relu_2,), kwargs = {}) | |
%self_layer1_1_bn1 : [#users=1] = call_module[target=self_layer1_1_bn1](args = (%self_layer1_1_conv1,), kwargs = {}) | |
%self_layer1_1_relu : [#users=1] = call_module[target=self_layer1_1_relu](args = (%self_layer1_1_bn1,), kwargs = {}) | |
%self_layer1_1_conv2 : [#users=1] = call_module[target=self_layer1_1_conv2](args = (%self_layer1_1_relu,), kwargs = {}) | |
%self_layer1_1_bn2 : [#users=1] = call_module[target=self_layer1_1_bn2](args = (%self_layer1_1_conv2,), kwargs = {}) | |
%self_layer1_1_relu_1 : [#users=1] = call_module[target=self_layer1_1_relu](args = (%self_layer1_1_bn2,), kwargs = {}) | |
%self_layer1_1_conv3 : [#users=1] = call_module[target=self_layer1_1_conv3](args = (%self_layer1_1_relu_1,), kwargs = {}) | |
%self_layer1_1_bn3 : [#users=1] = call_module[target=self_layer1_1_bn3](args = (%self_layer1_1_conv3,), kwargs = {}) | |
%iadd_1 : [#users=1] = call_function[target=operator.iadd](args = (%self_layer1_1_bn3, %self_layer1_0_relu_2), kwargs = {}) | |
%self_layer1_1_relu_2 : [#users=2] = call_module[target=self_layer1_1_relu](args = (%iadd_1,), kwargs = {}) | |
%self_layer1_2_conv1 : [#users=1] = call_module[target=self_layer1_2_conv1](args = (%self_layer1_1_relu_2,), kwargs = {}) | |
%self_layer1_2_bn1 : [#users=1] = call_module[target=self_layer1_2_bn1](args = (%self_layer1_2_conv1,), kwargs = {}) | |
%self_layer1_2_relu : [#users=1] = call_module[target=self_layer1_2_relu](args = (%self_layer1_2_bn1,), kwargs = {}) | |
%self_layer1_2_conv2 : [#users=1] = call_module[target=self_layer1_2_conv2](args = (%self_layer1_2_relu,), kwargs = {}) | |
%self_layer1_2_bn2 : [#users=1] = call_module[target=self_layer1_2_bn2](args = (%self_layer1_2_conv2,), kwargs = {}) | |
%self_layer1_2_relu_1 : [#users=1] = call_module[target=self_layer1_2_relu](args = (%self_layer1_2_bn2,), kwargs = {}) | |
%self_layer1_2_conv3 : [#users=1] = call_module[target=self_layer1_2_conv3](args = (%self_layer1_2_relu_1,), kwargs = {}) | |
%self_layer1_2_bn3 : [#users=1] = call_module[target=self_layer1_2_bn3](args = (%self_layer1_2_conv3,), kwargs = {}) | |
%iadd_2 : [#users=1] = call_function[target=operator.iadd](args = (%self_layer1_2_bn3, %self_layer1_1_relu_2), kwargs = {}) | |
%self_layer1_2_relu_2 : [#users=2] = call_module[target=self_layer1_2_relu](args = (%iadd_2,), kwargs = {}) | |
%self_layer2_0_conv1 : [#users=1] = call_module[target=self_layer2_0_conv1](args = (%self_layer1_2_relu_2,), kwargs = {}) | |
%self_layer2_0_bn1 : [#users=1] = call_module[target=self_layer2_0_bn1](args = (%self_layer2_0_conv1,), kwargs = {}) | |
%self_layer2_0_relu : [#users=1] = call_module[target=self_layer2_0_relu](args = (%self_layer2_0_bn1,), kwargs = {}) | |
%self_layer2_0_conv2 : [#users=1] = call_module[target=self_layer2_0_conv2](args = (%self_layer2_0_relu,), kwargs = {}) | |
%self_layer2_0_bn2 : [#users=1] = call_module[target=self_layer2_0_bn2](args = (%self_layer2_0_conv2,), kwargs = {}) | |
%self_layer2_0_relu_1 : [#users=1] = call_module[target=self_layer2_0_relu](args = (%self_layer2_0_bn2,), kwargs = {}) | |
%self_layer2_0_conv3 : [#users=1] = call_module[target=self_layer2_0_conv3](args = (%self_layer2_0_relu_1,), kwargs = {}) | |
%self_layer2_0_bn3 : [#users=1] = call_module[target=self_layer2_0_bn3](args = (%self_layer2_0_conv3,), kwargs = {}) | |
%self_layer2_0_downsample_0 : [#users=1] = call_module[target=self_layer2_0_downsample_0](args = (%self_layer1_2_relu_2,), kwargs = {}) | |
%self_layer2_0_downsample_1 : [#users=1] = call_module[target=self_layer2_0_downsample_1](args = (%self_layer2_0_downsample_0,), kwargs = {}) | |
%iadd_3 : [#users=1] = call_function[target=operator.iadd](args = (%self_layer2_0_bn3, %self_layer2_0_downsample_1), kwargs = {}) | |
%self_layer2_0_relu_2 : [#users=2] = call_module[target=self_layer2_0_relu](args = (%iadd_3,), kwargs = {}) | |
%self_layer2_1_conv1 : [#users=1] = call_module[target=self_layer2_1_conv1](args = (%self_layer2_0_relu_2,), kwargs = {}) | |
%self_layer2_1_bn1 : [#users=1] = call_module[target=self_layer2_1_bn1](args = (%self_layer2_1_conv1,), kwargs = {}) | |
%self_layer2_1_relu : [#users=1] = call_module[target=self_layer2_1_relu](args = (%self_layer2_1_bn1,), kwargs = {}) | |
%self_layer2_1_conv2 : [#users=1] = call_module[target=self_layer2_1_conv2](args = (%self_layer2_1_relu,), kwargs = {}) | |
%self_layer2_1_bn2 : [#users=1] = call_module[target=self_layer2_1_bn2](args = (%self_layer2_1_conv2,), kwargs = {}) | |
%self_layer2_1_relu_1 : [#users=1] = call_module[target=self_layer2_1_relu](args = (%self_layer2_1_bn2,), kwargs = {}) | |
%self_layer2_1_conv3 : [#users=1] = call_module[target=self_layer2_1_conv3](args = (%self_layer2_1_relu_1,), kwargs = {}) | |
%self_layer2_1_bn3 : [#users=1] = call_module[target=self_layer2_1_bn3](args = (%self_layer2_1_conv3,), kwargs = {}) | |
%iadd_4 : [#users=1] = call_function[target=operator.iadd](args = (%self_layer2_1_bn3, %self_layer2_0_relu_2), kwargs = {}) | |
%self_layer2_1_relu_2 : [#users=2] = call_module[target=self_layer2_1_relu](args = (%iadd_4,), kwargs = {}) | |
%self_layer2_2_conv1 : [#users=1] = call_module[target=self_layer2_2_conv1](args = (%self_layer2_1_relu_2,), kwargs = {}) | |
%self_layer2_2_bn1 : [#users=1] = call_module[target=self_layer2_2_bn1](args = (%self_layer2_2_conv1,), kwargs = {}) | |
%self_layer2_2_relu : [#users=1] = call_module[target=self_layer2_2_relu](args = (%self_layer2_2_bn1,), kwargs = {}) | |
%self_layer2_2_conv2 : [#users=1] = call_module[target=self_layer2_2_conv2](args = (%self_layer2_2_relu,), kwargs = {}) | |
%self_layer2_2_bn2 : [#users=1] = call_module[target=self_layer2_2_bn2](args = (%self_layer2_2_conv2,), kwargs = {}) | |
%self_layer2_2_relu_1 : [#users=1] = call_module[target=self_layer2_2_relu](args = (%self_layer2_2_bn2,), kwargs = {}) | |
%self_layer2_2_conv3 : [#users=1] = call_module[target=self_layer2_2_conv3](args = (%self_layer2_2_relu_1,), kwargs = {}) | |
%self_layer2_2_bn3 : [#users=1] = call_module[target=self_layer2_2_bn3](args = (%self_layer2_2_conv3,), kwargs = {}) | |
%iadd_5 : [#users=1] = call_function[target=operator.iadd](args = (%self_layer2_2_bn3, %self_layer2_1_relu_2), kwargs = {}) | |
%self_layer2_2_relu_2 : [#users=2] = call_module[target=self_layer2_2_relu](args = (%iadd_5,), kwargs = {}) | |
%self_layer2_3_conv1 : [#users=1] = call_module[target=self_layer2_3_conv1](args = (%self_layer2_2_relu_2,), kwargs = {}) | |
%self_layer2_3_bn1 : [#users=1] = call_module[target=self_layer2_3_bn1](args = (%self_layer2_3_conv1,), kwargs = {}) | |
%self_layer2_3_relu : [#users=1] = call_module[target=self_layer2_3_relu](args = (%self_layer2_3_bn1,), kwargs = {}) | |
%self_layer2_3_conv2 : [#users=1] = call_module[target=self_layer2_3_conv2](args = (%self_layer2_3_relu,), kwargs = {}) | |
%self_layer2_3_bn2 : [#users=1] = call_module[target=self_layer2_3_bn2](args = (%self_layer2_3_conv2,), kwargs = {}) | |
%self_layer2_3_relu_1 : [#users=1] = call_module[target=self_layer2_3_relu](args = (%self_layer2_3_bn2,), kwargs = {}) | |
%self_layer2_3_conv3 : [#users=1] = call_module[target=self_layer2_3_conv3](args = (%self_layer2_3_relu_1,), kwargs = {}) | |
%self_layer2_3_bn3 : [#users=1] = call_module[target=self_layer2_3_bn3](args = (%self_layer2_3_conv3,), kwargs = {}) | |
%iadd_6 : [#users=1] = call_function[target=operator.iadd](args = (%self_layer2_3_bn3, %self_layer2_2_relu_2), kwargs = {}) | |
%self_layer2_3_relu_2 : [#users=2] = call_module[target=self_layer2_3_relu](args = (%iadd_6,), kwargs = {}) | |
%self_layer3_0_conv1 : [#users=1] = call_module[target=self_layer3_0_conv1](args = (%self_layer2_3_relu_2,), kwargs = {}) | |
%self_layer3_0_bn1 : [#users=1] = call_module[target=self_layer3_0_bn1](args = (%self_layer3_0_conv1,), kwargs = {}) | |
%self_layer3_0_relu : [#users=1] = call_module[target=self_layer3_0_relu](args = (%self_layer3_0_bn1,), kwargs = {}) | |
%self_layer3_0_conv2 : [#users=1] = call_module[target=self_layer3_0_conv2](args = (%self_layer3_0_relu,), kwargs = {}) | |
%self_layer3_0_bn2 : [#users=1] = call_module[target=self_layer3_0_bn2](args = (%self_layer3_0_conv2,), kwargs = {}) | |
%self_layer3_0_relu_1 : [#users=1] = call_module[target=self_layer3_0_relu](args = (%self_layer3_0_bn2,), kwargs = {}) | |
%self_layer3_0_conv3 : [#users=1] = call_module[target=self_layer3_0_conv3](args = (%self_layer3_0_relu_1,), kwargs = {}) | |
%self_layer3_0_bn3 : [#users=1] = call_module[target=self_layer3_0_bn3](args = (%self_layer3_0_conv3,), kwargs = {}) | |
%self_layer3_0_downsample_0 : [#users=1] = call_module[target=self_layer3_0_downsample_0](args = (%self_layer2_3_relu_2,), kwargs = {}) | |
%self_layer3_0_downsample_1 : [#users=1] = call_module[target=self_layer3_0_downsample_1](args = (%self_layer3_0_downsample_0,), kwargs = {}) | |
%iadd_7 : [#users=1] = call_function[target=operator.iadd](args = (%self_layer3_0_bn3, %self_layer3_0_downsample_1), kwargs = {}) | |
%self_layer3_0_relu_2 : [#users=2] = call_module[target=self_layer3_0_relu](args = (%iadd_7,), kwargs = {}) | |
%self_layer3_1_conv1 : [#users=1] = call_module[target=self_layer3_1_conv1](args = (%self_layer3_0_relu_2,), kwargs = {}) | |
%self_layer3_1_bn1 : [#users=1] = call_module[target=self_layer3_1_bn1](args = (%self_layer3_1_conv1,), kwargs = {}) | |
%self_layer3_1_relu : [#users=1] = call_module[target=self_layer3_1_relu](args = (%self_layer3_1_bn1,), kwargs = {}) | |
%self_layer3_1_conv2 : [#users=1] = call_module[target=self_layer3_1_conv2](args = (%self_layer3_1_relu,), kwargs = {}) | |
%self_layer3_1_bn2 : [#users=1] = call_module[target=self_layer3_1_bn2](args = (%self_layer3_1_conv2,), kwargs = {}) | |
%self_layer3_1_relu_1 : [#users=1] = call_module[target=self_layer3_1_relu](args = (%self_layer3_1_bn2,), kwargs = {}) | |
%self_layer3_1_conv3 : [#users=1] = call_module[target=self_layer3_1_conv3](args = (%self_layer3_1_relu_1,), kwargs = {}) | |
%self_layer3_1_bn3 : [#users=1] = call_module[target=self_layer3_1_bn3](args = (%self_layer3_1_conv3,), kwargs = {}) | |
%iadd_8 : [#users=1] = call_function[target=operator.iadd](args = (%self_layer3_1_bn3, %self_layer3_0_relu_2), kwargs = {}) | |
%self_layer3_1_relu_2 : [#users=2] = call_module[target=self_layer3_1_relu](args = (%iadd_8,), kwargs = {}) | |
%self_layer3_2_conv1 : [#users=1] = call_module[target=self_layer3_2_conv1](args = (%self_layer3_1_relu_2,), kwargs = {}) | |
%self_layer3_2_bn1 : [#users=1] = call_module[target=self_layer3_2_bn1](args = (%self_layer3_2_conv1,), kwargs = {}) | |
%self_layer3_2_relu : [#users=1] = call_module[target=self_layer3_2_relu](args = (%self_layer3_2_bn1,), kwargs = {}) | |
%self_layer3_2_conv2 : [#users=1] = call_module[target=self_layer3_2_conv2](args = (%self_layer3_2_relu,), kwargs = {}) | |
%self_layer3_2_bn2 : [#users=1] = call_module[target=self_layer3_2_bn2](args = (%self_layer3_2_conv2,), kwargs = {}) | |
%self_layer3_2_relu_1 : [#users=1] = call_module[target=self_layer3_2_relu](args = (%self_layer3_2_bn2,), kwargs = {}) | |
%self_layer3_2_conv3 : [#users=1] = call_module[target=self_layer3_2_conv3](args = (%self_layer3_2_relu_1,), kwargs = {}) | |
%self_layer3_2_bn3 : [#users=1] = call_module[target=self_layer3_2_bn3](args = (%self_layer3_2_conv3,), kwargs = {}) | |
%iadd_9 : [#users=1] = call_function[target=operator.iadd](args = (%self_layer3_2_bn3, %self_layer3_1_relu_2), kwargs = {}) | |
%self_layer3_2_relu_2 : [#users=2] = call_module[target=self_layer3_2_relu](args = (%iadd_9,), kwargs = {}) | |
%self_layer3_3_conv1 : [#users=1] = call_module[target=self_layer3_3_conv1](args = (%self_layer3_2_relu_2,), kwargs = {}) | |
%self_layer3_3_bn1 : [#users=1] = call_module[target=self_layer3_3_bn1](args = (%self_layer3_3_conv1,), kwargs = {}) | |
%self_layer3_3_relu : [#users=1] = call_module[target=self_layer3_3_relu](args = (%self_layer3_3_bn1,), kwargs = {}) | |
%self_layer3_3_conv2 : [#users=1] = call_module[target=self_layer3_3_conv2](args = (%self_layer3_3_relu,), kwargs = {}) | |
%self_layer3_3_bn2 : [#users=1] = call_module[target=self_layer3_3_bn2](args = (%self_layer3_3_conv2,), kwargs = {}) | |
%self_layer3_3_relu_1 : [#users=1] = call_module[target=self_layer3_3_relu](args = (%self_layer3_3_bn2,), kwargs = {}) | |
%self_layer3_3_conv3 : [#users=1] = call_module[target=self_layer3_3_conv3](args = (%self_layer3_3_relu_1,), kwargs = {}) | |
%self_layer3_3_bn3 : [#users=1] = call_module[target=self_layer3_3_bn3](args = (%self_layer3_3_conv3,), kwargs = {}) | |
%iadd_10 : [#users=1] = call_function[target=operator.iadd](args = (%self_layer3_3_bn3, %self_layer3_2_relu_2), kwargs = {}) | |
%self_layer3_3_relu_2 : [#users=2] = call_module[target=self_layer3_3_relu](args = (%iadd_10,), kwargs = {}) | |
%self_layer3_4_conv1 : [#users=1] = call_module[target=self_layer3_4_conv1](args = (%self_layer3_3_relu_2,), kwargs = {}) | |
%self_layer3_4_bn1 : [#users=1] = call_module[target=self_layer3_4_bn1](args = (%self_layer3_4_conv1,), kwargs = {}) | |
%self_layer3_4_relu : [#users=1] = call_module[target=self_layer3_4_relu](args = (%self_layer3_4_bn1,), kwargs = {}) | |
%self_layer3_4_conv2 : [#users=1] = call_module[target=self_layer3_4_conv2](args = (%self_layer3_4_relu,), kwargs = {}) | |
%self_layer3_4_bn2 : [#users=1] = call_module[target=self_layer3_4_bn2](args = (%self_layer3_4_conv2,), kwargs = {}) | |
%self_layer3_4_relu_1 : [#users=1] = call_module[target=self_layer3_4_relu](args = (%self_layer3_4_bn2,), kwargs = {}) | |
%self_layer3_4_conv3 : [#users=1] = call_module[target=self_layer3_4_conv3](args = (%self_layer3_4_relu_1,), kwargs = {}) | |
%self_layer3_4_bn3 : [#users=1] = call_module[target=self_layer3_4_bn3](args = (%self_layer3_4_conv3,), kwargs = {}) | |
%iadd_11 : [#users=1] = call_function[target=operator.iadd](args = (%self_layer3_4_bn3, %self_layer3_3_relu_2), kwargs = {}) | |
%self_layer3_4_relu_2 : [#users=2] = call_module[target=self_layer3_4_relu](args = (%iadd_11,), kwargs = {}) | |
%self_layer3_5_conv1 : [#users=1] = call_module[target=self_layer3_5_conv1](args = (%self_layer3_4_relu_2,), kwargs = {}) | |
%self_layer3_5_bn1 : [#users=1] = call_module[target=self_layer3_5_bn1](args = (%self_layer3_5_conv1,), kwargs = {}) | |
%self_layer3_5_relu : [#users=1] = call_module[target=self_layer3_5_relu](args = (%self_layer3_5_bn1,), kwargs = {}) | |
%self_layer3_5_conv2 : [#users=1] = call_module[target=self_layer3_5_conv2](args = (%self_layer3_5_relu,), kwargs = {}) | |
%self_layer3_5_bn2 : [#users=1] = call_module[target=self_layer3_5_bn2](args = (%self_layer3_5_conv2,), kwargs = {}) | |
%self_layer3_5_relu_1 : [#users=1] = call_module[target=self_layer3_5_relu](args = (%self_layer3_5_bn2,), kwargs = {}) | |
%self_layer3_5_conv3 : [#users=1] = call_module[target=self_layer3_5_conv3](args = (%self_layer3_5_relu_1,), kwargs = {}) | |
%self_layer3_5_bn3 : [#users=1] = call_module[target=self_layer3_5_bn3](args = (%self_layer3_5_conv3,), kwargs = {}) | |
%iadd_12 : [#users=1] = call_function[target=operator.iadd](args = (%self_layer3_5_bn3, %self_layer3_4_relu_2), kwargs = {}) | |
%self_layer3_5_relu_2 : [#users=2] = call_module[target=self_layer3_5_relu](args = (%iadd_12,), kwargs = {}) | |
%self_layer4_0_conv1 : [#users=1] = call_module[target=self_layer4_0_conv1](args = (%self_layer3_5_relu_2,), kwargs = {}) | |
%self_layer4_0_bn1 : [#users=1] = call_module[target=self_layer4_0_bn1](args = (%self_layer4_0_conv1,), kwargs = {}) | |
%self_layer4_0_relu : [#users=1] = call_module[target=self_layer4_0_relu](args = (%self_layer4_0_bn1,), kwargs = {}) | |
%self_layer4_0_conv2 : [#users=1] = call_module[target=self_layer4_0_conv2](args = (%self_layer4_0_relu,), kwargs = {}) | |
%self_layer4_0_bn2 : [#users=1] = call_module[target=self_layer4_0_bn2](args = (%self_layer4_0_conv2,), kwargs = {}) | |
%self_layer4_0_relu_1 : [#users=1] = call_module[target=self_layer4_0_relu](args = (%self_layer4_0_bn2,), kwargs = {}) | |
%self_layer4_0_conv3 : [#users=1] = call_module[target=self_layer4_0_conv3](args = (%self_layer4_0_relu_1,), kwargs = {}) | |
%self_layer4_0_bn3 : [#users=1] = call_module[target=self_layer4_0_bn3](args = (%self_layer4_0_conv3,), kwargs = {}) | |
%self_layer4_0_downsample_0 : [#users=1] = call_module[target=self_layer4_0_downsample_0](args = (%self_layer3_5_relu_2,), kwargs = {}) | |
%self_layer4_0_downsample_1 : [#users=1] = call_module[target=self_layer4_0_downsample_1](args = (%self_layer4_0_downsample_0,), kwargs = {}) | |
%iadd_13 : [#users=1] = call_function[target=operator.iadd](args = (%self_layer4_0_bn3, %self_layer4_0_downsample_1), kwargs = {}) | |
%self_layer4_0_relu_2 : [#users=2] = call_module[target=self_layer4_0_relu](args = (%iadd_13,), kwargs = {}) | |
%self_layer4_1_conv1 : [#users=1] = call_module[target=self_layer4_1_conv1](args = (%self_layer4_0_relu_2,), kwargs = {}) | |
%self_layer4_1_bn1 : [#users=1] = call_module[target=self_layer4_1_bn1](args = (%self_layer4_1_conv1,), kwargs = {}) | |
%self_layer4_1_relu : [#users=1] = call_module[target=self_layer4_1_relu](args = (%self_layer4_1_bn1,), kwargs = {}) | |
%self_layer4_1_conv2 : [#users=1] = call_module[target=self_layer4_1_conv2](args = (%self_layer4_1_relu,), kwargs = {}) | |
%self_layer4_1_bn2 : [#users=1] = call_module[target=self_layer4_1_bn2](args = (%self_layer4_1_conv2,), kwargs = {}) | |
%self_layer4_1_relu_1 : [#users=1] = call_module[target=self_layer4_1_relu](args = (%self_layer4_1_bn2,), kwargs = {}) | |
%self_layer4_1_conv3 : [#users=1] = call_module[target=self_layer4_1_conv3](args = (%self_layer4_1_relu_1,), kwargs = {}) | |
%self_layer4_1_bn3 : [#users=1] = call_module[target=self_layer4_1_bn3](args = (%self_layer4_1_conv3,), kwargs = {}) | |
%iadd_14 : [#users=1] = call_function[target=operator.iadd](args = (%self_layer4_1_bn3, %self_layer4_0_relu_2), kwargs = {}) | |
%self_layer4_1_relu_2 : [#users=2] = call_module[target=self_layer4_1_relu](args = (%iadd_14,), kwargs = {}) | |
%self_layer4_2_conv1 : [#users=1] = call_module[target=self_layer4_2_conv1](args = (%self_layer4_1_relu_2,), kwargs = {}) | |
%self_layer4_2_bn1 : [#users=1] = call_module[target=self_layer4_2_bn1](args = (%self_layer4_2_conv1,), kwargs = {}) | |
%self_layer4_2_relu : [#users=1] = call_module[target=self_layer4_2_relu](args = (%self_layer4_2_bn1,), kwargs = {}) | |
%self_layer4_2_conv2 : [#users=1] = call_module[target=self_layer4_2_conv2](args = (%self_layer4_2_relu,), kwargs = {}) | |
%self_layer4_2_bn2 : [#users=1] = call_module[target=self_layer4_2_bn2](args = (%self_layer4_2_conv2,), kwargs = {}) | |
%self_layer4_2_relu_1 : [#users=1] = call_module[target=self_layer4_2_relu](args = (%self_layer4_2_bn2,), kwargs = {}) | |
%self_layer4_2_conv3 : [#users=1] = call_module[target=self_layer4_2_conv3](args = (%self_layer4_2_relu_1,), kwargs = {}) | |
%self_layer4_2_bn3 : [#users=1] = call_module[target=self_layer4_2_bn3](args = (%self_layer4_2_conv3,), kwargs = {}) | |
%iadd_15 : [#users=1] = call_function[target=operator.iadd](args = (%self_layer4_2_bn3, %self_layer4_1_relu_2), kwargs = {}) | |
%self_layer4_2_relu_2 : [#users=1] = call_module[target=self_layer4_2_relu](args = (%iadd_15,), kwargs = {}) | |
%self_avgpool : [#users=1] = call_module[target=self_avgpool](args = (%self_layer4_2_relu_2,), kwargs = {}) | |
%flatten : [#users=1] = call_function[target=torch.flatten](args = (%self_avgpool, 1), kwargs = {}) | |
%self_fc : [#users=1] = call_module[target=self_fc](args = (%flatten,), kwargs = {}) | |
return (self_fc,) | |
---split graph--- | |
graph(): | |
%x : torch.Tensor [#users=1] = placeholder[target=x] | |
%submod_0 : [#users=2] = call_module[target=submod_0](args = (%x,), kwargs = {}) | |
%getitem : [#users=1] = call_function[target=operator.getitem](args = (%submod_0, 0), kwargs = {}) | |
%getitem_1 : [#users=1] = call_function[target=operator.getitem](args = (%submod_0, 1), kwargs = {}) | |
%submod_1 : [#users=2] = call_module[target=submod_1](args = (%getitem, %getitem_1), kwargs = {}) | |
%getitem_2 : [#users=1] = call_function[target=operator.getitem](args = (%submod_1, 0), kwargs = {}) | |
%getitem_3 : [#users=1] = call_function[target=operator.getitem](args = (%submod_1, 1), kwargs = {}) | |
%submod_2 : [#users=2] = call_module[target=submod_2](args = (%getitem_2, %getitem_3), kwargs = {}) | |
%getitem_4 : [#users=1] = call_function[target=operator.getitem](args = (%submod_2, 0), kwargs = {}) | |
%getitem_5 : [#users=1] = call_function[target=operator.getitem](args = (%submod_2, 1), kwargs = {}) | |
%submod_3 : [#users=1] = call_module[target=submod_3](args = (%getitem_4, %getitem_5), kwargs = {}) | |
%submod_4 : [#users=1] = call_module[target=submod_4](args = (%submod_3,), kwargs = {}) | |
return (submod_4,) | |
--- graph--- | |
graph(): | |
%x : torch.Tensor [#users=1] = placeholder[target=x] | |
%submod_0 : [#users=2] = call_module[target=submod_0](args = (%x,), kwargs = {}) | |
%getitem : [#users=1] = call_function[target=operator.getitem](args = (%submod_0, 0), kwargs = {}) | |
%getitem_1 : [#users=1] = call_function[target=operator.getitem](args = (%submod_0, 1), kwargs = {}) | |
%submod_1 : [#users=2] = call_module[target=submod_1](args = (%getitem, %getitem_1), kwargs = {}) | |
%getitem_2 : [#users=1] = call_function[target=operator.getitem](args = (%submod_1, 0), kwargs = {}) | |
%getitem_3 : [#users=1] = call_function[target=operator.getitem](args = (%submod_1, 1), kwargs = {}) | |
%submod_2 : [#users=2] = call_module[target=submod_2](args = (%getitem_2, %getitem_3), kwargs = {}) | |
%getitem_4 : [#users=1] = call_function[target=operator.getitem](args = (%submod_2, 0), kwargs = {}) | |
%getitem_5 : [#users=1] = call_function[target=operator.getitem](args = (%submod_2, 1), kwargs = {}) | |
%submod_3 : [#users=1] = call_module[target=submod_3](args = (%getitem_4, %getitem_5), kwargs = {}) | |
%submod_4 : [#users=1] = call_module[target=submod_4](args = (%submod_3,), kwargs = {}) | |
return (submod_4,) | |
---submod_0 graph--- | |
graph(): | |
%x : [#users=1] = placeholder[target=x] | |
%self_conv1 : [#users=1] = call_module[target=self_conv1](args = (%x,), kwargs = {}) | |
%self_bn1 : [#users=1] = call_module[target=self_bn1](args = (%self_conv1,), kwargs = {}) | |
%self_relu : [#users=1] = call_module[target=self_relu](args = (%self_bn1,), kwargs = {}) | |
%self_maxpool : [#users=2] = call_module[target=self_maxpool](args = (%self_relu,), kwargs = {}) | |
%self_layer1_0_conv1 : [#users=1] = call_module[target=self_layer1_0_conv1](args = (%self_maxpool,), kwargs = {}) | |
%self_layer1_0_bn1 : [#users=1] = call_module[target=self_layer1_0_bn1](args = (%self_layer1_0_conv1,), kwargs = {}) | |
%self_layer1_0_relu : [#users=1] = call_module[target=self_layer1_0_relu](args = (%self_layer1_0_bn1,), kwargs = {}) | |
%self_layer1_0_conv2 : [#users=1] = call_module[target=self_layer1_0_conv2](args = (%self_layer1_0_relu,), kwargs = {}) | |
%self_layer1_0_bn2 : [#users=1] = call_module[target=self_layer1_0_bn2](args = (%self_layer1_0_conv2,), kwargs = {}) | |
%self_layer1_0_relu_1 : [#users=1] = call_module[target=self_layer1_0_relu](args = (%self_layer1_0_bn2,), kwargs = {}) | |
%self_layer1_0_conv3 : [#users=1] = call_module[target=self_layer1_0_conv3](args = (%self_layer1_0_relu_1,), kwargs = {}) | |
%self_layer1_0_bn3 : [#users=1] = call_module[target=self_layer1_0_bn3](args = (%self_layer1_0_conv3,), kwargs = {}) | |
%self_layer1_0_downsample_0 : [#users=1] = call_module[target=self_layer1_0_downsample_0](args = (%self_maxpool,), kwargs = {}) | |
%self_layer1_0_downsample_1 : [#users=1] = call_module[target=self_layer1_0_downsample_1](args = (%self_layer1_0_downsample_0,), kwargs = {}) | |
%iadd : [#users=1] = call_function[target=operator.iadd](args = (%self_layer1_0_bn3, %self_layer1_0_downsample_1), kwargs = {}) | |
%self_layer1_0_relu_2 : [#users=2] = call_module[target=self_layer1_0_relu](args = (%iadd,), kwargs = {}) | |
%self_layer1_1_conv1 : [#users=1] = call_module[target=self_layer1_1_conv1](args = (%self_layer1_0_relu_2,), kwargs = {}) | |
%self_layer1_1_bn1 : [#users=1] = call_module[target=self_layer1_1_bn1](args = (%self_layer1_1_conv1,), kwargs = {}) | |
%self_layer1_1_relu : [#users=1] = call_module[target=self_layer1_1_relu](args = (%self_layer1_1_bn1,), kwargs = {}) | |
%self_layer1_1_conv2 : [#users=1] = call_module[target=self_layer1_1_conv2](args = (%self_layer1_1_relu,), kwargs = {}) | |
%self_layer1_1_bn2 : [#users=1] = call_module[target=self_layer1_1_bn2](args = (%self_layer1_1_conv2,), kwargs = {}) | |
%self_layer1_1_relu_1 : [#users=1] = call_module[target=self_layer1_1_relu](args = (%self_layer1_1_bn2,), kwargs = {}) | |
%self_layer1_1_conv3 : [#users=1] = call_module[target=self_layer1_1_conv3](args = (%self_layer1_1_relu_1,), kwargs = {}) | |
%self_layer1_1_bn3 : [#users=1] = call_module[target=self_layer1_1_bn3](args = (%self_layer1_1_conv3,), kwargs = {}) | |
%iadd_1 : [#users=1] = call_function[target=operator.iadd](args = (%self_layer1_1_bn3, %self_layer1_0_relu_2), kwargs = {}) | |
%self_layer1_1_relu_2 : [#users=2] = call_module[target=self_layer1_1_relu](args = (%iadd_1,), kwargs = {}) | |
%self_layer1_2_conv1 : [#users=1] = call_module[target=self_layer1_2_conv1](args = (%self_layer1_1_relu_2,), kwargs = {}) | |
%self_layer1_2_bn1 : [#users=1] = call_module[target=self_layer1_2_bn1](args = (%self_layer1_2_conv1,), kwargs = {}) | |
%self_layer1_2_relu : [#users=1] = call_module[target=self_layer1_2_relu](args = (%self_layer1_2_bn1,), kwargs = {}) | |
%self_layer1_2_conv2 : [#users=1] = call_module[target=self_layer1_2_conv2](args = (%self_layer1_2_relu,), kwargs = {}) | |
%self_layer1_2_bn2 : [#users=1] = call_module[target=self_layer1_2_bn2](args = (%self_layer1_2_conv2,), kwargs = {}) | |
%self_layer1_2_relu_1 : [#users=1] = call_module[target=self_layer1_2_relu](args = (%self_layer1_2_bn2,), kwargs = {}) | |
%self_layer1_2_conv3 : [#users=1] = call_module[target=self_layer1_2_conv3](args = (%self_layer1_2_relu_1,), kwargs = {}) | |
%self_layer1_2_bn3 : [#users=1] = call_module[target=self_layer1_2_bn3](args = (%self_layer1_2_conv3,), kwargs = {}) | |
%iadd_2 : [#users=1] = call_function[target=operator.iadd](args = (%self_layer1_2_bn3, %self_layer1_1_relu_2), kwargs = {}) | |
%self_layer1_2_relu_2 : [#users=2] = call_module[target=self_layer1_2_relu](args = (%iadd_2,), kwargs = {}) | |
%self_layer2_0_conv1 : [#users=1] = call_module[target=self_layer2_0_conv1](args = (%self_layer1_2_relu_2,), kwargs = {}) | |
%self_layer2_0_bn1 : [#users=1] = call_module[target=self_layer2_0_bn1](args = (%self_layer2_0_conv1,), kwargs = {}) | |
%self_layer2_0_relu : [#users=1] = call_module[target=self_layer2_0_relu](args = (%self_layer2_0_bn1,), kwargs = {}) | |
%self_layer2_0_conv2 : [#users=1] = call_module[target=self_layer2_0_conv2](args = (%self_layer2_0_relu,), kwargs = {}) | |
%self_layer2_0_bn2 : [#users=1] = call_module[target=self_layer2_0_bn2](args = (%self_layer2_0_conv2,), kwargs = {}) | |
%self_layer2_0_relu_1 : [#users=1] = call_module[target=self_layer2_0_relu](args = (%self_layer2_0_bn2,), kwargs = {}) | |
%self_layer2_0_conv3 : [#users=1] = call_module[target=self_layer2_0_conv3](args = (%self_layer2_0_relu_1,), kwargs = {}) | |
%self_layer2_0_bn3 : [#users=1] = call_module[target=self_layer2_0_bn3](args = (%self_layer2_0_conv3,), kwargs = {}) | |
%self_layer2_0_downsample_0 : [#users=1] = call_module[target=self_layer2_0_downsample_0](args = (%self_layer1_2_relu_2,), kwargs = {}) | |
%self_layer2_0_downsample_1 : [#users=1] = call_module[target=self_layer2_0_downsample_1](args = (%self_layer2_0_downsample_0,), kwargs = {}) | |
%iadd_3 : [#users=1] = call_function[target=operator.iadd](args = (%self_layer2_0_bn3, %self_layer2_0_downsample_1), kwargs = {}) | |
%self_layer2_0_relu_2 : [#users=2] = call_module[target=self_layer2_0_relu](args = (%iadd_3,), kwargs = {}) | |
%self_layer2_1_conv1 : [#users=1] = call_module[target=self_layer2_1_conv1](args = (%self_layer2_0_relu_2,), kwargs = {}) | |
%self_layer2_1_bn1 : [#users=1] = call_module[target=self_layer2_1_bn1](args = (%self_layer2_1_conv1,), kwargs = {}) | |
%self_layer2_1_relu : [#users=1] = call_module[target=self_layer2_1_relu](args = (%self_layer2_1_bn1,), kwargs = {}) | |
%self_layer2_1_conv2 : [#users=1] = call_module[target=self_layer2_1_conv2](args = (%self_layer2_1_relu,), kwargs = {}) | |
%self_layer2_1_bn2 : [#users=1] = call_module[target=self_layer2_1_bn2](args = (%self_layer2_1_conv2,), kwargs = {}) | |
%self_layer2_1_relu_1 : [#users=1] = call_module[target=self_layer2_1_relu](args = (%self_layer2_1_bn2,), kwargs = {}) | |
%self_layer2_1_conv3 : [#users=1] = call_module[target=self_layer2_1_conv3](args = (%self_layer2_1_relu_1,), kwargs = {}) | |
%self_layer2_1_bn3 : [#users=1] = call_module[target=self_layer2_1_bn3](args = (%self_layer2_1_conv3,), kwargs = {}) | |
%iadd_4 : [#users=1] = call_function[target=operator.iadd](args = (%self_layer2_1_bn3, %self_layer2_0_relu_2), kwargs = {}) | |
%self_layer2_1_relu_2 : [#users=2] = call_module[target=self_layer2_1_relu](args = (%iadd_4,), kwargs = {}) | |
%self_layer2_2_conv1 : [#users=1] = call_module[target=self_layer2_2_conv1](args = (%self_layer2_1_relu_2,), kwargs = {}) | |
%self_layer2_2_bn1 : [#users=1] = call_module[target=self_layer2_2_bn1](args = (%self_layer2_2_conv1,), kwargs = {}) | |
%self_layer2_2_relu : [#users=1] = call_module[target=self_layer2_2_relu](args = (%self_layer2_2_bn1,), kwargs = {}) | |
%self_layer2_2_conv2 : [#users=1] = call_module[target=self_layer2_2_conv2](args = (%self_layer2_2_relu,), kwargs = {}) | |
%self_layer2_2_bn2 : [#users=1] = call_module[target=self_layer2_2_bn2](args = (%self_layer2_2_conv2,), kwargs = {}) | |
%self_layer2_2_relu_1 : [#users=1] = call_module[target=self_layer2_2_relu](args = (%self_layer2_2_bn2,), kwargs = {}) | |
%self_layer2_2_conv3 : [#users=1] = call_module[target=self_layer2_2_conv3](args = (%self_layer2_2_relu_1,), kwargs = {}) | |
%self_layer2_2_bn3 : [#users=1] = call_module[target=self_layer2_2_bn3](args = (%self_layer2_2_conv3,), kwargs = {}) | |
%iadd_5 : [#users=1] = call_function[target=operator.iadd](args = (%self_layer2_2_bn3, %self_layer2_1_relu_2), kwargs = {}) | |
%self_layer2_2_relu_2 : [#users=2] = call_module[target=self_layer2_2_relu](args = (%iadd_5,), kwargs = {}) | |
%self_layer2_3_conv1 : [#users=1] = call_module[target=self_layer2_3_conv1](args = (%self_layer2_2_relu_2,), kwargs = {}) | |
%self_layer2_3_bn1 : [#users=1] = call_module[target=self_layer2_3_bn1](args = (%self_layer2_3_conv1,), kwargs = {}) | |
%self_layer2_3_relu : [#users=1] = call_module[target=self_layer2_3_relu](args = (%self_layer2_3_bn1,), kwargs = {}) | |
%self_layer2_3_conv2 : [#users=1] = call_module[target=self_layer2_3_conv2](args = (%self_layer2_3_relu,), kwargs = {}) | |
%self_layer2_3_bn2 : [#users=1] = call_module[target=self_layer2_3_bn2](args = (%self_layer2_3_conv2,), kwargs = {}) | |
%self_layer2_3_relu_1 : [#users=1] = call_module[target=self_layer2_3_relu](args = (%self_layer2_3_bn2,), kwargs = {}) | |
%self_layer2_3_conv3 : [#users=1] = call_module[target=self_layer2_3_conv3](args = (%self_layer2_3_relu_1,), kwargs = {}) | |
%self_layer2_3_bn3 : [#users=1] = call_module[target=self_layer2_3_bn3](args = (%self_layer2_3_conv3,), kwargs = {}) | |
%iadd_6 : [#users=1] = call_function[target=operator.iadd](args = (%self_layer2_3_bn3, %self_layer2_2_relu_2), kwargs = {}) | |
%self_layer2_3_relu_2 : [#users=2] = call_module[target=self_layer2_3_relu](args = (%iadd_6,), kwargs = {}) | |
%self_layer3_0_conv1 : [#users=1] = call_module[target=self_layer3_0_conv1](args = (%self_layer2_3_relu_2,), kwargs = {}) | |
%self_layer3_0_bn1 : [#users=1] = call_module[target=self_layer3_0_bn1](args = (%self_layer3_0_conv1,), kwargs = {}) | |
%self_layer3_0_relu : [#users=1] = call_module[target=self_layer3_0_relu](args = (%self_layer3_0_bn1,), kwargs = {}) | |
%self_layer3_0_conv2 : [#users=1] = call_module[target=self_layer3_0_conv2](args = (%self_layer3_0_relu,), kwargs = {}) | |
%self_layer3_0_bn2 : [#users=1] = call_module[target=self_layer3_0_bn2](args = (%self_layer3_0_conv2,), kwargs = {}) | |
%self_layer3_0_relu_1 : [#users=1] = call_module[target=self_layer3_0_relu](args = (%self_layer3_0_bn2,), kwargs = {}) | |
%self_layer3_0_conv3 : [#users=1] = call_module[target=self_layer3_0_conv3](args = (%self_layer3_0_relu_1,), kwargs = {}) | |
%self_layer3_0_bn3 : [#users=1] = call_module[target=self_layer3_0_bn3](args = (%self_layer3_0_conv3,), kwargs = {}) | |
return (self_layer2_3_relu_2, self_layer3_0_bn3) | |
---submod_1 graph--- | |
graph(): | |
%self_layer2_3_relu_2 : [#users=1] = placeholder[target=self_layer2_3_relu_2] | |
%self_layer3_0_bn3 : [#users=1] = placeholder[target=self_layer3_0_bn3] | |
%self_layer3_0_downsample_0 : [#users=1] = call_module[target=self_layer3_0_downsample_0](args = (%self_layer2_3_relu_2,), kwargs = {}) | |
%self_layer3_0_downsample_1 : [#users=1] = call_module[target=self_layer3_0_downsample_1](args = (%self_layer3_0_downsample_0,), kwargs = {}) | |
%iadd : [#users=1] = call_function[target=operator.iadd](args = (%self_layer3_0_bn3, %self_layer3_0_downsample_1), kwargs = {}) | |
%self_layer3_0_relu : [#users=2] = call_module[target=self_layer3_0_relu](args = (%iadd,), kwargs = {}) | |
%self_layer3_1_conv1 : [#users=1] = call_module[target=self_layer3_1_conv1](args = (%self_layer3_0_relu,), kwargs = {}) | |
%self_layer3_1_bn1 : [#users=1] = call_module[target=self_layer3_1_bn1](args = (%self_layer3_1_conv1,), kwargs = {}) | |
%self_layer3_1_relu : [#users=1] = call_module[target=self_layer3_1_relu](args = (%self_layer3_1_bn1,), kwargs = {}) | |
%self_layer3_1_conv2 : [#users=1] = call_module[target=self_layer3_1_conv2](args = (%self_layer3_1_relu,), kwargs = {}) | |
%self_layer3_1_bn2 : [#users=1] = call_module[target=self_layer3_1_bn2](args = (%self_layer3_1_conv2,), kwargs = {}) | |
%self_layer3_1_relu_1 : [#users=1] = call_module[target=self_layer3_1_relu](args = (%self_layer3_1_bn2,), kwargs = {}) | |
%self_layer3_1_conv3 : [#users=1] = call_module[target=self_layer3_1_conv3](args = (%self_layer3_1_relu_1,), kwargs = {}) | |
%self_layer3_1_bn3 : [#users=1] = call_module[target=self_layer3_1_bn3](args = (%self_layer3_1_conv3,), kwargs = {}) | |
%iadd_1 : [#users=1] = call_function[target=operator.iadd](args = (%self_layer3_1_bn3, %self_layer3_0_relu), kwargs = {}) | |
%self_layer3_1_relu_2 : [#users=2] = call_module[target=self_layer3_1_relu](args = (%iadd_1,), kwargs = {}) | |
%self_layer3_2_conv1 : [#users=1] = call_module[target=self_layer3_2_conv1](args = (%self_layer3_1_relu_2,), kwargs = {}) | |
%self_layer3_2_bn1 : [#users=1] = call_module[target=self_layer3_2_bn1](args = (%self_layer3_2_conv1,), kwargs = {}) | |
%self_layer3_2_relu : [#users=1] = call_module[target=self_layer3_2_relu](args = (%self_layer3_2_bn1,), kwargs = {}) | |
%self_layer3_2_conv2 : [#users=1] = call_module[target=self_layer3_2_conv2](args = (%self_layer3_2_relu,), kwargs = {}) | |
%self_layer3_2_bn2 : [#users=1] = call_module[target=self_layer3_2_bn2](args = (%self_layer3_2_conv2,), kwargs = {}) | |
%self_layer3_2_relu_1 : [#users=1] = call_module[target=self_layer3_2_relu](args = (%self_layer3_2_bn2,), kwargs = {}) | |
%self_layer3_2_conv3 : [#users=1] = call_module[target=self_layer3_2_conv3](args = (%self_layer3_2_relu_1,), kwargs = {}) | |
%self_layer3_2_bn3 : [#users=1] = call_module[target=self_layer3_2_bn3](args = (%self_layer3_2_conv3,), kwargs = {}) | |
%iadd_2 : [#users=1] = call_function[target=operator.iadd](args = (%self_layer3_2_bn3, %self_layer3_1_relu_2), kwargs = {}) | |
%self_layer3_2_relu_2 : [#users=2] = call_module[target=self_layer3_2_relu](args = (%iadd_2,), kwargs = {}) | |
%self_layer3_3_conv1 : [#users=1] = call_module[target=self_layer3_3_conv1](args = (%self_layer3_2_relu_2,), kwargs = {}) | |
%self_layer3_3_bn1 : [#users=1] = call_module[target=self_layer3_3_bn1](args = (%self_layer3_3_conv1,), kwargs = {}) | |
%self_layer3_3_relu : [#users=1] = call_module[target=self_layer3_3_relu](args = (%self_layer3_3_bn1,), kwargs = {}) | |
%self_layer3_3_conv2 : [#users=1] = call_module[target=self_layer3_3_conv2](args = (%self_layer3_3_relu,), kwargs = {}) | |
%self_layer3_3_bn2 : [#users=1] = call_module[target=self_layer3_3_bn2](args = (%self_layer3_3_conv2,), kwargs = {}) | |
%self_layer3_3_relu_1 : [#users=1] = call_module[target=self_layer3_3_relu](args = (%self_layer3_3_bn2,), kwargs = {}) | |
%self_layer3_3_conv3 : [#users=1] = call_module[target=self_layer3_3_conv3](args = (%self_layer3_3_relu_1,), kwargs = {}) | |
%self_layer3_3_bn3 : [#users=1] = call_module[target=self_layer3_3_bn3](args = (%self_layer3_3_conv3,), kwargs = {}) | |
%iadd_3 : [#users=1] = call_function[target=operator.iadd](args = (%self_layer3_3_bn3, %self_layer3_2_relu_2), kwargs = {}) | |
%self_layer3_3_relu_2 : [#users=2] = call_module[target=self_layer3_3_relu](args = (%iadd_3,), kwargs = {}) | |
%self_layer3_4_conv1 : [#users=1] = call_module[target=self_layer3_4_conv1](args = (%self_layer3_3_relu_2,), kwargs = {}) | |
%self_layer3_4_bn1 : [#users=1] = call_module[target=self_layer3_4_bn1](args = (%self_layer3_4_conv1,), kwargs = {}) | |
%self_layer3_4_relu : [#users=1] = call_module[target=self_layer3_4_relu](args = (%self_layer3_4_bn1,), kwargs = {}) | |
%self_layer3_4_conv2 : [#users=1] = call_module[target=self_layer3_4_conv2](args = (%self_layer3_4_relu,), kwargs = {}) | |
%self_layer3_4_bn2 : [#users=1] = call_module[target=self_layer3_4_bn2](args = (%self_layer3_4_conv2,), kwargs = {}) | |
%self_layer3_4_relu_1 : [#users=1] = call_module[target=self_layer3_4_relu](args = (%self_layer3_4_bn2,), kwargs = {}) | |
%self_layer3_4_conv3 : [#users=1] = call_module[target=self_layer3_4_conv3](args = (%self_layer3_4_relu_1,), kwargs = {}) | |
%self_layer3_4_bn3 : [#users=1] = call_module[target=self_layer3_4_bn3](args = (%self_layer3_4_conv3,), kwargs = {}) | |
%iadd_4 : [#users=1] = call_function[target=operator.iadd](args = (%self_layer3_4_bn3, %self_layer3_3_relu_2), kwargs = {}) | |
%self_layer3_4_relu_2 : [#users=2] = call_module[target=self_layer3_4_relu](args = (%iadd_4,), kwargs = {}) | |
%self_layer3_5_conv1 : [#users=1] = call_module[target=self_layer3_5_conv1](args = (%self_layer3_4_relu_2,), kwargs = {}) | |
%self_layer3_5_bn1 : [#users=1] = call_module[target=self_layer3_5_bn1](args = (%self_layer3_5_conv1,), kwargs = {}) | |
%self_layer3_5_relu : [#users=1] = call_module[target=self_layer3_5_relu](args = (%self_layer3_5_bn1,), kwargs = {}) | |
%self_layer3_5_conv2 : [#users=1] = call_module[target=self_layer3_5_conv2](args = (%self_layer3_5_relu,), kwargs = {}) | |
%self_layer3_5_bn2 : [#users=1] = call_module[target=self_layer3_5_bn2](args = (%self_layer3_5_conv2,), kwargs = {}) | |
%self_layer3_5_relu_1 : [#users=1] = call_module[target=self_layer3_5_relu](args = (%self_layer3_5_bn2,), kwargs = {}) | |
%self_layer3_5_conv3 : [#users=1] = call_module[target=self_layer3_5_conv3](args = (%self_layer3_5_relu_1,), kwargs = {}) | |
%self_layer3_5_bn3 : [#users=1] = call_module[target=self_layer3_5_bn3](args = (%self_layer3_5_conv3,), kwargs = {}) | |
%iadd_5 : [#users=1] = call_function[target=operator.iadd](args = (%self_layer3_5_bn3, %self_layer3_4_relu_2), kwargs = {}) | |
%self_layer3_5_relu_2 : [#users=2] = call_module[target=self_layer3_5_relu](args = (%iadd_5,), kwargs = {}) | |
%self_layer4_0_conv1 : [#users=1] = call_module[target=self_layer4_0_conv1](args = (%self_layer3_5_relu_2,), kwargs = {}) | |
%self_layer4_0_bn1 : [#users=1] = call_module[target=self_layer4_0_bn1](args = (%self_layer4_0_conv1,), kwargs = {}) | |
%self_layer4_0_relu : [#users=1] = call_module[target=self_layer4_0_relu](args = (%self_layer4_0_bn1,), kwargs = {}) | |
return (self_layer4_0_relu, self_layer3_5_relu_2) | |
---submod_2 graph--- | |
graph(): | |
%self_layer4_0_relu : [#users=1] = placeholder[target=self_layer4_0_relu] | |
%self_layer3_5_relu_2 : [#users=1] = placeholder[target=self_layer3_5_relu_2] | |
%self_layer4_0_conv2 : [#users=1] = call_module[target=self_layer4_0_conv2](args = (%self_layer4_0_relu,), kwargs = {}) | |
%self_layer4_0_bn2 : [#users=1] = call_module[target=self_layer4_0_bn2](args = (%self_layer4_0_conv2,), kwargs = {}) | |
%self_layer4_0_relu_1 : [#users=1] = call_module[target=self_layer4_0_relu](args = (%self_layer4_0_bn2,), kwargs = {}) | |
%self_layer4_0_conv3 : [#users=1] = call_module[target=self_layer4_0_conv3](args = (%self_layer4_0_relu_1,), kwargs = {}) | |
%self_layer4_0_bn3 : [#users=1] = call_module[target=self_layer4_0_bn3](args = (%self_layer4_0_conv3,), kwargs = {}) | |
%self_layer4_0_downsample_0 : [#users=1] = call_module[target=self_layer4_0_downsample_0](args = (%self_layer3_5_relu_2,), kwargs = {}) | |
%self_layer4_0_downsample_1 : [#users=1] = call_module[target=self_layer4_0_downsample_1](args = (%self_layer4_0_downsample_0,), kwargs = {}) | |
%iadd : [#users=1] = call_function[target=operator.iadd](args = (%self_layer4_0_bn3, %self_layer4_0_downsample_1), kwargs = {}) | |
%self_layer4_0_relu_2 : [#users=2] = call_module[target=self_layer4_0_relu](args = (%iadd,), kwargs = {}) | |
%self_layer4_1_conv1 : [#users=1] = call_module[target=self_layer4_1_conv1](args = (%self_layer4_0_relu_2,), kwargs = {}) | |
%self_layer4_1_bn1 : [#users=1] = call_module[target=self_layer4_1_bn1](args = (%self_layer4_1_conv1,), kwargs = {}) | |
%self_layer4_1_relu : [#users=1] = call_module[target=self_layer4_1_relu](args = (%self_layer4_1_bn1,), kwargs = {}) | |
return (self_layer4_1_relu, self_layer4_0_relu_2) | |
---submod_3 graph--- | |
graph(): | |
%self_layer4_1_relu : [#users=1] = placeholder[target=self_layer4_1_relu] | |
%self_layer4_0_relu_2 : [#users=1] = placeholder[target=self_layer4_0_relu_2] | |
%self_layer4_1_conv2 : [#users=1] = call_module[target=self_layer4_1_conv2](args = (%self_layer4_1_relu,), kwargs = {}) | |
%self_layer4_1_bn2 : [#users=1] = call_module[target=self_layer4_1_bn2](args = (%self_layer4_1_conv2,), kwargs = {}) | |
%self_layer4_1_relu_1 : [#users=1] = call_module[target=self_layer4_1_relu](args = (%self_layer4_1_bn2,), kwargs = {}) | |
%self_layer4_1_conv3 : [#users=1] = call_module[target=self_layer4_1_conv3](args = (%self_layer4_1_relu_1,), kwargs = {}) | |
%self_layer4_1_bn3 : [#users=1] = call_module[target=self_layer4_1_bn3](args = (%self_layer4_1_conv3,), kwargs = {}) | |
%iadd : [#users=1] = call_function[target=operator.iadd](args = (%self_layer4_1_bn3, %self_layer4_0_relu_2), kwargs = {}) | |
%self_layer4_1_relu_2 : [#users=2] = call_module[target=self_layer4_1_relu](args = (%iadd,), kwargs = {}) | |
%self_layer4_2_conv1 : [#users=1] = call_module[target=self_layer4_2_conv1](args = (%self_layer4_1_relu_2,), kwargs = {}) | |
%self_layer4_2_bn1 : [#users=1] = call_module[target=self_layer4_2_bn1](args = (%self_layer4_2_conv1,), kwargs = {}) | |
%self_layer4_2_relu : [#users=1] = call_module[target=self_layer4_2_relu](args = (%self_layer4_2_bn1,), kwargs = {}) | |
%self_layer4_2_conv2 : [#users=1] = call_module[target=self_layer4_2_conv2](args = (%self_layer4_2_relu,), kwargs = {}) | |
%self_layer4_2_bn2 : [#users=1] = call_module[target=self_layer4_2_bn2](args = (%self_layer4_2_conv2,), kwargs = {}) | |
%self_layer4_2_relu_1 : [#users=1] = call_module[target=self_layer4_2_relu](args = (%self_layer4_2_bn2,), kwargs = {}) | |
%self_layer4_2_conv3 : [#users=1] = call_module[target=self_layer4_2_conv3](args = (%self_layer4_2_relu_1,), kwargs = {}) | |
%self_layer4_2_bn3 : [#users=1] = call_module[target=self_layer4_2_bn3](args = (%self_layer4_2_conv3,), kwargs = {}) | |
%iadd_1 : [#users=1] = call_function[target=operator.iadd](args = (%self_layer4_2_bn3, %self_layer4_1_relu_2), kwargs = {}) | |
%self_layer4_2_relu_2 : [#users=1] = call_module[target=self_layer4_2_relu](args = (%iadd_1,), kwargs = {}) | |
%self_avgpool : [#users=1] = call_module[target=self_avgpool](args = (%self_layer4_2_relu_2,), kwargs = {}) | |
%flatten : [#users=1] = call_function[target=torch.flatten](args = (%self_avgpool, 1), kwargs = {}) | |
return flatten | |
---submod_4 graph--- | |
graph(): | |
%flatten : [#users=1] = placeholder[target=flatten] | |
%self_fc : [#users=1] = call_module[target=self_fc](args = (%flatten,), kwargs = {}) | |
return self_fc | |
--------------- | |
run_node placeholder, x got args tuple() | |
run_node call_module, submod_0 got args tuple(T[torch.Size([32, 3, 224, 224])]) | |
still waiting for process to exit | |
didn't get the result, queue empty | |
run_node call_function, <built-in function getitem> got args tuple(tuple(T[torch.Size([32, 512, 28, 28])], T[torch.Size([32, 1024, 14, 14])]), 0) | |
run_node call_function, <built-in function getitem> got args tuple(tuple(T[torch.Size([32, 512, 28, 28])], T[torch.Size([32, 1024, 14, 14])]), 1) | |
run_node call_module, submod_1 got args tuple(T[torch.Size([32, 512, 28, 28])], T[torch.Size([32, 1024, 14, 14])]) | |
got exit 1 | |
didn't get the result, queue empty | |
exit code: 1 and result: {'nodes': 1, 'model_name': 'torchbenchmark.models.resnet50.Model', 'backend': 'torchdynamo_inductor', 'has_breaks': True, 'static_graph': False, 'result': None} | |
<RESULT>{"nodes": 1, "model_name": "torchbenchmark.models.resnet50.Model", "backend": "torchdynamo_inductor", "has_breaks": true, "static_graph": false, "result": null}</RESULT> | |
submitit INFO (2022-10-26 04:23:25,105) - Job completed successfully |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
submitit INFO (2022-10-26 02:47:41,285) - Starting with JobEnvironment(job_id=74173, hostname=a100-st-p4d24xlarge-24, local_rank=0(8), node=0(2), global_rank=0(16)) | |
submitit INFO (2022-10-26 02:47:41,286) - Loading pickle: /fsx/users/dberard/scratch-local/bench-fast/benchmark/logs_oct25/74173_submitted.pkl | |
This is node 0 | |
run_once | |
Process group: 16 tasks, rank: 0 | |
MY HOSTNAME: a100-st-p4d24xlarge-24 | |
FI_PROVIDER : efa | |
LD_LIBRARY_PATH : /fsx/users/dberard/scratch-local/bench-fast/aws-ofi-nccl/lib:/opt/amazon/efa/lib:/usr/local/cuda-11.6/lib:/usr/local/cuda-11.6/lib64:/opt/amazon/efa/lib:/opt/amazon/openmpi/lib:/usr/local/cuda/efa/lib:/usr/local/cuda/lib:/usr/local/cuda:/usr/local/cuda/lib64:/usr/local/cuda/extras/CUPTI/lib64:/usr/local/cuda/targets/x86_64-linux/lib:/usr/local/lib:/usr/lib: | |
NCCL_DEBUG : INFO | |
FI_EFA_USE_DEVICE_RDMA : 1 | |
still waiting for process to exit | |
didn't get the result, queue empty | |
a100-st-p4d24xlarge-24:97518:97518 [0] NCCL INFO NCCL_SOCKET_IFNAME set by environment to ens | |
a100-st-p4d24xlarge-24:97518:97518 [0] NCCL INFO NCCL_SOCKET_IFNAME set to ens | |
a100-st-p4d24xlarge-24:97518:97518 [0] NCCL INFO Bootstrap : Using ens32:10.200.74.168<0> | |
a100-st-p4d24xlarge-24:97518:97518 [0] NCCL INFO NET/Plugin: Failed to find ncclCollNetPlugin_v6 symbol. | |
a100-st-p4d24xlarge-24:97518:97518 [0] NCCL INFO NET/Plugin: Failed to find ncclCollNetPlugin symbol (v4 or v5). | |
a100-st-p4d24xlarge-24:97518:97518 [0] NCCL INFO cudaDriverVersion 11060 | |
NCCL version 2.14.3+cuda11.6 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO NET/OFI Using aws-ofi-nccl 1.4.0aws | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO NET/OFI Running on p4d.24xlarge platform, Setting NCCL_TOPO_FILE environment variable to /fsx/users/dberard/scratch-local/bench-fast/aws-ofi-nccl/share/aws-ofi-nccl/xml/p4d-24xl-topo.xml | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO NET/OFI Setting FI_EFA_FORK_SAFE environment variable to 1 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO NET/OFI Selected Provider is efa | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO Using network AWS Libfabric | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO NCCL_TOPO_FILE set by environment to /fsx/users/dberard/scratch-local/bench-fast/aws-ofi-nccl/share/aws-ofi-nccl/xml/p4d-24xl-topo.xml | |
still waiting for process to exit | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO NET/AWS Libfabric : GPU Direct RDMA Enabled for HCA 0 'rdmap16s27' | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO NET/AWS Libfabric : GPU Direct RDMA Enabled for HCA 1 'rdmap32s27' | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO NET/AWS Libfabric : GPU Direct RDMA Enabled for HCA 2 'rdmap144s27' | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO NET/AWS Libfabric : GPU Direct RDMA Enabled for HCA 3 'rdmap160s27' | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101d0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201c0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201d0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901c0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901d0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01c0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01d0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101d0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201c0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201d0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901c0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901d0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01c0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01d0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101d0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201c0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201d0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901c0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901d0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01c0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01d0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101d0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201c0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201d0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901c0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901d0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01c0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01d0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101d0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201c0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201d0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901c0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901d0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01c0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01d0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101d0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201c0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201d0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901c0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901d0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01c0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01d0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101d0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201c0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201d0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901c0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901d0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01c0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01d0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101d0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201c0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201d0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901c0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901d0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01c0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01d0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO Setting affinity for GPU 0 to 1f0000,0000001f | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO Channel 00/08 : 0 7 6 5 4 3 2 1 8 15 14 13 12 11 10 9 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO Channel 01/08 : 0 3 10 15 14 13 12 9 8 11 2 7 6 5 4 1 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO Channel 02/08 : 0 7 6 5 12 11 10 9 8 15 14 13 4 3 2 1 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO Channel 03/08 : 0 5 4 7 14 11 10 9 8 13 12 15 6 3 2 1 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO Channel 04/08 : 0 7 6 5 4 3 2 1 8 15 14 13 12 11 10 9 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO Channel 05/08 : 0 3 10 15 14 13 12 9 8 11 2 7 6 5 4 1 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO Channel 06/08 : 0 7 6 5 12 11 10 9 8 15 14 13 4 3 2 1 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO Channel 07/08 : 0 5 4 7 14 11 10 9 8 13 12 15 6 3 2 1 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO Trees [0] 1/8/-1->0->-1 [1] 1/-1/-1->0->7 [2] 1/-1/-1->0->7 [3] 1/-1/-1->0->7 [4] 1/-1/-1->0->8 [5] 1/-1/-1->0->7 [6] 1/-1/-1->0->7 [7] 1/-1/-1->0->7 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO Channel 01/0 : 0[101c0] -> 3[201d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3538002dd0 | |
a100-st-p4d24xlarge-24:97518:98071 [0] NCCL INFO New proxy send connection 0 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO Channel 05/0 : 0[101c0] -> 3[201d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3538002e10 | |
a100-st-p4d24xlarge-24:97518:98071 [0] NCCL INFO New proxy send connection 1 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO Channel 03/0 : 0[101c0] -> 5[901d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3538002e50 | |
a100-st-p4d24xlarge-24:97518:98071 [0] NCCL INFO New proxy send connection 2 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO Channel 07/0 : 0[101c0] -> 5[901d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-24:97518:98071 [0] NCCL INFO New proxy send connection 3 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3538002e90 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:97518:98071 [0] NCCL INFO New proxy recv connection 4 from local rank 0, transport 2 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3538002ed0 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO Channel 00/0 : 9[101d0] -> 0[101c0] [receive] via NET/AWS Libfabric/0/GDRDMA | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3538002f10 | |
a100-st-p4d24xlarge-24:97518:98071 [0] NCCL INFO New proxy recv connection 5 from local rank 0, transport 2 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO Channel 04/0 : 9[101d0] -> 0[101c0] [receive] via NET/AWS Libfabric/0/GDRDMA | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO Channel 00/0 : 0[101c0] -> 7[a01d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-24:97518:98071 [0] NCCL INFO New proxy send connection 6 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3538002f50 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO Channel 02/0 : 0[101c0] -> 7[a01d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-24:97518:98071 [0] NCCL INFO New proxy send connection 7 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3538002f90 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO Channel 04/0 : 0[101c0] -> 7[a01d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-24:97518:98071 [0] NCCL INFO New proxy send connection 8 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3538002fd0 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO Channel 06/0 : 0[101c0] -> 7[a01d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3538003010 | |
a100-st-p4d24xlarge-24:97518:98071 [0] NCCL INFO New proxy send connection 9 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3538003050 | |
a100-st-p4d24xlarge-24:97518:98071 [0] NCCL INFO New proxy recv connection 10 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3538003090 | |
a100-st-p4d24xlarge-24:97518:98071 [0] NCCL INFO New proxy recv connection 11 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f35380030d0 | |
a100-st-p4d24xlarge-24:97518:98071 [0] NCCL INFO New proxy recv connection 12 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3538003110 | |
a100-st-p4d24xlarge-24:97518:98071 [0] NCCL INFO New proxy recv connection 13 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3538003150 | |
a100-st-p4d24xlarge-24:97518:98071 [0] NCCL INFO New proxy recv connection 14 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3538003190 | |
a100-st-p4d24xlarge-24:97518:98071 [0] NCCL INFO New proxy recv connection 15 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO Connected all rings | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO Channel 00/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f35380031d0 | |
a100-st-p4d24xlarge-24:97518:98071 [0] NCCL INFO New proxy send connection 16 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO Channel 01/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-24:97518:98071 [0] NCCL INFO New proxy send connection 17 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3538003210 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO Channel 02/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-24:97518:98071 [0] NCCL INFO New proxy send connection 18 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3538003250 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO Channel 03/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-24:97518:98071 [0] NCCL INFO New proxy send connection 19 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3538003290 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO Channel 04/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-24:97518:98071 [0] NCCL INFO New proxy send connection 20 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f35380032d0 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO Channel 05/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-24:97518:98071 [0] NCCL INFO New proxy send connection 21 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3538003310 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO Channel 06/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-24:97518:98071 [0] NCCL INFO New proxy send connection 22 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3538003350 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO Channel 07/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-24:97518:98071 [0] NCCL INFO New proxy send connection 23 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3538003390 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO Channel 01/0 : 0[101c0] -> 7[a01d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-24:97518:98071 [0] NCCL INFO New proxy send connection 24 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f35380033d0 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO Channel 03/0 : 0[101c0] -> 7[a01d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-24:97518:98071 [0] NCCL INFO New proxy send connection 25 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3538003410 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO Channel 05/0 : 0[101c0] -> 7[a01d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-24:97518:98071 [0] NCCL INFO New proxy send connection 26 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3538003450 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO Channel 07/0 : 0[101c0] -> 7[a01d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-24:97518:98071 [0] NCCL INFO New proxy send connection 27 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3538003490 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f35380034d0 | |
a100-st-p4d24xlarge-24:97518:98071 [0] NCCL INFO New proxy recv connection 28 from local rank 0, transport 2 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO Channel 00/0 : 8[101c0] -> 0[101c0] [receive] via NET/AWS Libfabric/0/GDRDMA | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:97518:98071 [0] NCCL INFO New proxy recv connection 29 from local rank 0, transport 2 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3538003510 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO Channel 04/0 : 8[101c0] -> 0[101c0] [receive] via NET/AWS Libfabric/0/GDRDMA | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 0 (distance 3 <= 4), read 1 | |
a100-st-p4d24xlarge-24:97518:98071 [0] NCCL INFO New proxy send connection 30 from local rank 0, transport 2 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3538003550 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO Channel 00/0 : 0[101c0] -> 8[101c0] [send] via NET/AWS Libfabric/0/GDRDMA | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 0 (distance 3 <= 4), read 1 | |
a100-st-p4d24xlarge-24:97518:98071 [0] NCCL INFO New proxy send connection 31 from local rank 0, transport 2 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3538003590 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO Channel 04/0 : 0[101c0] -> 8[101c0] [send] via NET/AWS Libfabric/0/GDRDMA | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f35380035d0 | |
a100-st-p4d24xlarge-24:97518:98071 [0] NCCL INFO New proxy recv connection 32 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3538003610 | |
a100-st-p4d24xlarge-24:97518:98071 [0] NCCL INFO New proxy recv connection 33 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3538003650 | |
a100-st-p4d24xlarge-24:97518:98071 [0] NCCL INFO New proxy recv connection 34 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3538003690 | |
a100-st-p4d24xlarge-24:97518:98071 [0] NCCL INFO New proxy recv connection 35 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f35380036d0 | |
a100-st-p4d24xlarge-24:97518:98071 [0] NCCL INFO New proxy recv connection 36 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3538003710 | |
a100-st-p4d24xlarge-24:97518:98071 [0] NCCL INFO New proxy recv connection 37 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3538003750 | |
a100-st-p4d24xlarge-24:97518:98071 [0] NCCL INFO New proxy recv connection 38 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3538003790 | |
a100-st-p4d24xlarge-24:97518:98071 [0] NCCL INFO New proxy recv connection 39 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO Connected all trees | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO NCCL_ALGO set by environment to ring | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO threadThresholds 8/8/64 | 128/8/64 | 512 | 512 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO 8 coll channels, 8 p2p channels, 2 p2p channels per peer | |
a100-st-p4d24xlarge-24:97518:98071 [0] NCCL INFO New proxy send connection 40 from local rank 0, transport 2 | |
a100-st-p4d24xlarge-24:97518:98071 [0] NCCL INFO New proxy send connection 41 from local rank 1, transport 2 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f35380037d0 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 1 (distance 3 <= 4), read 1 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 1 (distance 3 <= 4), read 1 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 2 (distance 3 <= 4), read 1 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 2 (distance 3 <= 4), read 1 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 3 (distance 3 <= 4), read 1 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 3 (distance 3 <= 4), read 1 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 1 (distance 3 <= 4), read 1 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 1 (distance 3 <= 4), read 1 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 2 (distance 3 <= 4), read 1 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 2 (distance 3 <= 4), read 1 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 3 (distance 3 <= 4), read 1 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 3 (distance 3 <= 4), read 1 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO Connection to proxy localRank 2 -> connection 0x7f60dc003690 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO Connection to proxy localRank 4 -> connection 0x7f4b6c003690 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO Connection to proxy localRank 6 -> connection 0x7f7c6c003690 | |
a100-st-p4d24xlarge-24:97518:98071 [0] NCCL INFO New proxy send connection 42 from local rank 7, transport 2 | |
a100-st-p4d24xlarge-24:97518:98071 [0] NCCL INFO New proxy send connection 43 from local rank 6, transport 2 | |
a100-st-p4d24xlarge-24:97518:98071 [0] NCCL INFO New proxy send connection 44 from local rank 2, transport 2 | |
a100-st-p4d24xlarge-24:97518:98071 [0] NCCL INFO New proxy send connection 45 from local rank 5, transport 2 | |
a100-st-p4d24xlarge-24:97518:98071 [0] NCCL INFO New proxy send connection 46 from local rank 3, transport 2 | |
a100-st-p4d24xlarge-24:97518:98071 [0] NCCL INFO New proxy send connection 47 from local rank 4, transport 2 | |
a100-st-p4d24xlarge-24:97518:98032 [0] NCCL INFO comm 0x563e7e05bc30 rank 0 nranks 16 cudaDev 0 busId 101c0 - Init COMPLETE | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
result {'latency_median': 76.04889678955078, 'latency_stdev': 1.490786349319905} | |
has been placed into the queue | |
got exit 0 | |
got the result! | |
exit code: 0 and result: {'nodes': 2, 'model_name': 'torchbenchmark.models.resnet50.Model', 'backend': 'eager', 'has_breaks': False, 'static_graph': False, 'result': {'latency_median': 76.04889678955078, 'latency_stdev': 1.490786349319905}} | |
<RESULT>{"nodes": 2, "model_name": "torchbenchmark.models.resnet50.Model", "backend": "eager", "has_breaks": false, "static_graph": false, "result": {"latency_median": 76.04889678955078, "latency_stdev": 1.490786349319905}}</RESULT> | |
run_once | |
Process group: 16 tasks, rank: 0 | |
MY HOSTNAME: a100-st-p4d24xlarge-24 | |
FI_PROVIDER : efa | |
LD_LIBRARY_PATH : /fsx/users/dberard/scratch-local/bench-fast/aws-ofi-nccl/lib:/opt/amazon/efa/lib:/usr/local/cuda-11.6/lib:/usr/local/cuda-11.6/lib64:/opt/amazon/efa/lib:/opt/amazon/openmpi/lib:/usr/local/cuda/efa/lib:/usr/local/cuda/lib:/usr/local/cuda:/usr/local/cuda/lib64:/usr/local/cuda/extras/CUPTI/lib64:/usr/local/cuda/targets/x86_64-linux/lib:/usr/local/lib:/usr/lib: | |
NCCL_DEBUG : INFO | |
FI_EFA_USE_DEVICE_RDMA : 1 | |
still waiting for process to exit | |
didn't get the result, queue empty | |
a100-st-p4d24xlarge-24:917:917 [0] NCCL INFO NCCL_SOCKET_IFNAME set by environment to ens | |
a100-st-p4d24xlarge-24:917:917 [0] NCCL INFO NCCL_SOCKET_IFNAME set to ens | |
a100-st-p4d24xlarge-24:917:917 [0] NCCL INFO Bootstrap : Using ens32:10.200.74.168<0> | |
a100-st-p4d24xlarge-24:917:917 [0] NCCL INFO NET/Plugin: Failed to find ncclCollNetPlugin_v6 symbol. | |
a100-st-p4d24xlarge-24:917:917 [0] NCCL INFO NET/Plugin: Failed to find ncclCollNetPlugin symbol (v4 or v5). | |
a100-st-p4d24xlarge-24:917:917 [0] NCCL INFO cudaDriverVersion 11060 | |
NCCL version 2.14.3+cuda11.6 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO NET/OFI Using aws-ofi-nccl 1.4.0aws | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO NET/OFI Running on p4d.24xlarge platform, Setting NCCL_TOPO_FILE environment variable to /fsx/users/dberard/scratch-local/bench-fast/aws-ofi-nccl/share/aws-ofi-nccl/xml/p4d-24xl-topo.xml | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO NET/OFI Setting FI_EFA_FORK_SAFE environment variable to 1 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO NET/OFI Selected Provider is efa | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO Using network AWS Libfabric | |
still waiting for process to exit | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO NCCL_TOPO_FILE set by environment to /fsx/users/dberard/scratch-local/bench-fast/aws-ofi-nccl/share/aws-ofi-nccl/xml/p4d-24xl-topo.xml | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO NET/AWS Libfabric : GPU Direct RDMA Enabled for HCA 0 'rdmap16s27' | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO NET/AWS Libfabric : GPU Direct RDMA Enabled for HCA 1 'rdmap32s27' | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO NET/AWS Libfabric : GPU Direct RDMA Enabled for HCA 2 'rdmap144s27' | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO NET/AWS Libfabric : GPU Direct RDMA Enabled for HCA 3 'rdmap160s27' | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101d0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201c0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201d0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901c0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901d0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01c0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01d0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101d0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201c0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201d0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901c0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901d0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01c0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01d0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101d0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201c0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201d0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901c0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901d0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01c0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01d0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101d0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201c0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201d0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901c0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901d0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01c0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01d0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101d0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201c0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201d0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901c0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901d0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01c0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01d0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101d0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201c0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201d0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901c0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901d0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01c0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01d0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101d0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201c0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201d0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901c0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901d0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01c0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01d0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101d0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201c0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201d0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901c0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901d0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01c0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01d0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO Setting affinity for GPU 0 to 1f0000,0000001f | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO Channel 00/08 : 0 7 6 5 4 3 2 1 8 15 14 13 12 11 10 9 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO Channel 01/08 : 0 3 10 15 14 13 12 9 8 11 2 7 6 5 4 1 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO Channel 02/08 : 0 7 6 5 12 11 10 9 8 15 14 13 4 3 2 1 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO Channel 03/08 : 0 5 4 7 14 11 10 9 8 13 12 15 6 3 2 1 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO Channel 04/08 : 0 7 6 5 4 3 2 1 8 15 14 13 12 11 10 9 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO Channel 05/08 : 0 3 10 15 14 13 12 9 8 11 2 7 6 5 4 1 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO Channel 06/08 : 0 7 6 5 12 11 10 9 8 15 14 13 4 3 2 1 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO Channel 07/08 : 0 5 4 7 14 11 10 9 8 13 12 15 6 3 2 1 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO Trees [0] 1/8/-1->0->-1 [1] 1/-1/-1->0->7 [2] 1/-1/-1->0->7 [3] 1/-1/-1->0->7 [4] 1/-1/-1->0->8 [5] 1/-1/-1->0->7 [6] 1/-1/-1->0->7 [7] 1/-1/-1->0->7 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO Channel 01/0 : 0[101c0] -> 3[201d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3538002dd0 | |
a100-st-p4d24xlarge-24:917:2158 [0] NCCL INFO New proxy send connection 0 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO Channel 05/0 : 0[101c0] -> 3[201d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3538002e10 | |
a100-st-p4d24xlarge-24:917:2158 [0] NCCL INFO New proxy send connection 1 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO Channel 03/0 : 0[101c0] -> 5[901d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-24:917:2158 [0] NCCL INFO New proxy send connection 2 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3538002e50 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO Channel 07/0 : 0[101c0] -> 5[901d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-24:917:2158 [0] NCCL INFO New proxy send connection 3 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3538002e90 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:917:2158 [0] NCCL INFO New proxy recv connection 4 from local rank 0, transport 2 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3538002ed0 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO Channel 00/0 : 9[101d0] -> 0[101c0] [receive] via NET/AWS Libfabric/0/GDRDMA | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:917:2158 [0] NCCL INFO New proxy recv connection 5 from local rank 0, transport 2 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3538002f10 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO Channel 04/0 : 9[101d0] -> 0[101c0] [receive] via NET/AWS Libfabric/0/GDRDMA | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO Channel 00/0 : 0[101c0] -> 7[a01d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-24:917:2158 [0] NCCL INFO New proxy send connection 6 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3538002f50 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO Channel 02/0 : 0[101c0] -> 7[a01d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-24:917:2158 [0] NCCL INFO New proxy send connection 7 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3538002f90 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO Channel 04/0 : 0[101c0] -> 7[a01d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-24:917:2158 [0] NCCL INFO New proxy send connection 8 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3538002fd0 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO Channel 06/0 : 0[101c0] -> 7[a01d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-24:917:2158 [0] NCCL INFO New proxy send connection 9 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3538003010 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3538003050 | |
a100-st-p4d24xlarge-24:917:2158 [0] NCCL INFO New proxy recv connection 10 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3538003090 | |
a100-st-p4d24xlarge-24:917:2158 [0] NCCL INFO New proxy recv connection 11 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f35380030d0 | |
a100-st-p4d24xlarge-24:917:2158 [0] NCCL INFO New proxy recv connection 12 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3538003110 | |
a100-st-p4d24xlarge-24:917:2158 [0] NCCL INFO New proxy recv connection 13 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3538003150 | |
a100-st-p4d24xlarge-24:917:2158 [0] NCCL INFO New proxy recv connection 14 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3538003190 | |
a100-st-p4d24xlarge-24:917:2158 [0] NCCL INFO New proxy recv connection 15 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO Connected all rings | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO Channel 00/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f35380031d0 | |
a100-st-p4d24xlarge-24:917:2158 [0] NCCL INFO New proxy send connection 16 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO Channel 01/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-24:917:2158 [0] NCCL INFO New proxy send connection 17 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3538003210 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO Channel 02/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-24:917:2158 [0] NCCL INFO New proxy send connection 18 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3538003250 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO Channel 03/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3538003290 | |
a100-st-p4d24xlarge-24:917:2158 [0] NCCL INFO New proxy send connection 19 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO Channel 04/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f35380032d0 | |
a100-st-p4d24xlarge-24:917:2158 [0] NCCL INFO New proxy send connection 20 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO Channel 05/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3538003310 | |
a100-st-p4d24xlarge-24:917:2158 [0] NCCL INFO New proxy send connection 21 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO Channel 06/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3538003350 | |
a100-st-p4d24xlarge-24:917:2158 [0] NCCL INFO New proxy send connection 22 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO Channel 07/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3538003390 | |
a100-st-p4d24xlarge-24:917:2158 [0] NCCL INFO New proxy send connection 23 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO Channel 01/0 : 0[101c0] -> 7[a01d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f35380033d0 | |
a100-st-p4d24xlarge-24:917:2158 [0] NCCL INFO New proxy send connection 24 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO Channel 03/0 : 0[101c0] -> 7[a01d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3538003410 | |
a100-st-p4d24xlarge-24:917:2158 [0] NCCL INFO New proxy send connection 25 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO Channel 05/0 : 0[101c0] -> 7[a01d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3538003450 | |
a100-st-p4d24xlarge-24:917:2158 [0] NCCL INFO New proxy send connection 26 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO Channel 07/0 : 0[101c0] -> 7[a01d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3538003490 | |
a100-st-p4d24xlarge-24:917:2158 [0] NCCL INFO New proxy send connection 27 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:917:2158 [0] NCCL INFO New proxy recv connection 28 from local rank 0, transport 2 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f35380034d0 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO Channel 00/0 : 8[101c0] -> 0[101c0] [receive] via NET/AWS Libfabric/0/GDRDMA | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:917:2158 [0] NCCL INFO New proxy recv connection 29 from local rank 0, transport 2 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3538003510 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO Channel 04/0 : 8[101c0] -> 0[101c0] [receive] via NET/AWS Libfabric/0/GDRDMA | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 0 (distance 3 <= 4), read 1 | |
a100-st-p4d24xlarge-24:917:2158 [0] NCCL INFO New proxy send connection 30 from local rank 0, transport 2 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3538003550 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO Channel 00/0 : 0[101c0] -> 8[101c0] [send] via NET/AWS Libfabric/0/GDRDMA | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 0 (distance 3 <= 4), read 1 | |
a100-st-p4d24xlarge-24:917:2158 [0] NCCL INFO New proxy send connection 31 from local rank 0, transport 2 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3538003590 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO Channel 04/0 : 0[101c0] -> 8[101c0] [send] via NET/AWS Libfabric/0/GDRDMA | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f35380035d0 | |
a100-st-p4d24xlarge-24:917:2158 [0] NCCL INFO New proxy recv connection 32 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3538003610 | |
a100-st-p4d24xlarge-24:917:2158 [0] NCCL INFO New proxy recv connection 33 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3538003650 | |
a100-st-p4d24xlarge-24:917:2158 [0] NCCL INFO New proxy recv connection 34 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3538003690 | |
a100-st-p4d24xlarge-24:917:2158 [0] NCCL INFO New proxy recv connection 35 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f35380036d0 | |
a100-st-p4d24xlarge-24:917:2158 [0] NCCL INFO New proxy recv connection 36 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3538003710 | |
a100-st-p4d24xlarge-24:917:2158 [0] NCCL INFO New proxy recv connection 37 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-24:917:2158 [0] NCCL INFO New proxy recv connection 38 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3538003750 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3538003790 | |
a100-st-p4d24xlarge-24:917:2158 [0] NCCL INFO New proxy recv connection 39 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO Connected all trees | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO NCCL_ALGO set by environment to ring | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO threadThresholds 8/8/64 | 128/8/64 | 512 | 512 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO 8 coll channels, 8 p2p channels, 2 p2p channels per peer | |
a100-st-p4d24xlarge-24:917:2158 [0] NCCL INFO New proxy send connection 40 from local rank 0, transport 2 | |
a100-st-p4d24xlarge-24:917:2158 [0] NCCL INFO New proxy send connection 41 from local rank 1, transport 2 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f35380037d0 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 1 (distance 3 <= 4), read 1 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 1 (distance 3 <= 4), read 1 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 2 (distance 3 <= 4), read 1 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 2 (distance 3 <= 4), read 1 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 3 (distance 3 <= 4), read 1 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 3 (distance 3 <= 4), read 1 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 1 (distance 3 <= 4), read 1 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 1 (distance 3 <= 4), read 1 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 2 (distance 3 <= 4), read 1 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 2 (distance 3 <= 4), read 1 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 3 (distance 3 <= 4), read 1 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 3 (distance 3 <= 4), read 1 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO Connection to proxy localRank 2 -> connection 0x7f60e0003690 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO Connection to proxy localRank 4 -> connection 0x7f4b70003690 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO Connection to proxy localRank 6 -> connection 0x7f7c70003690 | |
a100-st-p4d24xlarge-24:917:2158 [0] NCCL INFO New proxy send connection 42 from local rank 7, transport 2 | |
a100-st-p4d24xlarge-24:917:2158 [0] NCCL INFO New proxy send connection 43 from local rank 2, transport 2 | |
a100-st-p4d24xlarge-24:917:2158 [0] NCCL INFO New proxy send connection 44 from local rank 6, transport 2 | |
didn't get the result, queue empty | |
a100-st-p4d24xlarge-24:917:2158 [0] NCCL INFO New proxy send connection 45 from local rank 5, transport 2 | |
a100-st-p4d24xlarge-24:917:2158 [0] NCCL INFO New proxy send connection 46 from local rank 3, transport 2 | |
a100-st-p4d24xlarge-24:917:2158 [0] NCCL INFO New proxy send connection 47 from local rank 4, transport 2 | |
a100-st-p4d24xlarge-24:917:2014 [0] NCCL INFO comm 0x563e7ef92c90 rank 0 nranks 16 cudaDev 0 busId 101c0 - Init COMPLETE | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
result {'latency_median': 93.18195343017578, 'latency_stdev': 0.26483998867502734} | |
has been placed into the queue | |
got the result! | |
got exit 0 | |
(already got results) | |
exit code: 0 and result: {'nodes': 2, 'model_name': 'torchbenchmark.models.resnet50.Model', 'backend': 'torchdynamo_inductor', 'has_breaks': False, 'static_graph': False, 'result': {'latency_median': 93.18195343017578, 'latency_stdev': 0.26483998867502734}} | |
<RESULT>{"nodes": 2, "model_name": "torchbenchmark.models.resnet50.Model", "backend": "torchdynamo_inductor", "has_breaks": false, "static_graph": false, "result": {"latency_median": 93.18195343017578, "latency_stdev": 0.26483998867502734}}</RESULT> | |
run_once | |
Process group: 16 tasks, rank: 0 | |
MY HOSTNAME: a100-st-p4d24xlarge-24 | |
FI_PROVIDER : efa | |
LD_LIBRARY_PATH : /fsx/users/dberard/scratch-local/bench-fast/aws-ofi-nccl/lib:/opt/amazon/efa/lib:/usr/local/cuda-11.6/lib:/usr/local/cuda-11.6/lib64:/opt/amazon/efa/lib:/opt/amazon/openmpi/lib:/usr/local/cuda/efa/lib:/usr/local/cuda/lib:/usr/local/cuda:/usr/local/cuda/lib64:/usr/local/cuda/extras/CUPTI/lib64:/usr/local/cuda/targets/x86_64-linux/lib:/usr/local/lib:/usr/lib: | |
NCCL_DEBUG : INFO | |
FI_EFA_USE_DEVICE_RDMA : 1 | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
a100-st-p4d24xlarge-24:6123:6123 [0] NCCL INFO NCCL_SOCKET_IFNAME set by environment to ens | |
a100-st-p4d24xlarge-24:6123:6123 [0] NCCL INFO NCCL_SOCKET_IFNAME set to ens | |
a100-st-p4d24xlarge-24:6123:6123 [0] NCCL INFO Bootstrap : Using ens32:10.200.74.168<0> | |
a100-st-p4d24xlarge-24:6123:6123 [0] NCCL INFO NET/Plugin: Failed to find ncclCollNetPlugin_v6 symbol. | |
a100-st-p4d24xlarge-24:6123:6123 [0] NCCL INFO NET/Plugin: Failed to find ncclCollNetPlugin symbol (v4 or v5). | |
a100-st-p4d24xlarge-24:6123:6123 [0] NCCL INFO cudaDriverVersion 11060 | |
NCCL version 2.14.3+cuda11.6 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO NET/OFI Using aws-ofi-nccl 1.4.0aws | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO NET/OFI Running on p4d.24xlarge platform, Setting NCCL_TOPO_FILE environment variable to /fsx/users/dberard/scratch-local/bench-fast/aws-ofi-nccl/share/aws-ofi-nccl/xml/p4d-24xl-topo.xml | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO NET/OFI Setting FI_EFA_FORK_SAFE environment variable to 1 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO NET/OFI Selected Provider is efa | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO Using network AWS Libfabric | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO NCCL_TOPO_FILE set by environment to /fsx/users/dberard/scratch-local/bench-fast/aws-ofi-nccl/share/aws-ofi-nccl/xml/p4d-24xl-topo.xml | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO NET/AWS Libfabric : GPU Direct RDMA Enabled for HCA 0 'rdmap16s27' | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO NET/AWS Libfabric : GPU Direct RDMA Enabled for HCA 1 'rdmap32s27' | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO NET/AWS Libfabric : GPU Direct RDMA Enabled for HCA 2 'rdmap144s27' | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO NET/AWS Libfabric : GPU Direct RDMA Enabled for HCA 3 'rdmap160s27' | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101d0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201c0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201d0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901c0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901d0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01c0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01d0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101d0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201c0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201d0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901c0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901d0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01c0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01d0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101d0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201c0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201d0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901c0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901d0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01c0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01d0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101d0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201c0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201d0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901c0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901d0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01c0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01d0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101d0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201c0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201d0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901c0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901d0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01c0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01d0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101d0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201c0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201d0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901c0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901d0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01c0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01d0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101d0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201c0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201d0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901c0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901d0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01c0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01d0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101d0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201c0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201d0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901c0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901d0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01c0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01d0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO Setting affinity for GPU 0 to 1f0000,0000001f | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO Channel 00/08 : 0 7 6 5 4 3 2 1 8 15 14 13 12 11 10 9 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO Channel 01/08 : 0 3 10 15 14 13 12 9 8 11 2 7 6 5 4 1 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO Channel 02/08 : 0 7 6 5 12 11 10 9 8 15 14 13 4 3 2 1 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO Channel 03/08 : 0 5 4 7 14 11 10 9 8 13 12 15 6 3 2 1 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO Channel 04/08 : 0 7 6 5 4 3 2 1 8 15 14 13 12 11 10 9 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO Channel 05/08 : 0 3 10 15 14 13 12 9 8 11 2 7 6 5 4 1 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO Channel 06/08 : 0 7 6 5 12 11 10 9 8 15 14 13 4 3 2 1 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO Channel 07/08 : 0 5 4 7 14 11 10 9 8 13 12 15 6 3 2 1 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO Trees [0] 1/8/-1->0->-1 [1] 1/-1/-1->0->7 [2] 1/-1/-1->0->7 [3] 1/-1/-1->0->7 [4] 1/-1/-1->0->8 [5] 1/-1/-1->0->7 [6] 1/-1/-1->0->7 [7] 1/-1/-1->0->7 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO Channel 01/0 : 0[101c0] -> 3[201d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3548002dd0 | |
a100-st-p4d24xlarge-24:6123:7261 [0] NCCL INFO New proxy send connection 0 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO Channel 05/0 : 0[101c0] -> 3[201d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3548002e10 | |
a100-st-p4d24xlarge-24:6123:7261 [0] NCCL INFO New proxy send connection 1 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO Channel 03/0 : 0[101c0] -> 5[901d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-24:6123:7261 [0] NCCL INFO New proxy send connection 2 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3548002e50 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO Channel 07/0 : 0[101c0] -> 5[901d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-24:6123:7261 [0] NCCL INFO New proxy send connection 3 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3548002e90 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:6123:7261 [0] NCCL INFO New proxy recv connection 4 from local rank 0, transport 2 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3548002ed0 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO Channel 00/0 : 9[101d0] -> 0[101c0] [receive] via NET/AWS Libfabric/0/GDRDMA | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:6123:7261 [0] NCCL INFO New proxy recv connection 5 from local rank 0, transport 2 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3548002f10 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO Channel 04/0 : 9[101d0] -> 0[101c0] [receive] via NET/AWS Libfabric/0/GDRDMA | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO Channel 00/0 : 0[101c0] -> 7[a01d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-24:6123:7261 [0] NCCL INFO New proxy send connection 6 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3548002f50 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO Channel 02/0 : 0[101c0] -> 7[a01d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-24:6123:7261 [0] NCCL INFO New proxy send connection 7 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3548002f90 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO Channel 04/0 : 0[101c0] -> 7[a01d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-24:6123:7261 [0] NCCL INFO New proxy send connection 8 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3548002fd0 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO Channel 06/0 : 0[101c0] -> 7[a01d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-24:6123:7261 [0] NCCL INFO New proxy send connection 9 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3548003010 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3548003050 | |
a100-st-p4d24xlarge-24:6123:7261 [0] NCCL INFO New proxy recv connection 10 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3548003090 | |
a100-st-p4d24xlarge-24:6123:7261 [0] NCCL INFO New proxy recv connection 11 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f35480030d0 | |
a100-st-p4d24xlarge-24:6123:7261 [0] NCCL INFO New proxy recv connection 12 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3548003110 | |
a100-st-p4d24xlarge-24:6123:7261 [0] NCCL INFO New proxy recv connection 13 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3548003150 | |
a100-st-p4d24xlarge-24:6123:7261 [0] NCCL INFO New proxy recv connection 14 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3548003190 | |
a100-st-p4d24xlarge-24:6123:7261 [0] NCCL INFO New proxy recv connection 15 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO Connected all rings | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO Channel 00/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f35480031d0 | |
a100-st-p4d24xlarge-24:6123:7261 [0] NCCL INFO New proxy send connection 16 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO Channel 01/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-24:6123:7261 [0] NCCL INFO New proxy send connection 17 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3548003210 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO Channel 02/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-24:6123:7261 [0] NCCL INFO New proxy send connection 18 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3548003250 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO Channel 03/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-24:6123:7261 [0] NCCL INFO New proxy send connection 19 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3548003290 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO Channel 04/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-24:6123:7261 [0] NCCL INFO New proxy send connection 20 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f35480032d0 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO Channel 05/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-24:6123:7261 [0] NCCL INFO New proxy send connection 21 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3548003310 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO Channel 06/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-24:6123:7261 [0] NCCL INFO New proxy send connection 22 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3548003350 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO Channel 07/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-24:6123:7261 [0] NCCL INFO New proxy send connection 23 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3548003390 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO Channel 01/0 : 0[101c0] -> 7[a01d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f35480033d0 | |
a100-st-p4d24xlarge-24:6123:7261 [0] NCCL INFO New proxy send connection 24 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO Channel 03/0 : 0[101c0] -> 7[a01d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-24:6123:7261 [0] NCCL INFO New proxy send connection 25 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3548003410 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO Channel 05/0 : 0[101c0] -> 7[a01d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-24:6123:7261 [0] NCCL INFO New proxy send connection 26 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3548003450 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO Channel 07/0 : 0[101c0] -> 7[a01d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3548003490 | |
a100-st-p4d24xlarge-24:6123:7261 [0] NCCL INFO New proxy send connection 27 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:6123:7261 [0] NCCL INFO New proxy recv connection 28 from local rank 0, transport 2 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f35480034d0 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO Channel 00/0 : 8[101c0] -> 0[101c0] [receive] via NET/AWS Libfabric/0/GDRDMA | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-24:6123:7261 [0] NCCL INFO New proxy recv connection 29 from local rank 0, transport 2 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3548003510 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO Channel 04/0 : 8[101c0] -> 0[101c0] [receive] via NET/AWS Libfabric/0/GDRDMA | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 0 (distance 3 <= 4), read 1 | |
a100-st-p4d24xlarge-24:6123:7261 [0] NCCL INFO New proxy send connection 30 from local rank 0, transport 2 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3548003550 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO Channel 00/0 : 0[101c0] -> 8[101c0] [send] via NET/AWS Libfabric/0/GDRDMA | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 0 (distance 3 <= 4), read 1 | |
a100-st-p4d24xlarge-24:6123:7261 [0] NCCL INFO New proxy send connection 31 from local rank 0, transport 2 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3548003590 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO Channel 04/0 : 0[101c0] -> 8[101c0] [send] via NET/AWS Libfabric/0/GDRDMA | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f35480035d0 | |
a100-st-p4d24xlarge-24:6123:7261 [0] NCCL INFO New proxy recv connection 32 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3548003610 | |
a100-st-p4d24xlarge-24:6123:7261 [0] NCCL INFO New proxy recv connection 33 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3548003650 | |
a100-st-p4d24xlarge-24:6123:7261 [0] NCCL INFO New proxy recv connection 34 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3548003690 | |
a100-st-p4d24xlarge-24:6123:7261 [0] NCCL INFO New proxy recv connection 35 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f35480036d0 | |
a100-st-p4d24xlarge-24:6123:7261 [0] NCCL INFO New proxy recv connection 36 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3548003710 | |
a100-st-p4d24xlarge-24:6123:7261 [0] NCCL INFO New proxy recv connection 37 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3548003750 | |
a100-st-p4d24xlarge-24:6123:7261 [0] NCCL INFO New proxy recv connection 38 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f3548003790 | |
a100-st-p4d24xlarge-24:6123:7261 [0] NCCL INFO New proxy recv connection 39 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO Connected all trees | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO NCCL_ALGO set by environment to ring | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO threadThresholds 8/8/64 | 128/8/64 | 512 | 512 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO 8 coll channels, 8 p2p channels, 2 p2p channels per peer | |
a100-st-p4d24xlarge-24:6123:7261 [0] NCCL INFO New proxy send connection 40 from local rank 0, transport 2 | |
a100-st-p4d24xlarge-24:6123:7261 [0] NCCL INFO New proxy send connection 41 from local rank 1, transport 2 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f35480037d0 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 1 (distance 3 <= 4), read 1 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 1 (distance 3 <= 4), read 1 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 2 (distance 3 <= 4), read 1 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 2 (distance 3 <= 4), read 1 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 3 (distance 3 <= 4), read 1 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 3 (distance 3 <= 4), read 1 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 1 (distance 3 <= 4), read 1 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 1 (distance 3 <= 4), read 1 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 2 (distance 3 <= 4), read 1 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 2 (distance 3 <= 4), read 1 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 3 (distance 3 <= 4), read 1 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 3 (distance 3 <= 4), read 1 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO Connection to proxy localRank 2 -> connection 0x7f60d8003590 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO Connection to proxy localRank 4 -> connection 0x7f4b70003110 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO Connection to proxy localRank 6 -> connection 0x7f7c6c003590 | |
a100-st-p4d24xlarge-24:6123:7261 [0] NCCL INFO New proxy send connection 42 from local rank 7, transport 2 | |
a100-st-p4d24xlarge-24:6123:7261 [0] NCCL INFO New proxy send connection 43 from local rank 6, transport 2 | |
didn't get the result, queue empty | |
a100-st-p4d24xlarge-24:6123:7261 [0] NCCL INFO New proxy send connection 44 from local rank 2, transport 2 | |
a100-st-p4d24xlarge-24:6123:7261 [0] NCCL INFO New proxy send connection 45 from local rank 5, transport 2 | |
a100-st-p4d24xlarge-24:6123:7261 [0] NCCL INFO New proxy send connection 46 from local rank 3, transport 2 | |
a100-st-p4d24xlarge-24:6123:7261 [0] NCCL INFO New proxy send connection 47 from local rank 4, transport 2 | |
a100-st-p4d24xlarge-24:6123:7127 [0] NCCL INFO comm 0x563e7e6ef4b0 rank 0 nranks 16 cudaDev 0 busId 101c0 - Init COMPLETE | |
DDPOptimizer used bucket cap 26214400 and produced the following buckets: | |
Index Size (b) Param Names | |
------- ---------- --------------------------------- | |
0 30257056 self_fc_weight | |
self_fc_bias | |
self_layer4_2_bn3_weight | |
self_layer4_2_bn3_bias | |
self_layer4_2_conv3_weight | |
self_layer4_2_bn2_weight | |
self_layer4_2_bn2_bias | |
self_layer4_2_conv2_weight | |
self_layer4_2_bn1_weight | |
self_layer4_2_bn1_bias | |
self_layer4_2_conv1_weight | |
self_layer4_1_bn3_weight | |
self_layer4_1_bn3_bias | |
self_layer4_1_conv3_weight | |
1 26255360 self_layer4_1_bn2_weight | |
self_layer4_1_bn2_bias | |
self_layer4_1_conv2_weight | |
self_layer4_1_bn1_weight | |
self_layer4_1_bn1_bias | |
self_layer4_1_conv1_weight | |
self_layer4_0_downsample_1_weight | |
self_layer4_0_downsample_1_bias | |
self_layer4_0_downsample_0_weight | |
self_layer4_0_bn3_weight | |
self_layer4_0_bn3_bias | |
self_layer4_0_conv3_weight | |
2 28366848 self_layer4_0_bn2_weight | |
self_layer4_0_bn2_bias | |
self_layer4_0_conv2_weight | |
self_layer4_0_bn1_weight | |
self_layer4_0_bn1_bias | |
self_layer4_0_conv1_weight | |
self_layer3_5_bn3_weight | |
self_layer3_5_bn3_bias | |
self_layer3_5_conv3_weight | |
self_layer3_5_bn2_weight | |
self_layer3_5_bn2_bias | |
self_layer3_5_conv2_weight | |
self_layer3_5_bn1_weight | |
self_layer3_5_bn1_bias | |
self_layer3_5_conv1_weight | |
self_layer3_4_bn3_weight | |
self_layer3_4_bn3_bias | |
self_layer3_4_conv3_weight | |
self_layer3_4_bn2_weight | |
self_layer3_4_bn2_bias | |
self_layer3_4_conv2_weight | |
self_layer3_4_bn1_weight | |
self_layer3_4_bn1_bias | |
self_layer3_4_conv1_weight | |
self_layer3_3_bn3_weight | |
self_layer3_3_bn3_bias | |
self_layer3_3_conv3_weight | |
self_layer3_3_bn2_weight | |
self_layer3_3_bn2_bias | |
self_layer3_3_conv2_weight | |
self_layer3_3_bn1_weight | |
self_layer3_3_bn1_bias | |
self_layer3_3_conv1_weight | |
self_layer3_2_bn3_weight | |
self_layer3_2_bn3_bias | |
self_layer3_2_conv3_weight | |
self_layer3_2_bn2_weight | |
self_layer3_2_bn2_bias | |
self_layer3_2_conv2_weight | |
3 17348864 self_layer3_2_bn1_weight | |
self_layer3_2_bn1_bias | |
self_layer3_2_conv1_weight | |
self_layer3_1_bn3_weight | |
self_layer3_1_bn3_bias | |
self_layer3_1_conv3_weight | |
self_layer3_1_bn2_weight | |
self_layer3_1_bn2_bias | |
self_layer3_1_conv2_weight | |
self_layer3_1_bn1_weight | |
self_layer3_1_bn1_bias | |
self_layer3_1_conv1_weight | |
self_layer3_0_downsample_1_weight | |
self_layer3_0_downsample_1_bias | |
self_layer3_0_downsample_0_weight | |
self_layer3_0_bn3_weight | |
self_layer3_0_bn3_bias | |
self_layer3_0_conv3_weight | |
self_layer3_0_bn2_weight | |
self_layer3_0_bn2_bias | |
self_layer3_0_conv2_weight | |
self_layer3_0_bn1_weight | |
self_layer3_0_bn1_bias | |
self_layer3_0_conv1_weight | |
self_layer2_3_bn3_weight | |
self_layer2_3_bn3_bias | |
self_layer2_3_conv3_weight | |
self_layer2_3_bn2_weight | |
self_layer2_3_bn2_bias | |
self_layer2_3_conv2_weight | |
self_layer2_3_bn1_weight | |
self_layer2_3_bn1_bias | |
self_layer2_3_conv1_weight | |
self_layer2_2_bn3_weight | |
self_layer2_2_bn3_bias | |
self_layer2_2_conv3_weight | |
self_layer2_2_bn2_weight | |
self_layer2_2_bn2_bias | |
self_layer2_2_conv2_weight | |
self_layer2_2_bn1_weight | |
self_layer2_2_bn1_bias | |
self_layer2_2_conv1_weight | |
self_layer2_1_bn3_weight | |
self_layer2_1_bn3_bias | |
self_layer2_1_conv3_weight | |
self_layer2_1_bn2_weight | |
self_layer2_1_bn2_bias | |
self_layer2_1_conv2_weight | |
self_layer2_1_bn1_weight | |
self_layer2_1_bn1_bias | |
self_layer2_1_conv1_weight | |
self_layer2_0_downsample_1_weight | |
self_layer2_0_downsample_1_bias | |
self_layer2_0_downsample_0_weight | |
self_layer2_0_bn3_weight | |
self_layer2_0_bn3_bias | |
self_layer2_0_conv3_weight | |
self_layer2_0_bn2_weight | |
self_layer2_0_bn2_bias | |
self_layer2_0_conv2_weight | |
self_layer2_0_bn1_weight | |
self_layer2_0_bn1_bias | |
self_layer2_0_conv1_weight | |
self_layer1_2_bn3_weight | |
self_layer1_2_bn3_bias | |
self_layer1_2_conv3_weight | |
self_layer1_2_bn2_weight | |
self_layer1_2_bn2_bias | |
self_layer1_2_conv2_weight | |
self_layer1_2_bn1_weight | |
self_layer1_2_bn1_bias | |
self_layer1_2_conv1_weight | |
self_layer1_1_bn3_weight | |
self_layer1_1_bn3_bias | |
self_layer1_1_conv3_weight | |
self_layer1_1_bn2_weight | |
self_layer1_1_bn2_bias | |
self_layer1_1_conv2_weight | |
self_layer1_1_bn1_weight | |
self_layer1_1_bn1_bias | |
self_layer1_1_conv1_weight | |
self_layer1_0_downsample_1_weight | |
self_layer1_0_downsample_1_bias | |
self_layer1_0_downsample_0_weight | |
self_layer1_0_bn3_weight | |
self_layer1_0_bn3_bias | |
self_layer1_0_conv3_weight | |
self_layer1_0_bn2_weight | |
self_layer1_0_bn2_bias | |
self_layer1_0_conv2_weight | |
self_layer1_0_bn1_weight | |
self_layer1_0_bn1_bias | |
self_layer1_0_conv1_weight | |
self_bn1_weight | |
self_bn1_bias | |
self_conv1_weight | |
---orig graph--- | |
graph(): | |
%x : torch.Tensor [#users=1] = placeholder[target=x] | |
%self_conv1 : [#users=1] = call_module[target=self_conv1](args = (%x,), kwargs = {}) | |
%self_bn1 : [#users=1] = call_module[target=self_bn1](args = (%self_conv1,), kwargs = {}) | |
%self_relu : [#users=1] = call_module[target=self_relu](args = (%self_bn1,), kwargs = {}) | |
%self_maxpool : [#users=2] = call_module[target=self_maxpool](args = (%self_relu,), kwargs = {}) | |
%self_layer1_0_conv1 : [#users=1] = call_module[target=self_layer1_0_conv1](args = (%self_maxpool,), kwargs = {}) | |
%self_layer1_0_bn1 : [#users=1] = call_module[target=self_layer1_0_bn1](args = (%self_layer1_0_conv1,), kwargs = {}) | |
%self_layer1_0_relu : [#users=1] = call_module[target=self_layer1_0_relu](args = (%self_layer1_0_bn1,), kwargs = {}) | |
%self_layer1_0_conv2 : [#users=1] = call_module[target=self_layer1_0_conv2](args = (%self_layer1_0_relu,), kwargs = {}) | |
%self_layer1_0_bn2 : [#users=1] = call_module[target=self_layer1_0_bn2](args = (%self_layer1_0_conv2,), kwargs = {}) | |
%self_layer1_0_relu_1 : [#users=1] = call_module[target=self_layer1_0_relu](args = (%self_layer1_0_bn2,), kwargs = {}) | |
%self_layer1_0_conv3 : [#users=1] = call_module[target=self_layer1_0_conv3](args = (%self_layer1_0_relu_1,), kwargs = {}) | |
%self_layer1_0_bn3 : [#users=1] = call_module[target=self_layer1_0_bn3](args = (%self_layer1_0_conv3,), kwargs = {}) | |
%self_layer1_0_downsample_0 : [#users=1] = call_module[target=self_layer1_0_downsample_0](args = (%self_maxpool,), kwargs = {}) | |
%self_layer1_0_downsample_1 : [#users=1] = call_module[target=self_layer1_0_downsample_1](args = (%self_layer1_0_downsample_0,), kwargs = {}) | |
%iadd : [#users=1] = call_function[target=operator.iadd](args = (%self_layer1_0_bn3, %self_layer1_0_downsample_1), kwargs = {}) | |
%self_layer1_0_relu_2 : [#users=2] = call_module[target=self_layer1_0_relu](args = (%iadd,), kwargs = {}) | |
%self_layer1_1_conv1 : [#users=1] = call_module[target=self_layer1_1_conv1](args = (%self_layer1_0_relu_2,), kwargs = {}) | |
%self_layer1_1_bn1 : [#users=1] = call_module[target=self_layer1_1_bn1](args = (%self_layer1_1_conv1,), kwargs = {}) | |
%self_layer1_1_relu : [#users=1] = call_module[target=self_layer1_1_relu](args = (%self_layer1_1_bn1,), kwargs = {}) | |
%self_layer1_1_conv2 : [#users=1] = call_module[target=self_layer1_1_conv2](args = (%self_layer1_1_relu,), kwargs = {}) | |
%self_layer1_1_bn2 : [#users=1] = call_module[target=self_layer1_1_bn2](args = (%self_layer1_1_conv2,), kwargs = {}) | |
%self_layer1_1_relu_1 : [#users=1] = call_module[target=self_layer1_1_relu](args = (%self_layer1_1_bn2,), kwargs = {}) | |
%self_layer1_1_conv3 : [#users=1] = call_module[target=self_layer1_1_conv3](args = (%self_layer1_1_relu_1,), kwargs = {}) | |
%self_layer1_1_bn3 : [#users=1] = call_module[target=self_layer1_1_bn3](args = (%self_layer1_1_conv3,), kwargs = {}) | |
%iadd_1 : [#users=1] = call_function[target=operator.iadd](args = (%self_layer1_1_bn3, %self_layer1_0_relu_2), kwargs = {}) | |
%self_layer1_1_relu_2 : [#users=2] = call_module[target=self_layer1_1_relu](args = (%iadd_1,), kwargs = {}) | |
%self_layer1_2_conv1 : [#users=1] = call_module[target=self_layer1_2_conv1](args = (%self_layer1_1_relu_2,), kwargs = {}) | |
%self_layer1_2_bn1 : [#users=1] = call_module[target=self_layer1_2_bn1](args = (%self_layer1_2_conv1,), kwargs = {}) | |
%self_layer1_2_relu : [#users=1] = call_module[target=self_layer1_2_relu](args = (%self_layer1_2_bn1,), kwargs = {}) | |
%self_layer1_2_conv2 : [#users=1] = call_module[target=self_layer1_2_conv2](args = (%self_layer1_2_relu,), kwargs = {}) | |
%self_layer1_2_bn2 : [#users=1] = call_module[target=self_layer1_2_bn2](args = (%self_layer1_2_conv2,), kwargs = {}) | |
%self_layer1_2_relu_1 : [#users=1] = call_module[target=self_layer1_2_relu](args = (%self_layer1_2_bn2,), kwargs = {}) | |
%self_layer1_2_conv3 : [#users=1] = call_module[target=self_layer1_2_conv3](args = (%self_layer1_2_relu_1,), kwargs = {}) | |
%self_layer1_2_bn3 : [#users=1] = call_module[target=self_layer1_2_bn3](args = (%self_layer1_2_conv3,), kwargs = {}) | |
%iadd_2 : [#users=1] = call_function[target=operator.iadd](args = (%self_layer1_2_bn3, %self_layer1_1_relu_2), kwargs = {}) | |
%self_layer1_2_relu_2 : [#users=2] = call_module[target=self_layer1_2_relu](args = (%iadd_2,), kwargs = {}) | |
%self_layer2_0_conv1 : [#users=1] = call_module[target=self_layer2_0_conv1](args = (%self_layer1_2_relu_2,), kwargs = {}) | |
%self_layer2_0_bn1 : [#users=1] = call_module[target=self_layer2_0_bn1](args = (%self_layer2_0_conv1,), kwargs = {}) | |
%self_layer2_0_relu : [#users=1] = call_module[target=self_layer2_0_relu](args = (%self_layer2_0_bn1,), kwargs = {}) | |
%self_layer2_0_conv2 : [#users=1] = call_module[target=self_layer2_0_conv2](args = (%self_layer2_0_relu,), kwargs = {}) | |
%self_layer2_0_bn2 : [#users=1] = call_module[target=self_layer2_0_bn2](args = (%self_layer2_0_conv2,), kwargs = {}) | |
%self_layer2_0_relu_1 : [#users=1] = call_module[target=self_layer2_0_relu](args = (%self_layer2_0_bn2,), kwargs = {}) | |
%self_layer2_0_conv3 : [#users=1] = call_module[target=self_layer2_0_conv3](args = (%self_layer2_0_relu_1,), kwargs = {}) | |
%self_layer2_0_bn3 : [#users=1] = call_module[target=self_layer2_0_bn3](args = (%self_layer2_0_conv3,), kwargs = {}) | |
%self_layer2_0_downsample_0 : [#users=1] = call_module[target=self_layer2_0_downsample_0](args = (%self_layer1_2_relu_2,), kwargs = {}) | |
%self_layer2_0_downsample_1 : [#users=1] = call_module[target=self_layer2_0_downsample_1](args = (%self_layer2_0_downsample_0,), kwargs = {}) | |
%iadd_3 : [#users=1] = call_function[target=operator.iadd](args = (%self_layer2_0_bn3, %self_layer2_0_downsample_1), kwargs = {}) | |
%self_layer2_0_relu_2 : [#users=2] = call_module[target=self_layer2_0_relu](args = (%iadd_3,), kwargs = {}) | |
%self_layer2_1_conv1 : [#users=1] = call_module[target=self_layer2_1_conv1](args = (%self_layer2_0_relu_2,), kwargs = {}) | |
%self_layer2_1_bn1 : [#users=1] = call_module[target=self_layer2_1_bn1](args = (%self_layer2_1_conv1,), kwargs = {}) | |
%self_layer2_1_relu : [#users=1] = call_module[target=self_layer2_1_relu](args = (%self_layer2_1_bn1,), kwargs = {}) | |
%self_layer2_1_conv2 : [#users=1] = call_module[target=self_layer2_1_conv2](args = (%self_layer2_1_relu,), kwargs = {}) | |
%self_layer2_1_bn2 : [#users=1] = call_module[target=self_layer2_1_bn2](args = (%self_layer2_1_conv2,), kwargs = {}) | |
%self_layer2_1_relu_1 : [#users=1] = call_module[target=self_layer2_1_relu](args = (%self_layer2_1_bn2,), kwargs = {}) | |
%self_layer2_1_conv3 : [#users=1] = call_module[target=self_layer2_1_conv3](args = (%self_layer2_1_relu_1,), kwargs = {}) | |
%self_layer2_1_bn3 : [#users=1] = call_module[target=self_layer2_1_bn3](args = (%self_layer2_1_conv3,), kwargs = {}) | |
%iadd_4 : [#users=1] = call_function[target=operator.iadd](args = (%self_layer2_1_bn3, %self_layer2_0_relu_2), kwargs = {}) | |
%self_layer2_1_relu_2 : [#users=2] = call_module[target=self_layer2_1_relu](args = (%iadd_4,), kwargs = {}) | |
%self_layer2_2_conv1 : [#users=1] = call_module[target=self_layer2_2_conv1](args = (%self_layer2_1_relu_2,), kwargs = {}) | |
%self_layer2_2_bn1 : [#users=1] = call_module[target=self_layer2_2_bn1](args = (%self_layer2_2_conv1,), kwargs = {}) | |
%self_layer2_2_relu : [#users=1] = call_module[target=self_layer2_2_relu](args = (%self_layer2_2_bn1,), kwargs = {}) | |
%self_layer2_2_conv2 : [#users=1] = call_module[target=self_layer2_2_conv2](args = (%self_layer2_2_relu,), kwargs = {}) | |
%self_layer2_2_bn2 : [#users=1] = call_module[target=self_layer2_2_bn2](args = (%self_layer2_2_conv2,), kwargs = {}) | |
%self_layer2_2_relu_1 : [#users=1] = call_module[target=self_layer2_2_relu](args = (%self_layer2_2_bn2,), kwargs = {}) | |
%self_layer2_2_conv3 : [#users=1] = call_module[target=self_layer2_2_conv3](args = (%self_layer2_2_relu_1,), kwargs = {}) | |
%self_layer2_2_bn3 : [#users=1] = call_module[target=self_layer2_2_bn3](args = (%self_layer2_2_conv3,), kwargs = {}) | |
%iadd_5 : [#users=1] = call_function[target=operator.iadd](args = (%self_layer2_2_bn3, %self_layer2_1_relu_2), kwargs = {}) | |
%self_layer2_2_relu_2 : [#users=2] = call_module[target=self_layer2_2_relu](args = (%iadd_5,), kwargs = {}) | |
%self_layer2_3_conv1 : [#users=1] = call_module[target=self_layer2_3_conv1](args = (%self_layer2_2_relu_2,), kwargs = {}) | |
%self_layer2_3_bn1 : [#users=1] = call_module[target=self_layer2_3_bn1](args = (%self_layer2_3_conv1,), kwargs = {}) | |
%self_layer2_3_relu : [#users=1] = call_module[target=self_layer2_3_relu](args = (%self_layer2_3_bn1,), kwargs = {}) | |
%self_layer2_3_conv2 : [#users=1] = call_module[target=self_layer2_3_conv2](args = (%self_layer2_3_relu,), kwargs = {}) | |
%self_layer2_3_bn2 : [#users=1] = call_module[target=self_layer2_3_bn2](args = (%self_layer2_3_conv2,), kwargs = {}) | |
%self_layer2_3_relu_1 : [#users=1] = call_module[target=self_layer2_3_relu](args = (%self_layer2_3_bn2,), kwargs = {}) | |
%self_layer2_3_conv3 : [#users=1] = call_module[target=self_layer2_3_conv3](args = (%self_layer2_3_relu_1,), kwargs = {}) | |
%self_layer2_3_bn3 : [#users=1] = call_module[target=self_layer2_3_bn3](args = (%self_layer2_3_conv3,), kwargs = {}) | |
%iadd_6 : [#users=1] = call_function[target=operator.iadd](args = (%self_layer2_3_bn3, %self_layer2_2_relu_2), kwargs = {}) | |
%self_layer2_3_relu_2 : [#users=2] = call_module[target=self_layer2_3_relu](args = (%iadd_6,), kwargs = {}) | |
%self_layer3_0_conv1 : [#users=1] = call_module[target=self_layer3_0_conv1](args = (%self_layer2_3_relu_2,), kwargs = {}) | |
%self_layer3_0_bn1 : [#users=1] = call_module[target=self_layer3_0_bn1](args = (%self_layer3_0_conv1,), kwargs = {}) | |
%self_layer3_0_relu : [#users=1] = call_module[target=self_layer3_0_relu](args = (%self_layer3_0_bn1,), kwargs = {}) | |
%self_layer3_0_conv2 : [#users=1] = call_module[target=self_layer3_0_conv2](args = (%self_layer3_0_relu,), kwargs = {}) | |
%self_layer3_0_bn2 : [#users=1] = call_module[target=self_layer3_0_bn2](args = (%self_layer3_0_conv2,), kwargs = {}) | |
%self_layer3_0_relu_1 : [#users=1] = call_module[target=self_layer3_0_relu](args = (%self_layer3_0_bn2,), kwargs = {}) | |
%self_layer3_0_conv3 : [#users=1] = call_module[target=self_layer3_0_conv3](args = (%self_layer3_0_relu_1,), kwargs = {}) | |
%self_layer3_0_bn3 : [#users=1] = call_module[target=self_layer3_0_bn3](args = (%self_layer3_0_conv3,), kwargs = {}) | |
%self_layer3_0_downsample_0 : [#users=1] = call_module[target=self_layer3_0_downsample_0](args = (%self_layer2_3_relu_2,), kwargs = {}) | |
%self_layer3_0_downsample_1 : [#users=1] = call_module[target=self_layer3_0_downsample_1](args = (%self_layer3_0_downsample_0,), kwargs = {}) | |
%iadd_7 : [#users=1] = call_function[target=operator.iadd](args = (%self_layer3_0_bn3, %self_layer3_0_downsample_1), kwargs = {}) | |
%self_layer3_0_relu_2 : [#users=2] = call_module[target=self_layer3_0_relu](args = (%iadd_7,), kwargs = {}) | |
%self_layer3_1_conv1 : [#users=1] = call_module[target=self_layer3_1_conv1](args = (%self_layer3_0_relu_2,), kwargs = {}) | |
%self_layer3_1_bn1 : [#users=1] = call_module[target=self_layer3_1_bn1](args = (%self_layer3_1_conv1,), kwargs = {}) | |
%self_layer3_1_relu : [#users=1] = call_module[target=self_layer3_1_relu](args = (%self_layer3_1_bn1,), kwargs = {}) | |
%self_layer3_1_conv2 : [#users=1] = call_module[target=self_layer3_1_conv2](args = (%self_layer3_1_relu,), kwargs = {}) | |
%self_layer3_1_bn2 : [#users=1] = call_module[target=self_layer3_1_bn2](args = (%self_layer3_1_conv2,), kwargs = {}) | |
%self_layer3_1_relu_1 : [#users=1] = call_module[target=self_layer3_1_relu](args = (%self_layer3_1_bn2,), kwargs = {}) | |
%self_layer3_1_conv3 : [#users=1] = call_module[target=self_layer3_1_conv3](args = (%self_layer3_1_relu_1,), kwargs = {}) | |
%self_layer3_1_bn3 : [#users=1] = call_module[target=self_layer3_1_bn3](args = (%self_layer3_1_conv3,), kwargs = {}) | |
%iadd_8 : [#users=1] = call_function[target=operator.iadd](args = (%self_layer3_1_bn3, %self_layer3_0_relu_2), kwargs = {}) | |
%self_layer3_1_relu_2 : [#users=2] = call_module[target=self_layer3_1_relu](args = (%iadd_8,), kwargs = {}) | |
%self_layer3_2_conv1 : [#users=1] = call_module[target=self_layer3_2_conv1](args = (%self_layer3_1_relu_2,), kwargs = {}) | |
%self_layer3_2_bn1 : [#users=1] = call_module[target=self_layer3_2_bn1](args = (%self_layer3_2_conv1,), kwargs = {}) | |
%self_layer3_2_relu : [#users=1] = call_module[target=self_layer3_2_relu](args = (%self_layer3_2_bn1,), kwargs = {}) | |
%self_layer3_2_conv2 : [#users=1] = call_module[target=self_layer3_2_conv2](args = (%self_layer3_2_relu,), kwargs = {}) | |
%self_layer3_2_bn2 : [#users=1] = call_module[target=self_layer3_2_bn2](args = (%self_layer3_2_conv2,), kwargs = {}) | |
%self_layer3_2_relu_1 : [#users=1] = call_module[target=self_layer3_2_relu](args = (%self_layer3_2_bn2,), kwargs = {}) | |
%self_layer3_2_conv3 : [#users=1] = call_module[target=self_layer3_2_conv3](args = (%self_layer3_2_relu_1,), kwargs = {}) | |
%self_layer3_2_bn3 : [#users=1] = call_module[target=self_layer3_2_bn3](args = (%self_layer3_2_conv3,), kwargs = {}) | |
%iadd_9 : [#users=1] = call_function[target=operator.iadd](args = (%self_layer3_2_bn3, %self_layer3_1_relu_2), kwargs = {}) | |
%self_layer3_2_relu_2 : [#users=2] = call_module[target=self_layer3_2_relu](args = (%iadd_9,), kwargs = {}) | |
%self_layer3_3_conv1 : [#users=1] = call_module[target=self_layer3_3_conv1](args = (%self_layer3_2_relu_2,), kwargs = {}) | |
%self_layer3_3_bn1 : [#users=1] = call_module[target=self_layer3_3_bn1](args = (%self_layer3_3_conv1,), kwargs = {}) | |
%self_layer3_3_relu : [#users=1] = call_module[target=self_layer3_3_relu](args = (%self_layer3_3_bn1,), kwargs = {}) | |
%self_layer3_3_conv2 : [#users=1] = call_module[target=self_layer3_3_conv2](args = (%self_layer3_3_relu,), kwargs = {}) | |
%self_layer3_3_bn2 : [#users=1] = call_module[target=self_layer3_3_bn2](args = (%self_layer3_3_conv2,), kwargs = {}) | |
%self_layer3_3_relu_1 : [#users=1] = call_module[target=self_layer3_3_relu](args = (%self_layer3_3_bn2,), kwargs = {}) | |
%self_layer3_3_conv3 : [#users=1] = call_module[target=self_layer3_3_conv3](args = (%self_layer3_3_relu_1,), kwargs = {}) | |
%self_layer3_3_bn3 : [#users=1] = call_module[target=self_layer3_3_bn3](args = (%self_layer3_3_conv3,), kwargs = {}) | |
%iadd_10 : [#users=1] = call_function[target=operator.iadd](args = (%self_layer3_3_bn3, %self_layer3_2_relu_2), kwargs = {}) | |
%self_layer3_3_relu_2 : [#users=2] = call_module[target=self_layer3_3_relu](args = (%iadd_10,), kwargs = {}) | |
%self_layer3_4_conv1 : [#users=1] = call_module[target=self_layer3_4_conv1](args = (%self_layer3_3_relu_2,), kwargs = {}) | |
%self_layer3_4_bn1 : [#users=1] = call_module[target=self_layer3_4_bn1](args = (%self_layer3_4_conv1,), kwargs = {}) | |
%self_layer3_4_relu : [#users=1] = call_module[target=self_layer3_4_relu](args = (%self_layer3_4_bn1,), kwargs = {}) | |
%self_layer3_4_conv2 : [#users=1] = call_module[target=self_layer3_4_conv2](args = (%self_layer3_4_relu,), kwargs = {}) | |
%self_layer3_4_bn2 : [#users=1] = call_module[target=self_layer3_4_bn2](args = (%self_layer3_4_conv2,), kwargs = {}) | |
%self_layer3_4_relu_1 : [#users=1] = call_module[target=self_layer3_4_relu](args = (%self_layer3_4_bn2,), kwargs = {}) | |
%self_layer3_4_conv3 : [#users=1] = call_module[target=self_layer3_4_conv3](args = (%self_layer3_4_relu_1,), kwargs = {}) | |
%self_layer3_4_bn3 : [#users=1] = call_module[target=self_layer3_4_bn3](args = (%self_layer3_4_conv3,), kwargs = {}) | |
%iadd_11 : [#users=1] = call_function[target=operator.iadd](args = (%self_layer3_4_bn3, %self_layer3_3_relu_2), kwargs = {}) | |
%self_layer3_4_relu_2 : [#users=2] = call_module[target=self_layer3_4_relu](args = (%iadd_11,), kwargs = {}) | |
%self_layer3_5_conv1 : [#users=1] = call_module[target=self_layer3_5_conv1](args = (%self_layer3_4_relu_2,), kwargs = {}) | |
%self_layer3_5_bn1 : [#users=1] = call_module[target=self_layer3_5_bn1](args = (%self_layer3_5_conv1,), kwargs = {}) | |
%self_layer3_5_relu : [#users=1] = call_module[target=self_layer3_5_relu](args = (%self_layer3_5_bn1,), kwargs = {}) | |
%self_layer3_5_conv2 : [#users=1] = call_module[target=self_layer3_5_conv2](args = (%self_layer3_5_relu,), kwargs = {}) | |
%self_layer3_5_bn2 : [#users=1] = call_module[target=self_layer3_5_bn2](args = (%self_layer3_5_conv2,), kwargs = {}) | |
%self_layer3_5_relu_1 : [#users=1] = call_module[target=self_layer3_5_relu](args = (%self_layer3_5_bn2,), kwargs = {}) | |
%self_layer3_5_conv3 : [#users=1] = call_module[target=self_layer3_5_conv3](args = (%self_layer3_5_relu_1,), kwargs = {}) | |
%self_layer3_5_bn3 : [#users=1] = call_module[target=self_layer3_5_bn3](args = (%self_layer3_5_conv3,), kwargs = {}) | |
%iadd_12 : [#users=1] = call_function[target=operator.iadd](args = (%self_layer3_5_bn3, %self_layer3_4_relu_2), kwargs = {}) | |
%self_layer3_5_relu_2 : [#users=2] = call_module[target=self_layer3_5_relu](args = (%iadd_12,), kwargs = {}) | |
%self_layer4_0_conv1 : [#users=1] = call_module[target=self_layer4_0_conv1](args = (%self_layer3_5_relu_2,), kwargs = {}) | |
%self_layer4_0_bn1 : [#users=1] = call_module[target=self_layer4_0_bn1](args = (%self_layer4_0_conv1,), kwargs = {}) | |
%self_layer4_0_relu : [#users=1] = call_module[target=self_layer4_0_relu](args = (%self_layer4_0_bn1,), kwargs = {}) | |
%self_layer4_0_conv2 : [#users=1] = call_module[target=self_layer4_0_conv2](args = (%self_layer4_0_relu,), kwargs = {}) | |
%self_layer4_0_bn2 : [#users=1] = call_module[target=self_layer4_0_bn2](args = (%self_layer4_0_conv2,), kwargs = {}) | |
%self_layer4_0_relu_1 : [#users=1] = call_module[target=self_layer4_0_relu](args = (%self_layer4_0_bn2,), kwargs = {}) | |
%self_layer4_0_conv3 : [#users=1] = call_module[target=self_layer4_0_conv3](args = (%self_layer4_0_relu_1,), kwargs = {}) | |
%self_layer4_0_bn3 : [#users=1] = call_module[target=self_layer4_0_bn3](args = (%self_layer4_0_conv3,), kwargs = {}) | |
%self_layer4_0_downsample_0 : [#users=1] = call_module[target=self_layer4_0_downsample_0](args = (%self_layer3_5_relu_2,), kwargs = {}) | |
%self_layer4_0_downsample_1 : [#users=1] = call_module[target=self_layer4_0_downsample_1](args = (%self_layer4_0_downsample_0,), kwargs = {}) | |
%iadd_13 : [#users=1] = call_function[target=operator.iadd](args = (%self_layer4_0_bn3, %self_layer4_0_downsample_1), kwargs = {}) | |
%self_layer4_0_relu_2 : [#users=2] = call_module[target=self_layer4_0_relu](args = (%iadd_13,), kwargs = {}) | |
%self_layer4_1_conv1 : [#users=1] = call_module[target=self_layer4_1_conv1](args = (%self_layer4_0_relu_2,), kwargs = {}) | |
%self_layer4_1_bn1 : [#users=1] = call_module[target=self_layer4_1_bn1](args = (%self_layer4_1_conv1,), kwargs = {}) | |
%self_layer4_1_relu : [#users=1] = call_module[target=self_layer4_1_relu](args = (%self_layer4_1_bn1,), kwargs = {}) | |
%self_layer4_1_conv2 : [#users=1] = call_module[target=self_layer4_1_conv2](args = (%self_layer4_1_relu,), kwargs = {}) | |
%self_layer4_1_bn2 : [#users=1] = call_module[target=self_layer4_1_bn2](args = (%self_layer4_1_conv2,), kwargs = {}) | |
%self_layer4_1_relu_1 : [#users=1] = call_module[target=self_layer4_1_relu](args = (%self_layer4_1_bn2,), kwargs = {}) | |
%self_layer4_1_conv3 : [#users=1] = call_module[target=self_layer4_1_conv3](args = (%self_layer4_1_relu_1,), kwargs = {}) | |
%self_layer4_1_bn3 : [#users=1] = call_module[target=self_layer4_1_bn3](args = (%self_layer4_1_conv3,), kwargs = {}) | |
%iadd_14 : [#users=1] = call_function[target=operator.iadd](args = (%self_layer4_1_bn3, %self_layer4_0_relu_2), kwargs = {}) | |
%self_layer4_1_relu_2 : [#users=2] = call_module[target=self_layer4_1_relu](args = (%iadd_14,), kwargs = {}) | |
%self_layer4_2_conv1 : [#users=1] = call_module[target=self_layer4_2_conv1](args = (%self_layer4_1_relu_2,), kwargs = {}) | |
%self_layer4_2_bn1 : [#users=1] = call_module[target=self_layer4_2_bn1](args = (%self_layer4_2_conv1,), kwargs = {}) | |
%self_layer4_2_relu : [#users=1] = call_module[target=self_layer4_2_relu](args = (%self_layer4_2_bn1,), kwargs = {}) | |
%self_layer4_2_conv2 : [#users=1] = call_module[target=self_layer4_2_conv2](args = (%self_layer4_2_relu,), kwargs = {}) | |
%self_layer4_2_bn2 : [#users=1] = call_module[target=self_layer4_2_bn2](args = (%self_layer4_2_conv2,), kwargs = {}) | |
%self_layer4_2_relu_1 : [#users=1] = call_module[target=self_layer4_2_relu](args = (%self_layer4_2_bn2,), kwargs = {}) | |
%self_layer4_2_conv3 : [#users=1] = call_module[target=self_layer4_2_conv3](args = (%self_layer4_2_relu_1,), kwargs = {}) | |
%self_layer4_2_bn3 : [#users=1] = call_module[target=self_layer4_2_bn3](args = (%self_layer4_2_conv3,), kwargs = {}) | |
%iadd_15 : [#users=1] = call_function[target=operator.iadd](args = (%self_layer4_2_bn3, %self_layer4_1_relu_2), kwargs = {}) | |
%self_layer4_2_relu_2 : [#users=1] = call_module[target=self_layer4_2_relu](args = (%iadd_15,), kwargs = {}) | |
%self_avgpool : [#users=1] = call_module[target=self_avgpool](args = (%self_layer4_2_relu_2,), kwargs = {}) | |
%flatten : [#users=1] = call_function[target=torch.flatten](args = (%self_avgpool, 1), kwargs = {}) | |
%self_fc : [#users=1] = call_module[target=self_fc](args = (%flatten,), kwargs = {}) | |
return (self_fc,) | |
---split graph--- | |
graph(): | |
%x : torch.Tensor [#users=1] = placeholder[target=x] | |
%submod_0 : [#users=2] = call_module[target=submod_0](args = (%x,), kwargs = {}) | |
%getitem : [#users=1] = call_function[target=operator.getitem](args = (%submod_0, 0), kwargs = {}) | |
%getitem_1 : [#users=1] = call_function[target=operator.getitem](args = (%submod_0, 1), kwargs = {}) | |
%submod_1 : [#users=2] = call_module[target=submod_1](args = (%getitem, %getitem_1), kwargs = {}) | |
%getitem_2 : [#users=1] = call_function[target=operator.getitem](args = (%submod_1, 0), kwargs = {}) | |
%getitem_3 : [#users=1] = call_function[target=operator.getitem](args = (%submod_1, 1), kwargs = {}) | |
%submod_2 : [#users=2] = call_module[target=submod_2](args = (%getitem_2, %getitem_3), kwargs = {}) | |
%getitem_4 : [#users=1] = call_function[target=operator.getitem](args = (%submod_2, 0), kwargs = {}) | |
%getitem_5 : [#users=1] = call_function[target=operator.getitem](args = (%submod_2, 1), kwargs = {}) | |
%submod_3 : [#users=1] = call_module[target=submod_3](args = (%getitem_4, %getitem_5), kwargs = {}) | |
return (submod_3,) | |
--- graph--- | |
graph(): | |
%x : torch.Tensor [#users=1] = placeholder[target=x] | |
%submod_0 : [#users=2] = call_module[target=submod_0](args = (%x,), kwargs = {}) | |
%getitem : [#users=1] = call_function[target=operator.getitem](args = (%submod_0, 0), kwargs = {}) | |
%getitem_1 : [#users=1] = call_function[target=operator.getitem](args = (%submod_0, 1), kwargs = {}) | |
%submod_1 : [#users=2] = call_module[target=submod_1](args = (%getitem, %getitem_1), kwargs = {}) | |
%getitem_2 : [#users=1] = call_function[target=operator.getitem](args = (%submod_1, 0), kwargs = {}) | |
%getitem_3 : [#users=1] = call_function[target=operator.getitem](args = (%submod_1, 1), kwargs = {}) | |
%submod_2 : [#users=2] = call_module[target=submod_2](args = (%getitem_2, %getitem_3), kwargs = {}) | |
%getitem_4 : [#users=1] = call_function[target=operator.getitem](args = (%submod_2, 0), kwargs = {}) | |
%getitem_5 : [#users=1] = call_function[target=operator.getitem](args = (%submod_2, 1), kwargs = {}) | |
%submod_3 : [#users=1] = call_module[target=submod_3](args = (%getitem_4, %getitem_5), kwargs = {}) | |
return (submod_3,) | |
---submod_0 graph--- | |
graph(): | |
%x : [#users=1] = placeholder[target=x] | |
%self_conv1 : [#users=1] = call_module[target=self_conv1](args = (%x,), kwargs = {}) | |
%self_bn1 : [#users=1] = call_module[target=self_bn1](args = (%self_conv1,), kwargs = {}) | |
%self_relu : [#users=1] = call_module[target=self_relu](args = (%self_bn1,), kwargs = {}) | |
%self_maxpool : [#users=2] = call_module[target=self_maxpool](args = (%self_relu,), kwargs = {}) | |
%self_layer1_0_conv1 : [#users=1] = call_module[target=self_layer1_0_conv1](args = (%self_maxpool,), kwargs = {}) | |
%self_layer1_0_bn1 : [#users=1] = call_module[target=self_layer1_0_bn1](args = (%self_layer1_0_conv1,), kwargs = {}) | |
%self_layer1_0_relu : [#users=1] = call_module[target=self_layer1_0_relu](args = (%self_layer1_0_bn1,), kwargs = {}) | |
%self_layer1_0_conv2 : [#users=1] = call_module[target=self_layer1_0_conv2](args = (%self_layer1_0_relu,), kwargs = {}) | |
%self_layer1_0_bn2 : [#users=1] = call_module[target=self_layer1_0_bn2](args = (%self_layer1_0_conv2,), kwargs = {}) | |
%self_layer1_0_relu_1 : [#users=1] = call_module[target=self_layer1_0_relu](args = (%self_layer1_0_bn2,), kwargs = {}) | |
%self_layer1_0_conv3 : [#users=1] = call_module[target=self_layer1_0_conv3](args = (%self_layer1_0_relu_1,), kwargs = {}) | |
%self_layer1_0_bn3 : [#users=1] = call_module[target=self_layer1_0_bn3](args = (%self_layer1_0_conv3,), kwargs = {}) | |
%self_layer1_0_downsample_0 : [#users=1] = call_module[target=self_layer1_0_downsample_0](args = (%self_maxpool,), kwargs = {}) | |
%self_layer1_0_downsample_1 : [#users=1] = call_module[target=self_layer1_0_downsample_1](args = (%self_layer1_0_downsample_0,), kwargs = {}) | |
%iadd : [#users=1] = call_function[target=operator.iadd](args = (%self_layer1_0_bn3, %self_layer1_0_downsample_1), kwargs = {}) | |
%self_layer1_0_relu_2 : [#users=2] = call_module[target=self_layer1_0_relu](args = (%iadd,), kwargs = {}) | |
%self_layer1_1_conv1 : [#users=1] = call_module[target=self_layer1_1_conv1](args = (%self_layer1_0_relu_2,), kwargs = {}) | |
%self_layer1_1_bn1 : [#users=1] = call_module[target=self_layer1_1_bn1](args = (%self_layer1_1_conv1,), kwargs = {}) | |
%self_layer1_1_relu : [#users=1] = call_module[target=self_layer1_1_relu](args = (%self_layer1_1_bn1,), kwargs = {}) | |
%self_layer1_1_conv2 : [#users=1] = call_module[target=self_layer1_1_conv2](args = (%self_layer1_1_relu,), kwargs = {}) | |
%self_layer1_1_bn2 : [#users=1] = call_module[target=self_layer1_1_bn2](args = (%self_layer1_1_conv2,), kwargs = {}) | |
%self_layer1_1_relu_1 : [#users=1] = call_module[target=self_layer1_1_relu](args = (%self_layer1_1_bn2,), kwargs = {}) | |
%self_layer1_1_conv3 : [#users=1] = call_module[target=self_layer1_1_conv3](args = (%self_layer1_1_relu_1,), kwargs = {}) | |
%self_layer1_1_bn3 : [#users=1] = call_module[target=self_layer1_1_bn3](args = (%self_layer1_1_conv3,), kwargs = {}) | |
%iadd_1 : [#users=1] = call_function[target=operator.iadd](args = (%self_layer1_1_bn3, %self_layer1_0_relu_2), kwargs = {}) | |
%self_layer1_1_relu_2 : [#users=2] = call_module[target=self_layer1_1_relu](args = (%iadd_1,), kwargs = {}) | |
%self_layer1_2_conv1 : [#users=1] = call_module[target=self_layer1_2_conv1](args = (%self_layer1_1_relu_2,), kwargs = {}) | |
%self_layer1_2_bn1 : [#users=1] = call_module[target=self_layer1_2_bn1](args = (%self_layer1_2_conv1,), kwargs = {}) | |
%self_layer1_2_relu : [#users=1] = call_module[target=self_layer1_2_relu](args = (%self_layer1_2_bn1,), kwargs = {}) | |
%self_layer1_2_conv2 : [#users=1] = call_module[target=self_layer1_2_conv2](args = (%self_layer1_2_relu,), kwargs = {}) | |
%self_layer1_2_bn2 : [#users=1] = call_module[target=self_layer1_2_bn2](args = (%self_layer1_2_conv2,), kwargs = {}) | |
%self_layer1_2_relu_1 : [#users=1] = call_module[target=self_layer1_2_relu](args = (%self_layer1_2_bn2,), kwargs = {}) | |
%self_layer1_2_conv3 : [#users=1] = call_module[target=self_layer1_2_conv3](args = (%self_layer1_2_relu_1,), kwargs = {}) | |
%self_layer1_2_bn3 : [#users=1] = call_module[target=self_layer1_2_bn3](args = (%self_layer1_2_conv3,), kwargs = {}) | |
%iadd_2 : [#users=1] = call_function[target=operator.iadd](args = (%self_layer1_2_bn3, %self_layer1_1_relu_2), kwargs = {}) | |
%self_layer1_2_relu_2 : [#users=2] = call_module[target=self_layer1_2_relu](args = (%iadd_2,), kwargs = {}) | |
%self_layer2_0_conv1 : [#users=1] = call_module[target=self_layer2_0_conv1](args = (%self_layer1_2_relu_2,), kwargs = {}) | |
%self_layer2_0_bn1 : [#users=1] = call_module[target=self_layer2_0_bn1](args = (%self_layer2_0_conv1,), kwargs = {}) | |
%self_layer2_0_relu : [#users=1] = call_module[target=self_layer2_0_relu](args = (%self_layer2_0_bn1,), kwargs = {}) | |
%self_layer2_0_conv2 : [#users=1] = call_module[target=self_layer2_0_conv2](args = (%self_layer2_0_relu,), kwargs = {}) | |
%self_layer2_0_bn2 : [#users=1] = call_module[target=self_layer2_0_bn2](args = (%self_layer2_0_conv2,), kwargs = {}) | |
%self_layer2_0_relu_1 : [#users=1] = call_module[target=self_layer2_0_relu](args = (%self_layer2_0_bn2,), kwargs = {}) | |
%self_layer2_0_conv3 : [#users=1] = call_module[target=self_layer2_0_conv3](args = (%self_layer2_0_relu_1,), kwargs = {}) | |
%self_layer2_0_bn3 : [#users=1] = call_module[target=self_layer2_0_bn3](args = (%self_layer2_0_conv3,), kwargs = {}) | |
%self_layer2_0_downsample_0 : [#users=1] = call_module[target=self_layer2_0_downsample_0](args = (%self_layer1_2_relu_2,), kwargs = {}) | |
%self_layer2_0_downsample_1 : [#users=1] = call_module[target=self_layer2_0_downsample_1](args = (%self_layer2_0_downsample_0,), kwargs = {}) | |
%iadd_3 : [#users=1] = call_function[target=operator.iadd](args = (%self_layer2_0_bn3, %self_layer2_0_downsample_1), kwargs = {}) | |
%self_layer2_0_relu_2 : [#users=2] = call_module[target=self_layer2_0_relu](args = (%iadd_3,), kwargs = {}) | |
%self_layer2_1_conv1 : [#users=1] = call_module[target=self_layer2_1_conv1](args = (%self_layer2_0_relu_2,), kwargs = {}) | |
%self_layer2_1_bn1 : [#users=1] = call_module[target=self_layer2_1_bn1](args = (%self_layer2_1_conv1,), kwargs = {}) | |
%self_layer2_1_relu : [#users=1] = call_module[target=self_layer2_1_relu](args = (%self_layer2_1_bn1,), kwargs = {}) | |
%self_layer2_1_conv2 : [#users=1] = call_module[target=self_layer2_1_conv2](args = (%self_layer2_1_relu,), kwargs = {}) | |
%self_layer2_1_bn2 : [#users=1] = call_module[target=self_layer2_1_bn2](args = (%self_layer2_1_conv2,), kwargs = {}) | |
%self_layer2_1_relu_1 : [#users=1] = call_module[target=self_layer2_1_relu](args = (%self_layer2_1_bn2,), kwargs = {}) | |
%self_layer2_1_conv3 : [#users=1] = call_module[target=self_layer2_1_conv3](args = (%self_layer2_1_relu_1,), kwargs = {}) | |
%self_layer2_1_bn3 : [#users=1] = call_module[target=self_layer2_1_bn3](args = (%self_layer2_1_conv3,), kwargs = {}) | |
%iadd_4 : [#users=1] = call_function[target=operator.iadd](args = (%self_layer2_1_bn3, %self_layer2_0_relu_2), kwargs = {}) | |
%self_layer2_1_relu_2 : [#users=2] = call_module[target=self_layer2_1_relu](args = (%iadd_4,), kwargs = {}) | |
%self_layer2_2_conv1 : [#users=1] = call_module[target=self_layer2_2_conv1](args = (%self_layer2_1_relu_2,), kwargs = {}) | |
%self_layer2_2_bn1 : [#users=1] = call_module[target=self_layer2_2_bn1](args = (%self_layer2_2_conv1,), kwargs = {}) | |
%self_layer2_2_relu : [#users=1] = call_module[target=self_layer2_2_relu](args = (%self_layer2_2_bn1,), kwargs = {}) | |
%self_layer2_2_conv2 : [#users=1] = call_module[target=self_layer2_2_conv2](args = (%self_layer2_2_relu,), kwargs = {}) | |
%self_layer2_2_bn2 : [#users=1] = call_module[target=self_layer2_2_bn2](args = (%self_layer2_2_conv2,), kwargs = {}) | |
%self_layer2_2_relu_1 : [#users=1] = call_module[target=self_layer2_2_relu](args = (%self_layer2_2_bn2,), kwargs = {}) | |
%self_layer2_2_conv3 : [#users=1] = call_module[target=self_layer2_2_conv3](args = (%self_layer2_2_relu_1,), kwargs = {}) | |
%self_layer2_2_bn3 : [#users=1] = call_module[target=self_layer2_2_bn3](args = (%self_layer2_2_conv3,), kwargs = {}) | |
%iadd_5 : [#users=1] = call_function[target=operator.iadd](args = (%self_layer2_2_bn3, %self_layer2_1_relu_2), kwargs = {}) | |
%self_layer2_2_relu_2 : [#users=2] = call_module[target=self_layer2_2_relu](args = (%iadd_5,), kwargs = {}) | |
%self_layer2_3_conv1 : [#users=1] = call_module[target=self_layer2_3_conv1](args = (%self_layer2_2_relu_2,), kwargs = {}) | |
%self_layer2_3_bn1 : [#users=1] = call_module[target=self_layer2_3_bn1](args = (%self_layer2_3_conv1,), kwargs = {}) | |
%self_layer2_3_relu : [#users=1] = call_module[target=self_layer2_3_relu](args = (%self_layer2_3_bn1,), kwargs = {}) | |
%self_layer2_3_conv2 : [#users=1] = call_module[target=self_layer2_3_conv2](args = (%self_layer2_3_relu,), kwargs = {}) | |
%self_layer2_3_bn2 : [#users=1] = call_module[target=self_layer2_3_bn2](args = (%self_layer2_3_conv2,), kwargs = {}) | |
%self_layer2_3_relu_1 : [#users=1] = call_module[target=self_layer2_3_relu](args = (%self_layer2_3_bn2,), kwargs = {}) | |
%self_layer2_3_conv3 : [#users=1] = call_module[target=self_layer2_3_conv3](args = (%self_layer2_3_relu_1,), kwargs = {}) | |
%self_layer2_3_bn3 : [#users=1] = call_module[target=self_layer2_3_bn3](args = (%self_layer2_3_conv3,), kwargs = {}) | |
%iadd_6 : [#users=1] = call_function[target=operator.iadd](args = (%self_layer2_3_bn3, %self_layer2_2_relu_2), kwargs = {}) | |
%self_layer2_3_relu_2 : [#users=2] = call_module[target=self_layer2_3_relu](args = (%iadd_6,), kwargs = {}) | |
%self_layer3_0_conv1 : [#users=1] = call_module[target=self_layer3_0_conv1](args = (%self_layer2_3_relu_2,), kwargs = {}) | |
%self_layer3_0_bn1 : [#users=1] = call_module[target=self_layer3_0_bn1](args = (%self_layer3_0_conv1,), kwargs = {}) | |
%self_layer3_0_relu : [#users=1] = call_module[target=self_layer3_0_relu](args = (%self_layer3_0_bn1,), kwargs = {}) | |
%self_layer3_0_conv2 : [#users=1] = call_module[target=self_layer3_0_conv2](args = (%self_layer3_0_relu,), kwargs = {}) | |
%self_layer3_0_bn2 : [#users=1] = call_module[target=self_layer3_0_bn2](args = (%self_layer3_0_conv2,), kwargs = {}) | |
%self_layer3_0_relu_1 : [#users=1] = call_module[target=self_layer3_0_relu](args = (%self_layer3_0_bn2,), kwargs = {}) | |
%self_layer3_0_conv3 : [#users=1] = call_module[target=self_layer3_0_conv3](args = (%self_layer3_0_relu_1,), kwargs = {}) | |
%self_layer3_0_bn3 : [#users=1] = call_module[target=self_layer3_0_bn3](args = (%self_layer3_0_conv3,), kwargs = {}) | |
%self_layer3_0_downsample_0 : [#users=1] = call_module[target=self_layer3_0_downsample_0](args = (%self_layer2_3_relu_2,), kwargs = {}) | |
%self_layer3_0_downsample_1 : [#users=1] = call_module[target=self_layer3_0_downsample_1](args = (%self_layer3_0_downsample_0,), kwargs = {}) | |
%iadd_7 : [#users=1] = call_function[target=operator.iadd](args = (%self_layer3_0_bn3, %self_layer3_0_downsample_1), kwargs = {}) | |
%self_layer3_0_relu_2 : [#users=2] = call_module[target=self_layer3_0_relu](args = (%iadd_7,), kwargs = {}) | |
%self_layer3_1_conv1 : [#users=1] = call_module[target=self_layer3_1_conv1](args = (%self_layer3_0_relu_2,), kwargs = {}) | |
%self_layer3_1_bn1 : [#users=1] = call_module[target=self_layer3_1_bn1](args = (%self_layer3_1_conv1,), kwargs = {}) | |
%self_layer3_1_relu : [#users=1] = call_module[target=self_layer3_1_relu](args = (%self_layer3_1_bn1,), kwargs = {}) | |
%self_layer3_1_conv2 : [#users=1] = call_module[target=self_layer3_1_conv2](args = (%self_layer3_1_relu,), kwargs = {}) | |
%self_layer3_1_bn2 : [#users=1] = call_module[target=self_layer3_1_bn2](args = (%self_layer3_1_conv2,), kwargs = {}) | |
%self_layer3_1_relu_1 : [#users=1] = call_module[target=self_layer3_1_relu](args = (%self_layer3_1_bn2,), kwargs = {}) | |
%self_layer3_1_conv3 : [#users=1] = call_module[target=self_layer3_1_conv3](args = (%self_layer3_1_relu_1,), kwargs = {}) | |
%self_layer3_1_bn3 : [#users=1] = call_module[target=self_layer3_1_bn3](args = (%self_layer3_1_conv3,), kwargs = {}) | |
%iadd_8 : [#users=1] = call_function[target=operator.iadd](args = (%self_layer3_1_bn3, %self_layer3_0_relu_2), kwargs = {}) | |
%self_layer3_1_relu_2 : [#users=2] = call_module[target=self_layer3_1_relu](args = (%iadd_8,), kwargs = {}) | |
%self_layer3_2_conv1 : [#users=1] = call_module[target=self_layer3_2_conv1](args = (%self_layer3_1_relu_2,), kwargs = {}) | |
%self_layer3_2_bn1 : [#users=1] = call_module[target=self_layer3_2_bn1](args = (%self_layer3_2_conv1,), kwargs = {}) | |
%self_layer3_2_relu : [#users=1] = call_module[target=self_layer3_2_relu](args = (%self_layer3_2_bn1,), kwargs = {}) | |
return (self_layer3_2_relu, self_layer3_1_relu_2) | |
---submod_1 graph--- | |
graph(): | |
%self_layer3_2_relu : [#users=1] = placeholder[target=self_layer3_2_relu] | |
%self_layer3_1_relu_2 : [#users=1] = placeholder[target=self_layer3_1_relu_2] | |
%self_layer3_2_conv2 : [#users=1] = call_module[target=self_layer3_2_conv2](args = (%self_layer3_2_relu,), kwargs = {}) | |
%self_layer3_2_bn2 : [#users=1] = call_module[target=self_layer3_2_bn2](args = (%self_layer3_2_conv2,), kwargs = {}) | |
%self_layer3_2_relu_1 : [#users=1] = call_module[target=self_layer3_2_relu](args = (%self_layer3_2_bn2,), kwargs = {}) | |
%self_layer3_2_conv3 : [#users=1] = call_module[target=self_layer3_2_conv3](args = (%self_layer3_2_relu_1,), kwargs = {}) | |
%self_layer3_2_bn3 : [#users=1] = call_module[target=self_layer3_2_bn3](args = (%self_layer3_2_conv3,), kwargs = {}) | |
%iadd : [#users=1] = call_function[target=operator.iadd](args = (%self_layer3_2_bn3, %self_layer3_1_relu_2), kwargs = {}) | |
%self_layer3_2_relu_2 : [#users=2] = call_module[target=self_layer3_2_relu](args = (%iadd,), kwargs = {}) | |
%self_layer3_3_conv1 : [#users=1] = call_module[target=self_layer3_3_conv1](args = (%self_layer3_2_relu_2,), kwargs = {}) | |
%self_layer3_3_bn1 : [#users=1] = call_module[target=self_layer3_3_bn1](args = (%self_layer3_3_conv1,), kwargs = {}) | |
%self_layer3_3_relu : [#users=1] = call_module[target=self_layer3_3_relu](args = (%self_layer3_3_bn1,), kwargs = {}) | |
%self_layer3_3_conv2 : [#users=1] = call_module[target=self_layer3_3_conv2](args = (%self_layer3_3_relu,), kwargs = {}) | |
%self_layer3_3_bn2 : [#users=1] = call_module[target=self_layer3_3_bn2](args = (%self_layer3_3_conv2,), kwargs = {}) | |
%self_layer3_3_relu_1 : [#users=1] = call_module[target=self_layer3_3_relu](args = (%self_layer3_3_bn2,), kwargs = {}) | |
%self_layer3_3_conv3 : [#users=1] = call_module[target=self_layer3_3_conv3](args = (%self_layer3_3_relu_1,), kwargs = {}) | |
%self_layer3_3_bn3 : [#users=1] = call_module[target=self_layer3_3_bn3](args = (%self_layer3_3_conv3,), kwargs = {}) | |
%iadd_1 : [#users=1] = call_function[target=operator.iadd](args = (%self_layer3_3_bn3, %self_layer3_2_relu_2), kwargs = {}) | |
%self_layer3_3_relu_2 : [#users=2] = call_module[target=self_layer3_3_relu](args = (%iadd_1,), kwargs = {}) | |
%self_layer3_4_conv1 : [#users=1] = call_module[target=self_layer3_4_conv1](args = (%self_layer3_3_relu_2,), kwargs = {}) | |
%self_layer3_4_bn1 : [#users=1] = call_module[target=self_layer3_4_bn1](args = (%self_layer3_4_conv1,), kwargs = {}) | |
%self_layer3_4_relu : [#users=1] = call_module[target=self_layer3_4_relu](args = (%self_layer3_4_bn1,), kwargs = {}) | |
%self_layer3_4_conv2 : [#users=1] = call_module[target=self_layer3_4_conv2](args = (%self_layer3_4_relu,), kwargs = {}) | |
%self_layer3_4_bn2 : [#users=1] = call_module[target=self_layer3_4_bn2](args = (%self_layer3_4_conv2,), kwargs = {}) | |
%self_layer3_4_relu_1 : [#users=1] = call_module[target=self_layer3_4_relu](args = (%self_layer3_4_bn2,), kwargs = {}) | |
%self_layer3_4_conv3 : [#users=1] = call_module[target=self_layer3_4_conv3](args = (%self_layer3_4_relu_1,), kwargs = {}) | |
%self_layer3_4_bn3 : [#users=1] = call_module[target=self_layer3_4_bn3](args = (%self_layer3_4_conv3,), kwargs = {}) | |
%iadd_2 : [#users=1] = call_function[target=operator.iadd](args = (%self_layer3_4_bn3, %self_layer3_3_relu_2), kwargs = {}) | |
%self_layer3_4_relu_2 : [#users=2] = call_module[target=self_layer3_4_relu](args = (%iadd_2,), kwargs = {}) | |
%self_layer3_5_conv1 : [#users=1] = call_module[target=self_layer3_5_conv1](args = (%self_layer3_4_relu_2,), kwargs = {}) | |
%self_layer3_5_bn1 : [#users=1] = call_module[target=self_layer3_5_bn1](args = (%self_layer3_5_conv1,), kwargs = {}) | |
%self_layer3_5_relu : [#users=1] = call_module[target=self_layer3_5_relu](args = (%self_layer3_5_bn1,), kwargs = {}) | |
%self_layer3_5_conv2 : [#users=1] = call_module[target=self_layer3_5_conv2](args = (%self_layer3_5_relu,), kwargs = {}) | |
%self_layer3_5_bn2 : [#users=1] = call_module[target=self_layer3_5_bn2](args = (%self_layer3_5_conv2,), kwargs = {}) | |
%self_layer3_5_relu_1 : [#users=1] = call_module[target=self_layer3_5_relu](args = (%self_layer3_5_bn2,), kwargs = {}) | |
%self_layer3_5_conv3 : [#users=1] = call_module[target=self_layer3_5_conv3](args = (%self_layer3_5_relu_1,), kwargs = {}) | |
%self_layer3_5_bn3 : [#users=1] = call_module[target=self_layer3_5_bn3](args = (%self_layer3_5_conv3,), kwargs = {}) | |
%iadd_3 : [#users=1] = call_function[target=operator.iadd](args = (%self_layer3_5_bn3, %self_layer3_4_relu_2), kwargs = {}) | |
%self_layer3_5_relu_2 : [#users=2] = call_module[target=self_layer3_5_relu](args = (%iadd_3,), kwargs = {}) | |
%self_layer4_0_conv1 : [#users=1] = call_module[target=self_layer4_0_conv1](args = (%self_layer3_5_relu_2,), kwargs = {}) | |
%self_layer4_0_bn1 : [#users=1] = call_module[target=self_layer4_0_bn1](args = (%self_layer4_0_conv1,), kwargs = {}) | |
%self_layer4_0_relu : [#users=1] = call_module[target=self_layer4_0_relu](args = (%self_layer4_0_bn1,), kwargs = {}) | |
%self_layer4_0_conv2 : [#users=1] = call_module[target=self_layer4_0_conv2](args = (%self_layer4_0_relu,), kwargs = {}) | |
%self_layer4_0_bn2 : [#users=1] = call_module[target=self_layer4_0_bn2](args = (%self_layer4_0_conv2,), kwargs = {}) | |
%self_layer4_0_relu_1 : [#users=1] = call_module[target=self_layer4_0_relu](args = (%self_layer4_0_bn2,), kwargs = {}) | |
return (self_layer4_0_relu_1, self_layer3_5_relu_2) | |
---submod_2 graph--- | |
graph(): | |
%self_layer4_0_relu_1 : [#users=1] = placeholder[target=self_layer4_0_relu_1] | |
%self_layer3_5_relu_2 : [#users=1] = placeholder[target=self_layer3_5_relu_2] | |
%self_layer4_0_conv3 : [#users=1] = call_module[target=self_layer4_0_conv3](args = (%self_layer4_0_relu_1,), kwargs = {}) | |
%self_layer4_0_bn3 : [#users=1] = call_module[target=self_layer4_0_bn3](args = (%self_layer4_0_conv3,), kwargs = {}) | |
%self_layer4_0_downsample_0 : [#users=1] = call_module[target=self_layer4_0_downsample_0](args = (%self_layer3_5_relu_2,), kwargs = {}) | |
%self_layer4_0_downsample_1 : [#users=1] = call_module[target=self_layer4_0_downsample_1](args = (%self_layer4_0_downsample_0,), kwargs = {}) | |
%iadd : [#users=1] = call_function[target=operator.iadd](args = (%self_layer4_0_bn3, %self_layer4_0_downsample_1), kwargs = {}) | |
%self_layer4_0_relu : [#users=2] = call_module[target=self_layer4_0_relu](args = (%iadd,), kwargs = {}) | |
%self_layer4_1_conv1 : [#users=1] = call_module[target=self_layer4_1_conv1](args = (%self_layer4_0_relu,), kwargs = {}) | |
%self_layer4_1_bn1 : [#users=1] = call_module[target=self_layer4_1_bn1](args = (%self_layer4_1_conv1,), kwargs = {}) | |
%self_layer4_1_relu : [#users=1] = call_module[target=self_layer4_1_relu](args = (%self_layer4_1_bn1,), kwargs = {}) | |
%self_layer4_1_conv2 : [#users=1] = call_module[target=self_layer4_1_conv2](args = (%self_layer4_1_relu,), kwargs = {}) | |
%self_layer4_1_bn2 : [#users=1] = call_module[target=self_layer4_1_bn2](args = (%self_layer4_1_conv2,), kwargs = {}) | |
%self_layer4_1_relu_1 : [#users=1] = call_module[target=self_layer4_1_relu](args = (%self_layer4_1_bn2,), kwargs = {}) | |
return (self_layer4_1_relu_1, self_layer4_0_relu) | |
---submod_3 graph--- | |
graph(): | |
%self_layer4_1_relu_1 : [#users=1] = placeholder[target=self_layer4_1_relu_1] | |
%self_layer4_0_relu_2 : [#users=1] = placeholder[target=self_layer4_0_relu_2] | |
%self_layer4_1_conv3 : [#users=1] = call_module[target=self_layer4_1_conv3](args = (%self_layer4_1_relu_1,), kwargs = {}) | |
%self_layer4_1_bn3 : [#users=1] = call_module[target=self_layer4_1_bn3](args = (%self_layer4_1_conv3,), kwargs = {}) | |
%iadd : [#users=1] = call_function[target=operator.iadd](args = (%self_layer4_1_bn3, %self_layer4_0_relu_2), kwargs = {}) | |
%self_layer4_1_relu : [#users=2] = call_module[target=self_layer4_1_relu](args = (%iadd,), kwargs = {}) | |
%self_layer4_2_conv1 : [#users=1] = call_module[target=self_layer4_2_conv1](args = (%self_layer4_1_relu,), kwargs = {}) | |
%self_layer4_2_bn1 : [#users=1] = call_module[target=self_layer4_2_bn1](args = (%self_layer4_2_conv1,), kwargs = {}) | |
%self_layer4_2_relu : [#users=1] = call_module[target=self_layer4_2_relu](args = (%self_layer4_2_bn1,), kwargs = {}) | |
%self_layer4_2_conv2 : [#users=1] = call_module[target=self_layer4_2_conv2](args = (%self_layer4_2_relu,), kwargs = {}) | |
%self_layer4_2_bn2 : [#users=1] = call_module[target=self_layer4_2_bn2](args = (%self_layer4_2_conv2,), kwargs = {}) | |
%self_layer4_2_relu_1 : [#users=1] = call_module[target=self_layer4_2_relu](args = (%self_layer4_2_bn2,), kwargs = {}) | |
%self_layer4_2_conv3 : [#users=1] = call_module[target=self_layer4_2_conv3](args = (%self_layer4_2_relu_1,), kwargs = {}) | |
%self_layer4_2_bn3 : [#users=1] = call_module[target=self_layer4_2_bn3](args = (%self_layer4_2_conv3,), kwargs = {}) | |
%iadd_1 : [#users=1] = call_function[target=operator.iadd](args = (%self_layer4_2_bn3, %self_layer4_1_relu), kwargs = {}) | |
%self_layer4_2_relu_2 : [#users=1] = call_module[target=self_layer4_2_relu](args = (%iadd_1,), kwargs = {}) | |
%self_avgpool : [#users=1] = call_module[target=self_avgpool](args = (%self_layer4_2_relu_2,), kwargs = {}) | |
%flatten : [#users=1] = call_function[target=torch.flatten](args = (%self_avgpool, 1), kwargs = {}) | |
%self_fc : [#users=1] = call_module[target=self_fc](args = (%flatten,), kwargs = {}) | |
return self_fc | |
--------------- | |
run_node placeholder, x got args tuple() | |
run_node call_module, submod_0 got args tuple(T[torch.Size([32, 3, 224, 224])]) | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
run_node call_function, <built-in function getitem> got args tuple(tuple(T[torch.Size([32, 256, 14, 14])], T[torch.Size([32, 1024, 14, 14])]), 0) | |
run_node call_function, <built-in function getitem> got args tuple(tuple(T[torch.Size([32, 256, 14, 14])], T[torch.Size([32, 1024, 14, 14])]), 1) | |
run_node call_module, submod_1 got args tuple(T[torch.Size([32, 256, 14, 14])], T[torch.Size([32, 1024, 14, 14])]) | |
still waiting for process to exit | |
run_node call_function, <built-in function getitem> got args tuple(tuple(T[torch.Size([32, 512, 7, 7])], T[torch.Size([32, 1024, 14, 14])]), 0) | |
run_node call_function, <built-in function getitem> got args tuple(tuple(T[torch.Size([32, 512, 7, 7])], T[torch.Size([32, 1024, 14, 14])]), 1) | |
run_node call_module, submod_2 got args tuple(T[torch.Size([32, 512, 7, 7])], T[torch.Size([32, 1024, 14, 14])]) | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
run_node call_function, <built-in function getitem> got args tuple(tuple(T[torch.Size([32, 512, 7, 7])], T[torch.Size([32, 2048, 7, 7])]), 0) | |
run_node call_function, <built-in function getitem> got args tuple(tuple(T[torch.Size([32, 512, 7, 7])], T[torch.Size([32, 2048, 7, 7])]), 1) | |
run_node call_module, submod_3 got args tuple(T[torch.Size([32, 512, 7, 7])], T[torch.Size([32, 2048, 7, 7])]) | |
run_node output, output got args tuple(tuple(T[torch.Size([32, 1000])])) | |
---final graph--- | |
graph(): | |
%x : torch.Tensor [#users=1] = placeholder[target=x] | |
%submod_0 : [#users=2] = call_module[target=compiled_submod_0](args = (%x,), kwargs = {}) | |
%getitem : [#users=1] = call_function[target=operator.getitem](args = (%submod_0, 0), kwargs = {}) | |
%getitem_1 : [#users=1] = call_function[target=operator.getitem](args = (%submod_0, 1), kwargs = {}) | |
%submod_1 : [#users=2] = call_module[target=compiled_submod_1](args = (%getitem, %getitem_1), kwargs = {}) | |
%getitem_2 : [#users=1] = call_function[target=operator.getitem](args = (%submod_1, 0), kwargs = {}) | |
%getitem_3 : [#users=1] = call_function[target=operator.getitem](args = (%submod_1, 1), kwargs = {}) | |
%submod_2 : [#users=2] = call_module[target=compiled_submod_2](args = (%getitem_2, %getitem_3), kwargs = {}) | |
%getitem_4 : [#users=1] = call_function[target=operator.getitem](args = (%submod_2, 0), kwargs = {}) | |
%getitem_5 : [#users=1] = call_function[target=operator.getitem](args = (%submod_2, 1), kwargs = {}) | |
%submod_3 : [#users=1] = call_module[target=compiled_submod_3](args = (%getitem_4, %getitem_5), kwargs = {}) | |
return (submod_3,) | |
--------------- | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
result {'latency_median': 88.8248291015625, 'latency_stdev': 0.22256090897985134} | |
has been placed into the queue | |
got the result! | |
got exit 0 | |
(already got results) | |
exit code: 0 and result: {'nodes': 2, 'model_name': 'torchbenchmark.models.resnet50.Model', 'backend': 'torchdynamo_inductor', 'has_breaks': True, 'static_graph': False, 'result': {'latency_median': 88.8248291015625, 'latency_stdev': 0.22256090897985134}} | |
<RESULT>{"nodes": 2, "model_name": "torchbenchmark.models.resnet50.Model", "backend": "torchdynamo_inductor", "has_breaks": true, "static_graph": false, "result": {"latency_median": 88.8248291015625, "latency_stdev": 0.22256090897985134}}</RESULT> | |
submitit INFO (2022-10-26 02:57:12,347) - Job completed successfully |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torchvision/models/_utils.py:208: UserWarning: The parameter 'pretrained' is deprecated since 0.13 and may be removed in the future, please use 'weights' instead. | |
warnings.warn( | |
/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torchvision/models/_utils.py:223: UserWarning: Arguments other than a weight enum or `None` for 'weights' are deprecated since 0.13 and may be removed in the future. The current behavior is equivalent to passing `weights=ResNet50_Weights.IMAGENET1K_V1`. You can also use `weights=ResNet50_Weights.DEFAULT` to get the most up-to-date weights. | |
warnings.warn(msg) | |
/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torchvision/models/_utils.py:208: UserWarning: The parameter 'pretrained' is deprecated since 0.13 and may be removed in the future, please use 'weights' instead. | |
warnings.warn( | |
/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torchvision/models/_utils.py:223: UserWarning: Arguments other than a weight enum or `None` for 'weights' are deprecated since 0.13 and may be removed in the future. The current behavior is equivalent to passing `weights=ResNet50_Weights.IMAGENET1K_V1`. You can also use `weights=ResNet50_Weights.DEFAULT` to get the most up-to-date weights. | |
warnings.warn(msg) | |
[2022-10-26 23:11:52,922] torch._dynamo.variables.torch: [WARNING] Profiler will be ignored | |
[2022-10-26 23:11:54,279] torch._dynamo.variables.torch: [WARNING] Profiler will be ignored | |
[2022-10-26 23:13:29,380] torch._dynamo.optimizations.training: [WARNING] Unable to use Aot Autograd because of presence of mutation | |
[2022-10-26 23:13:29,380] torch._inductor.compile_fx: [WARNING] Aot Autograd is not safe to run, so falling back to eager | |
[2022-10-26 23:13:29,554] torch._dynamo.variables.torch: [WARNING] Profiler will be ignored | |
[2022-10-26 23:13:29,806] torch._dynamo.variables.torch: [WARNING] Profiler will be ignored | |
[2022-10-26 23:13:30,055] torch._dynamo.variables.torch: [WARNING] Profiler will be ignored | |
[2022-10-26 23:14:47,694] torch._dynamo.optimizations.training: [WARNING] Unable to use Aot Autograd because of presence of mutation | |
[2022-10-26 23:14:47,695] torch._inductor.compile_fx: [WARNING] Aot Autograd is not safe to run, so falling back to eager | |
/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torchvision/models/_utils.py:208: UserWarning: The parameter 'pretrained' is deprecated since 0.13 and may be removed in the future, please use 'weights' instead. | |
warnings.warn( | |
/data/home/dberard/miniconda/envs/bench-fast/lib/python3.8/site-packages/torchvision/models/_utils.py:223: UserWarning: Arguments other than a weight enum or `None` for 'weights' are deprecated since 0.13 and may be removed in the future. The current behavior is equivalent to passing `weights=ResNet50_Weights.IMAGENET1K_V1`. You can also use `weights=ResNet50_Weights.DEFAULT` to get the most up-to-date weights. | |
warnings.warn(msg) | |
[2022-10-26 23:15:10,239] torch._dynamo.variables.torch: [WARNING] Profiler will be ignored | |
[2022-10-26 23:15:11,591] torch._dynamo.variables.torch: [WARNING] Profiler will be ignored | |
[2022-10-26 23:15:22,586] torch._dynamo.optimizations.training: [WARNING] Unable to use Aot Autograd because of presence of mutation | |
[2022-10-26 23:15:22,586] torch._inductor.compile_fx: [WARNING] Aot Autograd is not safe to run, so falling back to eager | |
[2022-10-26 23:16:43,761] torch._dynamo.optimizations.training: [WARNING] Unable to use Aot Autograd because of presence of mutation | |
[2022-10-26 23:16:43,761] torch._inductor.compile_fx: [WARNING] Aot Autograd is not safe to run, so falling back to eager | |
[2022-10-26 23:16:43,938] torch._dynamo.variables.torch: [WARNING] Profiler will be ignored | |
[2022-10-26 23:16:44,190] torch._dynamo.variables.torch: [WARNING] Profiler will be ignored | |
[2022-10-26 23:16:44,437] torch._dynamo.variables.torch: [WARNING] Profiler will be ignored | |
[2022-10-26 23:18:00,742] torch._dynamo.optimizations.training: [WARNING] Unable to use Aot Autograd because of presence of mutation | |
[2022-10-26 23:18:00,743] torch._inductor.compile_fx: [WARNING] Aot Autograd is not safe to run, so falling back to eager |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
submitit INFO (2022-10-26 23:11:13,091) - Starting with JobEnvironment(job_id=74383, hostname=a100-st-p4d24xlarge-35, local_rank=0(8), node=0(1), global_rank=0(8)) | |
submitit INFO (2022-10-26 23:11:13,092) - Loading pickle: /fsx/users/dberard/scratch-local/bench-fast/benchmark/logs_oct25/74383_submitted.pkl | |
This is node 0 | |
run_once | |
Process group: 8 tasks, rank: 0 | |
MY HOSTNAME: a100-st-p4d24xlarge-35 | |
FI_PROVIDER : efa | |
LD_LIBRARY_PATH : /fsx/users/dberard/scratch-local/bench-fast/aws-ofi-nccl/lib:/opt/amazon/efa/lib:/usr/local/cuda-11.6/lib:/usr/local/cuda-11.6/lib64:/opt/amazon/efa/lib:/opt/amazon/openmpi/lib:/usr/local/cuda/efa/lib:/usr/local/cuda/lib:/usr/local/cuda:/usr/local/cuda/lib64:/usr/local/cuda/extras/CUPTI/lib64:/usr/local/cuda/targets/x86_64-linux/lib:/usr/local/lib:/usr/lib: | |
NCCL_DEBUG : INFO | |
FI_EFA_USE_DEVICE_RDMA : 1 | |
still waiting for process to exit | |
didn't get the result, queue empty | |
a100-st-p4d24xlarge-35:49529:49529 [0] NCCL INFO NCCL_SOCKET_IFNAME set by environment to ens | |
a100-st-p4d24xlarge-35:49529:49529 [0] NCCL INFO NCCL_SOCKET_IFNAME set to ens | |
a100-st-p4d24xlarge-35:49529:49529 [0] NCCL INFO Bootstrap : Using ens32:10.200.70.12<0> | |
a100-st-p4d24xlarge-35:49529:49529 [0] NCCL INFO NET/Plugin: Failed to find ncclCollNetPlugin_v6 symbol. | |
a100-st-p4d24xlarge-35:49529:49529 [0] NCCL INFO NET/Plugin: Failed to find ncclCollNetPlugin symbol (v4 or v5). | |
a100-st-p4d24xlarge-35:49529:49529 [0] NCCL INFO cudaDriverVersion 11060 | |
NCCL version 2.14.3+cuda11.6 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO NET/OFI Using aws-ofi-nccl 1.4.0aws | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO NET/OFI Running on p4d.24xlarge platform, Setting NCCL_TOPO_FILE environment variable to /fsx/users/dberard/scratch-local/bench-fast/aws-ofi-nccl/share/aws-ofi-nccl/xml/p4d-24xl-topo.xml | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO NET/OFI Setting FI_EFA_FORK_SAFE environment variable to 1 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO NET/OFI Selected Provider is efa | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Using network AWS Libfabric | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO NCCL_TOPO_FILE set by environment to /fsx/users/dberard/scratch-local/bench-fast/aws-ofi-nccl/share/aws-ofi-nccl/xml/p4d-24xl-topo.xml | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO NET/AWS Libfabric : GPU Direct RDMA Enabled for HCA 0 'rdmap16s27' | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO NET/AWS Libfabric : GPU Direct RDMA Enabled for HCA 1 'rdmap32s27' | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO NET/AWS Libfabric : GPU Direct RDMA Enabled for HCA 2 'rdmap144s27' | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO NET/AWS Libfabric : GPU Direct RDMA Enabled for HCA 3 'rdmap160s27' | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101d0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201c0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201d0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901c0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901d0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01c0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01d0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101d0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201c0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201d0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901c0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901d0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01c0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01d0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101d0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201c0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201d0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901c0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901d0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01c0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01d0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101d0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201c0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201d0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901c0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901d0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01c0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01d0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Setting affinity for GPU 0 to 1f0000,0000001f | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Channel 00/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Channel 01/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Channel 02/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Channel 03/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Channel 04/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Channel 05/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Channel 06/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Channel 07/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Channel 08/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Channel 09/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Channel 10/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Channel 11/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Channel 12/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Channel 13/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Channel 14/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Channel 15/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Channel 16/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Channel 17/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Channel 18/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Channel 19/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Channel 20/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Channel 21/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Channel 22/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Channel 23/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Trees [0] 1/-1/-1->0->-1 [1] 1/-1/-1->0->-1 [2] 1/-1/-1->0->-1 [3] 1/-1/-1->0->-1 [4] 1/-1/-1->0->-1 [5] 1/-1/-1->0->-1 [6] 1/-1/-1->0->-1 [7] 1/-1/-1->0->-1 [8] 1/-1/-1->0->-1 [9] 1/-1/-1->0->-1 [10] 1/-1/-1->0->-1 [11] 1/-1/-1->0->-1 [12] 1/-1/-1->0->-1 [13] 1/-1/-1->0->-1 [14] 1/-1/-1->0->-1 [15] 1/-1/-1->0->-1 [16] 1/-1/-1->0->-1 [17] 1/-1/-1->0->-1 [18] 1/-1/-1->0->-1 [19] 1/-1/-1->0->-1 [20] 1/-1/-1->0->-1 [21] 1/-1/-1->0->-1 [22] 1/-1/-1->0->-1 [23] 1/-1/-1->0->-1 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af8002dd0 | |
a100-st-p4d24xlarge-35:49529:50077 [0] NCCL INFO New proxy recv connection 0 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af8002e10 | |
a100-st-p4d24xlarge-35:49529:50077 [0] NCCL INFO New proxy recv connection 1 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:49529:50077 [0] NCCL INFO New proxy recv connection 2 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af8002e50 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af8002e90 | |
a100-st-p4d24xlarge-35:49529:50077 [0] NCCL INFO New proxy recv connection 3 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:49529:50077 [0] NCCL INFO New proxy recv connection 4 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af8002ed0 | |
a100-st-p4d24xlarge-35:49529:50077 [0] NCCL INFO New proxy recv connection 5 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af8002f10 | |
a100-st-p4d24xlarge-35:49529:50077 [0] NCCL INFO New proxy recv connection 6 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af8002f50 | |
a100-st-p4d24xlarge-35:49529:50077 [0] NCCL INFO New proxy recv connection 7 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af8002f90 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af8002fd0 | |
a100-st-p4d24xlarge-35:49529:50077 [0] NCCL INFO New proxy recv connection 8 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af8003010 | |
a100-st-p4d24xlarge-35:49529:50077 [0] NCCL INFO New proxy recv connection 9 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af8003050 | |
a100-st-p4d24xlarge-35:49529:50077 [0] NCCL INFO New proxy recv connection 10 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af8003090 | |
a100-st-p4d24xlarge-35:49529:50077 [0] NCCL INFO New proxy recv connection 11 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af80030d0 | |
a100-st-p4d24xlarge-35:49529:50077 [0] NCCL INFO New proxy recv connection 12 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af8003110 | |
a100-st-p4d24xlarge-35:49529:50077 [0] NCCL INFO New proxy recv connection 13 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af8003150 | |
a100-st-p4d24xlarge-35:49529:50077 [0] NCCL INFO New proxy recv connection 14 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af8003190 | |
a100-st-p4d24xlarge-35:49529:50077 [0] NCCL INFO New proxy recv connection 15 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af80031d0 | |
a100-st-p4d24xlarge-35:49529:50077 [0] NCCL INFO New proxy recv connection 16 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af8003210 | |
a100-st-p4d24xlarge-35:49529:50077 [0] NCCL INFO New proxy recv connection 17 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af8003250 | |
a100-st-p4d24xlarge-35:49529:50077 [0] NCCL INFO New proxy recv connection 18 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af8003290 | |
a100-st-p4d24xlarge-35:49529:50077 [0] NCCL INFO New proxy recv connection 19 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af80032d0 | |
a100-st-p4d24xlarge-35:49529:50077 [0] NCCL INFO New proxy recv connection 20 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af8003310 | |
a100-st-p4d24xlarge-35:49529:50077 [0] NCCL INFO New proxy recv connection 21 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af8003350 | |
a100-st-p4d24xlarge-35:49529:50077 [0] NCCL INFO New proxy recv connection 22 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af8003390 | |
a100-st-p4d24xlarge-35:49529:50077 [0] NCCL INFO New proxy recv connection 23 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Channel 00/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af80033d0 | |
a100-st-p4d24xlarge-35:49529:50077 [0] NCCL INFO New proxy send connection 24 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Channel 01/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af8003410 | |
a100-st-p4d24xlarge-35:49529:50077 [0] NCCL INFO New proxy send connection 25 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Channel 02/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af8003450 | |
a100-st-p4d24xlarge-35:49529:50077 [0] NCCL INFO New proxy send connection 26 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Channel 03/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af8003490 | |
a100-st-p4d24xlarge-35:49529:50077 [0] NCCL INFO New proxy send connection 27 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Channel 04/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af80034d0 | |
a100-st-p4d24xlarge-35:49529:50077 [0] NCCL INFO New proxy send connection 28 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Channel 05/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af8003510 | |
a100-st-p4d24xlarge-35:49529:50077 [0] NCCL INFO New proxy send connection 29 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Channel 06/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af8003550 | |
a100-st-p4d24xlarge-35:49529:50077 [0] NCCL INFO New proxy send connection 30 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Channel 07/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af8003590 | |
a100-st-p4d24xlarge-35:49529:50077 [0] NCCL INFO New proxy send connection 31 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Channel 08/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af80035d0 | |
a100-st-p4d24xlarge-35:49529:50077 [0] NCCL INFO New proxy send connection 32 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Channel 09/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af8003610 | |
a100-st-p4d24xlarge-35:49529:50077 [0] NCCL INFO New proxy send connection 33 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Channel 10/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af8003650 | |
a100-st-p4d24xlarge-35:49529:50077 [0] NCCL INFO New proxy send connection 34 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Channel 11/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af8003690 | |
a100-st-p4d24xlarge-35:49529:50077 [0] NCCL INFO New proxy send connection 35 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Channel 12/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af80036d0 | |
a100-st-p4d24xlarge-35:49529:50077 [0] NCCL INFO New proxy send connection 36 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Channel 13/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af8003710 | |
a100-st-p4d24xlarge-35:49529:50077 [0] NCCL INFO New proxy send connection 37 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Channel 14/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af8003750 | |
a100-st-p4d24xlarge-35:49529:50077 [0] NCCL INFO New proxy send connection 38 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Channel 15/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af8003790 | |
a100-st-p4d24xlarge-35:49529:50077 [0] NCCL INFO New proxy send connection 39 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Channel 16/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af80037d0 | |
a100-st-p4d24xlarge-35:49529:50077 [0] NCCL INFO New proxy send connection 40 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Channel 17/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af8003810 | |
a100-st-p4d24xlarge-35:49529:50077 [0] NCCL INFO New proxy send connection 41 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Channel 18/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af8003850 | |
a100-st-p4d24xlarge-35:49529:50077 [0] NCCL INFO New proxy send connection 42 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Channel 19/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af8003890 | |
a100-st-p4d24xlarge-35:49529:50077 [0] NCCL INFO New proxy send connection 43 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Channel 20/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af80038d0 | |
a100-st-p4d24xlarge-35:49529:50077 [0] NCCL INFO New proxy send connection 44 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Channel 21/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af8003910 | |
a100-st-p4d24xlarge-35:49529:50077 [0] NCCL INFO New proxy send connection 45 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Channel 22/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af8003950 | |
a100-st-p4d24xlarge-35:49529:50077 [0] NCCL INFO New proxy send connection 46 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Channel 23/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af8003990 | |
a100-st-p4d24xlarge-35:49529:50077 [0] NCCL INFO New proxy send connection 47 from local rank 0, transport 0 | |
still waiting for process to exit | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Connected all rings | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af80039d0 | |
a100-st-p4d24xlarge-35:49529:50077 [0] NCCL INFO New proxy recv connection 48 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:49529:50077 [0] NCCL INFO New proxy recv connection 49 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af8003a10 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af8003a50 | |
a100-st-p4d24xlarge-35:49529:50077 [0] NCCL INFO New proxy recv connection 50 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af8003a90 | |
a100-st-p4d24xlarge-35:49529:50077 [0] NCCL INFO New proxy recv connection 51 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af8003ad0 | |
a100-st-p4d24xlarge-35:49529:50077 [0] NCCL INFO New proxy recv connection 52 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af8003b10 | |
a100-st-p4d24xlarge-35:49529:50077 [0] NCCL INFO New proxy recv connection 53 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af8003b50 | |
a100-st-p4d24xlarge-35:49529:50077 [0] NCCL INFO New proxy recv connection 54 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af8003b90 | |
a100-st-p4d24xlarge-35:49529:50077 [0] NCCL INFO New proxy recv connection 55 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af8003bd0 | |
a100-st-p4d24xlarge-35:49529:50077 [0] NCCL INFO New proxy recv connection 56 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af8003c10 | |
a100-st-p4d24xlarge-35:49529:50077 [0] NCCL INFO New proxy recv connection 57 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af8003c50 | |
a100-st-p4d24xlarge-35:49529:50077 [0] NCCL INFO New proxy recv connection 58 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af8003c90 | |
a100-st-p4d24xlarge-35:49529:50077 [0] NCCL INFO New proxy recv connection 59 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af8003cd0 | |
a100-st-p4d24xlarge-35:49529:50077 [0] NCCL INFO New proxy recv connection 60 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af8003d10 | |
a100-st-p4d24xlarge-35:49529:50077 [0] NCCL INFO New proxy recv connection 61 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af8003d50 | |
a100-st-p4d24xlarge-35:49529:50077 [0] NCCL INFO New proxy recv connection 62 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af8003d90 | |
a100-st-p4d24xlarge-35:49529:50077 [0] NCCL INFO New proxy recv connection 63 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af8003dd0 | |
a100-st-p4d24xlarge-35:49529:50077 [0] NCCL INFO New proxy recv connection 64 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af8003e10 | |
a100-st-p4d24xlarge-35:49529:50077 [0] NCCL INFO New proxy recv connection 65 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af8003e50 | |
a100-st-p4d24xlarge-35:49529:50077 [0] NCCL INFO New proxy recv connection 66 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af8003e90 | |
a100-st-p4d24xlarge-35:49529:50077 [0] NCCL INFO New proxy recv connection 67 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af8003ed0 | |
a100-st-p4d24xlarge-35:49529:50077 [0] NCCL INFO New proxy recv connection 68 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af8003f10 | |
a100-st-p4d24xlarge-35:49529:50077 [0] NCCL INFO New proxy recv connection 69 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af8003f50 | |
a100-st-p4d24xlarge-35:49529:50077 [0] NCCL INFO New proxy recv connection 70 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af8003f90 | |
a100-st-p4d24xlarge-35:49529:50077 [0] NCCL INFO New proxy recv connection 71 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Connected all trees | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO NCCL_ALGO set by environment to ring | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO threadThresholds 8/8/64 | 64/8/64 | 512 | 512 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO 24 coll channels, 32 p2p channels, 32 p2p channels per peer | |
a100-st-p4d24xlarge-35:49529:50077 [0] NCCL INFO New proxy send connection 72 from local rank 0, transport 2 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af8003fd0 | |
a100-st-p4d24xlarge-35:49529:50028 [0] NCCL INFO comm 0x5562b12335c0 rank 0 nranks 8 cudaDev 0 busId 101c0 - Init COMPLETE | |
result {'latency_median': 63.02003288269043, 'latency_stdev': 0.2317810499888692} | |
has been placed into the queue | |
got the result! | |
got exit 0 | |
(already got results) | |
exit code: 0 and result: {'nodes': 1, 'model_name': 'torchbenchmark.models.resnet50.Model', 'backend': 'eager', 'has_breaks': False, 'static_graph': False, 'result': {'latency_median': 63.02003288269043, 'latency_stdev': 0.2317810499888692}} | |
<RESULT>{"nodes": 1, "model_name": "torchbenchmark.models.resnet50.Model", "backend": "eager", "has_breaks": false, "static_graph": false, "result": {"latency_median": 63.02003288269043, "latency_stdev": 0.2317810499888692}}</RESULT> | |
run_once | |
Process group: 8 tasks, rank: 0 | |
MY HOSTNAME: a100-st-p4d24xlarge-35 | |
FI_PROVIDER : efa | |
LD_LIBRARY_PATH : /fsx/users/dberard/scratch-local/bench-fast/aws-ofi-nccl/lib:/opt/amazon/efa/lib:/usr/local/cuda-11.6/lib:/usr/local/cuda-11.6/lib64:/opt/amazon/efa/lib:/opt/amazon/openmpi/lib:/usr/local/cuda/efa/lib:/usr/local/cuda/lib:/usr/local/cuda:/usr/local/cuda/lib64:/usr/local/cuda/extras/CUPTI/lib64:/usr/local/cuda/targets/x86_64-linux/lib:/usr/local/lib:/usr/lib: | |
NCCL_DEBUG : INFO | |
FI_EFA_USE_DEVICE_RDMA : 1 | |
still waiting for process to exit | |
didn't get the result, queue empty | |
a100-st-p4d24xlarge-35:50204:50204 [0] NCCL INFO NCCL_SOCKET_IFNAME set by environment to ens | |
a100-st-p4d24xlarge-35:50204:50204 [0] NCCL INFO NCCL_SOCKET_IFNAME set to ens | |
a100-st-p4d24xlarge-35:50204:50204 [0] NCCL INFO Bootstrap : Using ens32:10.200.70.12<0> | |
a100-st-p4d24xlarge-35:50204:50204 [0] NCCL INFO NET/Plugin: Failed to find ncclCollNetPlugin_v6 symbol. | |
a100-st-p4d24xlarge-35:50204:50204 [0] NCCL INFO NET/Plugin: Failed to find ncclCollNetPlugin symbol (v4 or v5). | |
a100-st-p4d24xlarge-35:50204:50204 [0] NCCL INFO cudaDriverVersion 11060 | |
NCCL version 2.14.3+cuda11.6 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO NET/OFI Using aws-ofi-nccl 1.4.0aws | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO NET/OFI Running on p4d.24xlarge platform, Setting NCCL_TOPO_FILE environment variable to /fsx/users/dberard/scratch-local/bench-fast/aws-ofi-nccl/share/aws-ofi-nccl/xml/p4d-24xl-topo.xml | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO NET/OFI Setting FI_EFA_FORK_SAFE environment variable to 1 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO NET/OFI Selected Provider is efa | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Using network AWS Libfabric | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO NCCL_TOPO_FILE set by environment to /fsx/users/dberard/scratch-local/bench-fast/aws-ofi-nccl/share/aws-ofi-nccl/xml/p4d-24xl-topo.xml | |
still waiting for process to exit | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO NET/AWS Libfabric : GPU Direct RDMA Enabled for HCA 0 'rdmap16s27' | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO NET/AWS Libfabric : GPU Direct RDMA Enabled for HCA 1 'rdmap32s27' | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO NET/AWS Libfabric : GPU Direct RDMA Enabled for HCA 2 'rdmap144s27' | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO NET/AWS Libfabric : GPU Direct RDMA Enabled for HCA 3 'rdmap160s27' | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101d0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201c0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201d0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901c0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901d0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01c0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01d0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101d0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201c0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201d0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901c0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901d0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01c0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01d0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101d0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201c0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201d0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901c0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901d0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01c0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01d0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101d0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201c0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201d0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901c0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901d0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01c0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01d0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Setting affinity for GPU 0 to 1f0000,0000001f | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Channel 00/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Channel 01/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Channel 02/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Channel 03/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Channel 04/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Channel 05/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Channel 06/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Channel 07/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Channel 08/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Channel 09/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Channel 10/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Channel 11/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Channel 12/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Channel 13/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Channel 14/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Channel 15/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Channel 16/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Channel 17/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Channel 18/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Channel 19/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Channel 20/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Channel 21/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Channel 22/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Channel 23/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Trees [0] 1/-1/-1->0->-1 [1] 1/-1/-1->0->-1 [2] 1/-1/-1->0->-1 [3] 1/-1/-1->0->-1 [4] 1/-1/-1->0->-1 [5] 1/-1/-1->0->-1 [6] 1/-1/-1->0->-1 [7] 1/-1/-1->0->-1 [8] 1/-1/-1->0->-1 [9] 1/-1/-1->0->-1 [10] 1/-1/-1->0->-1 [11] 1/-1/-1->0->-1 [12] 1/-1/-1->0->-1 [13] 1/-1/-1->0->-1 [14] 1/-1/-1->0->-1 [15] 1/-1/-1->0->-1 [16] 1/-1/-1->0->-1 [17] 1/-1/-1->0->-1 [18] 1/-1/-1->0->-1 [19] 1/-1/-1->0->-1 [20] 1/-1/-1->0->-1 [21] 1/-1/-1->0->-1 [22] 1/-1/-1->0->-1 [23] 1/-1/-1->0->-1 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0002dd0 | |
a100-st-p4d24xlarge-35:50204:51338 [0] NCCL INFO New proxy recv connection 0 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0002e10 | |
a100-st-p4d24xlarge-35:50204:51338 [0] NCCL INFO New proxy recv connection 1 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:50204:51338 [0] NCCL INFO New proxy recv connection 2 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0002e50 | |
a100-st-p4d24xlarge-35:50204:51338 [0] NCCL INFO New proxy recv connection 3 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0002e90 | |
a100-st-p4d24xlarge-35:50204:51338 [0] NCCL INFO New proxy recv connection 4 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0002ed0 | |
a100-st-p4d24xlarge-35:50204:51338 [0] NCCL INFO New proxy recv connection 5 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0002f10 | |
a100-st-p4d24xlarge-35:50204:51338 [0] NCCL INFO New proxy recv connection 6 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0002f50 | |
a100-st-p4d24xlarge-35:50204:51338 [0] NCCL INFO New proxy recv connection 7 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0002f90 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0002fd0 | |
a100-st-p4d24xlarge-35:50204:51338 [0] NCCL INFO New proxy recv connection 8 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003010 | |
a100-st-p4d24xlarge-35:50204:51338 [0] NCCL INFO New proxy recv connection 9 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003050 | |
a100-st-p4d24xlarge-35:50204:51338 [0] NCCL INFO New proxy recv connection 10 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003090 | |
a100-st-p4d24xlarge-35:50204:51338 [0] NCCL INFO New proxy recv connection 11 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af00030d0 | |
a100-st-p4d24xlarge-35:50204:51338 [0] NCCL INFO New proxy recv connection 12 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003110 | |
a100-st-p4d24xlarge-35:50204:51338 [0] NCCL INFO New proxy recv connection 13 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003150 | |
a100-st-p4d24xlarge-35:50204:51338 [0] NCCL INFO New proxy recv connection 14 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003190 | |
a100-st-p4d24xlarge-35:50204:51338 [0] NCCL INFO New proxy recv connection 15 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af00031d0 | |
a100-st-p4d24xlarge-35:50204:51338 [0] NCCL INFO New proxy recv connection 16 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003210 | |
a100-st-p4d24xlarge-35:50204:51338 [0] NCCL INFO New proxy recv connection 17 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003250 | |
a100-st-p4d24xlarge-35:50204:51338 [0] NCCL INFO New proxy recv connection 18 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003290 | |
a100-st-p4d24xlarge-35:50204:51338 [0] NCCL INFO New proxy recv connection 19 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af00032d0 | |
a100-st-p4d24xlarge-35:50204:51338 [0] NCCL INFO New proxy recv connection 20 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003310 | |
a100-st-p4d24xlarge-35:50204:51338 [0] NCCL INFO New proxy recv connection 21 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003350 | |
a100-st-p4d24xlarge-35:50204:51338 [0] NCCL INFO New proxy recv connection 22 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003390 | |
a100-st-p4d24xlarge-35:50204:51338 [0] NCCL INFO New proxy recv connection 23 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Channel 00/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af00033d0 | |
a100-st-p4d24xlarge-35:50204:51338 [0] NCCL INFO New proxy send connection 24 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Channel 01/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003410 | |
a100-st-p4d24xlarge-35:50204:51338 [0] NCCL INFO New proxy send connection 25 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Channel 02/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003450 | |
a100-st-p4d24xlarge-35:50204:51338 [0] NCCL INFO New proxy send connection 26 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Channel 03/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003490 | |
a100-st-p4d24xlarge-35:50204:51338 [0] NCCL INFO New proxy send connection 27 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Channel 04/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af00034d0 | |
a100-st-p4d24xlarge-35:50204:51338 [0] NCCL INFO New proxy send connection 28 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Channel 05/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003510 | |
a100-st-p4d24xlarge-35:50204:51338 [0] NCCL INFO New proxy send connection 29 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Channel 06/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003550 | |
a100-st-p4d24xlarge-35:50204:51338 [0] NCCL INFO New proxy send connection 30 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Channel 07/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003590 | |
a100-st-p4d24xlarge-35:50204:51338 [0] NCCL INFO New proxy send connection 31 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Channel 08/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af00035d0 | |
a100-st-p4d24xlarge-35:50204:51338 [0] NCCL INFO New proxy send connection 32 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Channel 09/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003610 | |
a100-st-p4d24xlarge-35:50204:51338 [0] NCCL INFO New proxy send connection 33 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Channel 10/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003650 | |
a100-st-p4d24xlarge-35:50204:51338 [0] NCCL INFO New proxy send connection 34 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Channel 11/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003690 | |
a100-st-p4d24xlarge-35:50204:51338 [0] NCCL INFO New proxy send connection 35 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Channel 12/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af00036d0 | |
a100-st-p4d24xlarge-35:50204:51338 [0] NCCL INFO New proxy send connection 36 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Channel 13/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003710 | |
a100-st-p4d24xlarge-35:50204:51338 [0] NCCL INFO New proxy send connection 37 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Channel 14/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003750 | |
a100-st-p4d24xlarge-35:50204:51338 [0] NCCL INFO New proxy send connection 38 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Channel 15/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003790 | |
a100-st-p4d24xlarge-35:50204:51338 [0] NCCL INFO New proxy send connection 39 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Channel 16/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af00037d0 | |
a100-st-p4d24xlarge-35:50204:51338 [0] NCCL INFO New proxy send connection 40 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Channel 17/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003810 | |
a100-st-p4d24xlarge-35:50204:51338 [0] NCCL INFO New proxy send connection 41 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Channel 18/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003850 | |
a100-st-p4d24xlarge-35:50204:51338 [0] NCCL INFO New proxy send connection 42 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Channel 19/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003890 | |
a100-st-p4d24xlarge-35:50204:51338 [0] NCCL INFO New proxy send connection 43 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Channel 20/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af00038d0 | |
a100-st-p4d24xlarge-35:50204:51338 [0] NCCL INFO New proxy send connection 44 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Channel 21/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003910 | |
a100-st-p4d24xlarge-35:50204:51338 [0] NCCL INFO New proxy send connection 45 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Channel 22/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003950 | |
a100-st-p4d24xlarge-35:50204:51338 [0] NCCL INFO New proxy send connection 46 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Channel 23/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003990 | |
a100-st-p4d24xlarge-35:50204:51338 [0] NCCL INFO New proxy send connection 47 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Connected all rings | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af00039d0 | |
a100-st-p4d24xlarge-35:50204:51338 [0] NCCL INFO New proxy recv connection 48 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:50204:51338 [0] NCCL INFO New proxy recv connection 49 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003a10 | |
a100-st-p4d24xlarge-35:50204:51338 [0] NCCL INFO New proxy recv connection 50 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003a50 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003a90 | |
a100-st-p4d24xlarge-35:50204:51338 [0] NCCL INFO New proxy recv connection 51 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003ad0 | |
a100-st-p4d24xlarge-35:50204:51338 [0] NCCL INFO New proxy recv connection 52 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003b10 | |
a100-st-p4d24xlarge-35:50204:51338 [0] NCCL INFO New proxy recv connection 53 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003b50 | |
a100-st-p4d24xlarge-35:50204:51338 [0] NCCL INFO New proxy recv connection 54 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003b90 | |
a100-st-p4d24xlarge-35:50204:51338 [0] NCCL INFO New proxy recv connection 55 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003bd0 | |
a100-st-p4d24xlarge-35:50204:51338 [0] NCCL INFO New proxy recv connection 56 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003c10 | |
a100-st-p4d24xlarge-35:50204:51338 [0] NCCL INFO New proxy recv connection 57 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003c50 | |
a100-st-p4d24xlarge-35:50204:51338 [0] NCCL INFO New proxy recv connection 58 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003c90 | |
a100-st-p4d24xlarge-35:50204:51338 [0] NCCL INFO New proxy recv connection 59 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003cd0 | |
a100-st-p4d24xlarge-35:50204:51338 [0] NCCL INFO New proxy recv connection 60 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003d10 | |
a100-st-p4d24xlarge-35:50204:51338 [0] NCCL INFO New proxy recv connection 61 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003d50 | |
a100-st-p4d24xlarge-35:50204:51338 [0] NCCL INFO New proxy recv connection 62 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003d90 | |
a100-st-p4d24xlarge-35:50204:51338 [0] NCCL INFO New proxy recv connection 63 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003dd0 | |
a100-st-p4d24xlarge-35:50204:51338 [0] NCCL INFO New proxy recv connection 64 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003e10 | |
a100-st-p4d24xlarge-35:50204:51338 [0] NCCL INFO New proxy recv connection 65 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003e50 | |
a100-st-p4d24xlarge-35:50204:51338 [0] NCCL INFO New proxy recv connection 66 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003e90 | |
a100-st-p4d24xlarge-35:50204:51338 [0] NCCL INFO New proxy recv connection 67 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003ed0 | |
a100-st-p4d24xlarge-35:50204:51338 [0] NCCL INFO New proxy recv connection 68 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003f10 | |
a100-st-p4d24xlarge-35:50204:51338 [0] NCCL INFO New proxy recv connection 69 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003f50 | |
a100-st-p4d24xlarge-35:50204:51338 [0] NCCL INFO New proxy recv connection 70 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003f90 | |
a100-st-p4d24xlarge-35:50204:51338 [0] NCCL INFO New proxy recv connection 71 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Connected all trees | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO NCCL_ALGO set by environment to ring | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO threadThresholds 8/8/64 | 64/8/64 | 512 | 512 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO 24 coll channels, 32 p2p channels, 32 p2p channels per peer | |
a100-st-p4d24xlarge-35:50204:51338 [0] NCCL INFO New proxy send connection 72 from local rank 0, transport 2 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003fd0 | |
a100-st-p4d24xlarge-35:50204:51295 [0] NCCL INFO comm 0x5562b16bb6f0 rank 0 nranks 8 cudaDev 0 busId 101c0 - Init COMPLETE | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
result {'latency_median': 73.09516906738281, 'latency_stdev': 0.6081925671589652} | |
has been placed into the queue | |
got the result! | |
got exit 0 | |
(already got results) | |
exit code: 0 and result: {'nodes': 1, 'model_name': 'torchbenchmark.models.resnet50.Model', 'backend': 'torchdynamo_inductor', 'has_breaks': False, 'static_graph': False, 'result': {'latency_median': 73.09516906738281, 'latency_stdev': 0.6081925671589652}} | |
<RESULT>{"nodes": 1, "model_name": "torchbenchmark.models.resnet50.Model", "backend": "torchdynamo_inductor", "has_breaks": false, "static_graph": false, "result": {"latency_median": 73.09516906738281, "latency_stdev": 0.6081925671589652}}</RESULT> | |
run_once | |
Process group: 8 tasks, rank: 0 | |
MY HOSTNAME: a100-st-p4d24xlarge-35 | |
FI_PROVIDER : efa | |
LD_LIBRARY_PATH : /fsx/users/dberard/scratch-local/bench-fast/aws-ofi-nccl/lib:/opt/amazon/efa/lib:/usr/local/cuda-11.6/lib:/usr/local/cuda-11.6/lib64:/opt/amazon/efa/lib:/opt/amazon/openmpi/lib:/usr/local/cuda/efa/lib:/usr/local/cuda/lib:/usr/local/cuda:/usr/local/cuda/lib64:/usr/local/cuda/extras/CUPTI/lib64:/usr/local/cuda/targets/x86_64-linux/lib:/usr/local/lib:/usr/lib: | |
NCCL_DEBUG : INFO | |
FI_EFA_USE_DEVICE_RDMA : 1 | |
still waiting for process to exit | |
didn't get the result, queue empty | |
a100-st-p4d24xlarge-35:54314:54314 [0] NCCL INFO NCCL_SOCKET_IFNAME set by environment to ens | |
a100-st-p4d24xlarge-35:54314:54314 [0] NCCL INFO NCCL_SOCKET_IFNAME set to ens | |
a100-st-p4d24xlarge-35:54314:54314 [0] NCCL INFO Bootstrap : Using ens32:10.200.70.12<0> | |
a100-st-p4d24xlarge-35:54314:54314 [0] NCCL INFO NET/Plugin: Failed to find ncclCollNetPlugin_v6 symbol. | |
a100-st-p4d24xlarge-35:54314:54314 [0] NCCL INFO NET/Plugin: Failed to find ncclCollNetPlugin symbol (v4 or v5). | |
a100-st-p4d24xlarge-35:54314:54314 [0] NCCL INFO cudaDriverVersion 11060 | |
NCCL version 2.14.3+cuda11.6 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO NET/OFI Using aws-ofi-nccl 1.4.0aws | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO NET/OFI Running on p4d.24xlarge platform, Setting NCCL_TOPO_FILE environment variable to /fsx/users/dberard/scratch-local/bench-fast/aws-ofi-nccl/share/aws-ofi-nccl/xml/p4d-24xl-topo.xml | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO NET/OFI Setting FI_EFA_FORK_SAFE environment variable to 1 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO NET/OFI Selected Provider is efa | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Using network AWS Libfabric | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO NCCL_TOPO_FILE set by environment to /fsx/users/dberard/scratch-local/bench-fast/aws-ofi-nccl/share/aws-ofi-nccl/xml/p4d-24xl-topo.xml | |
still waiting for process to exit | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO NET/AWS Libfabric : GPU Direct RDMA Enabled for HCA 0 'rdmap16s27' | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO NET/AWS Libfabric : GPU Direct RDMA Enabled for HCA 1 'rdmap32s27' | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO NET/AWS Libfabric : GPU Direct RDMA Enabled for HCA 2 'rdmap144s27' | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO NET/AWS Libfabric : GPU Direct RDMA Enabled for HCA 3 'rdmap160s27' | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101d0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201c0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201d0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901c0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901d0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01c0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01d0 / HCA 0 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101d0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201c0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201d0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901c0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901d0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01c0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01d0 / HCA 1 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101d0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201c0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201d0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901c0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901d0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01c0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01d0 / HCA 2 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101c0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 101d0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201c0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 201d0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901c0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO GPU Direct RDMA Enabled for GPU 901d0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01c0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO GPU Direct RDMA Enabled for GPU a01d0 / HCA 3 (distance 3 <= 4), read 0 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Setting affinity for GPU 0 to 1f0000,0000001f | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Channel 00/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Channel 01/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Channel 02/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Channel 03/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Channel 04/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Channel 05/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Channel 06/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Channel 07/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Channel 08/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Channel 09/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Channel 10/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Channel 11/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Channel 12/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Channel 13/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Channel 14/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Channel 15/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Channel 16/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Channel 17/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Channel 18/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Channel 19/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Channel 20/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Channel 21/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Channel 22/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Channel 23/24 : 0 1 2 3 4 5 6 7 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Trees [0] 1/-1/-1->0->-1 [1] 1/-1/-1->0->-1 [2] 1/-1/-1->0->-1 [3] 1/-1/-1->0->-1 [4] 1/-1/-1->0->-1 [5] 1/-1/-1->0->-1 [6] 1/-1/-1->0->-1 [7] 1/-1/-1->0->-1 [8] 1/-1/-1->0->-1 [9] 1/-1/-1->0->-1 [10] 1/-1/-1->0->-1 [11] 1/-1/-1->0->-1 [12] 1/-1/-1->0->-1 [13] 1/-1/-1->0->-1 [14] 1/-1/-1->0->-1 [15] 1/-1/-1->0->-1 [16] 1/-1/-1->0->-1 [17] 1/-1/-1->0->-1 [18] 1/-1/-1->0->-1 [19] 1/-1/-1->0->-1 [20] 1/-1/-1->0->-1 [21] 1/-1/-1->0->-1 [22] 1/-1/-1->0->-1 [23] 1/-1/-1->0->-1 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0002dd0 | |
a100-st-p4d24xlarge-35:54314:55424 [0] NCCL INFO New proxy recv connection 0 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0002e10 | |
a100-st-p4d24xlarge-35:54314:55424 [0] NCCL INFO New proxy recv connection 1 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:54314:55424 [0] NCCL INFO New proxy recv connection 2 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0002e50 | |
a100-st-p4d24xlarge-35:54314:55424 [0] NCCL INFO New proxy recv connection 3 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0002e90 | |
a100-st-p4d24xlarge-35:54314:55424 [0] NCCL INFO New proxy recv connection 4 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0002ed0 | |
a100-st-p4d24xlarge-35:54314:55424 [0] NCCL INFO New proxy recv connection 5 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0002f10 | |
a100-st-p4d24xlarge-35:54314:55424 [0] NCCL INFO New proxy recv connection 6 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0002f50 | |
a100-st-p4d24xlarge-35:54314:55424 [0] NCCL INFO New proxy recv connection 7 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0002f90 | |
a100-st-p4d24xlarge-35:54314:55424 [0] NCCL INFO New proxy recv connection 8 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0002fd0 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003010 | |
a100-st-p4d24xlarge-35:54314:55424 [0] NCCL INFO New proxy recv connection 9 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003050 | |
a100-st-p4d24xlarge-35:54314:55424 [0] NCCL INFO New proxy recv connection 10 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003090 | |
a100-st-p4d24xlarge-35:54314:55424 [0] NCCL INFO New proxy recv connection 11 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af00030d0 | |
a100-st-p4d24xlarge-35:54314:55424 [0] NCCL INFO New proxy recv connection 12 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003110 | |
a100-st-p4d24xlarge-35:54314:55424 [0] NCCL INFO New proxy recv connection 13 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003150 | |
a100-st-p4d24xlarge-35:54314:55424 [0] NCCL INFO New proxy recv connection 14 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003190 | |
a100-st-p4d24xlarge-35:54314:55424 [0] NCCL INFO New proxy recv connection 15 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af00031d0 | |
a100-st-p4d24xlarge-35:54314:55424 [0] NCCL INFO New proxy recv connection 16 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003210 | |
a100-st-p4d24xlarge-35:54314:55424 [0] NCCL INFO New proxy recv connection 17 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003250 | |
a100-st-p4d24xlarge-35:54314:55424 [0] NCCL INFO New proxy recv connection 18 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003290 | |
a100-st-p4d24xlarge-35:54314:55424 [0] NCCL INFO New proxy recv connection 19 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af00032d0 | |
a100-st-p4d24xlarge-35:54314:55424 [0] NCCL INFO New proxy recv connection 20 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003310 | |
a100-st-p4d24xlarge-35:54314:55424 [0] NCCL INFO New proxy recv connection 21 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003350 | |
a100-st-p4d24xlarge-35:54314:55424 [0] NCCL INFO New proxy recv connection 22 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003390 | |
a100-st-p4d24xlarge-35:54314:55424 [0] NCCL INFO New proxy recv connection 23 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Channel 00/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af00033d0 | |
a100-st-p4d24xlarge-35:54314:55424 [0] NCCL INFO New proxy send connection 24 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Channel 01/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003410 | |
a100-st-p4d24xlarge-35:54314:55424 [0] NCCL INFO New proxy send connection 25 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Channel 02/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003450 | |
a100-st-p4d24xlarge-35:54314:55424 [0] NCCL INFO New proxy send connection 26 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Channel 03/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003490 | |
a100-st-p4d24xlarge-35:54314:55424 [0] NCCL INFO New proxy send connection 27 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Channel 04/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af00034d0 | |
a100-st-p4d24xlarge-35:54314:55424 [0] NCCL INFO New proxy send connection 28 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Channel 05/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003510 | |
a100-st-p4d24xlarge-35:54314:55424 [0] NCCL INFO New proxy send connection 29 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Channel 06/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003550 | |
a100-st-p4d24xlarge-35:54314:55424 [0] NCCL INFO New proxy send connection 30 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Channel 07/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003590 | |
a100-st-p4d24xlarge-35:54314:55424 [0] NCCL INFO New proxy send connection 31 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Channel 08/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af00035d0 | |
a100-st-p4d24xlarge-35:54314:55424 [0] NCCL INFO New proxy send connection 32 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Channel 09/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003610 | |
a100-st-p4d24xlarge-35:54314:55424 [0] NCCL INFO New proxy send connection 33 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Channel 10/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003650 | |
a100-st-p4d24xlarge-35:54314:55424 [0] NCCL INFO New proxy send connection 34 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Channel 11/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003690 | |
a100-st-p4d24xlarge-35:54314:55424 [0] NCCL INFO New proxy send connection 35 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Channel 12/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af00036d0 | |
a100-st-p4d24xlarge-35:54314:55424 [0] NCCL INFO New proxy send connection 36 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Channel 13/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003710 | |
a100-st-p4d24xlarge-35:54314:55424 [0] NCCL INFO New proxy send connection 37 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Channel 14/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003750 | |
a100-st-p4d24xlarge-35:54314:55424 [0] NCCL INFO New proxy send connection 38 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Channel 15/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003790 | |
a100-st-p4d24xlarge-35:54314:55424 [0] NCCL INFO New proxy send connection 39 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Channel 16/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af00037d0 | |
a100-st-p4d24xlarge-35:54314:55424 [0] NCCL INFO New proxy send connection 40 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Channel 17/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003810 | |
a100-st-p4d24xlarge-35:54314:55424 [0] NCCL INFO New proxy send connection 41 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Channel 18/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003850 | |
a100-st-p4d24xlarge-35:54314:55424 [0] NCCL INFO New proxy send connection 42 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Channel 19/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003890 | |
a100-st-p4d24xlarge-35:54314:55424 [0] NCCL INFO New proxy send connection 43 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Channel 20/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af00038d0 | |
a100-st-p4d24xlarge-35:54314:55424 [0] NCCL INFO New proxy send connection 44 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Channel 21/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003910 | |
a100-st-p4d24xlarge-35:54314:55424 [0] NCCL INFO New proxy send connection 45 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Channel 22/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003950 | |
a100-st-p4d24xlarge-35:54314:55424 [0] NCCL INFO New proxy send connection 46 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Channel 23/0 : 0[101c0] -> 1[101d0] via P2P/IPC/read | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003990 | |
a100-st-p4d24xlarge-35:54314:55424 [0] NCCL INFO New proxy send connection 47 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Connected all rings | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af00039d0 | |
a100-st-p4d24xlarge-35:54314:55424 [0] NCCL INFO New proxy recv connection 48 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003a10 | |
a100-st-p4d24xlarge-35:54314:55424 [0] NCCL INFO New proxy recv connection 49 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:54314:55424 [0] NCCL INFO New proxy recv connection 50 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003a50 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003a90 | |
a100-st-p4d24xlarge-35:54314:55424 [0] NCCL INFO New proxy recv connection 51 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003ad0 | |
a100-st-p4d24xlarge-35:54314:55424 [0] NCCL INFO New proxy recv connection 52 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003b10 | |
a100-st-p4d24xlarge-35:54314:55424 [0] NCCL INFO New proxy recv connection 53 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003b50 | |
a100-st-p4d24xlarge-35:54314:55424 [0] NCCL INFO New proxy recv connection 54 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003b90 | |
a100-st-p4d24xlarge-35:54314:55424 [0] NCCL INFO New proxy recv connection 55 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003bd0 | |
a100-st-p4d24xlarge-35:54314:55424 [0] NCCL INFO New proxy recv connection 56 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003c10 | |
a100-st-p4d24xlarge-35:54314:55424 [0] NCCL INFO New proxy recv connection 57 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003c50 | |
a100-st-p4d24xlarge-35:54314:55424 [0] NCCL INFO New proxy recv connection 58 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003c90 | |
a100-st-p4d24xlarge-35:54314:55424 [0] NCCL INFO New proxy recv connection 59 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003cd0 | |
a100-st-p4d24xlarge-35:54314:55424 [0] NCCL INFO New proxy recv connection 60 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003d10 | |
a100-st-p4d24xlarge-35:54314:55424 [0] NCCL INFO New proxy recv connection 61 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003d50 | |
a100-st-p4d24xlarge-35:54314:55424 [0] NCCL INFO New proxy recv connection 62 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003d90 | |
a100-st-p4d24xlarge-35:54314:55424 [0] NCCL INFO New proxy recv connection 63 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003dd0 | |
a100-st-p4d24xlarge-35:54314:55424 [0] NCCL INFO New proxy recv connection 64 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003e10 | |
a100-st-p4d24xlarge-35:54314:55424 [0] NCCL INFO New proxy recv connection 65 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003e50 | |
a100-st-p4d24xlarge-35:54314:55424 [0] NCCL INFO New proxy recv connection 66 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003e90 | |
a100-st-p4d24xlarge-35:54314:55424 [0] NCCL INFO New proxy recv connection 67 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003ed0 | |
a100-st-p4d24xlarge-35:54314:55424 [0] NCCL INFO New proxy recv connection 68 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003f10 | |
a100-st-p4d24xlarge-35:54314:55424 [0] NCCL INFO New proxy recv connection 69 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003f50 | |
a100-st-p4d24xlarge-35:54314:55424 [0] NCCL INFO New proxy recv connection 70 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003f90 | |
a100-st-p4d24xlarge-35:54314:55424 [0] NCCL INFO New proxy recv connection 71 from local rank 0, transport 0 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Connected all trees | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO NCCL_ALGO set by environment to ring | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO threadThresholds 8/8/64 | 64/8/64 | 512 | 512 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO 24 coll channels, 32 p2p channels, 32 p2p channels per peer | |
a100-st-p4d24xlarge-35:54314:55424 [0] NCCL INFO New proxy send connection 72 from local rank 0, transport 2 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO Connection to proxy localRank 0 -> connection 0x7f4af0003fd0 | |
a100-st-p4d24xlarge-35:54314:55382 [0] NCCL INFO comm 0x5562b1c1a0a0 rank 0 nranks 8 cudaDev 0 busId 101c0 - Init COMPLETE | |
didn't get the result, queue empty | |
DDPOptimizer used bucket cap 26214400 and produced the following buckets: | |
Index Size (b) Param Names | |
------- ---------- --------------------------------- | |
0 8196000 self_fc_weight | |
self_fc_bias | |
1 31502336 self_layer4_2_bn3_weight | |
self_layer4_2_bn3_bias | |
self_layer4_2_conv3_weight | |
self_layer4_2_bn2_weight | |
self_layer4_2_bn2_bias | |
self_layer4_2_conv2_weight | |
self_layer4_2_bn1_weight | |
self_layer4_2_bn1_bias | |
self_layer4_2_conv1_weight | |
self_layer4_1_bn3_weight | |
self_layer4_1_bn3_bias | |
self_layer4_1_conv3_weight | |
self_layer4_1_bn2_weight | |
self_layer4_1_bn2_bias | |
self_layer4_1_conv2_weight | |
2 26255360 self_layer4_1_bn1_weight | |
self_layer4_1_bn1_bias | |
self_layer4_1_conv1_weight | |
self_layer4_0_downsample_1_weight | |
self_layer4_0_downsample_1_bias | |
self_layer4_0_downsample_0_weight | |
self_layer4_0_bn3_weight | |
self_layer4_0_bn3_bias | |
self_layer4_0_conv3_weight | |
self_layer4_0_bn2_weight | |
self_layer4_0_bn2_bias | |
self_layer4_0_conv2_weight | |
3 26550272 self_layer4_0_bn1_weight | |
self_layer4_0_bn1_bias | |
self_layer4_0_conv1_weight | |
self_layer3_5_bn3_weight | |
self_layer3_5_bn3_bias | |
self_layer3_5_conv3_weight | |
self_layer3_5_bn2_weight | |
self_layer3_5_bn2_bias | |
self_layer3_5_conv2_weight | |
self_layer3_5_bn1_weight | |
self_layer3_5_bn1_bias | |
self_layer3_5_conv1_weight | |
self_layer3_4_bn3_weight | |
self_layer3_4_bn3_bias | |
self_layer3_4_conv3_weight | |
self_layer3_4_bn2_weight | |
self_layer3_4_bn2_bias | |
self_layer3_4_conv2_weight | |
self_layer3_4_bn1_weight | |
self_layer3_4_bn1_bias | |
self_layer3_4_conv1_weight | |
self_layer3_3_bn3_weight | |
self_layer3_3_bn3_bias | |
self_layer3_3_conv3_weight | |
self_layer3_3_bn2_weight | |
self_layer3_3_bn2_bias | |
self_layer3_3_conv2_weight | |
self_layer3_3_bn1_weight | |
self_layer3_3_bn1_bias | |
self_layer3_3_conv1_weight | |
self_layer3_2_bn3_weight | |
self_layer3_2_bn3_bias | |
self_layer3_2_conv3_weight | |
self_layer3_2_bn2_weight | |
self_layer3_2_bn2_bias | |
self_layer3_2_conv2_weight | |
self_layer3_2_bn1_weight | |
self_layer3_2_bn1_bias | |
self_layer3_2_conv1_weight | |
self_layer3_1_bn3_weight | |
self_layer3_1_bn3_bias | |
self_layer3_1_conv3_weight | |
self_layer3_1_bn2_weight | |
self_layer3_1_bn2_bias | |
self_layer3_1_conv2_weight | |
self_layer3_1_bn1_weight | |
self_layer3_1_bn1_bias | |
self_layer3_1_conv1_weight | |
self_layer3_0_downsample_1_weight | |
self_layer3_0_downsample_1_bias | |
self_layer3_0_downsample_0_weight | |
4 9724160 self_layer3_0_bn3_weight | |
self_layer3_0_bn3_bias | |
self_layer3_0_conv3_weight | |
self_layer3_0_bn2_weight | |
self_layer3_0_bn2_bias | |
self_layer3_0_conv2_weight | |
self_layer3_0_bn1_weight | |
self_layer3_0_bn1_bias | |
self_layer3_0_conv1_weight | |
self_layer2_3_bn3_weight | |
self_layer2_3_bn3_bias | |
self_layer2_3_conv3_weight | |
self_layer2_3_bn2_weight | |
self_layer2_3_bn2_bias | |
self_layer2_3_conv2_weight | |
self_layer2_3_bn1_weight | |
self_layer2_3_bn1_bias | |
self_layer2_3_conv1_weight | |
self_layer2_2_bn3_weight | |
self_layer2_2_bn3_bias | |
self_layer2_2_conv3_weight | |
self_layer2_2_bn2_weight | |
self_layer2_2_bn2_bias | |
self_layer2_2_conv2_weight | |
self_layer2_2_bn1_weight | |
self_layer2_2_bn1_bias | |
self_layer2_2_conv1_weight | |
self_layer2_1_bn3_weight | |
self_layer2_1_bn3_bias | |
self_layer2_1_conv3_weight | |
self_layer2_1_bn2_weight | |
self_layer2_1_bn2_bias | |
self_layer2_1_conv2_weight | |
self_layer2_1_bn1_weight | |
self_layer2_1_bn1_bias | |
self_layer2_1_conv1_weight | |
self_layer2_0_downsample_1_weight | |
self_layer2_0_downsample_1_bias | |
self_layer2_0_downsample_0_weight | |
self_layer2_0_bn3_weight | |
self_layer2_0_bn3_bias | |
self_layer2_0_conv3_weight | |
self_layer2_0_bn2_weight | |
self_layer2_0_bn2_bias | |
self_layer2_0_conv2_weight | |
self_layer2_0_bn1_weight | |
self_layer2_0_bn1_bias | |
self_layer2_0_conv1_weight | |
self_layer1_2_bn3_weight | |
self_layer1_2_bn3_bias | |
self_layer1_2_conv3_weight | |
self_layer1_2_bn2_weight | |
self_layer1_2_bn2_bias | |
self_layer1_2_conv2_weight | |
self_layer1_2_bn1_weight | |
self_layer1_2_bn1_bias | |
self_layer1_2_conv1_weight | |
self_layer1_1_bn3_weight | |
self_layer1_1_bn3_bias | |
self_layer1_1_conv3_weight | |
self_layer1_1_bn2_weight | |
self_layer1_1_bn2_bias | |
self_layer1_1_conv2_weight | |
self_layer1_1_bn1_weight | |
self_layer1_1_bn1_bias | |
self_layer1_1_conv1_weight | |
self_layer1_0_downsample_1_weight | |
self_layer1_0_downsample_1_bias | |
self_layer1_0_downsample_0_weight | |
self_layer1_0_bn3_weight | |
self_layer1_0_bn3_bias | |
self_layer1_0_conv3_weight | |
self_layer1_0_bn2_weight | |
self_layer1_0_bn2_bias | |
self_layer1_0_conv2_weight | |
self_layer1_0_bn1_weight | |
self_layer1_0_bn1_bias | |
self_layer1_0_conv1_weight | |
self_bn1_weight | |
self_bn1_bias | |
self_conv1_weight | |
---orig graph--- | |
graph(): | |
%x : torch.Tensor [#users=1] = placeholder[target=x] | |
%self_conv1 : [#users=1] = call_module[target=self_conv1](args = (%x,), kwargs = {}) | |
%self_bn1 : [#users=1] = call_module[target=self_bn1](args = (%self_conv1,), kwargs = {}) | |
%self_relu : [#users=1] = call_module[target=self_relu](args = (%self_bn1,), kwargs = {}) | |
%self_maxpool : [#users=2] = call_module[target=self_maxpool](args = (%self_relu,), kwargs = {}) | |
%self_layer1_0_conv1 : [#users=1] = call_module[target=self_layer1_0_conv1](args = (%self_maxpool,), kwargs = {}) | |
%self_layer1_0_bn1 : [#users=1] = call_module[target=self_layer1_0_bn1](args = (%self_layer1_0_conv1,), kwargs = {}) | |
%self_layer1_0_relu : [#users=1] = call_module[target=self_layer1_0_relu](args = (%self_layer1_0_bn1,), kwargs = {}) | |
%self_layer1_0_conv2 : [#users=1] = call_module[target=self_layer1_0_conv2](args = (%self_layer1_0_relu,), kwargs = {}) | |
%self_layer1_0_bn2 : [#users=1] = call_module[target=self_layer1_0_bn2](args = (%self_layer1_0_conv2,), kwargs = {}) | |
%self_layer1_0_relu_1 : [#users=1] = call_module[target=self_layer1_0_relu](args = (%self_layer1_0_bn2,), kwargs = {}) | |
%self_layer1_0_conv3 : [#users=1] = call_module[target=self_layer1_0_conv3](args = (%self_layer1_0_relu_1,), kwargs = {}) | |
%self_layer1_0_bn3 : [#users=1] = call_module[target=self_layer1_0_bn3](args = (%self_layer1_0_conv3,), kwargs = {}) | |
%self_layer1_0_downsample_0 : [#users=1] = call_module[target=self_layer1_0_downsample_0](args = (%self_maxpool,), kwargs = {}) | |
%self_layer1_0_downsample_1 : [#users=1] = call_module[target=self_layer1_0_downsample_1](args = (%self_layer1_0_downsample_0,), kwargs = {}) | |
%iadd : [#users=1] = call_function[target=operator.iadd](args = (%self_layer1_0_bn3, %self_layer1_0_downsample_1), kwargs = {}) | |
%self_layer1_0_relu_2 : [#users=2] = call_module[target=self_layer1_0_relu](args = (%iadd,), kwargs = {}) | |
%self_layer1_1_conv1 : [#users=1] = call_module[target=self_layer1_1_conv1](args = (%self_layer1_0_relu_2,), kwargs = {}) | |
%self_layer1_1_bn1 : [#users=1] = call_module[target=self_layer1_1_bn1](args = (%self_layer1_1_conv1,), kwargs = {}) | |
%self_layer1_1_relu : [#users=1] = call_module[target=self_layer1_1_relu](args = (%self_layer1_1_bn1,), kwargs = {}) | |
%self_layer1_1_conv2 : [#users=1] = call_module[target=self_layer1_1_conv2](args = (%self_layer1_1_relu,), kwargs = {}) | |
%self_layer1_1_bn2 : [#users=1] = call_module[target=self_layer1_1_bn2](args = (%self_layer1_1_conv2,), kwargs = {}) | |
%self_layer1_1_relu_1 : [#users=1] = call_module[target=self_layer1_1_relu](args = (%self_layer1_1_bn2,), kwargs = {}) | |
%self_layer1_1_conv3 : [#users=1] = call_module[target=self_layer1_1_conv3](args = (%self_layer1_1_relu_1,), kwargs = {}) | |
%self_layer1_1_bn3 : [#users=1] = call_module[target=self_layer1_1_bn3](args = (%self_layer1_1_conv3,), kwargs = {}) | |
%iadd_1 : [#users=1] = call_function[target=operator.iadd](args = (%self_layer1_1_bn3, %self_layer1_0_relu_2), kwargs = {}) | |
%self_layer1_1_relu_2 : [#users=2] = call_module[target=self_layer1_1_relu](args = (%iadd_1,), kwargs = {}) | |
%self_layer1_2_conv1 : [#users=1] = call_module[target=self_layer1_2_conv1](args = (%self_layer1_1_relu_2,), kwargs = {}) | |
%self_layer1_2_bn1 : [#users=1] = call_module[target=self_layer1_2_bn1](args = (%self_layer1_2_conv1,), kwargs = {}) | |
%self_layer1_2_relu : [#users=1] = call_module[target=self_layer1_2_relu](args = (%self_layer1_2_bn1,), kwargs = {}) | |
%self_layer1_2_conv2 : [#users=1] = call_module[target=self_layer1_2_conv2](args = (%self_layer1_2_relu,), kwargs = {}) | |
%self_layer1_2_bn2 : [#users=1] = call_module[target=self_layer1_2_bn2](args = (%self_layer1_2_conv2,), kwargs = {}) | |
%self_layer1_2_relu_1 : [#users=1] = call_module[target=self_layer1_2_relu](args = (%self_layer1_2_bn2,), kwargs = {}) | |
%self_layer1_2_conv3 : [#users=1] = call_module[target=self_layer1_2_conv3](args = (%self_layer1_2_relu_1,), kwargs = {}) | |
%self_layer1_2_bn3 : [#users=1] = call_module[target=self_layer1_2_bn3](args = (%self_layer1_2_conv3,), kwargs = {}) | |
%iadd_2 : [#users=1] = call_function[target=operator.iadd](args = (%self_layer1_2_bn3, %self_layer1_1_relu_2), kwargs = {}) | |
%self_layer1_2_relu_2 : [#users=2] = call_module[target=self_layer1_2_relu](args = (%iadd_2,), kwargs = {}) | |
%self_layer2_0_conv1 : [#users=1] = call_module[target=self_layer2_0_conv1](args = (%self_layer1_2_relu_2,), kwargs = {}) | |
%self_layer2_0_bn1 : [#users=1] = call_module[target=self_layer2_0_bn1](args = (%self_layer2_0_conv1,), kwargs = {}) | |
%self_layer2_0_relu : [#users=1] = call_module[target=self_layer2_0_relu](args = (%self_layer2_0_bn1,), kwargs = {}) | |
%self_layer2_0_conv2 : [#users=1] = call_module[target=self_layer2_0_conv2](args = (%self_layer2_0_relu,), kwargs = {}) | |
%self_layer2_0_bn2 : [#users=1] = call_module[target=self_layer2_0_bn2](args = (%self_layer2_0_conv2,), kwargs = {}) | |
%self_layer2_0_relu_1 : [#users=1] = call_module[target=self_layer2_0_relu](args = (%self_layer2_0_bn2,), kwargs = {}) | |
%self_layer2_0_conv3 : [#users=1] = call_module[target=self_layer2_0_conv3](args = (%self_layer2_0_relu_1,), kwargs = {}) | |
%self_layer2_0_bn3 : [#users=1] = call_module[target=self_layer2_0_bn3](args = (%self_layer2_0_conv3,), kwargs = {}) | |
%self_layer2_0_downsample_0 : [#users=1] = call_module[target=self_layer2_0_downsample_0](args = (%self_layer1_2_relu_2,), kwargs = {}) | |
%self_layer2_0_downsample_1 : [#users=1] = call_module[target=self_layer2_0_downsample_1](args = (%self_layer2_0_downsample_0,), kwargs = {}) | |
%iadd_3 : [#users=1] = call_function[target=operator.iadd](args = (%self_layer2_0_bn3, %self_layer2_0_downsample_1), kwargs = {}) | |
%self_layer2_0_relu_2 : [#users=2] = call_module[target=self_layer2_0_relu](args = (%iadd_3,), kwargs = {}) | |
%self_layer2_1_conv1 : [#users=1] = call_module[target=self_layer2_1_conv1](args = (%self_layer2_0_relu_2,), kwargs = {}) | |
%self_layer2_1_bn1 : [#users=1] = call_module[target=self_layer2_1_bn1](args = (%self_layer2_1_conv1,), kwargs = {}) | |
%self_layer2_1_relu : [#users=1] = call_module[target=self_layer2_1_relu](args = (%self_layer2_1_bn1,), kwargs = {}) | |
%self_layer2_1_conv2 : [#users=1] = call_module[target=self_layer2_1_conv2](args = (%self_layer2_1_relu,), kwargs = {}) | |
%self_layer2_1_bn2 : [#users=1] = call_module[target=self_layer2_1_bn2](args = (%self_layer2_1_conv2,), kwargs = {}) | |
%self_layer2_1_relu_1 : [#users=1] = call_module[target=self_layer2_1_relu](args = (%self_layer2_1_bn2,), kwargs = {}) | |
%self_layer2_1_conv3 : [#users=1] = call_module[target=self_layer2_1_conv3](args = (%self_layer2_1_relu_1,), kwargs = {}) | |
%self_layer2_1_bn3 : [#users=1] = call_module[target=self_layer2_1_bn3](args = (%self_layer2_1_conv3,), kwargs = {}) | |
%iadd_4 : [#users=1] = call_function[target=operator.iadd](args = (%self_layer2_1_bn3, %self_layer2_0_relu_2), kwargs = {}) | |
%self_layer2_1_relu_2 : [#users=2] = call_module[target=self_layer2_1_relu](args = (%iadd_4,), kwargs = {}) | |
%self_layer2_2_conv1 : [#users=1] = call_module[target=self_layer2_2_conv1](args = (%self_layer2_1_relu_2,), kwargs = {}) | |
%self_layer2_2_bn1 : [#users=1] = call_module[target=self_layer2_2_bn1](args = (%self_layer2_2_conv1,), kwargs = {}) | |
%self_layer2_2_relu : [#users=1] = call_module[target=self_layer2_2_relu](args = (%self_layer2_2_bn1,), kwargs = {}) | |
%self_layer2_2_conv2 : [#users=1] = call_module[target=self_layer2_2_conv2](args = (%self_layer2_2_relu,), kwargs = {}) | |
%self_layer2_2_bn2 : [#users=1] = call_module[target=self_layer2_2_bn2](args = (%self_layer2_2_conv2,), kwargs = {}) | |
%self_layer2_2_relu_1 : [#users=1] = call_module[target=self_layer2_2_relu](args = (%self_layer2_2_bn2,), kwargs = {}) | |
%self_layer2_2_conv3 : [#users=1] = call_module[target=self_layer2_2_conv3](args = (%self_layer2_2_relu_1,), kwargs = {}) | |
%self_layer2_2_bn3 : [#users=1] = call_module[target=self_layer2_2_bn3](args = (%self_layer2_2_conv3,), kwargs = {}) | |
%iadd_5 : [#users=1] = call_function[target=operator.iadd](args = (%self_layer2_2_bn3, %self_layer2_1_relu_2), kwargs = {}) | |
%self_layer2_2_relu_2 : [#users=2] = call_module[target=self_layer2_2_relu](args = (%iadd_5,), kwargs = {}) | |
%self_layer2_3_conv1 : [#users=1] = call_module[target=self_layer2_3_conv1](args = (%self_layer2_2_relu_2,), kwargs = {}) | |
%self_layer2_3_bn1 : [#users=1] = call_module[target=self_layer2_3_bn1](args = (%self_layer2_3_conv1,), kwargs = {}) | |
%self_layer2_3_relu : [#users=1] = call_module[target=self_layer2_3_relu](args = (%self_layer2_3_bn1,), kwargs = {}) | |
%self_layer2_3_conv2 : [#users=1] = call_module[target=self_layer2_3_conv2](args = (%self_layer2_3_relu,), kwargs = {}) | |
%self_layer2_3_bn2 : [#users=1] = call_module[target=self_layer2_3_bn2](args = (%self_layer2_3_conv2,), kwargs = {}) | |
%self_layer2_3_relu_1 : [#users=1] = call_module[target=self_layer2_3_relu](args = (%self_layer2_3_bn2,), kwargs = {}) | |
%self_layer2_3_conv3 : [#users=1] = call_module[target=self_layer2_3_conv3](args = (%self_layer2_3_relu_1,), kwargs = {}) | |
%self_layer2_3_bn3 : [#users=1] = call_module[target=self_layer2_3_bn3](args = (%self_layer2_3_conv3,), kwargs = {}) | |
%iadd_6 : [#users=1] = call_function[target=operator.iadd](args = (%self_layer2_3_bn3, %self_layer2_2_relu_2), kwargs = {}) | |
%self_layer2_3_relu_2 : [#users=2] = call_module[target=self_layer2_3_relu](args = (%iadd_6,), kwargs = {}) | |
%self_layer3_0_conv1 : [#users=1] = call_module[target=self_layer3_0_conv1](args = (%self_layer2_3_relu_2,), kwargs = {}) | |
%self_layer3_0_bn1 : [#users=1] = call_module[target=self_layer3_0_bn1](args = (%self_layer3_0_conv1,), kwargs = {}) | |
%self_layer3_0_relu : [#users=1] = call_module[target=self_layer3_0_relu](args = (%self_layer3_0_bn1,), kwargs = {}) | |
%self_layer3_0_conv2 : [#users=1] = call_module[target=self_layer3_0_conv2](args = (%self_layer3_0_relu,), kwargs = {}) | |
%self_layer3_0_bn2 : [#users=1] = call_module[target=self_layer3_0_bn2](args = (%self_layer3_0_conv2,), kwargs = {}) | |
%self_layer3_0_relu_1 : [#users=1] = call_module[target=self_layer3_0_relu](args = (%self_layer3_0_bn2,), kwargs = {}) | |
%self_layer3_0_conv3 : [#users=1] = call_module[target=self_layer3_0_conv3](args = (%self_layer3_0_relu_1,), kwargs = {}) | |
%self_layer3_0_bn3 : [#users=1] = call_module[target=self_layer3_0_bn3](args = (%self_layer3_0_conv3,), kwargs = {}) | |
%self_layer3_0_downsample_0 : [#users=1] = call_module[target=self_layer3_0_downsample_0](args = (%self_layer2_3_relu_2,), kwargs = {}) | |
%self_layer3_0_downsample_1 : [#users=1] = call_module[target=self_layer3_0_downsample_1](args = (%self_layer3_0_downsample_0,), kwargs = {}) | |
%iadd_7 : [#users=1] = call_function[target=operator.iadd](args = (%self_layer3_0_bn3, %self_layer3_0_downsample_1), kwargs = {}) | |
%self_layer3_0_relu_2 : [#users=2] = call_module[target=self_layer3_0_relu](args = (%iadd_7,), kwargs = {}) | |
%self_layer3_1_conv1 : [#users=1] = call_module[target=self_layer3_1_conv1](args = (%self_layer3_0_relu_2,), kwargs = {}) | |
%self_layer3_1_bn1 : [#users=1] = call_module[target=self_layer3_1_bn1](args = (%self_layer3_1_conv1,), kwargs = {}) | |
%self_layer3_1_relu : [#users=1] = call_module[target=self_layer3_1_relu](args = (%self_layer3_1_bn1,), kwargs = {}) | |
%self_layer3_1_conv2 : [#users=1] = call_module[target=self_layer3_1_conv2](args = (%self_layer3_1_relu,), kwargs = {}) | |
%self_layer3_1_bn2 : [#users=1] = call_module[target=self_layer3_1_bn2](args = (%self_layer3_1_conv2,), kwargs = {}) | |
%self_layer3_1_relu_1 : [#users=1] = call_module[target=self_layer3_1_relu](args = (%self_layer3_1_bn2,), kwargs = {}) | |
%self_layer3_1_conv3 : [#users=1] = call_module[target=self_layer3_1_conv3](args = (%self_layer3_1_relu_1,), kwargs = {}) | |
%self_layer3_1_bn3 : [#users=1] = call_module[target=self_layer3_1_bn3](args = (%self_layer3_1_conv3,), kwargs = {}) | |
%iadd_8 : [#users=1] = call_function[target=operator.iadd](args = (%self_layer3_1_bn3, %self_layer3_0_relu_2), kwargs = {}) | |
%self_layer3_1_relu_2 : [#users=2] = call_module[target=self_layer3_1_relu](args = (%iadd_8,), kwargs = {}) | |
%self_layer3_2_conv1 : [#users=1] = call_module[target=self_layer3_2_conv1](args = (%self_layer3_1_relu_2,), kwargs = {}) | |
%self_layer3_2_bn1 : [#users=1] = call_module[target=self_layer3_2_bn1](args = (%self_layer3_2_conv1,), kwargs = {}) | |
%self_layer3_2_relu : [#users=1] = call_module[target=self_layer3_2_relu](args = (%self_layer3_2_bn1,), kwargs = {}) | |
%self_layer3_2_conv2 : [#users=1] = call_module[target=self_layer3_2_conv2](args = (%self_layer3_2_relu,), kwargs = {}) | |
%self_layer3_2_bn2 : [#users=1] = call_module[target=self_layer3_2_bn2](args = (%self_layer3_2_conv2,), kwargs = {}) | |
%self_layer3_2_relu_1 : [#users=1] = call_module[target=self_layer3_2_relu](args = (%self_layer3_2_bn2,), kwargs = {}) | |
%self_layer3_2_conv3 : [#users=1] = call_module[target=self_layer3_2_conv3](args = (%self_layer3_2_relu_1,), kwargs = {}) | |
%self_layer3_2_bn3 : [#users=1] = call_module[target=self_layer3_2_bn3](args = (%self_layer3_2_conv3,), kwargs = {}) | |
%iadd_9 : [#users=1] = call_function[target=operator.iadd](args = (%self_layer3_2_bn3, %self_layer3_1_relu_2), kwargs = {}) | |
%self_layer3_2_relu_2 : [#users=2] = call_module[target=self_layer3_2_relu](args = (%iadd_9,), kwargs = {}) | |
%self_layer3_3_conv1 : [#users=1] = call_module[target=self_layer3_3_conv1](args = (%self_layer3_2_relu_2,), kwargs = {}) | |
%self_layer3_3_bn1 : [#users=1] = call_module[target=self_layer3_3_bn1](args = (%self_layer3_3_conv1,), kwargs = {}) | |
%self_layer3_3_relu : [#users=1] = call_module[target=self_layer3_3_relu](args = (%self_layer3_3_bn1,), kwargs = {}) | |
%self_layer3_3_conv2 : [#users=1] = call_module[target=self_layer3_3_conv2](args = (%self_layer3_3_relu,), kwargs = {}) | |
%self_layer3_3_bn2 : [#users=1] = call_module[target=self_layer3_3_bn2](args = (%self_layer3_3_conv2,), kwargs = {}) | |
%self_layer3_3_relu_1 : [#users=1] = call_module[target=self_layer3_3_relu](args = (%self_layer3_3_bn2,), kwargs = {}) | |
%self_layer3_3_conv3 : [#users=1] = call_module[target=self_layer3_3_conv3](args = (%self_layer3_3_relu_1,), kwargs = {}) | |
%self_layer3_3_bn3 : [#users=1] = call_module[target=self_layer3_3_bn3](args = (%self_layer3_3_conv3,), kwargs = {}) | |
%iadd_10 : [#users=1] = call_function[target=operator.iadd](args = (%self_layer3_3_bn3, %self_layer3_2_relu_2), kwargs = {}) | |
%self_layer3_3_relu_2 : [#users=2] = call_module[target=self_layer3_3_relu](args = (%iadd_10,), kwargs = {}) | |
%self_layer3_4_conv1 : [#users=1] = call_module[target=self_layer3_4_conv1](args = (%self_layer3_3_relu_2,), kwargs = {}) | |
%self_layer3_4_bn1 : [#users=1] = call_module[target=self_layer3_4_bn1](args = (%self_layer3_4_conv1,), kwargs = {}) | |
%self_layer3_4_relu : [#users=1] = call_module[target=self_layer3_4_relu](args = (%self_layer3_4_bn1,), kwargs = {}) | |
%self_layer3_4_conv2 : [#users=1] = call_module[target=self_layer3_4_conv2](args = (%self_layer3_4_relu,), kwargs = {}) | |
%self_layer3_4_bn2 : [#users=1] = call_module[target=self_layer3_4_bn2](args = (%self_layer3_4_conv2,), kwargs = {}) | |
%self_layer3_4_relu_1 : [#users=1] = call_module[target=self_layer3_4_relu](args = (%self_layer3_4_bn2,), kwargs = {}) | |
%self_layer3_4_conv3 : [#users=1] = call_module[target=self_layer3_4_conv3](args = (%self_layer3_4_relu_1,), kwargs = {}) | |
%self_layer3_4_bn3 : [#users=1] = call_module[target=self_layer3_4_bn3](args = (%self_layer3_4_conv3,), kwargs = {}) | |
%iadd_11 : [#users=1] = call_function[target=operator.iadd](args = (%self_layer3_4_bn3, %self_layer3_3_relu_2), kwargs = {}) | |
%self_layer3_4_relu_2 : [#users=2] = call_module[target=self_layer3_4_relu](args = (%iadd_11,), kwargs = {}) | |
%self_layer3_5_conv1 : [#users=1] = call_module[target=self_layer3_5_conv1](args = (%self_layer3_4_relu_2,), kwargs = {}) | |
%self_layer3_5_bn1 : [#users=1] = call_module[target=self_layer3_5_bn1](args = (%self_layer3_5_conv1,), kwargs = {}) | |
%self_layer3_5_relu : [#users=1] = call_module[target=self_layer3_5_relu](args = (%self_layer3_5_bn1,), kwargs = {}) | |
%self_layer3_5_conv2 : [#users=1] = call_module[target=self_layer3_5_conv2](args = (%self_layer3_5_relu,), kwargs = {}) | |
%self_layer3_5_bn2 : [#users=1] = call_module[target=self_layer3_5_bn2](args = (%self_layer3_5_conv2,), kwargs = {}) | |
%self_layer3_5_relu_1 : [#users=1] = call_module[target=self_layer3_5_relu](args = (%self_layer3_5_bn2,), kwargs = {}) | |
%self_layer3_5_conv3 : [#users=1] = call_module[target=self_layer3_5_conv3](args = (%self_layer3_5_relu_1,), kwargs = {}) | |
%self_layer3_5_bn3 : [#users=1] = call_module[target=self_layer3_5_bn3](args = (%self_layer3_5_conv3,), kwargs = {}) | |
%iadd_12 : [#users=1] = call_function[target=operator.iadd](args = (%self_layer3_5_bn3, %self_layer3_4_relu_2), kwargs = {}) | |
%self_layer3_5_relu_2 : [#users=2] = call_module[target=self_layer3_5_relu](args = (%iadd_12,), kwargs = {}) | |
%self_layer4_0_conv1 : [#users=1] = call_module[target=self_layer4_0_conv1](args = (%self_layer3_5_relu_2,), kwargs = {}) | |
%self_layer4_0_bn1 : [#users=1] = call_module[target=self_layer4_0_bn1](args = (%self_layer4_0_conv1,), kwargs = {}) | |
%self_layer4_0_relu : [#users=1] = call_module[target=self_layer4_0_relu](args = (%self_layer4_0_bn1,), kwargs = {}) | |
%self_layer4_0_conv2 : [#users=1] = call_module[target=self_layer4_0_conv2](args = (%self_layer4_0_relu,), kwargs = {}) | |
%self_layer4_0_bn2 : [#users=1] = call_module[target=self_layer4_0_bn2](args = (%self_layer4_0_conv2,), kwargs = {}) | |
%self_layer4_0_relu_1 : [#users=1] = call_module[target=self_layer4_0_relu](args = (%self_layer4_0_bn2,), kwargs = {}) | |
%self_layer4_0_conv3 : [#users=1] = call_module[target=self_layer4_0_conv3](args = (%self_layer4_0_relu_1,), kwargs = {}) | |
%self_layer4_0_bn3 : [#users=1] = call_module[target=self_layer4_0_bn3](args = (%self_layer4_0_conv3,), kwargs = {}) | |
%self_layer4_0_downsample_0 : [#users=1] = call_module[target=self_layer4_0_downsample_0](args = (%self_layer3_5_relu_2,), kwargs = {}) | |
%self_layer4_0_downsample_1 : [#users=1] = call_module[target=self_layer4_0_downsample_1](args = (%self_layer4_0_downsample_0,), kwargs = {}) | |
%iadd_13 : [#users=1] = call_function[target=operator.iadd](args = (%self_layer4_0_bn3, %self_layer4_0_downsample_1), kwargs = {}) | |
%self_layer4_0_relu_2 : [#users=2] = call_module[target=self_layer4_0_relu](args = (%iadd_13,), kwargs = {}) | |
%self_layer4_1_conv1 : [#users=1] = call_module[target=self_layer4_1_conv1](args = (%self_layer4_0_relu_2,), kwargs = {}) | |
%self_layer4_1_bn1 : [#users=1] = call_module[target=self_layer4_1_bn1](args = (%self_layer4_1_conv1,), kwargs = {}) | |
%self_layer4_1_relu : [#users=1] = call_module[target=self_layer4_1_relu](args = (%self_layer4_1_bn1,), kwargs = {}) | |
%self_layer4_1_conv2 : [#users=1] = call_module[target=self_layer4_1_conv2](args = (%self_layer4_1_relu,), kwargs = {}) | |
%self_layer4_1_bn2 : [#users=1] = call_module[target=self_layer4_1_bn2](args = (%self_layer4_1_conv2,), kwargs = {}) | |
%self_layer4_1_relu_1 : [#users=1] = call_module[target=self_layer4_1_relu](args = (%self_layer4_1_bn2,), kwargs = {}) | |
%self_layer4_1_conv3 : [#users=1] = call_module[target=self_layer4_1_conv3](args = (%self_layer4_1_relu_1,), kwargs = {}) | |
%self_layer4_1_bn3 : [#users=1] = call_module[target=self_layer4_1_bn3](args = (%self_layer4_1_conv3,), kwargs = {}) | |
%iadd_14 : [#users=1] = call_function[target=operator.iadd](args = (%self_layer4_1_bn3, %self_layer4_0_relu_2), kwargs = {}) | |
%self_layer4_1_relu_2 : [#users=2] = call_module[target=self_layer4_1_relu](args = (%iadd_14,), kwargs = {}) | |
%self_layer4_2_conv1 : [#users=1] = call_module[target=self_layer4_2_conv1](args = (%self_layer4_1_relu_2,), kwargs = {}) | |
%self_layer4_2_bn1 : [#users=1] = call_module[target=self_layer4_2_bn1](args = (%self_layer4_2_conv1,), kwargs = {}) | |
%self_layer4_2_relu : [#users=1] = call_module[target=self_layer4_2_relu](args = (%self_layer4_2_bn1,), kwargs = {}) | |
%self_layer4_2_conv2 : [#users=1] = call_module[target=self_layer4_2_conv2](args = (%self_layer4_2_relu,), kwargs = {}) | |
%self_layer4_2_bn2 : [#users=1] = call_module[target=self_layer4_2_bn2](args = (%self_layer4_2_conv2,), kwargs = {}) | |
%self_layer4_2_relu_1 : [#users=1] = call_module[target=self_layer4_2_relu](args = (%self_layer4_2_bn2,), kwargs = {}) | |
%self_layer4_2_conv3 : [#users=1] = call_module[target=self_layer4_2_conv3](args = (%self_layer4_2_relu_1,), kwargs = {}) | |
%self_layer4_2_bn3 : [#users=1] = call_module[target=self_layer4_2_bn3](args = (%self_layer4_2_conv3,), kwargs = {}) | |
%iadd_15 : [#users=1] = call_function[target=operator.iadd](args = (%self_layer4_2_bn3, %self_layer4_1_relu_2), kwargs = {}) | |
%self_layer4_2_relu_2 : [#users=1] = call_module[target=self_layer4_2_relu](args = (%iadd_15,), kwargs = {}) | |
%self_avgpool : [#users=1] = call_module[target=self_avgpool](args = (%self_layer4_2_relu_2,), kwargs = {}) | |
%flatten : [#users=1] = call_function[target=torch.flatten](args = (%self_avgpool, 1), kwargs = {}) | |
%self_fc : [#users=1] = call_module[target=self_fc](args = (%flatten,), kwargs = {}) | |
return (self_fc,) | |
---split graph--- | |
graph(): | |
%x : torch.Tensor [#users=1] = placeholder[target=x] | |
%submod_0 : [#users=2] = call_module[target=submod_0](args = (%x,), kwargs = {}) | |
%getitem : [#users=1] = call_function[target=operator.getitem](args = (%submod_0, 0), kwargs = {}) | |
%getitem_1 : [#users=1] = call_function[target=operator.getitem](args = (%submod_0, 1), kwargs = {}) | |
%submod_1 : [#users=2] = call_module[target=submod_1](args = (%getitem, %getitem_1), kwargs = {}) | |
%getitem_2 : [#users=1] = call_function[target=operator.getitem](args = (%submod_1, 0), kwargs = {}) | |
%getitem_3 : [#users=1] = call_function[target=operator.getitem](args = (%submod_1, 1), kwargs = {}) | |
%submod_2 : [#users=2] = call_module[target=submod_2](args = (%getitem_2, %getitem_3), kwargs = {}) | |
%getitem_4 : [#users=1] = call_function[target=operator.getitem](args = (%submod_2, 0), kwargs = {}) | |
%getitem_5 : [#users=1] = call_function[target=operator.getitem](args = (%submod_2, 1), kwargs = {}) | |
%submod_3 : [#users=1] = call_module[target=submod_3](args = (%getitem_4, %getitem_5), kwargs = {}) | |
%submod_4 : [#users=1] = call_module[target=submod_4](args = (%submod_3,), kwargs = {}) | |
return (submod_4,) | |
--- graph--- | |
graph(): | |
%x : torch.Tensor [#users=1] = placeholder[target=x] | |
%submod_0 : [#users=2] = call_module[target=submod_0](args = (%x,), kwargs = {}) | |
%getitem : [#users=1] = call_function[target=operator.getitem](args = (%submod_0, 0), kwargs = {}) | |
%getitem_1 : [#users=1] = call_function[target=operator.getitem](args = (%submod_0, 1), kwargs = {}) | |
%submod_1 : [#users=2] = call_module[target=submod_1](args = (%getitem, %getitem_1), kwargs = {}) | |
%getitem_2 : [#users=1] = call_function[target=operator.getitem](args = (%submod_1, 0), kwargs = {}) | |
%getitem_3 : [#users=1] = call_function[target=operator.getitem](args = (%submod_1, 1), kwargs = {}) | |
%submod_2 : [#users=2] = call_module[target=submod_2](args = (%getitem_2, %getitem_3), kwargs = {}) | |
%getitem_4 : [#users=1] = call_function[target=operator.getitem](args = (%submod_2, 0), kwargs = {}) | |
%getitem_5 : [#users=1] = call_function[target=operator.getitem](args = (%submod_2, 1), kwargs = {}) | |
%submod_3 : [#users=1] = call_module[target=submod_3](args = (%getitem_4, %getitem_5), kwargs = {}) | |
%submod_4 : [#users=1] = call_module[target=submod_4](args = (%submod_3,), kwargs = {}) | |
return (submod_4,) | |
---submod_0 graph--- | |
graph(): | |
%x : [#users=1] = placeholder[target=x] | |
%self_conv1 : [#users=1] = call_module[target=self_conv1](args = (%x,), kwargs = {}) | |
%self_bn1 : [#users=1] = call_module[target=self_bn1](args = (%self_conv1,), kwargs = {}) | |
%self_relu : [#users=1] = call_module[target=self_relu](args = (%self_bn1,), kwargs = {}) | |
%self_maxpool : [#users=2] = call_module[target=self_maxpool](args = (%self_relu,), kwargs = {}) | |
%self_layer1_0_conv1 : [#users=1] = call_module[target=self_layer1_0_conv1](args = (%self_maxpool,), kwargs = {}) | |
%self_layer1_0_bn1 : [#users=1] = call_module[target=self_layer1_0_bn1](args = (%self_layer1_0_conv1,), kwargs = {}) | |
%self_layer1_0_relu : [#users=1] = call_module[target=self_layer1_0_relu](args = (%self_layer1_0_bn1,), kwargs = {}) | |
%self_layer1_0_conv2 : [#users=1] = call_module[target=self_layer1_0_conv2](args = (%self_layer1_0_relu,), kwargs = {}) | |
%self_layer1_0_bn2 : [#users=1] = call_module[target=self_layer1_0_bn2](args = (%self_layer1_0_conv2,), kwargs = {}) | |
%self_layer1_0_relu_1 : [#users=1] = call_module[target=self_layer1_0_relu](args = (%self_layer1_0_bn2,), kwargs = {}) | |
%self_layer1_0_conv3 : [#users=1] = call_module[target=self_layer1_0_conv3](args = (%self_layer1_0_relu_1,), kwargs = {}) | |
%self_layer1_0_bn3 : [#users=1] = call_module[target=self_layer1_0_bn3](args = (%self_layer1_0_conv3,), kwargs = {}) | |
%self_layer1_0_downsample_0 : [#users=1] = call_module[target=self_layer1_0_downsample_0](args = (%self_maxpool,), kwargs = {}) | |
%self_layer1_0_downsample_1 : [#users=1] = call_module[target=self_layer1_0_downsample_1](args = (%self_layer1_0_downsample_0,), kwargs = {}) | |
%iadd : [#users=1] = call_function[target=operator.iadd](args = (%self_layer1_0_bn3, %self_layer1_0_downsample_1), kwargs = {}) | |
%self_layer1_0_relu_2 : [#users=2] = call_module[target=self_layer1_0_relu](args = (%iadd,), kwargs = {}) | |
%self_layer1_1_conv1 : [#users=1] = call_module[target=self_layer1_1_conv1](args = (%self_layer1_0_relu_2,), kwargs = {}) | |
%self_layer1_1_bn1 : [#users=1] = call_module[target=self_layer1_1_bn1](args = (%self_layer1_1_conv1,), kwargs = {}) | |
%self_layer1_1_relu : [#users=1] = call_module[target=self_layer1_1_relu](args = (%self_layer1_1_bn1,), kwargs = {}) | |
%self_layer1_1_conv2 : [#users=1] = call_module[target=self_layer1_1_conv2](args = (%self_layer1_1_relu,), kwargs = {}) | |
%self_layer1_1_bn2 : [#users=1] = call_module[target=self_layer1_1_bn2](args = (%self_layer1_1_conv2,), kwargs = {}) | |
%self_layer1_1_relu_1 : [#users=1] = call_module[target=self_layer1_1_relu](args = (%self_layer1_1_bn2,), kwargs = {}) | |
%self_layer1_1_conv3 : [#users=1] = call_module[target=self_layer1_1_conv3](args = (%self_layer1_1_relu_1,), kwargs = {}) | |
%self_layer1_1_bn3 : [#users=1] = call_module[target=self_layer1_1_bn3](args = (%self_layer1_1_conv3,), kwargs = {}) | |
%iadd_1 : [#users=1] = call_function[target=operator.iadd](args = (%self_layer1_1_bn3, %self_layer1_0_relu_2), kwargs = {}) | |
%self_layer1_1_relu_2 : [#users=2] = call_module[target=self_layer1_1_relu](args = (%iadd_1,), kwargs = {}) | |
%self_layer1_2_conv1 : [#users=1] = call_module[target=self_layer1_2_conv1](args = (%self_layer1_1_relu_2,), kwargs = {}) | |
%self_layer1_2_bn1 : [#users=1] = call_module[target=self_layer1_2_bn1](args = (%self_layer1_2_conv1,), kwargs = {}) | |
%self_layer1_2_relu : [#users=1] = call_module[target=self_layer1_2_relu](args = (%self_layer1_2_bn1,), kwargs = {}) | |
%self_layer1_2_conv2 : [#users=1] = call_module[target=self_layer1_2_conv2](args = (%self_layer1_2_relu,), kwargs = {}) | |
%self_layer1_2_bn2 : [#users=1] = call_module[target=self_layer1_2_bn2](args = (%self_layer1_2_conv2,), kwargs = {}) | |
%self_layer1_2_relu_1 : [#users=1] = call_module[target=self_layer1_2_relu](args = (%self_layer1_2_bn2,), kwargs = {}) | |
%self_layer1_2_conv3 : [#users=1] = call_module[target=self_layer1_2_conv3](args = (%self_layer1_2_relu_1,), kwargs = {}) | |
%self_layer1_2_bn3 : [#users=1] = call_module[target=self_layer1_2_bn3](args = (%self_layer1_2_conv3,), kwargs = {}) | |
%iadd_2 : [#users=1] = call_function[target=operator.iadd](args = (%self_layer1_2_bn3, %self_layer1_1_relu_2), kwargs = {}) | |
%self_layer1_2_relu_2 : [#users=2] = call_module[target=self_layer1_2_relu](args = (%iadd_2,), kwargs = {}) | |
%self_layer2_0_conv1 : [#users=1] = call_module[target=self_layer2_0_conv1](args = (%self_layer1_2_relu_2,), kwargs = {}) | |
%self_layer2_0_bn1 : [#users=1] = call_module[target=self_layer2_0_bn1](args = (%self_layer2_0_conv1,), kwargs = {}) | |
%self_layer2_0_relu : [#users=1] = call_module[target=self_layer2_0_relu](args = (%self_layer2_0_bn1,), kwargs = {}) | |
%self_layer2_0_conv2 : [#users=1] = call_module[target=self_layer2_0_conv2](args = (%self_layer2_0_relu,), kwargs = {}) | |
%self_layer2_0_bn2 : [#users=1] = call_module[target=self_layer2_0_bn2](args = (%self_layer2_0_conv2,), kwargs = {}) | |
%self_layer2_0_relu_1 : [#users=1] = call_module[target=self_layer2_0_relu](args = (%self_layer2_0_bn2,), kwargs = {}) | |
%self_layer2_0_conv3 : [#users=1] = call_module[target=self_layer2_0_conv3](args = (%self_layer2_0_relu_1,), kwargs = {}) | |
%self_layer2_0_bn3 : [#users=1] = call_module[target=self_layer2_0_bn3](args = (%self_layer2_0_conv3,), kwargs = {}) | |
%self_layer2_0_downsample_0 : [#users=1] = call_module[target=self_layer2_0_downsample_0](args = (%self_layer1_2_relu_2,), kwargs = {}) | |
%self_layer2_0_downsample_1 : [#users=1] = call_module[target=self_layer2_0_downsample_1](args = (%self_layer2_0_downsample_0,), kwargs = {}) | |
%iadd_3 : [#users=1] = call_function[target=operator.iadd](args = (%self_layer2_0_bn3, %self_layer2_0_downsample_1), kwargs = {}) | |
%self_layer2_0_relu_2 : [#users=2] = call_module[target=self_layer2_0_relu](args = (%iadd_3,), kwargs = {}) | |
%self_layer2_1_conv1 : [#users=1] = call_module[target=self_layer2_1_conv1](args = (%self_layer2_0_relu_2,), kwargs = {}) | |
%self_layer2_1_bn1 : [#users=1] = call_module[target=self_layer2_1_bn1](args = (%self_layer2_1_conv1,), kwargs = {}) | |
%self_layer2_1_relu : [#users=1] = call_module[target=self_layer2_1_relu](args = (%self_layer2_1_bn1,), kwargs = {}) | |
%self_layer2_1_conv2 : [#users=1] = call_module[target=self_layer2_1_conv2](args = (%self_layer2_1_relu,), kwargs = {}) | |
%self_layer2_1_bn2 : [#users=1] = call_module[target=self_layer2_1_bn2](args = (%self_layer2_1_conv2,), kwargs = {}) | |
%self_layer2_1_relu_1 : [#users=1] = call_module[target=self_layer2_1_relu](args = (%self_layer2_1_bn2,), kwargs = {}) | |
%self_layer2_1_conv3 : [#users=1] = call_module[target=self_layer2_1_conv3](args = (%self_layer2_1_relu_1,), kwargs = {}) | |
%self_layer2_1_bn3 : [#users=1] = call_module[target=self_layer2_1_bn3](args = (%self_layer2_1_conv3,), kwargs = {}) | |
%iadd_4 : [#users=1] = call_function[target=operator.iadd](args = (%self_layer2_1_bn3, %self_layer2_0_relu_2), kwargs = {}) | |
%self_layer2_1_relu_2 : [#users=2] = call_module[target=self_layer2_1_relu](args = (%iadd_4,), kwargs = {}) | |
%self_layer2_2_conv1 : [#users=1] = call_module[target=self_layer2_2_conv1](args = (%self_layer2_1_relu_2,), kwargs = {}) | |
%self_layer2_2_bn1 : [#users=1] = call_module[target=self_layer2_2_bn1](args = (%self_layer2_2_conv1,), kwargs = {}) | |
%self_layer2_2_relu : [#users=1] = call_module[target=self_layer2_2_relu](args = (%self_layer2_2_bn1,), kwargs = {}) | |
%self_layer2_2_conv2 : [#users=1] = call_module[target=self_layer2_2_conv2](args = (%self_layer2_2_relu,), kwargs = {}) | |
%self_layer2_2_bn2 : [#users=1] = call_module[target=self_layer2_2_bn2](args = (%self_layer2_2_conv2,), kwargs = {}) | |
%self_layer2_2_relu_1 : [#users=1] = call_module[target=self_layer2_2_relu](args = (%self_layer2_2_bn2,), kwargs = {}) | |
%self_layer2_2_conv3 : [#users=1] = call_module[target=self_layer2_2_conv3](args = (%self_layer2_2_relu_1,), kwargs = {}) | |
%self_layer2_2_bn3 : [#users=1] = call_module[target=self_layer2_2_bn3](args = (%self_layer2_2_conv3,), kwargs = {}) | |
%iadd_5 : [#users=1] = call_function[target=operator.iadd](args = (%self_layer2_2_bn3, %self_layer2_1_relu_2), kwargs = {}) | |
%self_layer2_2_relu_2 : [#users=2] = call_module[target=self_layer2_2_relu](args = (%iadd_5,), kwargs = {}) | |
%self_layer2_3_conv1 : [#users=1] = call_module[target=self_layer2_3_conv1](args = (%self_layer2_2_relu_2,), kwargs = {}) | |
%self_layer2_3_bn1 : [#users=1] = call_module[target=self_layer2_3_bn1](args = (%self_layer2_3_conv1,), kwargs = {}) | |
%self_layer2_3_relu : [#users=1] = call_module[target=self_layer2_3_relu](args = (%self_layer2_3_bn1,), kwargs = {}) | |
%self_layer2_3_conv2 : [#users=1] = call_module[target=self_layer2_3_conv2](args = (%self_layer2_3_relu,), kwargs = {}) | |
%self_layer2_3_bn2 : [#users=1] = call_module[target=self_layer2_3_bn2](args = (%self_layer2_3_conv2,), kwargs = {}) | |
%self_layer2_3_relu_1 : [#users=1] = call_module[target=self_layer2_3_relu](args = (%self_layer2_3_bn2,), kwargs = {}) | |
%self_layer2_3_conv3 : [#users=1] = call_module[target=self_layer2_3_conv3](args = (%self_layer2_3_relu_1,), kwargs = {}) | |
%self_layer2_3_bn3 : [#users=1] = call_module[target=self_layer2_3_bn3](args = (%self_layer2_3_conv3,), kwargs = {}) | |
%iadd_6 : [#users=1] = call_function[target=operator.iadd](args = (%self_layer2_3_bn3, %self_layer2_2_relu_2), kwargs = {}) | |
%self_layer2_3_relu_2 : [#users=2] = call_module[target=self_layer2_3_relu](args = (%iadd_6,), kwargs = {}) | |
%self_layer3_0_conv1 : [#users=1] = call_module[target=self_layer3_0_conv1](args = (%self_layer2_3_relu_2,), kwargs = {}) | |
%self_layer3_0_bn1 : [#users=1] = call_module[target=self_layer3_0_bn1](args = (%self_layer3_0_conv1,), kwargs = {}) | |
%self_layer3_0_relu : [#users=1] = call_module[target=self_layer3_0_relu](args = (%self_layer3_0_bn1,), kwargs = {}) | |
%self_layer3_0_conv2 : [#users=1] = call_module[target=self_layer3_0_conv2](args = (%self_layer3_0_relu,), kwargs = {}) | |
%self_layer3_0_bn2 : [#users=1] = call_module[target=self_layer3_0_bn2](args = (%self_layer3_0_conv2,), kwargs = {}) | |
%self_layer3_0_relu_1 : [#users=1] = call_module[target=self_layer3_0_relu](args = (%self_layer3_0_bn2,), kwargs = {}) | |
%self_layer3_0_conv3 : [#users=1] = call_module[target=self_layer3_0_conv3](args = (%self_layer3_0_relu_1,), kwargs = {}) | |
%self_layer3_0_bn3 : [#users=1] = call_module[target=self_layer3_0_bn3](args = (%self_layer3_0_conv3,), kwargs = {}) | |
return (self_layer2_3_relu_2, self_layer3_0_bn3) | |
---submod_1 graph--- | |
graph(): | |
%self_layer2_3_relu_2 : [#users=1] = placeholder[target=self_layer2_3_relu_2] | |
%self_layer3_0_bn3 : [#users=1] = placeholder[target=self_layer3_0_bn3] | |
%self_layer3_0_downsample_0 : [#users=1] = call_module[target=self_layer3_0_downsample_0](args = (%self_layer2_3_relu_2,), kwargs = {}) | |
%self_layer3_0_downsample_1 : [#users=1] = call_module[target=self_layer3_0_downsample_1](args = (%self_layer3_0_downsample_0,), kwargs = {}) | |
%iadd : [#users=1] = call_function[target=operator.iadd](args = (%self_layer3_0_bn3, %self_layer3_0_downsample_1), kwargs = {}) | |
%self_layer3_0_relu : [#users=2] = call_module[target=self_layer3_0_relu](args = (%iadd,), kwargs = {}) | |
%self_layer3_1_conv1 : [#users=1] = call_module[target=self_layer3_1_conv1](args = (%self_layer3_0_relu,), kwargs = {}) | |
%self_layer3_1_bn1 : [#users=1] = call_module[target=self_layer3_1_bn1](args = (%self_layer3_1_conv1,), kwargs = {}) | |
%self_layer3_1_relu : [#users=1] = call_module[target=self_layer3_1_relu](args = (%self_layer3_1_bn1,), kwargs = {}) | |
%self_layer3_1_conv2 : [#users=1] = call_module[target=self_layer3_1_conv2](args = (%self_layer3_1_relu,), kwargs = {}) | |
%self_layer3_1_bn2 : [#users=1] = call_module[target=self_layer3_1_bn2](args = (%self_layer3_1_conv2,), kwargs = {}) | |
%self_layer3_1_relu_1 : [#users=1] = call_module[target=self_layer3_1_relu](args = (%self_layer3_1_bn2,), kwargs = {}) | |
%self_layer3_1_conv3 : [#users=1] = call_module[target=self_layer3_1_conv3](args = (%self_layer3_1_relu_1,), kwargs = {}) | |
%self_layer3_1_bn3 : [#users=1] = call_module[target=self_layer3_1_bn3](args = (%self_layer3_1_conv3,), kwargs = {}) | |
%iadd_1 : [#users=1] = call_function[target=operator.iadd](args = (%self_layer3_1_bn3, %self_layer3_0_relu), kwargs = {}) | |
%self_layer3_1_relu_2 : [#users=2] = call_module[target=self_layer3_1_relu](args = (%iadd_1,), kwargs = {}) | |
%self_layer3_2_conv1 : [#users=1] = call_module[target=self_layer3_2_conv1](args = (%self_layer3_1_relu_2,), kwargs = {}) | |
%self_layer3_2_bn1 : [#users=1] = call_module[target=self_layer3_2_bn1](args = (%self_layer3_2_conv1,), kwargs = {}) | |
%self_layer3_2_relu : [#users=1] = call_module[target=self_layer3_2_relu](args = (%self_layer3_2_bn1,), kwargs = {}) | |
%self_layer3_2_conv2 : [#users=1] = call_module[target=self_layer3_2_conv2](args = (%self_layer3_2_relu,), kwargs = {}) | |
%self_layer3_2_bn2 : [#users=1] = call_module[target=self_layer3_2_bn2](args = (%self_layer3_2_conv2,), kwargs = {}) | |
%self_layer3_2_relu_1 : [#users=1] = call_module[target=self_layer3_2_relu](args = (%self_layer3_2_bn2,), kwargs = {}) | |
%self_layer3_2_conv3 : [#users=1] = call_module[target=self_layer3_2_conv3](args = (%self_layer3_2_relu_1,), kwargs = {}) | |
%self_layer3_2_bn3 : [#users=1] = call_module[target=self_layer3_2_bn3](args = (%self_layer3_2_conv3,), kwargs = {}) | |
%iadd_2 : [#users=1] = call_function[target=operator.iadd](args = (%self_layer3_2_bn3, %self_layer3_1_relu_2), kwargs = {}) | |
%self_layer3_2_relu_2 : [#users=2] = call_module[target=self_layer3_2_relu](args = (%iadd_2,), kwargs = {}) | |
%self_layer3_3_conv1 : [#users=1] = call_module[target=self_layer3_3_conv1](args = (%self_layer3_2_relu_2,), kwargs = {}) | |
%self_layer3_3_bn1 : [#users=1] = call_module[target=self_layer3_3_bn1](args = (%self_layer3_3_conv1,), kwargs = {}) | |
%self_layer3_3_relu : [#users=1] = call_module[target=self_layer3_3_relu](args = (%self_layer3_3_bn1,), kwargs = {}) | |
%self_layer3_3_conv2 : [#users=1] = call_module[target=self_layer3_3_conv2](args = (%self_layer3_3_relu,), kwargs = {}) | |
%self_layer3_3_bn2 : [#users=1] = call_module[target=self_layer3_3_bn2](args = (%self_layer3_3_conv2,), kwargs = {}) | |
%self_layer3_3_relu_1 : [#users=1] = call_module[target=self_layer3_3_relu](args = (%self_layer3_3_bn2,), kwargs = {}) | |
%self_layer3_3_conv3 : [#users=1] = call_module[target=self_layer3_3_conv3](args = (%self_layer3_3_relu_1,), kwargs = {}) | |
%self_layer3_3_bn3 : [#users=1] = call_module[target=self_layer3_3_bn3](args = (%self_layer3_3_conv3,), kwargs = {}) | |
%iadd_3 : [#users=1] = call_function[target=operator.iadd](args = (%self_layer3_3_bn3, %self_layer3_2_relu_2), kwargs = {}) | |
%self_layer3_3_relu_2 : [#users=2] = call_module[target=self_layer3_3_relu](args = (%iadd_3,), kwargs = {}) | |
%self_layer3_4_conv1 : [#users=1] = call_module[target=self_layer3_4_conv1](args = (%self_layer3_3_relu_2,), kwargs = {}) | |
%self_layer3_4_bn1 : [#users=1] = call_module[target=self_layer3_4_bn1](args = (%self_layer3_4_conv1,), kwargs = {}) | |
%self_layer3_4_relu : [#users=1] = call_module[target=self_layer3_4_relu](args = (%self_layer3_4_bn1,), kwargs = {}) | |
%self_layer3_4_conv2 : [#users=1] = call_module[target=self_layer3_4_conv2](args = (%self_layer3_4_relu,), kwargs = {}) | |
%self_layer3_4_bn2 : [#users=1] = call_module[target=self_layer3_4_bn2](args = (%self_layer3_4_conv2,), kwargs = {}) | |
%self_layer3_4_relu_1 : [#users=1] = call_module[target=self_layer3_4_relu](args = (%self_layer3_4_bn2,), kwargs = {}) | |
%self_layer3_4_conv3 : [#users=1] = call_module[target=self_layer3_4_conv3](args = (%self_layer3_4_relu_1,), kwargs = {}) | |
%self_layer3_4_bn3 : [#users=1] = call_module[target=self_layer3_4_bn3](args = (%self_layer3_4_conv3,), kwargs = {}) | |
%iadd_4 : [#users=1] = call_function[target=operator.iadd](args = (%self_layer3_4_bn3, %self_layer3_3_relu_2), kwargs = {}) | |
%self_layer3_4_relu_2 : [#users=2] = call_module[target=self_layer3_4_relu](args = (%iadd_4,), kwargs = {}) | |
%self_layer3_5_conv1 : [#users=1] = call_module[target=self_layer3_5_conv1](args = (%self_layer3_4_relu_2,), kwargs = {}) | |
%self_layer3_5_bn1 : [#users=1] = call_module[target=self_layer3_5_bn1](args = (%self_layer3_5_conv1,), kwargs = {}) | |
%self_layer3_5_relu : [#users=1] = call_module[target=self_layer3_5_relu](args = (%self_layer3_5_bn1,), kwargs = {}) | |
%self_layer3_5_conv2 : [#users=1] = call_module[target=self_layer3_5_conv2](args = (%self_layer3_5_relu,), kwargs = {}) | |
%self_layer3_5_bn2 : [#users=1] = call_module[target=self_layer3_5_bn2](args = (%self_layer3_5_conv2,), kwargs = {}) | |
%self_layer3_5_relu_1 : [#users=1] = call_module[target=self_layer3_5_relu](args = (%self_layer3_5_bn2,), kwargs = {}) | |
%self_layer3_5_conv3 : [#users=1] = call_module[target=self_layer3_5_conv3](args = (%self_layer3_5_relu_1,), kwargs = {}) | |
%self_layer3_5_bn3 : [#users=1] = call_module[target=self_layer3_5_bn3](args = (%self_layer3_5_conv3,), kwargs = {}) | |
%iadd_5 : [#users=1] = call_function[target=operator.iadd](args = (%self_layer3_5_bn3, %self_layer3_4_relu_2), kwargs = {}) | |
%self_layer3_5_relu_2 : [#users=2] = call_module[target=self_layer3_5_relu](args = (%iadd_5,), kwargs = {}) | |
%self_layer4_0_conv1 : [#users=1] = call_module[target=self_layer4_0_conv1](args = (%self_layer3_5_relu_2,), kwargs = {}) | |
%self_layer4_0_bn1 : [#users=1] = call_module[target=self_layer4_0_bn1](args = (%self_layer4_0_conv1,), kwargs = {}) | |
%self_layer4_0_relu : [#users=1] = call_module[target=self_layer4_0_relu](args = (%self_layer4_0_bn1,), kwargs = {}) | |
return (self_layer4_0_relu, self_layer3_5_relu_2) | |
---submod_2 graph--- | |
graph(): | |
%self_layer4_0_relu : [#users=1] = placeholder[target=self_layer4_0_relu] | |
%self_layer3_5_relu_2 : [#users=1] = placeholder[target=self_layer3_5_relu_2] | |
%self_layer4_0_conv2 : [#users=1] = call_module[target=self_layer4_0_conv2](args = (%self_layer4_0_relu,), kwargs = {}) | |
%self_layer4_0_bn2 : [#users=1] = call_module[target=self_layer4_0_bn2](args = (%self_layer4_0_conv2,), kwargs = {}) | |
%self_layer4_0_relu_1 : [#users=1] = call_module[target=self_layer4_0_relu](args = (%self_layer4_0_bn2,), kwargs = {}) | |
%self_layer4_0_conv3 : [#users=1] = call_module[target=self_layer4_0_conv3](args = (%self_layer4_0_relu_1,), kwargs = {}) | |
%self_layer4_0_bn3 : [#users=1] = call_module[target=self_layer4_0_bn3](args = (%self_layer4_0_conv3,), kwargs = {}) | |
%self_layer4_0_downsample_0 : [#users=1] = call_module[target=self_layer4_0_downsample_0](args = (%self_layer3_5_relu_2,), kwargs = {}) | |
%self_layer4_0_downsample_1 : [#users=1] = call_module[target=self_layer4_0_downsample_1](args = (%self_layer4_0_downsample_0,), kwargs = {}) | |
%iadd : [#users=1] = call_function[target=operator.iadd](args = (%self_layer4_0_bn3, %self_layer4_0_downsample_1), kwargs = {}) | |
%self_layer4_0_relu_2 : [#users=2] = call_module[target=self_layer4_0_relu](args = (%iadd,), kwargs = {}) | |
%self_layer4_1_conv1 : [#users=1] = call_module[target=self_layer4_1_conv1](args = (%self_layer4_0_relu_2,), kwargs = {}) | |
%self_layer4_1_bn1 : [#users=1] = call_module[target=self_layer4_1_bn1](args = (%self_layer4_1_conv1,), kwargs = {}) | |
%self_layer4_1_relu : [#users=1] = call_module[target=self_layer4_1_relu](args = (%self_layer4_1_bn1,), kwargs = {}) | |
return (self_layer4_1_relu, self_layer4_0_relu_2) | |
---submod_3 graph--- | |
graph(): | |
%self_layer4_1_relu : [#users=1] = placeholder[target=self_layer4_1_relu] | |
%self_layer4_0_relu_2 : [#users=1] = placeholder[target=self_layer4_0_relu_2] | |
%self_layer4_1_conv2 : [#users=1] = call_module[target=self_layer4_1_conv2](args = (%self_layer4_1_relu,), kwargs = {}) | |
%self_layer4_1_bn2 : [#users=1] = call_module[target=self_layer4_1_bn2](args = (%self_layer4_1_conv2,), kwargs = {}) | |
%self_layer4_1_relu_1 : [#users=1] = call_module[target=self_layer4_1_relu](args = (%self_layer4_1_bn2,), kwargs = {}) | |
%self_layer4_1_conv3 : [#users=1] = call_module[target=self_layer4_1_conv3](args = (%self_layer4_1_relu_1,), kwargs = {}) | |
%self_layer4_1_bn3 : [#users=1] = call_module[target=self_layer4_1_bn3](args = (%self_layer4_1_conv3,), kwargs = {}) | |
%iadd : [#users=1] = call_function[target=operator.iadd](args = (%self_layer4_1_bn3, %self_layer4_0_relu_2), kwargs = {}) | |
%self_layer4_1_relu_2 : [#users=2] = call_module[target=self_layer4_1_relu](args = (%iadd,), kwargs = {}) | |
%self_layer4_2_conv1 : [#users=1] = call_module[target=self_layer4_2_conv1](args = (%self_layer4_1_relu_2,), kwargs = {}) | |
%self_layer4_2_bn1 : [#users=1] = call_module[target=self_layer4_2_bn1](args = (%self_layer4_2_conv1,), kwargs = {}) | |
%self_layer4_2_relu : [#users=1] = call_module[target=self_layer4_2_relu](args = (%self_layer4_2_bn1,), kwargs = {}) | |
%self_layer4_2_conv2 : [#users=1] = call_module[target=self_layer4_2_conv2](args = (%self_layer4_2_relu,), kwargs = {}) | |
%self_layer4_2_bn2 : [#users=1] = call_module[target=self_layer4_2_bn2](args = (%self_layer4_2_conv2,), kwargs = {}) | |
%self_layer4_2_relu_1 : [#users=1] = call_module[target=self_layer4_2_relu](args = (%self_layer4_2_bn2,), kwargs = {}) | |
%self_layer4_2_conv3 : [#users=1] = call_module[target=self_layer4_2_conv3](args = (%self_layer4_2_relu_1,), kwargs = {}) | |
%self_layer4_2_bn3 : [#users=1] = call_module[target=self_layer4_2_bn3](args = (%self_layer4_2_conv3,), kwargs = {}) | |
%iadd_1 : [#users=1] = call_function[target=operator.iadd](args = (%self_layer4_2_bn3, %self_layer4_1_relu_2), kwargs = {}) | |
%self_layer4_2_relu_2 : [#users=1] = call_module[target=self_layer4_2_relu](args = (%iadd_1,), kwargs = {}) | |
%self_avgpool : [#users=1] = call_module[target=self_avgpool](args = (%self_layer4_2_relu_2,), kwargs = {}) | |
%flatten : [#users=1] = call_function[target=torch.flatten](args = (%self_avgpool, 1), kwargs = {}) | |
return flatten | |
---submod_4 graph--- | |
graph(): | |
%flatten : [#users=1] = placeholder[target=flatten] | |
%self_fc : [#users=1] = call_module[target=self_fc](args = (%flatten,), kwargs = {}) | |
return self_fc | |
--------------- | |
run_node placeholder, x got args tuple() | |
run_node call_module, submod_0 got args tuple(T[torch.Size([32, 3, 224, 224])]) | |
still waiting for process to exit | |
run_node call_function, <built-in function getitem> got args tuple(tuple(T[torch.Size([32, 512, 28, 28])], T[torch.Size([32, 1024, 14, 14])]), 0) | |
run_node call_function, <built-in function getitem> got args tuple(tuple(T[torch.Size([32, 512, 28, 28])], T[torch.Size([32, 1024, 14, 14])]), 1) | |
run_node call_module, submod_1 got args tuple(T[torch.Size([32, 512, 28, 28])], T[torch.Size([32, 1024, 14, 14])]) | |
run_node call_function, <built-in function getitem> got args tuple(tuple(T[torch.Size([32, 512, 14, 14])], T[torch.Size([32, 1024, 14, 14])]), 0) | |
run_node call_function, <built-in function getitem> got args tuple(tuple(T[torch.Size([32, 512, 14, 14])], T[torch.Size([32, 1024, 14, 14])]), 1) | |
run_node call_module, submod_2 got args tuple(T[torch.Size([32, 512, 14, 14])], T[torch.Size([32, 1024, 14, 14])]) | |
didn't get the result, queue empty | |
run_node call_function, <built-in function getitem> got args tuple(tuple(T[torch.Size([32, 512, 7, 7])], T[torch.Size([32, 2048, 7, 7])]), 0) | |
run_node call_function, <built-in function getitem> got args tuple(tuple(T[torch.Size([32, 512, 7, 7])], T[torch.Size([32, 2048, 7, 7])]), 1) | |
run_node call_module, submod_3 got args tuple(T[torch.Size([32, 512, 7, 7])], T[torch.Size([32, 2048, 7, 7])]) | |
run_node call_module, submod_4 got args tuple(T[torch.Size([32, 2048])]) | |
run_node output, output got args tuple(tuple(T[torch.Size([32, 1000])])) | |
---final graph--- | |
graph(): | |
%x : torch.Tensor [#users=1] = placeholder[target=x] | |
%submod_0 : [#users=2] = call_module[target=compiled_submod_0](args = (%x,), kwargs = {}) | |
%getitem : [#users=1] = call_function[target=operator.getitem](args = (%submod_0, 0), kwargs = {}) | |
%getitem_1 : [#users=1] = call_function[target=operator.getitem](args = (%submod_0, 1), kwargs = {}) | |
%submod_1 : [#users=2] = call_module[target=compiled_submod_1](args = (%getitem, %getitem_1), kwargs = {}) | |
%getitem_2 : [#users=1] = call_function[target=operator.getitem](args = (%submod_1, 0), kwargs = {}) | |
%getitem_3 : [#users=1] = call_function[target=operator.getitem](args = (%submod_1, 1), kwargs = {}) | |
%submod_2 : [#users=2] = call_module[target=compiled_submod_2](args = (%getitem_2, %getitem_3), kwargs = {}) | |
%getitem_4 : [#users=1] = call_function[target=operator.getitem](args = (%submod_2, 0), kwargs = {}) | |
%getitem_5 : [#users=1] = call_function[target=operator.getitem](args = (%submod_2, 1), kwargs = {}) | |
%submod_3 : [#users=1] = call_module[target=compiled_submod_3](args = (%getitem_4, %getitem_5), kwargs = {}) | |
%submod_4 : [#users=1] = call_module[target=compiled_submod_4](args = (%submod_3,), kwargs = {}) | |
return (submod_4,) | |
--------------- | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
still waiting for process to exit | |
didn't get the result, queue empty | |
result {'latency_median': 73.5912971496582, 'latency_stdev': 0.5808285154412532} | |
has been placed into the queue | |
got exit 0 | |
got the result! | |
exit code: 0 and result: {'nodes': 1, 'model_name': 'torchbenchmark.models.resnet50.Model', 'backend': 'torchdynamo_inductor', 'has_breaks': True, 'static_graph': False, 'result': {'latency_median': 73.5912971496582, 'latency_stdev': 0.5808285154412532}} | |
<RESULT>{"nodes": 1, "model_name": "torchbenchmark.models.resnet50.Model", "backend": "torchdynamo_inductor", "has_breaks": true, "static_graph": false, "result": {"latency_median": 73.5912971496582, "latency_stdev": 0.5808285154412532}}</RESULT> | |
submitit INFO (2022-10-26 23:18:06,460) - Job completed successfully |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
These are resnet50 results.
The failing logs come from https://github.com/pytorch/pytorch/tree/davidberard98/repro-dynamo-ddp-resnet-issue
The success logs come from viable/strict as of Oct 26, 2022 afternoon.
The logs contain results from 3 runs:
The logs contain DDPOptimizer(debug=True) i.e. debug turned on for DDPOptimizer. In the failing logs, we get an error about mutated views coming from inductor.
But we can see by comparing the fail and success logs that the graph splits are the same (you can check with vimdiff)