Created
March 29, 2023 17:09
-
-
Save gengwg/3520df2ebf8872e5ee4818367e3a4122 to your computer and use it in GitHub Desktop.
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
# ./deviceQuery | |
./deviceQuery Starting... | |
CUDA Device Query (Runtime API) version (CUDART static linking) | |
Detected 8 CUDA Capable device(s) | |
Device 0: "NVIDIA A100-SXM4-80GB" | |
CUDA Driver Version / Runtime Version 11.4 / 11.4 | |
CUDA Capability Major/Minor version number: 8.0 | |
Total amount of global memory: 81251 MBytes (85198045184 bytes) | |
(108) Multiprocessors, (064) CUDA Cores/MP: 6912 CUDA Cores | |
GPU Max Clock rate: 1410 MHz (1.41 GHz) | |
Memory Clock rate: 1593 Mhz | |
Memory Bus Width: 5120-bit | |
L2 Cache Size: 41943040 bytes | |
Maximum Texture Dimension Size (x,y,z) 1D=(131072), 2D=(131072, 65536), 3D=(16384, 16384, 16384) | |
Maximum Layered 1D Texture Size, (num) layers 1D=(32768), 2048 layers | |
Maximum Layered 2D Texture Size, (num) layers 2D=(32768, 32768), 2048 layers | |
Total amount of constant memory: 65536 bytes | |
Total amount of shared memory per block: 49152 bytes | |
Total shared memory per multiprocessor: 167936 bytes | |
Total number of registers available per block: 65536 | |
Warp size: 32 | |
Maximum number of threads per multiprocessor: 2048 | |
Maximum number of threads per block: 1024 | |
Max dimension size of a thread block (x,y,z): (1024, 1024, 64) | |
Max dimension size of a grid size (x,y,z): (2147483647, 65535, 65535) | |
Maximum memory pitch: 2147483647 bytes | |
Texture alignment: 512 bytes | |
Concurrent copy and kernel execution: Yes with 3 copy engine(s) | |
Run time limit on kernels: No | |
Integrated GPU sharing Host Memory: No | |
Support host page-locked memory mapping: Yes | |
Alignment requirement for Surfaces: Yes | |
Device has ECC support: Enabled | |
Device supports Unified Addressing (UVA): Yes | |
Device supports Managed Memory: Yes | |
Device supports Compute Preemption: Yes | |
Supports Cooperative Kernel Launch: Yes | |
Supports MultiDevice Co-op Kernel Launch: Yes | |
Device PCI Domain ID / Bus ID / location ID: 0 / 7 / 0 | |
Compute Mode: | |
< Default (multiple host threads can use ::cudaSetDevice() with device simultaneously) > | |
Device 1: "NVIDIA A100-SXM4-80GB" | |
CUDA Driver Version / Runtime Version 11.4 / 11.4 | |
CUDA Capability Major/Minor version number: 8.0 | |
Total amount of global memory: 81251 MBytes (85198045184 bytes) | |
(108) Multiprocessors, (064) CUDA Cores/MP: 6912 CUDA Cores | |
GPU Max Clock rate: 1410 MHz (1.41 GHz) | |
Memory Clock rate: 1593 Mhz | |
Memory Bus Width: 5120-bit | |
L2 Cache Size: 41943040 bytes | |
Maximum Texture Dimension Size (x,y,z) 1D=(131072), 2D=(131072, 65536), 3D=(16384, 16384, 16384) | |
Maximum Layered 1D Texture Size, (num) layers 1D=(32768), 2048 layers | |
Maximum Layered 2D Texture Size, (num) layers 2D=(32768, 32768), 2048 layers | |
Total amount of constant memory: 65536 bytes | |
Total amount of shared memory per block: 49152 bytes | |
Total shared memory per multiprocessor: 167936 bytes | |
Total number of registers available per block: 65536 | |
Warp size: 32 | |
Maximum number of threads per multiprocessor: 2048 | |
Maximum number of threads per block: 1024 | |
Max dimension size of a thread block (x,y,z): (1024, 1024, 64) | |
Max dimension size of a grid size (x,y,z): (2147483647, 65535, 65535) | |
Maximum memory pitch: 2147483647 bytes | |
Texture alignment: 512 bytes | |
Concurrent copy and kernel execution: Yes with 3 copy engine(s) | |
Run time limit on kernels: No | |
Integrated GPU sharing Host Memory: No | |
Support host page-locked memory mapping: Yes | |
Alignment requirement for Surfaces: Yes | |
Device has ECC support: Enabled | |
Device supports Unified Addressing (UVA): Yes | |
Device supports Managed Memory: Yes | |
Device supports Compute Preemption: Yes | |
Supports Cooperative Kernel Launch: Yes | |
Supports MultiDevice Co-op Kernel Launch: Yes | |
Device PCI Domain ID / Bus ID / location ID: 0 / 15 / 0 | |
Compute Mode: | |
< Default (multiple host threads can use ::cudaSetDevice() with device simultaneously) > | |
Device 2: "NVIDIA A100-SXM4-80GB" | |
CUDA Driver Version / Runtime Version 11.4 / 11.4 | |
CUDA Capability Major/Minor version number: 8.0 | |
Total amount of global memory: 81251 MBytes (85198045184 bytes) | |
(108) Multiprocessors, (064) CUDA Cores/MP: 6912 CUDA Cores | |
GPU Max Clock rate: 1410 MHz (1.41 GHz) | |
Memory Clock rate: 1593 Mhz | |
Memory Bus Width: 5120-bit | |
L2 Cache Size: 41943040 bytes | |
Maximum Texture Dimension Size (x,y,z) 1D=(131072), 2D=(131072, 65536), 3D=(16384, 16384, 16384) | |
Maximum Layered 1D Texture Size, (num) layers 1D=(32768), 2048 layers | |
Maximum Layered 2D Texture Size, (num) layers 2D=(32768, 32768), 2048 layers | |
Total amount of constant memory: 65536 bytes | |
Total amount of shared memory per block: 49152 bytes | |
Total shared memory per multiprocessor: 167936 bytes | |
Total number of registers available per block: 65536 | |
Warp size: 32 | |
Maximum number of threads per multiprocessor: 2048 | |
Maximum number of threads per block: 1024 | |
Max dimension size of a thread block (x,y,z): (1024, 1024, 64) | |
Max dimension size of a grid size (x,y,z): (2147483647, 65535, 65535) | |
Maximum memory pitch: 2147483647 bytes | |
Texture alignment: 512 bytes | |
Concurrent copy and kernel execution: Yes with 3 copy engine(s) | |
Run time limit on kernels: No | |
Integrated GPU sharing Host Memory: No | |
Support host page-locked memory mapping: Yes | |
Alignment requirement for Surfaces: Yes | |
Device has ECC support: Enabled | |
Device supports Unified Addressing (UVA): Yes | |
Device supports Managed Memory: Yes | |
Device supports Compute Preemption: Yes | |
Supports Cooperative Kernel Launch: Yes | |
Supports MultiDevice Co-op Kernel Launch: Yes | |
Device PCI Domain ID / Bus ID / location ID: 0 / 71 / 0 | |
Compute Mode: | |
< Default (multiple host threads can use ::cudaSetDevice() with device simultaneously) > | |
Device 3: "NVIDIA A100-SXM4-80GB" | |
CUDA Driver Version / Runtime Version 11.4 / 11.4 | |
CUDA Capability Major/Minor version number: 8.0 | |
Total amount of global memory: 81251 MBytes (85198045184 bytes) | |
(108) Multiprocessors, (064) CUDA Cores/MP: 6912 CUDA Cores | |
GPU Max Clock rate: 1410 MHz (1.41 GHz) | |
Memory Clock rate: 1593 Mhz | |
Memory Bus Width: 5120-bit | |
L2 Cache Size: 41943040 bytes | |
Maximum Texture Dimension Size (x,y,z) 1D=(131072), 2D=(131072, 65536), 3D=(16384, 16384, 16384) | |
Maximum Layered 1D Texture Size, (num) layers 1D=(32768), 2048 layers | |
Maximum Layered 2D Texture Size, (num) layers 2D=(32768, 32768), 2048 layers | |
Total amount of constant memory: 65536 bytes | |
Total amount of shared memory per block: 49152 bytes | |
Total shared memory per multiprocessor: 167936 bytes | |
Total number of registers available per block: 65536 | |
Warp size: 32 | |
Maximum number of threads per multiprocessor: 2048 | |
Maximum number of threads per block: 1024 | |
Max dimension size of a thread block (x,y,z): (1024, 1024, 64) | |
Max dimension size of a grid size (x,y,z): (2147483647, 65535, 65535) | |
Maximum memory pitch: 2147483647 bytes | |
Texture alignment: 512 bytes | |
Concurrent copy and kernel execution: Yes with 3 copy engine(s) | |
Run time limit on kernels: No | |
Integrated GPU sharing Host Memory: No | |
Support host page-locked memory mapping: Yes | |
Alignment requirement for Surfaces: Yes | |
Device has ECC support: Enabled | |
Device supports Unified Addressing (UVA): Yes | |
Device supports Managed Memory: Yes | |
Device supports Compute Preemption: Yes | |
Supports Cooperative Kernel Launch: Yes | |
Supports MultiDevice Co-op Kernel Launch: Yes | |
Device PCI Domain ID / Bus ID / location ID: 0 / 78 / 0 | |
Compute Mode: | |
< Default (multiple host threads can use ::cudaSetDevice() with device simultaneously) > | |
Device 4: "NVIDIA A100-SXM4-80GB" | |
CUDA Driver Version / Runtime Version 11.4 / 11.4 | |
CUDA Capability Major/Minor version number: 8.0 | |
Total amount of global memory: 81251 MBytes (85198045184 bytes) | |
(108) Multiprocessors, (064) CUDA Cores/MP: 6912 CUDA Cores | |
GPU Max Clock rate: 1410 MHz (1.41 GHz) | |
Memory Clock rate: 1593 Mhz | |
Memory Bus Width: 5120-bit | |
L2 Cache Size: 41943040 bytes | |
Maximum Texture Dimension Size (x,y,z) 1D=(131072), 2D=(131072, 65536), 3D=(16384, 16384, 16384) | |
Maximum Layered 1D Texture Size, (num) layers 1D=(32768), 2048 layers | |
Maximum Layered 2D Texture Size, (num) layers 2D=(32768, 32768), 2048 layers | |
Total amount of constant memory: 65536 bytes | |
Total amount of shared memory per block: 49152 bytes | |
Total shared memory per multiprocessor: 167936 bytes | |
Total number of registers available per block: 65536 | |
Warp size: 32 | |
Maximum number of threads per multiprocessor: 2048 | |
Maximum number of threads per block: 1024 | |
Max dimension size of a thread block (x,y,z): (1024, 1024, 64) | |
Max dimension size of a grid size (x,y,z): (2147483647, 65535, 65535) | |
Maximum memory pitch: 2147483647 bytes | |
Texture alignment: 512 bytes | |
Concurrent copy and kernel execution: Yes with 3 copy engine(s) | |
Run time limit on kernels: No | |
Integrated GPU sharing Host Memory: No | |
Support host page-locked memory mapping: Yes | |
Alignment requirement for Surfaces: Yes | |
Device has ECC support: Enabled | |
Device supports Unified Addressing (UVA): Yes | |
Device supports Managed Memory: Yes | |
Device supports Compute Preemption: Yes | |
Supports Cooperative Kernel Launch: Yes | |
Supports MultiDevice Co-op Kernel Launch: Yes | |
Device PCI Domain ID / Bus ID / location ID: 0 / 135 / 0 | |
Compute Mode: | |
< Default (multiple host threads can use ::cudaSetDevice() with device simultaneously) > | |
Device 5: "NVIDIA A100-SXM4-80GB" | |
CUDA Driver Version / Runtime Version 11.4 / 11.4 | |
CUDA Capability Major/Minor version number: 8.0 | |
Total amount of global memory: 81251 MBytes (85198045184 bytes) | |
(108) Multiprocessors, (064) CUDA Cores/MP: 6912 CUDA Cores | |
GPU Max Clock rate: 1410 MHz (1.41 GHz) | |
Memory Clock rate: 1593 Mhz | |
Memory Bus Width: 5120-bit | |
L2 Cache Size: 41943040 bytes | |
Maximum Texture Dimension Size (x,y,z) 1D=(131072), 2D=(131072, 65536), 3D=(16384, 16384, 16384) | |
Maximum Layered 1D Texture Size, (num) layers 1D=(32768), 2048 layers | |
Maximum Layered 2D Texture Size, (num) layers 2D=(32768, 32768), 2048 layers | |
Total amount of constant memory: 65536 bytes | |
Total amount of shared memory per block: 49152 bytes | |
Total shared memory per multiprocessor: 167936 bytes | |
Total number of registers available per block: 65536 | |
Warp size: 32 | |
Maximum number of threads per multiprocessor: 2048 | |
Maximum number of threads per block: 1024 | |
Max dimension size of a thread block (x,y,z): (1024, 1024, 64) | |
Max dimension size of a grid size (x,y,z): (2147483647, 65535, 65535) | |
Maximum memory pitch: 2147483647 bytes | |
Texture alignment: 512 bytes | |
Concurrent copy and kernel execution: Yes with 3 copy engine(s) | |
Run time limit on kernels: No | |
Integrated GPU sharing Host Memory: No | |
Support host page-locked memory mapping: Yes | |
Alignment requirement for Surfaces: Yes | |
Device has ECC support: Enabled | |
Device supports Unified Addressing (UVA): Yes | |
Device supports Managed Memory: Yes | |
Device supports Compute Preemption: Yes | |
Supports Cooperative Kernel Launch: Yes | |
Supports MultiDevice Co-op Kernel Launch: Yes | |
Device PCI Domain ID / Bus ID / location ID: 0 / 144 / 0 | |
Compute Mode: | |
< Default (multiple host threads can use ::cudaSetDevice() with device simultaneously) > | |
Device 6: "NVIDIA A100-SXM4-80GB" | |
CUDA Driver Version / Runtime Version 11.4 / 11.4 | |
CUDA Capability Major/Minor version number: 8.0 | |
Total amount of global memory: 81251 MBytes (85198045184 bytes) | |
(108) Multiprocessors, (064) CUDA Cores/MP: 6912 CUDA Cores | |
GPU Max Clock rate: 1410 MHz (1.41 GHz) | |
Memory Clock rate: 1593 Mhz | |
Memory Bus Width: 5120-bit | |
L2 Cache Size: 41943040 bytes | |
Maximum Texture Dimension Size (x,y,z) 1D=(131072), 2D=(131072, 65536), 3D=(16384, 16384, 16384) | |
Maximum Layered 1D Texture Size, (num) layers 1D=(32768), 2048 layers | |
Maximum Layered 2D Texture Size, (num) layers 2D=(32768, 32768), 2048 layers | |
Total amount of constant memory: 65536 bytes | |
Total amount of shared memory per block: 49152 bytes | |
Total shared memory per multiprocessor: 167936 bytes | |
Total number of registers available per block: 65536 | |
Warp size: 32 | |
Maximum number of threads per multiprocessor: 2048 | |
Maximum number of threads per block: 1024 | |
Max dimension size of a thread block (x,y,z): (1024, 1024, 64) | |
Max dimension size of a grid size (x,y,z): (2147483647, 65535, 65535) | |
Maximum memory pitch: 2147483647 bytes | |
Texture alignment: 512 bytes | |
Concurrent copy and kernel execution: Yes with 3 copy engine(s) | |
Run time limit on kernels: No | |
Integrated GPU sharing Host Memory: No | |
Support host page-locked memory mapping: Yes | |
Alignment requirement for Surfaces: Yes | |
Device has ECC support: Enabled | |
Device supports Unified Addressing (UVA): Yes | |
Device supports Managed Memory: Yes | |
Device supports Compute Preemption: Yes | |
Supports Cooperative Kernel Launch: Yes | |
Supports MultiDevice Co-op Kernel Launch: Yes | |
Device PCI Domain ID / Bus ID / location ID: 0 / 183 / 0 | |
Compute Mode: | |
< Default (multiple host threads can use ::cudaSetDevice() with device simultaneously) > | |
Device 7: "NVIDIA A100-SXM4-80GB" | |
CUDA Driver Version / Runtime Version 11.4 / 11.4 | |
CUDA Capability Major/Minor version number: 8.0 | |
Total amount of global memory: 81251 MBytes (85198045184 bytes) | |
(108) Multiprocessors, (064) CUDA Cores/MP: 6912 CUDA Cores | |
GPU Max Clock rate: 1410 MHz (1.41 GHz) | |
Memory Clock rate: 1593 Mhz | |
Memory Bus Width: 5120-bit | |
L2 Cache Size: 41943040 bytes | |
Maximum Texture Dimension Size (x,y,z) 1D=(131072), 2D=(131072, 65536), 3D=(16384, 16384, 16384) | |
Maximum Layered 1D Texture Size, (num) layers 1D=(32768), 2048 layers | |
Maximum Layered 2D Texture Size, (num) layers 2D=(32768, 32768), 2048 layers | |
Total amount of constant memory: 65536 bytes | |
Total amount of shared memory per block: 49152 bytes | |
Total shared memory per multiprocessor: 167936 bytes | |
Total number of registers available per block: 65536 | |
Warp size: 32 | |
Maximum number of threads per multiprocessor: 2048 | |
Maximum number of threads per block: 1024 | |
Max dimension size of a thread block (x,y,z): (1024, 1024, 64) | |
Max dimension size of a grid size (x,y,z): (2147483647, 65535, 65535) | |
Maximum memory pitch: 2147483647 bytes | |
Texture alignment: 512 bytes | |
Concurrent copy and kernel execution: Yes with 3 copy engine(s) | |
Run time limit on kernels: No | |
Integrated GPU sharing Host Memory: No | |
Support host page-locked memory mapping: Yes | |
Alignment requirement for Surfaces: Yes | |
Device has ECC support: Enabled | |
Device supports Unified Addressing (UVA): Yes | |
Device supports Managed Memory: Yes | |
Device supports Compute Preemption: Yes | |
Supports Cooperative Kernel Launch: Yes | |
Supports MultiDevice Co-op Kernel Launch: Yes | |
Device PCI Domain ID / Bus ID / location ID: 0 / 189 / 0 | |
Compute Mode: | |
< Default (multiple host threads can use ::cudaSetDevice() with device simultaneously) > | |
> Peer access from NVIDIA A100-SXM4-80GB (GPU0) -> NVIDIA A100-SXM4-80GB (GPU1) : Yes | |
> Peer access from NVIDIA A100-SXM4-80GB (GPU0) -> NVIDIA A100-SXM4-80GB (GPU2) : Yes | |
> Peer access from NVIDIA A100-SXM4-80GB (GPU0) -> NVIDIA A100-SXM4-80GB (GPU3) : Yes | |
> Peer access from NVIDIA A100-SXM4-80GB (GPU0) -> NVIDIA A100-SXM4-80GB (GPU4) : Yes | |
> Peer access from NVIDIA A100-SXM4-80GB (GPU0) -> NVIDIA A100-SXM4-80GB (GPU5) : Yes | |
> Peer access from NVIDIA A100-SXM4-80GB (GPU0) -> NVIDIA A100-SXM4-80GB (GPU6) : Yes | |
> Peer access from NVIDIA A100-SXM4-80GB (GPU0) -> NVIDIA A100-SXM4-80GB (GPU7) : Yes | |
> Peer access from NVIDIA A100-SXM4-80GB (GPU1) -> NVIDIA A100-SXM4-80GB (GPU0) : Yes | |
> Peer access from NVIDIA A100-SXM4-80GB (GPU1) -> NVIDIA A100-SXM4-80GB (GPU2) : Yes | |
> Peer access from NVIDIA A100-SXM4-80GB (GPU1) -> NVIDIA A100-SXM4-80GB (GPU3) : Yes | |
> Peer access from NVIDIA A100-SXM4-80GB (GPU1) -> NVIDIA A100-SXM4-80GB (GPU4) : Yes | |
> Peer access from NVIDIA A100-SXM4-80GB (GPU1) -> NVIDIA A100-SXM4-80GB (GPU5) : Yes | |
> Peer access from NVIDIA A100-SXM4-80GB (GPU1) -> NVIDIA A100-SXM4-80GB (GPU6) : Yes | |
> Peer access from NVIDIA A100-SXM4-80GB (GPU1) -> NVIDIA A100-SXM4-80GB (GPU7) : Yes | |
> Peer access from NVIDIA A100-SXM4-80GB (GPU2) -> NVIDIA A100-SXM4-80GB (GPU0) : Yes | |
> Peer access from NVIDIA A100-SXM4-80GB (GPU2) -> NVIDIA A100-SXM4-80GB (GPU1) : Yes | |
> Peer access from NVIDIA A100-SXM4-80GB (GPU2) -> NVIDIA A100-SXM4-80GB (GPU3) : Yes | |
> Peer access from NVIDIA A100-SXM4-80GB (GPU2) -> NVIDIA A100-SXM4-80GB (GPU4) : Yes | |
> Peer access from NVIDIA A100-SXM4-80GB (GPU2) -> NVIDIA A100-SXM4-80GB (GPU5) : Yes | |
> Peer access from NVIDIA A100-SXM4-80GB (GPU2) -> NVIDIA A100-SXM4-80GB (GPU6) : Yes | |
> Peer access from NVIDIA A100-SXM4-80GB (GPU2) -> NVIDIA A100-SXM4-80GB (GPU7) : Yes | |
> Peer access from NVIDIA A100-SXM4-80GB (GPU3) -> NVIDIA A100-SXM4-80GB (GPU0) : Yes | |
> Peer access from NVIDIA A100-SXM4-80GB (GPU3) -> NVIDIA A100-SXM4-80GB (GPU1) : Yes | |
> Peer access from NVIDIA A100-SXM4-80GB (GPU3) -> NVIDIA A100-SXM4-80GB (GPU2) : Yes | |
> Peer access from NVIDIA A100-SXM4-80GB (GPU3) -> NVIDIA A100-SXM4-80GB (GPU4) : Yes | |
> Peer access from NVIDIA A100-SXM4-80GB (GPU3) -> NVIDIA A100-SXM4-80GB (GPU5) : Yes | |
> Peer access from NVIDIA A100-SXM4-80GB (GPU3) -> NVIDIA A100-SXM4-80GB (GPU6) : Yes | |
> Peer access from NVIDIA A100-SXM4-80GB (GPU3) -> NVIDIA A100-SXM4-80GB (GPU7) : Yes | |
> Peer access from NVIDIA A100-SXM4-80GB (GPU4) -> NVIDIA A100-SXM4-80GB (GPU0) : Yes | |
> Peer access from NVIDIA A100-SXM4-80GB (GPU4) -> NVIDIA A100-SXM4-80GB (GPU1) : Yes | |
> Peer access from NVIDIA A100-SXM4-80GB (GPU4) -> NVIDIA A100-SXM4-80GB (GPU2) : Yes | |
> Peer access from NVIDIA A100-SXM4-80GB (GPU4) -> NVIDIA A100-SXM4-80GB (GPU3) : Yes | |
> Peer access from NVIDIA A100-SXM4-80GB (GPU4) -> NVIDIA A100-SXM4-80GB (GPU5) : Yes | |
> Peer access from NVIDIA A100-SXM4-80GB (GPU4) -> NVIDIA A100-SXM4-80GB (GPU6) : Yes | |
> Peer access from NVIDIA A100-SXM4-80GB (GPU4) -> NVIDIA A100-SXM4-80GB (GPU7) : Yes | |
> Peer access from NVIDIA A100-SXM4-80GB (GPU5) -> NVIDIA A100-SXM4-80GB (GPU0) : Yes | |
> Peer access from NVIDIA A100-SXM4-80GB (GPU5) -> NVIDIA A100-SXM4-80GB (GPU1) : Yes | |
> Peer access from NVIDIA A100-SXM4-80GB (GPU5) -> NVIDIA A100-SXM4-80GB (GPU2) : Yes | |
> Peer access from NVIDIA A100-SXM4-80GB (GPU5) -> NVIDIA A100-SXM4-80GB (GPU3) : Yes | |
> Peer access from NVIDIA A100-SXM4-80GB (GPU5) -> NVIDIA A100-SXM4-80GB (GPU4) : Yes | |
> Peer access from NVIDIA A100-SXM4-80GB (GPU5) -> NVIDIA A100-SXM4-80GB (GPU6) : Yes | |
> Peer access from NVIDIA A100-SXM4-80GB (GPU5) -> NVIDIA A100-SXM4-80GB (GPU7) : Yes | |
> Peer access from NVIDIA A100-SXM4-80GB (GPU6) -> NVIDIA A100-SXM4-80GB (GPU0) : Yes | |
> Peer access from NVIDIA A100-SXM4-80GB (GPU6) -> NVIDIA A100-SXM4-80GB (GPU1) : Yes | |
> Peer access from NVIDIA A100-SXM4-80GB (GPU6) -> NVIDIA A100-SXM4-80GB (GPU2) : Yes | |
> Peer access from NVIDIA A100-SXM4-80GB (GPU6) -> NVIDIA A100-SXM4-80GB (GPU3) : Yes | |
> Peer access from NVIDIA A100-SXM4-80GB (GPU6) -> NVIDIA A100-SXM4-80GB (GPU4) : Yes | |
> Peer access from NVIDIA A100-SXM4-80GB (GPU6) -> NVIDIA A100-SXM4-80GB (GPU5) : Yes | |
> Peer access from NVIDIA A100-SXM4-80GB (GPU6) -> NVIDIA A100-SXM4-80GB (GPU7) : Yes | |
> Peer access from NVIDIA A100-SXM4-80GB (GPU7) -> NVIDIA A100-SXM4-80GB (GPU0) : Yes | |
> Peer access from NVIDIA A100-SXM4-80GB (GPU7) -> NVIDIA A100-SXM4-80GB (GPU1) : Yes | |
> Peer access from NVIDIA A100-SXM4-80GB (GPU7) -> NVIDIA A100-SXM4-80GB (GPU2) : Yes | |
> Peer access from NVIDIA A100-SXM4-80GB (GPU7) -> NVIDIA A100-SXM4-80GB (GPU3) : Yes | |
> Peer access from NVIDIA A100-SXM4-80GB (GPU7) -> NVIDIA A100-SXM4-80GB (GPU4) : Yes | |
> Peer access from NVIDIA A100-SXM4-80GB (GPU7) -> NVIDIA A100-SXM4-80GB (GPU5) : Yes | |
> Peer access from NVIDIA A100-SXM4-80GB (GPU7) -> NVIDIA A100-SXM4-80GB (GPU6) : Yes | |
deviceQuery, CUDA Driver = CUDART, CUDA Driver Version = 11.4, CUDA Runtime Version = 11.4, NumDevs = 8 | |
Result = PASS |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment