Created
September 20, 2025 05:01
-
-
Save AmosLewis/5310028e88359f84da4a19f0949879fa to your computer and use it in GitHub Desktop.
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
root@smci350-zts-gtu-c8-25:/mlperf/harness# /opt/rocm/bin/rocminfo | |
ROCk module version 6.14.14 is loaded | |
===================== | |
HSA System Attributes | |
===================== | |
Runtime Version: 1.18 | |
Runtime Ext Version: 1.11 | |
System Timestamp Freq.: 1000.000000MHz | |
Sig. Max Wait Duration: 18446744073709551615 (0xFFFFFFFFFFFFFFFF) (timestamp count) | |
Machine Model: LARGE | |
System Endianness: LITTLE | |
Mwaitx: DISABLED | |
XNACK enabled: NO | |
DMAbuf Support: YES | |
VMM Support: YES | |
========== | |
HSA Agents | |
========== | |
******* | |
Agent 1 | |
******* | |
Name: AMD EPYC 9575F 64-Core Processor | |
Uuid: CPU-XX | |
Marketing Name: AMD EPYC 9575F 64-Core Processor | |
Vendor Name: CPU | |
Feature: None specified | |
Profile: FULL_PROFILE | |
Float Round Mode: NEAR | |
Max Queue Number: 0(0x0) | |
Queue Min Size: 0(0x0) | |
Queue Max Size: 0(0x0) | |
Queue Type: MULTI | |
Node: 0 | |
Device Type: CPU | |
Cache Info: | |
L1: 49152(0xc000) KB | |
Chip ID: 0(0x0) | |
ASIC Revision: 0(0x0) | |
Cacheline Size: 64(0x40) | |
Max Clock Freq. (MHz): 3300 | |
BDFID: 0 | |
Internal Node ID: 0 | |
Compute Unit: 128 | |
SIMDs per CU: 0 | |
Shader Engines: 0 | |
Shader Arrs. per Eng.: 0 | |
WatchPts on Addr. Ranges:1 | |
Memory Properties: | |
Features: None | |
Pool Info: | |
Pool 1 | |
Segment: GLOBAL; FLAGS: FINE GRAINED | |
Size: 1584921340(0x5e77fafc) KB | |
Allocatable: TRUE | |
Alloc Granule: 4KB | |
Alloc Recommended Granule:4KB | |
Alloc Alignment: 4KB | |
Accessible by all: TRUE | |
Pool 2 | |
Segment: GLOBAL; FLAGS: EXTENDED FINE GRAINED | |
Size: 1584921340(0x5e77fafc) KB | |
Allocatable: TRUE | |
Alloc Granule: 4KB | |
Alloc Recommended Granule:4KB | |
Alloc Alignment: 4KB | |
Accessible by all: TRUE | |
Pool 3 | |
Segment: GLOBAL; FLAGS: KERNARG, FINE GRAINED | |
Size: 1584921340(0x5e77fafc) KB | |
Allocatable: TRUE | |
Alloc Granule: 4KB | |
Alloc Recommended Granule:4KB | |
Alloc Alignment: 4KB | |
Accessible by all: TRUE | |
Pool 4 | |
Segment: GLOBAL; FLAGS: COARSE GRAINED | |
Size: 1584921340(0x5e77fafc) KB | |
Allocatable: TRUE | |
Alloc Granule: 4KB | |
Alloc Recommended Granule:4KB | |
Alloc Alignment: 4KB | |
Accessible by all: TRUE | |
ISA Info: | |
******* | |
Agent 2 | |
******* | |
Name: AMD EPYC 9575F 64-Core Processor | |
Uuid: CPU-XX | |
Marketing Name: AMD EPYC 9575F 64-Core Processor | |
Vendor Name: CPU | |
Feature: None specified | |
Profile: FULL_PROFILE | |
Float Round Mode: NEAR | |
Max Queue Number: 0(0x0) | |
Queue Min Size: 0(0x0) | |
Queue Max Size: 0(0x0) | |
Queue Type: MULTI | |
Node: 1 | |
Device Type: CPU | |
Cache Info: | |
L1: 49152(0xc000) KB | |
Chip ID: 0(0x0) | |
ASIC Revision: 0(0x0) | |
Cacheline Size: 64(0x40) | |
Max Clock Freq. (MHz): 3300 | |
BDFID: 0 | |
Internal Node ID: 1 | |
Compute Unit: 128 | |
SIMDs per CU: 0 | |
Shader Engines: 0 | |
Shader Arrs. per Eng.: 0 | |
WatchPts on Addr. Ranges:1 | |
Memory Properties: | |
Features: None | |
Pool Info: | |
Pool 1 | |
Segment: GLOBAL; FLAGS: FINE GRAINED | |
Size: 1585342172(0x5e7e66dc) KB | |
Allocatable: TRUE | |
Alloc Granule: 4KB | |
Alloc Recommended Granule:4KB | |
Alloc Alignment: 4KB | |
Accessible by all: TRUE | |
Pool 2 | |
Segment: GLOBAL; FLAGS: EXTENDED FINE GRAINED | |
Size: 1585342172(0x5e7e66dc) KB | |
Allocatable: TRUE | |
Alloc Granule: 4KB | |
Alloc Recommended Granule:4KB | |
Alloc Alignment: 4KB | |
Accessible by all: TRUE | |
Pool 3 | |
Segment: GLOBAL; FLAGS: KERNARG, FINE GRAINED | |
Size: 1585342172(0x5e7e66dc) KB | |
Allocatable: TRUE | |
Alloc Granule: 4KB | |
Alloc Recommended Granule:4KB | |
Alloc Alignment: 4KB | |
Accessible by all: TRUE | |
Pool 4 | |
Segment: GLOBAL; FLAGS: COARSE GRAINED | |
Size: 1585342172(0x5e7e66dc) KB | |
Allocatable: TRUE | |
Alloc Granule: 4KB | |
Alloc Recommended Granule:4KB | |
Alloc Alignment: 4KB | |
Accessible by all: TRUE | |
ISA Info: | |
******* | |
Agent 3 | |
******* | |
Name: gfx950 | |
Uuid: GPU-94503405327fb12f | |
Marketing Name: | |
Vendor Name: AMD | |
Feature: KERNEL_DISPATCH | |
Profile: BASE_PROFILE | |
Float Round Mode: NEAR | |
Max Queue Number: 128(0x80) | |
Queue Min Size: 64(0x40) | |
Queue Max Size: 131072(0x20000) | |
Queue Type: MULTI | |
Node: 2 | |
Device Type: GPU | |
Cache Info: | |
L1: 32(0x20) KB | |
L2: 4096(0x1000) KB | |
L3: 262144(0x40000) KB | |
Chip ID: 30112(0x75a0) | |
ASIC Revision: 0(0x0) | |
Cacheline Size: 128(0x80) | |
Max Clock Freq. (MHz): 2200 | |
BDFID: 29952 | |
Internal Node ID: 2 | |
Compute Unit: 256 | |
SIMDs per CU: 4 | |
Shader Engines: 32 | |
Shader Arrs. per Eng.: 1 | |
WatchPts on Addr. Ranges:4 | |
Coherent Host Access: FALSE | |
Memory Properties: | |
Features: KERNEL_DISPATCH | |
Fast F16 Operation: TRUE | |
Wavefront Size: 64(0x40) | |
Workgroup Max Size: 1024(0x400) | |
Workgroup Max Size per Dimension: | |
x 1024(0x400) | |
y 1024(0x400) | |
z 1024(0x400) | |
Max Waves Per CU: 32(0x20) | |
Max Work-item Per CU: 2048(0x800) | |
Grid Max Size: 4294967295(0xffffffff) | |
Grid Max Size per Dimension: | |
x 2147483647(0x7fffffff) | |
y 65535(0xffff) | |
z 65535(0xffff) | |
Max fbarriers/Workgrp: 32 | |
Packet Processor uCode:: 30 | |
SDMA engine uCode:: 12 | |
IOMMU Support:: None | |
Pool Info: | |
Pool 1 | |
Segment: GLOBAL; FLAGS: COARSE GRAINED | |
Size: 301973504(0x11ffc000) KB | |
Allocatable: TRUE | |
Alloc Granule: 4KB | |
Alloc Recommended Granule:2048KB | |
Alloc Alignment: 4KB | |
Accessible by all: FALSE | |
Pool 2 | |
Segment: GLOBAL; FLAGS: EXTENDED FINE GRAINED | |
Size: 301973504(0x11ffc000) KB | |
Allocatable: TRUE | |
Alloc Granule: 4KB | |
Alloc Recommended Granule:2048KB | |
Alloc Alignment: 4KB | |
Accessible by all: FALSE | |
Pool 3 | |
Segment: GLOBAL; FLAGS: FINE GRAINED | |
Size: 301973504(0x11ffc000) KB | |
Allocatable: TRUE | |
Alloc Granule: 4KB | |
Alloc Recommended Granule:2048KB | |
Alloc Alignment: 4KB | |
Accessible by all: FALSE | |
Pool 4 | |
Segment: GROUP | |
Size: 160(0xa0) KB | |
Allocatable: FALSE | |
Alloc Granule: 0KB | |
Alloc Recommended Granule:0KB | |
Alloc Alignment: 0KB | |
Accessible by all: FALSE | |
ISA Info: | |
ISA 1 | |
Name: amdgcn-amd-amdhsa--gfx950:sramecc+:xnack- | |
Machine Models: HSA_MACHINE_MODEL_LARGE | |
Profiles: HSA_PROFILE_BASE | |
Default Rounding Mode: NEAR | |
Default Rounding Mode: NEAR | |
Fast f16: TRUE | |
Workgroup Max Size: 1024(0x400) | |
Workgroup Max Size per Dimension: | |
x 1024(0x400) | |
y 1024(0x400) | |
z 1024(0x400) | |
Grid Max Size: 4294967295(0xffffffff) | |
Grid Max Size per Dimension: | |
x 2147483647(0x7fffffff) | |
y 65535(0xffff) | |
z 65535(0xffff) | |
FBarrier Max Size: 32 | |
ISA 2 | |
Name: amdgcn-amd-amdhsa--gfx9-4-generic:sramecc+:xnack- | |
Machine Models: HSA_MACHINE_MODEL_LARGE | |
Profiles: HSA_PROFILE_BASE | |
Default Rounding Mode: NEAR | |
Default Rounding Mode: NEAR | |
Fast f16: TRUE | |
Workgroup Max Size: 1024(0x400) | |
Workgroup Max Size per Dimension: | |
x 1024(0x400) | |
y 1024(0x400) | |
z 1024(0x400) | |
Grid Max Size: 4294967295(0xffffffff) | |
Grid Max Size per Dimension: | |
x 2147483647(0x7fffffff) | |
y 65535(0xffff) | |
z 65535(0xffff) | |
FBarrier Max Size: 32 | |
******* | |
Agent 4 | |
******* | |
Name: gfx950 | |
Uuid: GPU-a60286fd5b49e048 | |
Marketing Name: | |
Vendor Name: AMD | |
Feature: KERNEL_DISPATCH | |
Profile: BASE_PROFILE | |
Float Round Mode: NEAR | |
Max Queue Number: 128(0x80) | |
Queue Min Size: 64(0x40) | |
Queue Max Size: 131072(0x20000) | |
Queue Type: MULTI | |
Node: 3 | |
Device Type: GPU | |
Cache Info: | |
L1: 32(0x20) KB | |
L2: 4096(0x1000) KB | |
L3: 262144(0x40000) KB | |
Chip ID: 30112(0x75a0) | |
ASIC Revision: 0(0x0) | |
Cacheline Size: 128(0x80) | |
Max Clock Freq. (MHz): 2200 | |
BDFID: 1280 | |
Internal Node ID: 3 | |
Compute Unit: 256 | |
SIMDs per CU: 4 | |
Shader Engines: 32 | |
Shader Arrs. per Eng.: 1 | |
WatchPts on Addr. Ranges:4 | |
Coherent Host Access: FALSE | |
Memory Properties: | |
Features: KERNEL_DISPATCH | |
Fast F16 Operation: TRUE | |
Wavefront Size: 64(0x40) | |
Workgroup Max Size: 1024(0x400) | |
Workgroup Max Size per Dimension: | |
x 1024(0x400) | |
y 1024(0x400) | |
z 1024(0x400) | |
Max Waves Per CU: 32(0x20) | |
Max Work-item Per CU: 2048(0x800) | |
Grid Max Size: 4294967295(0xffffffff) | |
Grid Max Size per Dimension: | |
x 2147483647(0x7fffffff) | |
y 65535(0xffff) | |
z 65535(0xffff) | |
Max fbarriers/Workgrp: 32 | |
Packet Processor uCode:: 30 | |
SDMA engine uCode:: 12 | |
IOMMU Support:: None | |
Pool Info: | |
Pool 1 | |
Segment: GLOBAL; FLAGS: COARSE GRAINED | |
Size: 301973504(0x11ffc000) KB | |
Allocatable: TRUE | |
Alloc Granule: 4KB | |
Alloc Recommended Granule:2048KB | |
Alloc Alignment: 4KB | |
Accessible by all: FALSE | |
Pool 2 | |
Segment: GLOBAL; FLAGS: EXTENDED FINE GRAINED | |
Size: 301973504(0x11ffc000) KB | |
Allocatable: TRUE | |
Alloc Granule: 4KB | |
Alloc Recommended Granule:2048KB | |
Alloc Alignment: 4KB | |
Accessible by all: FALSE | |
Pool 3 | |
Segment: GLOBAL; FLAGS: FINE GRAINED | |
Size: 301973504(0x11ffc000) KB | |
Allocatable: TRUE | |
Alloc Granule: 4KB | |
Alloc Recommended Granule:2048KB | |
Alloc Alignment: 4KB | |
Accessible by all: FALSE | |
Pool 4 | |
Segment: GROUP | |
Size: 160(0xa0) KB | |
Allocatable: FALSE | |
Alloc Granule: 0KB | |
Alloc Recommended Granule:0KB | |
Alloc Alignment: 0KB | |
Accessible by all: FALSE | |
ISA Info: | |
ISA 1 | |
Name: amdgcn-amd-amdhsa--gfx950:sramecc+:xnack- | |
Machine Models: HSA_MACHINE_MODEL_LARGE | |
Profiles: HSA_PROFILE_BASE | |
Default Rounding Mode: NEAR | |
Default Rounding Mode: NEAR | |
Fast f16: TRUE | |
Workgroup Max Size: 1024(0x400) | |
Workgroup Max Size per Dimension: | |
x 1024(0x400) | |
y 1024(0x400) | |
z 1024(0x400) | |
Grid Max Size: 4294967295(0xffffffff) | |
Grid Max Size per Dimension: | |
x 2147483647(0x7fffffff) | |
y 65535(0xffff) | |
z 65535(0xffff) | |
FBarrier Max Size: 32 | |
ISA 2 | |
Name: amdgcn-amd-amdhsa--gfx9-4-generic:sramecc+:xnack- | |
Machine Models: HSA_MACHINE_MODEL_LARGE | |
Profiles: HSA_PROFILE_BASE | |
Default Rounding Mode: NEAR | |
Default Rounding Mode: NEAR | |
Fast f16: TRUE | |
Workgroup Max Size: 1024(0x400) | |
Workgroup Max Size per Dimension: | |
x 1024(0x400) | |
y 1024(0x400) | |
z 1024(0x400) | |
Grid Max Size: 4294967295(0xffffffff) | |
Grid Max Size per Dimension: | |
x 2147483647(0x7fffffff) | |
y 65535(0xffff) | |
z 65535(0xffff) | |
FBarrier Max Size: 32 | |
******* | |
Agent 5 | |
******* | |
Name: gfx950 | |
Uuid: GPU-3407ac3e8b8847f4 | |
Marketing Name: | |
Vendor Name: AMD | |
Feature: KERNEL_DISPATCH | |
Profile: BASE_PROFILE | |
Float Round Mode: NEAR | |
Max Queue Number: 128(0x80) | |
Queue Min Size: 64(0x40) | |
Queue Max Size: 131072(0x20000) | |
Queue Type: MULTI | |
Node: 4 | |
Device Type: GPU | |
Cache Info: | |
L1: 32(0x20) KB | |
L2: 4096(0x1000) KB | |
L3: 262144(0x40000) KB | |
Chip ID: 30112(0x75a0) | |
ASIC Revision: 0(0x0) | |
Cacheline Size: 128(0x80) | |
Max Clock Freq. (MHz): 2200 | |
BDFID: 25856 | |
Internal Node ID: 4 | |
Compute Unit: 256 | |
SIMDs per CU: 4 | |
Shader Engines: 32 | |
Shader Arrs. per Eng.: 1 | |
WatchPts on Addr. Ranges:4 | |
Coherent Host Access: FALSE | |
Memory Properties: | |
Features: KERNEL_DISPATCH | |
Fast F16 Operation: TRUE | |
Wavefront Size: 64(0x40) | |
Workgroup Max Size: 1024(0x400) | |
Workgroup Max Size per Dimension: | |
x 1024(0x400) | |
y 1024(0x400) | |
z 1024(0x400) | |
Max Waves Per CU: 32(0x20) | |
Max Work-item Per CU: 2048(0x800) | |
Grid Max Size: 4294967295(0xffffffff) | |
Grid Max Size per Dimension: | |
x 2147483647(0x7fffffff) | |
y 65535(0xffff) | |
z 65535(0xffff) | |
Max fbarriers/Workgrp: 32 | |
Packet Processor uCode:: 30 | |
SDMA engine uCode:: 12 | |
IOMMU Support:: None | |
Pool Info: | |
Pool 1 | |
Segment: GLOBAL; FLAGS: COARSE GRAINED | |
Size: 301973504(0x11ffc000) KB | |
Allocatable: TRUE | |
Alloc Granule: 4KB | |
Alloc Recommended Granule:2048KB | |
Alloc Alignment: 4KB | |
Accessible by all: FALSE | |
Pool 2 | |
Segment: GLOBAL; FLAGS: EXTENDED FINE GRAINED | |
Size: 301973504(0x11ffc000) KB | |
Allocatable: TRUE | |
Alloc Granule: 4KB | |
Alloc Recommended Granule:2048KB | |
Alloc Alignment: 4KB | |
Accessible by all: FALSE | |
Pool 3 | |
Segment: GLOBAL; FLAGS: FINE GRAINED | |
Size: 301973504(0x11ffc000) KB | |
Allocatable: TRUE | |
Alloc Granule: 4KB | |
Alloc Recommended Granule:2048KB | |
Alloc Alignment: 4KB | |
Accessible by all: FALSE | |
Pool 4 | |
Segment: GROUP | |
Size: 160(0xa0) KB | |
Allocatable: FALSE | |
Alloc Granule: 0KB | |
Alloc Recommended Granule:0KB | |
Alloc Alignment: 0KB | |
Accessible by all: FALSE | |
ISA Info: | |
ISA 1 | |
Name: amdgcn-amd-amdhsa--gfx950:sramecc+:xnack- | |
Machine Models: HSA_MACHINE_MODEL_LARGE | |
Profiles: HSA_PROFILE_BASE | |
Default Rounding Mode: NEAR | |
Default Rounding Mode: NEAR | |
Fast f16: TRUE | |
Workgroup Max Size: 1024(0x400) | |
Workgroup Max Size per Dimension: | |
x 1024(0x400) | |
y 1024(0x400) | |
z 1024(0x400) | |
Grid Max Size: 4294967295(0xffffffff) | |
Grid Max Size per Dimension: | |
x 2147483647(0x7fffffff) | |
y 65535(0xffff) | |
z 65535(0xffff) | |
FBarrier Max Size: 32 | |
ISA 2 | |
Name: amdgcn-amd-amdhsa--gfx9-4-generic:sramecc+:xnack- | |
Machine Models: HSA_MACHINE_MODEL_LARGE | |
Profiles: HSA_PROFILE_BASE | |
Default Rounding Mode: NEAR | |
Default Rounding Mode: NEAR | |
Fast f16: TRUE | |
Workgroup Max Size: 1024(0x400) | |
Workgroup Max Size per Dimension: | |
x 1024(0x400) | |
y 1024(0x400) | |
z 1024(0x400) | |
Grid Max Size: 4294967295(0xffffffff) | |
Grid Max Size per Dimension: | |
x 2147483647(0x7fffffff) | |
y 65535(0xffff) | |
z 65535(0xffff) | |
FBarrier Max Size: 32 | |
******* | |
Agent 6 | |
******* | |
Name: gfx950 | |
Uuid: GPU-8cc58eddfea518e5 | |
Marketing Name: | |
Vendor Name: AMD | |
Feature: KERNEL_DISPATCH | |
Profile: BASE_PROFILE | |
Float Round Mode: NEAR | |
Max Queue Number: 128(0x80) | |
Queue Min Size: 64(0x40) | |
Queue Max Size: 131072(0x20000) | |
Queue Type: MULTI | |
Node: 5 | |
Device Type: GPU | |
Cache Info: | |
L1: 32(0x20) KB | |
L2: 4096(0x1000) KB | |
L3: 262144(0x40000) KB | |
Chip ID: 30112(0x75a0) | |
ASIC Revision: 0(0x0) | |
Cacheline Size: 128(0x80) | |
Max Clock Freq. (MHz): 2200 | |
BDFID: 5376 | |
Internal Node ID: 5 | |
Compute Unit: 256 | |
SIMDs per CU: 4 | |
Shader Engines: 32 | |
Shader Arrs. per Eng.: 1 | |
WatchPts on Addr. Ranges:4 | |
Coherent Host Access: FALSE | |
Memory Properties: | |
Features: KERNEL_DISPATCH | |
Fast F16 Operation: TRUE | |
Wavefront Size: 64(0x40) | |
Workgroup Max Size: 1024(0x400) | |
Workgroup Max Size per Dimension: | |
x 1024(0x400) | |
y 1024(0x400) | |
z 1024(0x400) | |
Max Waves Per CU: 32(0x20) | |
Max Work-item Per CU: 2048(0x800) | |
Grid Max Size: 4294967295(0xffffffff) | |
Grid Max Size per Dimension: | |
x 2147483647(0x7fffffff) | |
y 65535(0xffff) | |
z 65535(0xffff) | |
Max fbarriers/Workgrp: 32 | |
Packet Processor uCode:: 30 | |
SDMA engine uCode:: 12 | |
IOMMU Support:: None | |
Pool Info: | |
Pool 1 | |
Segment: GLOBAL; FLAGS: COARSE GRAINED | |
Size: 301973504(0x11ffc000) KB | |
Allocatable: TRUE | |
Alloc Granule: 4KB | |
Alloc Recommended Granule:2048KB | |
Alloc Alignment: 4KB | |
Accessible by all: FALSE | |
Pool 2 | |
Segment: GLOBAL; FLAGS: EXTENDED FINE GRAINED | |
Size: 301973504(0x11ffc000) KB | |
Allocatable: TRUE | |
Alloc Granule: 4KB | |
Alloc Recommended Granule:2048KB | |
Alloc Alignment: 4KB | |
Accessible by all: FALSE | |
Pool 3 | |
Segment: GLOBAL; FLAGS: FINE GRAINED | |
Size: 301973504(0x11ffc000) KB | |
Allocatable: TRUE | |
Alloc Granule: 4KB | |
Alloc Recommended Granule:2048KB | |
Alloc Alignment: 4KB | |
Accessible by all: FALSE | |
Pool 4 | |
Segment: GROUP | |
Size: 160(0xa0) KB | |
Allocatable: FALSE | |
Alloc Granule: 0KB | |
Alloc Recommended Granule:0KB | |
Alloc Alignment: 0KB | |
Accessible by all: FALSE | |
ISA Info: | |
ISA 1 | |
Name: amdgcn-amd-amdhsa--gfx950:sramecc+:xnack- | |
Machine Models: HSA_MACHINE_MODEL_LARGE | |
Profiles: HSA_PROFILE_BASE | |
Default Rounding Mode: NEAR | |
Default Rounding Mode: NEAR | |
Fast f16: TRUE | |
Workgroup Max Size: 1024(0x400) | |
Workgroup Max Size per Dimension: | |
x 1024(0x400) | |
y 1024(0x400) | |
z 1024(0x400) | |
Grid Max Size: 4294967295(0xffffffff) | |
Grid Max Size per Dimension: | |
x 2147483647(0x7fffffff) | |
y 65535(0xffff) | |
z 65535(0xffff) | |
FBarrier Max Size: 32 | |
ISA 2 | |
Name: amdgcn-amd-amdhsa--gfx9-4-generic:sramecc+:xnack- | |
Machine Models: HSA_MACHINE_MODEL_LARGE | |
Profiles: HSA_PROFILE_BASE | |
Default Rounding Mode: NEAR | |
Default Rounding Mode: NEAR | |
Fast f16: TRUE | |
Workgroup Max Size: 1024(0x400) | |
Workgroup Max Size per Dimension: | |
x 1024(0x400) | |
y 1024(0x400) | |
z 1024(0x400) | |
Grid Max Size: 4294967295(0xffffffff) | |
Grid Max Size per Dimension: | |
x 2147483647(0x7fffffff) | |
y 65535(0xffff) | |
z 65535(0xffff) | |
FBarrier Max Size: 32 | |
******* | |
Agent 7 | |
******* | |
Name: gfx950 | |
Uuid: GPU-21cde581a488eb81 | |
Marketing Name: | |
Vendor Name: AMD | |
Feature: KERNEL_DISPATCH | |
Profile: BASE_PROFILE | |
Float Round Mode: NEAR | |
Max Queue Number: 128(0x80) | |
Queue Min Size: 64(0x40) | |
Queue Max Size: 131072(0x20000) | |
Queue Type: MULTI | |
Node: 6 | |
Device Type: GPU | |
Cache Info: | |
L1: 32(0x20) KB | |
L2: 4096(0x1000) KB | |
L3: 262144(0x40000) KB | |
Chip ID: 30112(0x75a0) | |
ASIC Revision: 0(0x0) | |
Cacheline Size: 128(0x80) | |
Max Clock Freq. (MHz): 2200 | |
BDFID: 62720 | |
Internal Node ID: 6 | |
Compute Unit: 256 | |
SIMDs per CU: 4 | |
Shader Engines: 32 | |
Shader Arrs. per Eng.: 1 | |
WatchPts on Addr. Ranges:4 | |
Coherent Host Access: FALSE | |
Memory Properties: | |
Features: KERNEL_DISPATCH | |
Fast F16 Operation: TRUE | |
Wavefront Size: 64(0x40) | |
Workgroup Max Size: 1024(0x400) | |
Workgroup Max Size per Dimension: | |
x 1024(0x400) | |
y 1024(0x400) | |
z 1024(0x400) | |
Max Waves Per CU: 32(0x20) | |
Max Work-item Per CU: 2048(0x800) | |
Grid Max Size: 4294967295(0xffffffff) | |
Grid Max Size per Dimension: | |
x 2147483647(0x7fffffff) | |
y 65535(0xffff) | |
z 65535(0xffff) | |
Max fbarriers/Workgrp: 32 | |
Packet Processor uCode:: 30 | |
SDMA engine uCode:: 12 | |
IOMMU Support:: None | |
Pool Info: | |
Pool 1 | |
Segment: GLOBAL; FLAGS: COARSE GRAINED | |
Size: 301973504(0x11ffc000) KB | |
Allocatable: TRUE | |
Alloc Granule: 4KB | |
Alloc Recommended Granule:2048KB | |
Alloc Alignment: 4KB | |
Accessible by all: FALSE | |
Pool 2 | |
Segment: GLOBAL; FLAGS: EXTENDED FINE GRAINED | |
Size: 301973504(0x11ffc000) KB | |
Allocatable: TRUE | |
Alloc Granule: 4KB | |
Alloc Recommended Granule:2048KB | |
Alloc Alignment: 4KB | |
Accessible by all: FALSE | |
Pool 3 | |
Segment: GLOBAL; FLAGS: FINE GRAINED | |
Size: 301973504(0x11ffc000) KB | |
Allocatable: TRUE | |
Alloc Granule: 4KB | |
Alloc Recommended Granule:2048KB | |
Alloc Alignment: 4KB | |
Accessible by all: FALSE | |
Pool 4 | |
Segment: GROUP | |
Size: 160(0xa0) KB | |
Allocatable: FALSE | |
Alloc Granule: 0KB | |
Alloc Recommended Granule:0KB | |
Alloc Alignment: 0KB | |
Accessible by all: FALSE | |
ISA Info: | |
ISA 1 | |
Name: amdgcn-amd-amdhsa--gfx950:sramecc+:xnack- | |
Machine Models: HSA_MACHINE_MODEL_LARGE | |
Profiles: HSA_PROFILE_BASE | |
Default Rounding Mode: NEAR | |
Default Rounding Mode: NEAR | |
Fast f16: TRUE | |
Workgroup Max Size: 1024(0x400) | |
Workgroup Max Size per Dimension: | |
x 1024(0x400) | |
y 1024(0x400) | |
z 1024(0x400) | |
Grid Max Size: 4294967295(0xffffffff) | |
Grid Max Size per Dimension: | |
x 2147483647(0x7fffffff) | |
y 65535(0xffff) | |
z 65535(0xffff) | |
FBarrier Max Size: 32 | |
ISA 2 | |
Name: amdgcn-amd-amdhsa--gfx9-4-generic:sramecc+:xnack- | |
Machine Models: HSA_MACHINE_MODEL_LARGE | |
Profiles: HSA_PROFILE_BASE | |
Default Rounding Mode: NEAR | |
Default Rounding Mode: NEAR | |
Fast f16: TRUE | |
Workgroup Max Size: 1024(0x400) | |
Workgroup Max Size per Dimension: | |
x 1024(0x400) | |
y 1024(0x400) | |
z 1024(0x400) | |
Grid Max Size: 4294967295(0xffffffff) | |
Grid Max Size per Dimension: | |
x 2147483647(0x7fffffff) | |
y 65535(0xffff) | |
z 65535(0xffff) | |
FBarrier Max Size: 32 | |
******* | |
Agent 8 | |
******* | |
Name: gfx950 | |
Uuid: GPU-15015787f6cc7b2a | |
Marketing Name: | |
Vendor Name: AMD | |
Feature: KERNEL_DISPATCH | |
Profile: BASE_PROFILE | |
Float Round Mode: NEAR | |
Max Queue Number: 128(0x80) | |
Queue Min Size: 64(0x40) | |
Queue Max Size: 131072(0x20000) | |
Queue Type: MULTI | |
Node: 7 | |
Device Type: GPU | |
Cache Info: | |
L1: 32(0x20) KB | |
L2: 4096(0x1000) KB | |
L3: 262144(0x40000) KB | |
Chip ID: 30112(0x75a0) | |
ASIC Revision: 0(0x0) | |
Cacheline Size: 128(0x80) | |
Max Clock Freq. (MHz): 2200 | |
BDFID: 34048 | |
Internal Node ID: 7 | |
Compute Unit: 256 | |
SIMDs per CU: 4 | |
Shader Engines: 32 | |
Shader Arrs. per Eng.: 1 | |
WatchPts on Addr. Ranges:4 | |
Coherent Host Access: FALSE | |
Memory Properties: | |
Features: KERNEL_DISPATCH | |
Fast F16 Operation: TRUE | |
Wavefront Size: 64(0x40) | |
Workgroup Max Size: 1024(0x400) | |
Workgroup Max Size per Dimension: | |
x 1024(0x400) | |
y 1024(0x400) | |
z 1024(0x400) | |
Max Waves Per CU: 32(0x20) | |
Max Work-item Per CU: 2048(0x800) | |
Grid Max Size: 4294967295(0xffffffff) | |
Grid Max Size per Dimension: | |
x 2147483647(0x7fffffff) | |
y 65535(0xffff) | |
z 65535(0xffff) | |
Max fbarriers/Workgrp: 32 | |
Packet Processor uCode:: 30 | |
SDMA engine uCode:: 12 | |
IOMMU Support:: None | |
Pool Info: | |
Pool 1 | |
Segment: GLOBAL; FLAGS: COARSE GRAINED | |
Size: 301973504(0x11ffc000) KB | |
Allocatable: TRUE | |
Alloc Granule: 4KB | |
Alloc Recommended Granule:2048KB | |
Alloc Alignment: 4KB | |
Accessible by all: FALSE | |
Pool 2 | |
Segment: GLOBAL; FLAGS: EXTENDED FINE GRAINED | |
Size: 301973504(0x11ffc000) KB | |
Allocatable: TRUE | |
Alloc Granule: 4KB | |
Alloc Recommended Granule:2048KB | |
Alloc Alignment: 4KB | |
Accessible by all: FALSE | |
Pool 3 | |
Segment: GLOBAL; FLAGS: FINE GRAINED | |
Size: 301973504(0x11ffc000) KB | |
Allocatable: TRUE | |
Alloc Granule: 4KB | |
Alloc Recommended Granule:2048KB | |
Alloc Alignment: 4KB | |
Accessible by all: FALSE | |
Pool 4 | |
Segment: GROUP | |
Size: 160(0xa0) KB | |
Allocatable: FALSE | |
Alloc Granule: 0KB | |
Alloc Recommended Granule:0KB | |
Alloc Alignment: 0KB | |
Accessible by all: FALSE | |
ISA Info: | |
ISA 1 | |
Name: amdgcn-amd-amdhsa--gfx950:sramecc+:xnack- | |
Machine Models: HSA_MACHINE_MODEL_LARGE | |
Profiles: HSA_PROFILE_BASE | |
Default Rounding Mode: NEAR | |
Default Rounding Mode: NEAR | |
Fast f16: TRUE | |
Workgroup Max Size: 1024(0x400) | |
Workgroup Max Size per Dimension: | |
x 1024(0x400) | |
y 1024(0x400) | |
z 1024(0x400) | |
Grid Max Size: 4294967295(0xffffffff) | |
Grid Max Size per Dimension: | |
x 2147483647(0x7fffffff) | |
y 65535(0xffff) | |
z 65535(0xffff) | |
FBarrier Max Size: 32 | |
ISA 2 | |
Name: amdgcn-amd-amdhsa--gfx9-4-generic:sramecc+:xnack- | |
Machine Models: HSA_MACHINE_MODEL_LARGE | |
Profiles: HSA_PROFILE_BASE | |
Default Rounding Mode: NEAR | |
Default Rounding Mode: NEAR | |
Fast f16: TRUE | |
Workgroup Max Size: 1024(0x400) | |
Workgroup Max Size per Dimension: | |
x 1024(0x400) | |
y 1024(0x400) | |
z 1024(0x400) | |
Grid Max Size: 4294967295(0xffffffff) | |
Grid Max Size per Dimension: | |
x 2147483647(0x7fffffff) | |
y 65535(0xffff) | |
z 65535(0xffff) | |
FBarrier Max Size: 32 | |
******* | |
Agent 9 | |
******* | |
Name: gfx950 | |
Uuid: GPU-47c0bcdf4b861d6f | |
Marketing Name: | |
Vendor Name: AMD | |
Feature: KERNEL_DISPATCH | |
Profile: BASE_PROFILE | |
Float Round Mode: NEAR | |
Max Queue Number: 128(0x80) | |
Queue Min Size: 64(0x40) | |
Queue Max Size: 131072(0x20000) | |
Queue Type: MULTI | |
Node: 8 | |
Device Type: GPU | |
Cache Info: | |
L1: 32(0x20) KB | |
L2: 4096(0x1000) KB | |
L3: 262144(0x40000) KB | |
Chip ID: 30112(0x75a0) | |
ASIC Revision: 0(0x0) | |
Cacheline Size: 128(0x80) | |
Max Clock Freq. (MHz): 2200 | |
BDFID: 58624 | |
Internal Node ID: 8 | |
Compute Unit: 256 | |
SIMDs per CU: 4 | |
Shader Engines: 32 | |
Shader Arrs. per Eng.: 1 | |
WatchPts on Addr. Ranges:4 | |
Coherent Host Access: FALSE | |
Memory Properties: | |
Features: KERNEL_DISPATCH | |
Fast F16 Operation: TRUE | |
Wavefront Size: 64(0x40) | |
Workgroup Max Size: 1024(0x400) | |
Workgroup Max Size per Dimension: | |
x 1024(0x400) | |
y 1024(0x400) | |
z 1024(0x400) | |
Max Waves Per CU: 32(0x20) | |
Max Work-item Per CU: 2048(0x800) | |
Grid Max Size: 4294967295(0xffffffff) | |
Grid Max Size per Dimension: | |
x 2147483647(0x7fffffff) | |
y 65535(0xffff) | |
z 65535(0xffff) | |
Max fbarriers/Workgrp: 32 | |
Packet Processor uCode:: 30 | |
SDMA engine uCode:: 12 | |
IOMMU Support:: None | |
Pool Info: | |
Pool 1 | |
Segment: GLOBAL; FLAGS: COARSE GRAINED | |
Size: 301973504(0x11ffc000) KB | |
Allocatable: TRUE | |
Alloc Granule: 4KB | |
Alloc Recommended Granule:2048KB | |
Alloc Alignment: 4KB | |
Accessible by all: FALSE | |
Pool 2 | |
Segment: GLOBAL; FLAGS: EXTENDED FINE GRAINED | |
Size: 301973504(0x11ffc000) KB | |
Allocatable: TRUE | |
Alloc Granule: 4KB | |
Alloc Recommended Granule:2048KB | |
Alloc Alignment: 4KB | |
Accessible by all: FALSE | |
Pool 3 | |
Segment: GLOBAL; FLAGS: FINE GRAINED | |
Size: 301973504(0x11ffc000) KB | |
Allocatable: TRUE | |
Alloc Granule: 4KB | |
Alloc Recommended Granule:2048KB | |
Alloc Alignment: 4KB | |
Accessible by all: FALSE | |
Pool 4 | |
Segment: GROUP | |
Size: 160(0xa0) KB | |
Allocatable: FALSE | |
Alloc Granule: 0KB | |
Alloc Recommended Granule:0KB | |
Alloc Alignment: 0KB | |
Accessible by all: FALSE | |
ISA Info: | |
ISA 1 | |
Name: amdgcn-amd-amdhsa--gfx950:sramecc+:xnack- | |
Machine Models: HSA_MACHINE_MODEL_LARGE | |
Profiles: HSA_PROFILE_BASE | |
Default Rounding Mode: NEAR | |
Default Rounding Mode: NEAR | |
Fast f16: TRUE | |
Workgroup Max Size: 1024(0x400) | |
Workgroup Max Size per Dimension: | |
x 1024(0x400) | |
y 1024(0x400) | |
z 1024(0x400) | |
Grid Max Size: 4294967295(0xffffffff) | |
Grid Max Size per Dimension: | |
x 2147483647(0x7fffffff) | |
y 65535(0xffff) | |
z 65535(0xffff) | |
FBarrier Max Size: 32 | |
ISA 2 | |
Name: amdgcn-amd-amdhsa--gfx9-4-generic:sramecc+:xnack- | |
Machine Models: HSA_MACHINE_MODEL_LARGE | |
Profiles: HSA_PROFILE_BASE | |
Default Rounding Mode: NEAR | |
Default Rounding Mode: NEAR | |
Fast f16: TRUE | |
Workgroup Max Size: 1024(0x400) | |
Workgroup Max Size per Dimension: | |
x 1024(0x400) | |
y 1024(0x400) | |
z 1024(0x400) | |
Grid Max Size: 4294967295(0xffffffff) | |
Grid Max Size per Dimension: | |
x 2147483647(0x7fffffff) | |
y 65535(0xffff) | |
z 65535(0xffff) | |
FBarrier Max Size: 32 | |
******* | |
Agent 10 | |
******* | |
Name: gfx950 | |
Uuid: GPU-febddeaa8b851d07 | |
Marketing Name: | |
Vendor Name: AMD | |
Feature: KERNEL_DISPATCH | |
Profile: BASE_PROFILE | |
Float Round Mode: NEAR | |
Max Queue Number: 128(0x80) | |
Queue Min Size: 64(0x40) | |
Queue Max Size: 131072(0x20000) | |
Queue Type: MULTI | |
Node: 9 | |
Device Type: GPU | |
Cache Info: | |
L1: 32(0x20) KB | |
L2: 4096(0x1000) KB | |
L3: 262144(0x40000) KB | |
Chip ID: 30112(0x75a0) | |
ASIC Revision: 0(0x0) | |
Cacheline Size: 128(0x80) | |
Max Clock Freq. (MHz): 2200 | |
BDFID: 38144 | |
Internal Node ID: 9 | |
Compute Unit: 256 | |
SIMDs per CU: 4 | |
Shader Engines: 32 | |
Shader Arrs. per Eng.: 1 | |
WatchPts on Addr. Ranges:4 | |
Coherent Host Access: FALSE | |
Memory Properties: | |
Features: KERNEL_DISPATCH | |
Fast F16 Operation: TRUE | |
Wavefront Size: 64(0x40) | |
Workgroup Max Size: 1024(0x400) | |
Workgroup Max Size per Dimension: | |
x 1024(0x400) | |
y 1024(0x400) | |
z 1024(0x400) | |
Max Waves Per CU: 32(0x20) | |
Max Work-item Per CU: 2048(0x800) | |
Grid Max Size: 4294967295(0xffffffff) | |
Grid Max Size per Dimension: | |
x 2147483647(0x7fffffff) | |
y 65535(0xffff) | |
z 65535(0xffff) | |
Max fbarriers/Workgrp: 32 | |
Packet Processor uCode:: 30 | |
SDMA engine uCode:: 12 | |
IOMMU Support:: None | |
Pool Info: | |
Pool 1 | |
Segment: GLOBAL; FLAGS: COARSE GRAINED | |
Size: 301973504(0x11ffc000) KB | |
Allocatable: TRUE | |
Alloc Granule: 4KB | |
Alloc Recommended Granule:2048KB | |
Alloc Alignment: 4KB | |
Accessible by all: FALSE | |
Pool 2 | |
Segment: GLOBAL; FLAGS: EXTENDED FINE GRAINED | |
Size: 301973504(0x11ffc000) KB | |
Allocatable: TRUE | |
Alloc Granule: 4KB | |
Alloc Recommended Granule:2048KB | |
Alloc Alignment: 4KB | |
Accessible by all: FALSE | |
Pool 3 | |
Segment: GLOBAL; FLAGS: FINE GRAINED | |
Size: 301973504(0x11ffc000) KB | |
Allocatable: TRUE | |
Alloc Granule: 4KB | |
Alloc Recommended Granule:2048KB | |
Alloc Alignment: 4KB | |
Accessible by all: FALSE | |
Pool 4 | |
Segment: GROUP | |
Size: 160(0xa0) KB | |
Allocatable: FALSE | |
Alloc Granule: 0KB | |
Alloc Recommended Granule:0KB | |
Alloc Alignment: 0KB | |
Accessible by all: FALSE | |
ISA Info: | |
ISA 1 | |
Name: amdgcn-amd-amdhsa--gfx950:sramecc+:xnack- | |
Machine Models: HSA_MACHINE_MODEL_LARGE | |
Profiles: HSA_PROFILE_BASE | |
Default Rounding Mode: NEAR | |
Default Rounding Mode: NEAR | |
Fast f16: TRUE | |
Workgroup Max Size: 1024(0x400) | |
Workgroup Max Size per Dimension: | |
x 1024(0x400) | |
y 1024(0x400) | |
z 1024(0x400) | |
Grid Max Size: 4294967295(0xffffffff) | |
Grid Max Size per Dimension: | |
x 2147483647(0x7fffffff) | |
y 65535(0xffff) | |
z 65535(0xffff) | |
FBarrier Max Size: 32 | |
ISA 2 | |
Name: amdgcn-amd-amdhsa--gfx9-4-generic:sramecc+:xnack- | |
Machine Models: HSA_MACHINE_MODEL_LARGE | |
Profiles: HSA_PROFILE_BASE | |
Default Rounding Mode: NEAR | |
Default Rounding Mode: NEAR | |
Fast f16: TRUE | |
Workgroup Max Size: 1024(0x400) | |
Workgroup Max Size per Dimension: | |
x 1024(0x400) | |
y 1024(0x400) | |
z 1024(0x400) | |
Grid Max Size: 4294967295(0xffffffff) | |
Grid Max Size per Dimension: | |
x 2147483647(0x7fffffff) | |
y 65535(0xffff) | |
z 65535(0xffff) | |
FBarrier Max Size: 32 | |
*** Done *** |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment