Experimental effort to run ROCm + PyTorch on a gfx1201 GPU (AMD Radeon RX 9070 XT /
AI PRO R9700, RDNA4, PCI 0x1002:0x7551) through a shared, firmware-light lite::
ROCr backend that programs the GPU's compute queue directly from userspace — on three
OSes that lack the usual ROCm/KFD kernel path:
- macOS (Apple Silicon) — AMD eGPU over Thunderbolt, via a DriverKit DEXT (no kernel driver).
- Linux (x86) —
amdgpu_liteminimal kernel shim + a userspace bring-up. - Windows — userspace
D3DKMTEscapebackend over the productionamdgpu_wddmKMD; compute shader dispatch working on gfx1201.
