harishanand95/Stable_Diffusion.md

Last active June 18, 2025 10:19

Star (61) You must be signed in to star a gist
Fork (9) You must be signed in to fork a gist

Learn more about clone URLs
Clone this repository at <script src="https://gist.github.com/harishanand95/75f4515e6187a6aa3261af6ac6f61269.js"></script>
Save harishanand95/75f4515e6187a6aa3261af6ac6f61269 to your computer and use it in GitHub Desktop.

Download ZIP

Stable Diffusion on AMD GPUs on Windows using DirectML

Raw

Stable_Diffusion.md

Stable Diffusion for AMD GPUs on Windows using DirectML

UPDATE: A faster (20x) approach for running Stable Diffusion using MLIR/Vulkan/IREE is available on Windows:

https://github.com/nod-ai/SHARK/blob/main/shark/examples/shark_inference/stable_diffusion/stable_diffusion_amd.md

Install 🤗 diffusers

conda create --name sd39 python=3.9 -y
conda activate sd39
pip install diffusers==0.3.0
pip install transformers
pip install onnxruntime
pip install onnx

Install DirectML latest release

You can download the nightly onnxruntime-directml release from the link below

https://aiinfra.visualstudio.com/PublicPackages/_artifacts/feed/ORT-Nightly/PyPI/ort-nightly-directml/overview/1.13.0.dev20220908001

Run python --version to find out, which whl file to download.

If you are on Python3.7, download the file that ends with **-cp37-cp37m-win_amd64.whl.
If you are on Python3.8, download the file that ends with **-cp38-cp38m-win_amd64.whl
and likewise

pip install ort_nightly_directml-1.13.0.dev20220908001-cp39-cp39-win_amd64.whl --force-reinstall

Convert Stable Diffusion model to ONNX format

This apporach is faster than downloading the onnx models files.

Download diffusers/scripts/convert_stable_diffusion_checkpoint_to_onnx.py to your working directory. You can try the command below to download the script.

wget https://raw.githubusercontent.com/huggingface/diffusers/main/scripts/convert_stable_diffusion_checkpoint_to_onnx.py

Run huggingface-cli.exe login and provide huggingface access token.
Convert the model using the command below. Models are stored in stable_diffusion_onnx folder.

python convert_stable_diffusion_checkpoint_to_onnx.py --model_path="CompVis/stable-diffusion-v1-4" --output_path="./stable_diffusion_onnx"

Run Stable Diffusion on AMD GPUs

Here is an example python code for stable diffusion pipeline using huggingface diffusers.

from diffusers import StableDiffusionOnnxPipeline
pipe = StableDiffusionOnnxPipeline.from_pretrained("./stable_diffusion_onnx", provider="DmlExecutionProvider")
prompt = "a photo of an astronaut riding a horse on mars"
image = pipe(prompt).images[0] 
image.save("astronaut_rides_horse.png")

claforte commented Nov 17, 2022

FYI, @harishanand95 is documenting how to use IREE (https://iree-org.github.io/iree/) through the Vulkan API to run StableDiffusion text->image. We expect to release the instructions next week. In our tests, this alternative toolchain runs >10X faster than ONNX RT->DirectML for text->image, and Nod.ai is also working to support img->img soon... we think the performance difference is in part explained by MLIR and IREE being a compiler toolchain, compared to ORT that's more of an interpreter. If you're interested in learning more and supporting this new code path, please email me at claforte at my employer's domain, or send me a Discord friend invite at claforte (my number is #7115). BTW I'm also trying to get the authorization to reward the most helpful open-source developers with a few Navi2 and Navi3 GPUs (soon after they are officially released). :-)

nomanHasan commented Nov 23, 2022

Thank you @claforte @harishanand95 for your efforts at making Stable Diffusion more accessible. I run an RX 580, GFX803 which seems to have lost AMD ROCM support long ago. Still, the internet is full of workarounds that do not work in my experience. Looking forward to your guy's hard work to get us to use the open-source API method.

cpietsch commented Nov 23, 2022

The main issue here is the windows route. If you use linux you can even use the goto stable diffusion UI: https://github.com/AUTOMATIC1111/stable-diffusion-webui/wiki/Install-and-Run-on-AMD-GPUs
Still, I would love to see windows support through the Vulkan API.
If I understand it correctly we need to convert the SD model to SPIR-V using iree-compiler?
There is an example using SHARK: https://github.com/nod-ai/SHARK/blob/b448770ec26d8b8b0cf332f752915ac39b02d935/shark/examples/shark_inference/stable_diff.py

nomanHasan commented Nov 23, 2022

@cpietsch It doesn't work for Linux very well. The Linux-exclusive ROCM only properly support their workstation GPUs and support for consumer GPUs is lagging. You'd have to follow weird workarounds to get them working on the recent cards. And for slightly older cards like GFX803, it turns out to be impossible.

cpietsch commented Nov 23, 2022

Oh sorry about that. It worked out of the box for my Radeon VII and I thought that that this was the same for the rest.

Author

harishanand95 commented Dec 1, 2022

Hello everyone. As Christian mentioned, we have added a new pipeline for AMD GPUs using MLIR/IREE. This approach significantly boosts the performance of running Stable Diffusion in Windows and avoids the current ONNX/DirectML approach.

Instructions: https://github.com/nod-ai/SHARK/blob/main/shark/examples/shark_inference/stable_diffusion/stable_diffusion_amd.md

Please reach out to us on the discord link on the instructions page or create GitHub issues if something does not work for you.

Thanks!

@averad, Could you please give it a try and update your instructions too? You can reach us on the discord channel if you have any questions, Thanks!

averad commented Dec 1, 2022

@harishanand95 I will give it a try and update the Instructions.

averad commented Dec 2, 2022 •

edited

Loading

@harishanand95 I wasn't able to test the process as IREE doesn't have support for RX 500 series cards - GCNv3

I've suggested adding def VK_TTA_RGCNv3 : I32EnumAttrCase<"AMD_RGCNv3", 103, "rgcn3">; and am working on compiling IREE with my suggested changes for testing.

cpietsch commented Dec 4, 2022 •

edited

Loading

I am getting 3.85 it/s on my 6900xt on SHARK (vulkan), that is 13 seconds for 50 iterations

phreeware commented Dec 4, 2022

hi, the exe doesnt work for me following your little guide (using the MLIR driver on 6900XT), im getting errors:

ill try the manual guide

cpietsch commented Dec 4, 2022

For me the Advanced Installation worked

VohminV commented Mar 2, 2023

Time generation ?

harishanand95/Stable_Diffusion.md

Stable Diffusion for AMD GPUs on Windows using DirectML

Install 🤗 diffusers

Install DirectML latest release

Convert Stable Diffusion model to ONNX format

Run Stable Diffusion on AMD GPUs

claforte commented Nov 17, 2022

Uh oh!

nomanHasan commented Nov 23, 2022

Uh oh!

cpietsch commented Nov 23, 2022

Uh oh!

nomanHasan commented Nov 23, 2022

Uh oh!

cpietsch commented Nov 23, 2022

Uh oh!

harishanand95 commented Dec 1, 2022

Uh oh!

averad commented Dec 1, 2022

Uh oh!

averad commented Dec 2, 2022 •

edited

Loading

Uh oh!

cpietsch commented Dec 4, 2022 •

edited

Loading

Uh oh!

phreeware commented Dec 4, 2022

Uh oh!

cpietsch commented Dec 4, 2022

Uh oh!

VohminV commented Mar 2, 2023

Uh oh!

harishanand95/Stable_Diffusion.md

Stable Diffusion for AMD GPUs on Windows using DirectML

Install 🤗 diffusers

Install DirectML latest release

Convert Stable Diffusion model to ONNX format

Run Stable Diffusion on AMD GPUs

claforte commented Nov 17, 2022

Uh oh!

nomanHasan commented Nov 23, 2022

Uh oh!

cpietsch commented Nov 23, 2022

Uh oh!

nomanHasan commented Nov 23, 2022

Uh oh!

cpietsch commented Nov 23, 2022

Uh oh!

harishanand95 commented Dec 1, 2022

Uh oh!

averad commented Dec 1, 2022

Uh oh!

averad commented Dec 2, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

cpietsch commented Dec 4, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

phreeware commented Dec 4, 2022

Uh oh!

cpietsch commented Dec 4, 2022

Uh oh!

VohminV commented Mar 2, 2023

Uh oh!

averad commented Dec 2, 2022 •

edited

Loading

cpietsch commented Dec 4, 2022 •

edited

Loading