Created
September 26, 2024 16:55
-
-
Save csullivan/2229d8e341d430bac1792279d8872a87 to your computer and use it in GitHub Desktop.
Performance comparison: 5% gain using wgmma with LHS in registers vs shared. [1] https://github.com/csullivan/wgmma-intrin
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment