TL;DR:
- Compile times are 13-16% shorter if you have several backends (always on Windows), slight but consistent improvements if you use only a single one (default on Mac)
- Runtime performance of compute passes gets worse by 10-30% (with a 40% worse outlier for a lot of dispatches in a single pass)
- Runtime performance of render passes sometimes 17-34% better and sometimes 10% worse 🤷
- Stuck doc-gen issue is fixed!
wgpu-info
Windows binary size goes down, Mac binary sizes goes up unless you add more backends