(Training material on pytorch CPU performance optimization)
- Part I: Memory Formats and Channels Last Optimization
- Part II: Parallelization Techniques
- Part IV: BFloat16 Kernel Optimization
Chinese version for this chapter, link.
This section contains the following subjects: