Azhng/brainstorm.md

Created August 12, 2018 17:35

Star (0) You must be signed in to star a gist
Fork (0) You must be signed in to fork a gist

Learn more about clone URLs
Clone this repository at <script src="https://gist.github.com/Azhng/35a93b44f70712ca615e5cce888fe941.js"></script>
Save Azhng/35a93b44f70712ca615e5cce888fe941 to your computer and use it in GitHub Desktop.

Download ZIP

Raw

brainstorm.md

Scope

A very specific narrow topic within GPGPU computing (implemented using OpenCL / CUDA)

Potential topics

OpenCL: Introduction
- Command Queue model
- Host memory model
  - opencl native memory types
  - shared virtual memory
- Device memory model
  - global memory
  - local memory
  - constant memory
  - private memory
- Kernel execution domain
  - work item
  - work group
  - events
  - out-of-order command queue
  - use event to implement kernel dependency
- Synchronization techniques
  - barriers
  - fences
  - atomics
  - memory ordering
OpenCL: Implementation on different hardware
- Multi-core CPU
  - multiplexing work group on a single physical GPU
  - vectorization
- GPU
  - GPU threading
  - Overall Sea Island ISA
  - SIMD unit mapping to work items & work groups
OpenCL: live demo - 95% efficiency improvement
- Image clustering example from the book
  - naive CPU implementation
  - naive GPU implementation
  - GPU implementation with coalesced memory
  - GPU implementation with vectorization
  - GPU implementation with local memory (programmable scratch pad memory)
  - GPU implementation with constant memory
- FFT implementation

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment