About Writing

Chapter 4: Computer Architecture and Scheduling

4.1 Architecture of a modern GPU

GPU layout

4.2 Block scheduling

block assignment

4.3 Synchronization and transparent scalability

synchronization

4.4 Warps and SIMD hardware

warp layout
processing blocks
processing blocks

4.5 Control divergence

divergences

4.6 Warp scheduling and latency tolerance