The target audience possesses basic to advanced knowledge of parallel algorithms and graphics APIs.
This tutorial intends to attract viewers with a strong interest for understanding and optimizing for the underlying mechanisms of parallel execution on GPU hardware.
Senior developers get a chance to acquaint themselves with recent CUDA features and their impact on kernel design.
Furthermore, the audience is introduced to task-based applications of CUDA beyond the classic many-kernel programming pattern.