Bookmarks
03 CUDA Fundamental Optimization Part 1
Detailed lecture on foundational CUDA performance techniques—memory coalescing, occupancy, and kernel launch parameters—illustrated through hands-on code profiling and optimization steps.
Refterm Lecture Part 5 - Parsing with SIMD
Technical lecture showing how to accelerate text parsing by leveraging SIMD instructions, delving into low-level CPU mechanics, data alignment, and practical code optimization strategies.
04 CUDA Fundamental Optimization Part 2
Lecture on CUDA fundamental optimizations provides specialized technical guidance for high-performance GPU computing.