Ludwig - cs/computer

Zig in Depth: Vectors and SIMD

Added on July 24, 2025 · · views

cs/computer_architecture/hardware/vectorization cs/parallel_computing/simd cs/programming_languages/zig

03 CUDA Fundamental Optimization Part 1

Added on July 24, 2025 · 91:00 · 8.6K views

Detailed lecture on foundational CUDA performance techniques—memory coalescing, occupancy, and kernel launch parameters—illustrated through hands-on code profiling and optimization steps.

cs/computer_architecture/hardware/gpus cs/parallel_computing/cuda

Computer Architecture Explained With MINECRAFT

Added on July 24, 2025 · · views

cs/computer_architecture/cpu_design

ARM Assembly: Lesson 1 (MOV, Exit Syscall)

Added on July 24, 2025 · 18:15 · 84.5K views

Step-by-step lesson introducing ARM assembly programming—registers, MOV instruction, SWI syscall, compiling, and emulation—providing foundational skills for low-level ARM development.

cs/computer_architecture/hardware/arm cs/systems_programming/assembly

I Designed A CPU (And So Can You)

Added on July 24, 2025 · · views

cs/computer_architecture/cpu_design

CRAFTING A CPU TO RUN PROGRAMS

Added on July 24, 2025 · 19:48 · 231.3K views

Step-by-step project that assembles fundamental digital components into a functioning minimalist CPU, explaining instruction decoding, control signals and integration of prior ALU and memory modules.

cs/computer_architecture/cpu_design

Refterm Lecture Part 5 - Parsing with SIMD

Added on July 24, 2025 · 27:01 · 27.9K views

Technical lecture showing how to accelerate text parsing by leveraging SIMD instructions, delving into low-level CPU mechanics, data alignment, and practical code optimization strategies.

cs/computer_architecture/hardware/vectorization cs/parallel_computing/simd

CppCon 2016: Timur Doumler “Want fast C++? Know your hardware!"

Added on July 24, 2025 · 59:44 · 185.2K views

CppCon talk illustrating how cache hierarchies, branch prediction, alignment, and SIMD influence C++ performance and providing guidelines for writing hardware-conscious, high-speed code.

cs/software_development/performance_optimization cs/computer_architecture/hardware/optimization

C++ cache locality and branch predictability

Added on July 24, 2025 · 10:43 · 109.5K views

Practical C++ demonstration of how cache locality and branch prediction affect real-world runtime, showcasing code patterns and optimizations to exploit modern CPU behaviour for faster programs.

cs/software_development/performance_optimization cs/computer_architecture/hardware/memory_models cs/computer_architecture/hardware/optimization cs/programming_languages/cpp

Zen 5 And AI Doom w/ Casey Muratori

Added on July 24, 2025 · · views

cs/software_development/performance_optimization cs/computer_architecture/hardware/optimization

The Tech Poutine #23: AMD's Moving to 2nm

Added on July 24, 2025 · 203:23 · 9.6K views

Long-form industry analysis show covering semiconductor manufacturing roadmaps, AMD’s 2 nm “Venice” chiplets, yield calculations, HBM4, CHIPS Act developments, and organizational changes at Intel and Nvidia—providing practitioners with deep context on cutting-edge processor and foundry hardware.

cs/computer_architecture/hardware/optimization