Bookmarks

Deriving Muon

CUDA Mode Keynote | Andrej Karpathy | Eureka Labs

Luminal - Search-Based Deep Learning Compilers

Fine-tune LLMs - Line by line code example

Hands-on Jupyter notebook walks line-by-line through performing LoRA fine-tuning of a large language model using HuggingFace PEFT.

George Hotz | Programming | rewriting linearizer (tinygrad) | Day In The Life Of A Software Engineer

George Hotz refactors tinygrad’s linearizer, exposing low-level tensor compiler optimizations that map high-level ops to efficient GPU kernels.

The Most Important Algorithm in Machine Learning

How To Scale

An overview of gradient descent optimization algorithms

ai

An overview of gradient descent optimization algorithms∗

Tensor2Tensor Intro

Subcategories