Bookmarks

The Unreasonable Effectiveness of JPEG: A Signal Processing Approach

LoRA explained (and a bit about precision and quantization)

Concise primer on LoRA and QLoRA, showing how low-rank adapters enable parameter-efficient fine-tuning of Transformer models under quantization.

neural video codecs: the future of video compression

ai

1-bit Model

2309.10668

Pruning vs Quantization: Which is Better?

Subcategories