Bookmarks

POMDPs for Dummies

Richard S. Sutton, Turing Award Winner | Approximately Correct

AI Olympics (multi-agent reinforcement learning)

World Models

by Marcus Hutter and David Quarel and Elliot Catt

Genie 2: A large-scale foundation world model

TS_Tutorial

Indices and tables

Self-Rewarding Language Models

Some Core Principles of Large Language Model (LLM) Tuning

Subcategories