index

Short notes

Confidence regulation in LLMs
Direct logit attribution
Reinforcement learning from human feedback / LLMs as policy
Activation steering

Long notes

Maximum update parametrization (Adam)
Rotary positional encoding

Claude

On the storage cost of latent video representations

Tutorials

Introduction to frame-autoregressive video models

Resources

Recommended materials
My theses