Short notes
Confidence regulation in LLMs
Direct logit attribution
Reinforcement learning from human feedback / LLMs as policy
Activation steering
Long notes
Maximum update parametrization (Adam)
Rotary positional encoding
Claude
On the storage cost of latent video representations
Resources
Recommended materials
Introduction to frame-autoregressive video models
My theses