Short notes
Confidence regulation in LLMs
Direct logit attribution
Reinforcement learning from human feedback / LLMs as policy
Activation steering
Long notes
Maximum update parametrization (Adam)
Rotary positional encoding
Resources
Recommended materials
My theses