- Predictions, Patterns, and Actions
- Karpathy’s YouTube Playlist
- Transformers from Scratch
- GPT in 60 Lines
- CUDA MatMul Kernel
- Autonomous AI Agents
- ML Interviews Book
- Novice’s Guide to LLM Training
- Transformer Math
- FlashAttention Explainer
- Transformer Inference Arithmetic
- Kernel Cookbook
- Horace He Fast DL
- Long Term Memory
- Introduction to Kalman Filters for Programmers
- Google’s Grokking Explainer
- Understand Autodiff in 30 Lines
- Little Book of Deep Learning
- The Bitter Lesson
- The Bitter Lesson 2
- Swyx’s AI Notes
- Keeping up with AGI
- Kipply’s July-August Reading List
- Handmade Transformer
- Distributed and Efficient Finetuning
- stateof.ai
- Machine Learning Flashcards
- Kinetics-700
- DALL-E 3
- Llama V2
- A16Z Infra Llama Chat
- HF Model
- Finetuning Tutorial
- How is Llama Possible
- SAIL Courses
- Stanford Smallville, Interactive Simulacra
- OpenAI Cookbook
- [Hardwae Resourcesre Resources](https://docs.google.com/document/d/1NCxdf9hTB1hFeRTvXOV0Pce4Yh7AFl2JuQhMyOQ4EjE/edit?usp=sharing)
- RLHF Lit Review
- Kipply’s September-October Reading List
- Tracing Emergent Abilities
- Matrix Calculus for Deep Learning
- https://github.com/stas00/ml-engineering
- https://blog.briankitano.com/llama-from-scratch/