← all repositories

tanishqkumar/beyond-nanogpt

Annotated from-scratch implementations of modern deep learning techniques spanning LLMs, vision transformers, diffusion models, and RL algorithms.

beyond-nanogpt
Velocity · 7d
+3.2
★ / day
Trend
steady
star history

This is an educational repository designed to bridge the gap between basic nanoGPT implementations and research-level deep learning. It provides annotated, from-scratch implementations of nearly 100 modern ML techniques including KV caching and speculative decoding for LLMs, vision transformers, attention variants like linear attention, generative models like diffusion and flow matching, and landmark RL papers such as PPO, A3C, and AlphaZero. All code includes detailed comments explaining subtle details typically glossed over in papers and production code.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.