ML Frameworks

Newcomers Heavyweights

Hottest Accelerating

heavyweights · velocity + momentum

karpathy/nanochat

+230 ★/day→steady

Karpathy's minimal LLM training harness turns a $43K 2019 training run into a sub-$100 afternoon project.

★ 54.7k Python Language Models · explained

rasbt/LLMs-from-scratch

+92 ★/day→steady

A step-by-step PyTorch walkthrough that trains a small-but-real LLM on ordinary laptops, no external libraries allowed.

★ 96.8k Jupyter Notebook Language Models · explained

maderix/ANE

+67 ★/day→steady

A weekend research hack that reverse-engineers private APIs to run backpropagation on the Neural Engine Apple reserves for inference only.

★ 6.7k Objective-C ML Frameworks · explained

unslothai/unsloth

+72 ★/day→steady

Unsloth Studio wraps training, inference, and RL into a single web UI with aggressive memory optimizations.

★ 66k Python Inference · Serving · explained

hiyouga/LlamaFactory

+65 ★/day→steady

One framework claims to handle 100+ LLMs and VLMs with zero-code CLI and a web UI—backed by enough quantization methods to make a compression engineer weep.

★ 72k Python ML Frameworks · explained

exo-explore/exo

+63 ★/day→steady

exo automatically clusters your Apple devices to run frontier models that won't fit on one machine, using Thunderbolt like a datacenter backplane.

★ 45.2k Python Inference · Serving · explained

jingyaogong/minimind-o

+47 ★/day→steady

MiniMind-O is a from-scratch Omni implementation small enough to train in ~2 hours on a single RTX 3090, designed for developers who want to understand the full pipeline rather than download a black box.

★ 1.8k Python Language Models · explained

angelos-p/llm-from-scratch

+47 ★/day→steady

A stripped-down workshop that trades GPT-2 scale for the clarity of writing every transformer component yourself.

★ 3k Learning · explained

huggingface/transformers

+58 ★/day→steady

Hugging Face's Transformers library became the de facto standard for model definitions by being the boring part everyone agrees on.

★ 161.4k Python Language Models · explained

huggingface/open-r1

+52 ★/day→steady

A community effort to reverse-engineer and openly reproduce the training pipeline behind DeepSeek's famous reasoning model.

★ 26k Python Language Models · explained

Lightricks/LTX-2

+46 ★/day→steady

Lightricks open-sources the full inference stack and LoRA trainer for their DiT-based audio-video model, complete with camera-control LoRAs and HDR output pipelines.

★ 7.2k Python Image · Video · Audio · explained

lucas-maes/le-wm

+43 ★/day→steady

LeWorldModel cuts JEPA training from six loss hyperparameters to one, then plans 48× faster than foundation-model competitors.

★ 3.7k Python Agents · explained

tensorflow/tensorflow

+51 ★/day→steady

Google's ML framework wants to be the entire pipeline, not just the model.

★ 195.6k C++ ML Frameworks · explained

Orchestra-Research/AI-Research-SKILLs

+43 ★/day→steady

A structured prompt library that teaches Claude Code, Codex, or Gemini how to run the full ML research lifecycle — from literature review to LaTeX.

★ 9.4k TeX Agents · explained

karpathy/nanoGPT

+47 ★/day→steady

A deliberately minimal GPT-2 implementation that taught a generation how transformers work, now officially succeeded by nanochat.

★ 59.3k Python Language Models · explained

ultralytics/ultralytics

+43 ★/day→steady

Ultralytics turned the classic object detector into a unified computer-vision Swiss Army knife you can train via CLI or Python.

★ 58.1k Python Computer Vision · explained

GeeeekExplorer/nano-vllm

+38 ★/day→steady

A from-scratch inference engine that trades the kitchen sink for a readable codebase without tanking throughput.

★ 13.9k Python Inference · Serving · explained

sapientinc/HRM

+37 ★/day→steady

HRM replaces Chain-of-Thought with a brain-inspired recurrent architecture that plans slowly and computes fast, all in one forward pass.

★ 12.5k Python Language Models · explained

karpathy/llm.c

+38 ★/day→steady

A from-scratch LLM trainer that ditches 245MB of PyTorch dependencies for raw C/CUDA, and somehow runs slightly faster.

★ 30.1k Cuda Language Models · explained

facebookresearch/dinov3

+35 ★/day→steady

DINOv3 is a family of self-supervised vision backbones designed to produce high-quality dense features for everything from semantic segmentation to satellite canopy mapping, often beating task-specialized models out of the box.

★ 10.6k Jupyter Notebook Computer Vision · explained

loading more…