ML Frameworks

ML Frameworks

newcomers · velocity + momentum
01
karpathy/nanochat
+230 ★/daysteady

Karpathy's minimal LLM training harness turns a $43K 2019 training run into a sub-$100 afternoon project.

54.7k Python Language Models · explained
02
rasbt/LLMs-from-scratch
+92 ★/daysteady

A step-by-step PyTorch walkthrough that trains a small-but-real LLM on ordinary laptops, no external libraries allowed.

96.8k Jupyter Notebook Language Models · explained
03
maderix/ANE
+67 ★/daysteady

A weekend research hack that reverse-engineers private APIs to run backpropagation on the Neural Engine Apple reserves for inference only.

6.7k Objective-C ML Frameworks · explained
04
unslothai/unsloth
+72 ★/daysteady

Unsloth Studio wraps training, inference, and RL into a single web UI with aggressive memory optimizations.

66k Python Inference · Serving · explained
05
exo-explore/exo
+63 ★/daysteady

exo automatically clusters your Apple devices to run frontier models that won't fit on one machine, using Thunderbolt like a datacenter backplane.

45.2k Python Inference · Serving · explained
06
hiyouga/LlamaFactory
+65 ★/daysteady

One framework claims to handle 100+ LLMs and VLMs with zero-code CLI and a web UI—backed by enough quantization methods to make a compression engineer weep.

72k Python ML Frameworks · explained
07
jingyaogong/minimind-o
+47 ★/daysteady

MiniMind-O is a from-scratch Omni implementation small enough to train in ~2 hours on a single RTX 3090, designed for developers who want to understand the full pipeline rather than download a black box.

1.8k Python Language Models · explained
08
angelos-p/llm-from-scratch
+47 ★/daysteady

A stripped-down workshop that trades GPT-2 scale for the clarity of writing every transformer component yourself.

3k Learning · explained
09
huggingface/open-r1
+52 ★/daysteady

A community effort to reverse-engineer and openly reproduce the training pipeline behind DeepSeek's famous reasoning model.

26k Python Language Models · explained
10
huggingface/transformers
+58 ★/daysteady

Hugging Face's Transformers library became the de facto standard for model definitions by being the boring part everyone agrees on.

161.4k Python Language Models · explained
11
Lightricks/LTX-2
+46 ★/daysteady

Lightricks open-sources the full inference stack and LoRA trainer for their DiT-based audio-video model, complete with camera-control LoRAs and HDR output pipelines.

7.2k Python Image · Video · Audio · explained
12
lucas-maes/le-wm
+43 ★/daysteady

LeWorldModel cuts JEPA training from six loss hyperparameters to one, then plans 48× faster than foundation-model competitors.

3.7k Python Agents · explained
13

A structured prompt library that teaches Claude Code, Codex, or Gemini how to run the full ML research lifecycle — from literature review to LaTeX.

9.4k TeX Agents · explained
14
karpathy/nanoGPT
+47 ★/daysteady

A deliberately minimal GPT-2 implementation that taught a generation how transformers work, now officially succeeded by nanochat.

59.3k Python Language Models · explained
16
ultralytics/ultralytics
+43 ★/daysteady

Ultralytics turned the classic object detector into a unified computer-vision Swiss Army knife you can train via CLI or Python.

58.1k Python Computer Vision · explained
18
sapientinc/HRM
+37 ★/daysteady

HRM replaces Chain-of-Thought with a brain-inspired recurrent architecture that plans slowly and computes fast, all in one forward pass.

12.5k Python Language Models · explained
19
karpathy/llm.c
+38 ★/daysteady

A from-scratch LLM trainer that ditches 245MB of PyTorch dependencies for raw C/CUDA, and somehow runs slightly faster.

30.1k Cuda Language Models · explained
20
facebookresearch/dinov3
+35 ★/daysteady

DINOv3 is a family of self-supervised vision backbones designed to produce high-quality dense features for everything from semantic segmentation to satellite canopy mapping, often beating task-specialized models out of the box.

10.6k Jupyter Notebook Computer Vision · explained
loading more…

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.