ML Frameworks

underdogs breaking out

+14% /wk +13 ★/day↗accelerating

It split off from `verl` to give diffusion, video, and omni-modality models an RL post-training framework that doesn't treat them like chatbots.

★ 651 Python ML Frameworks · explained

KEV0143/Comparative-analysis-of-hourly-load-forecasting-using-PatchTST-TFT-NHiTS-and-CatBoost

+13% /wk +28 ★/day↗accelerating

Pits PatchTST, TFT, and N-HiTS against CatBoost to help energy markets pick a forecasting model.

★ 1.5k Python Domain Apps · explained

google-research/tabfm

+12% /wk +37 ★/day↘cooling

TabFM exists so you can run classification and regression on messy, mixed-type tables without retraining a model on your data.

★ 2.1k Python Language Models · explained

NVIDIA-NeMo/Automodel

+12% /wk +13 ★/day↗accelerating

NeMo AutoModel automates the busywork of wiring HuggingFace LLMs and VLMs into PyTorch-native distributed training so you can fine-tune or pretrain without hand-rolling parallelism code.

★ 770 Python ML Frameworks · explained

kvcache-ai/ktransformers

+8.7% /wk +237 ★/day↗accelerating

KTransformers makes CPU-GPU heterogeneous inference and fine-tuning for massive MoE models almost practical on consumer hardware.

★ 19k Python Inference · Serving · explained

ForceInjection/AI-fundamentals

+7.9% /wk +22 ★/day↗accelerating

Curated technical deep-dives covering everything from NVLink signal integrity to Kubernetes GPU scheduling and Huawei NPU porting.

★ 2k HTML Learning · explained

google-ai-edge/LiteRT

+6.4% /wk +29 ★/day↗accelerating

Google's edge ML runtime grows up, adds async NPU support, and finally admits PyTorch exists.

★ 3.2k C++ Inference · Serving · explained

redai-infra/Relax

+6.1% /wk +4.7 ★/day→steady

This engine decouples RL training and inference into independent GPU services, letting text, vision, and audio models post-train asynchronously at scale.

★ 538 Python ML Frameworks · explained

mlc-ai/modern-gpu-programming-for-mlsys

+5.7% /wk +8.7 ★/day→steady

This book teaches modern GPU kernel programming as a hardware-first progression, using the Blackwell architecture and a Python IR DSL called TIRx to move from concepts to production-grade kernels.

★ 1.1k HTML Learning · explained

CalvinXKY/InfraTech

+5.5% /wk +25 ★/day↗accelerating

A Chinese-language notebook curriculum that teaches LLM inference engineering by rebuilding vLLM and SGLang internals in Python.

★ 3.1k Jupyter Notebook Inference · Serving · explained

datawhalechina/every-embodied

+4.6% /wk +19 ★/day↗accelerating

Chinese open-source community Datawhale built a from-zero embodied-AI course that gets you from `print('hello')` to fine-tuning SmolVLA and Pi0.

★ 2.9k Python Learning · explained

jingyaogong/minimind-o

+4.6% /wk +14 ★/day↗accelerating

MiniMind-O packs listen-see-speak intelligence into a 0.1B-parameter model you can retrain from the first line of code on a single desktop GPU.

★ 2.2k Python Language Models · explained

Tencent-Hunyuan/UniRL

+3.5% /wk +4.3 ★/day→steady

One framework to run RL post-training on diffusion, autoregressive, and hybrid models without rewriting the orchestration layer each time.

★ 854 Python ML Frameworks · explained

FareedKhan-dev/train-llm-from-scratch

+3.1% /wk +39 ★/day↗accelerating

A PyTorch implementation of "Attention Is All You Need" that scales from 13M to multi-billion parameter models.

★ 8.7k Python Language Models · explained

google/magika

+2.6% /wk +65 ★/day↗accelerating

A tiny deep-learning model that guesses what a file actually contains, not just what its extension claims.

★ 17.7k Python Other AI · explained

Orchestra-Research/AI-Research-SKILLs

+2.4% /wk +38 ★/day↗accelerating

A library of 98 structured skill packs that turns coding agents into end-to-end AI researchers, from distributed training to LaTeX submission.

★ 11.1k TeX Agents · explained

fla-org/flash-linear-attention

+2.3% /wk +18 ★/day↗accelerating

It corrals the latest subquadratic sequence-model research into hardware-efficient, training-ready PyTorch layers verified across NVIDIA, AMD, and Intel GPUs.

★ 5.4k Python ML Frameworks · explained

RLinf/RLinf

+2.3% /wk +14 ★/day↗accelerating

RLinf exists because fine-tuning policies for robots and agents still requires rewriting your training stack for every new simulator, world model, or hardware rig.

★ 4.3k Python Agents · explained

Blaizzy/mlx-vlm

+2.2% /wk +16 ★/day↗accelerating

MLX-VLM crams speculative decoding, continuous batching, and KV cache quantization into a Mac-native toolkit for running multimodal models locally.

★ 5.3k Python Image · Video · Audio · explained

radixark/miles

+2.1% /wk +5.4 ★/day↘cooling

A production-hardened fork of slime that keeps massive MoE models from collapsing by obsessing over bit-wise alignment between rollout and training.

★ 1.8k Python ML Frameworks · explained

loading more…