Language Models

big names on the move

+225 ★/day↗accelerating

Kronos recasts noisy, multi-dimensional candlestick data as hierarchical discrete tokens so an autoregressive Transformer can forecast financial markets like a language model.

★ 33.8k Python Language Models · explained

Zackriya-Solutions/meetily

+169 ★/day↘cooling

Meetily transcribes and summarizes meetings entirely on-device, because "we don't log your calls" is a promise best kept by physics.

★ 26.6k Rust Language Models · explained Feature

TauricResearch/TradingAgents

+139 ★/day↘cooling

A research framework that assigns LLMs to trading-floor roles—analyst, researcher, trader, risk manager—to debate and execute simulated stock decisions.

★ 94.5k Python Agents · explained Feature

anthropics/claude-cookbooks

+110 ★/day↘cooling

Official Jupyter notebooks demonstrating how to wire Claude into production tasks like RAG, SQL queries, and multimodal pipelines.

★ 49.9k Jupyter Notebook Learning · explained

ggml-org/llama.cpp

+104 ★/day↘cooling

It exists to run large language models on virtually any hardware—from Apple Silicon to RISC-V to your browser—with zero external dependencies and minimal setup.

★ 121.6k C++ Inference · Serving · explained

BerriAI/litellm

+103 ★/day↗accelerating

Because swapping from GPT-4o to Claude shouldn't require rewriting your request plumbing.

★ 54.7k Python LLMOps · Eval · explained

lyogavin/airllm

+98 ★/day↘cooling

AirLLM slices giant transformers into layer shards so they fit in consumer VRAM without quantization or distillation.

★ 24k Jupyter Notebook Inference · Serving · explained

google/langextract

+90 ★/day↗accelerating

LangExtract exists because asking an LLM to pull names and dates out of a report is easy; proving exactly which sentence each came from is the hard part.

★ 37.8k Python Data Tooling · explained

jingyaogong/minimind

+72 ★/day↗accelerating

MiniMind is an educational training ground that rebuilds every stage of a modern language model—from tokenizer to RLHF—in raw PyTorch so you can see the gears turning instead of just calling high-level APIs.

★ 53.8k Python Language Models · explained

rasbt/LLMs-from-scratch

+70 ★/day↗accelerating

It teaches how LLMs work by implementing tokenization, attention, pretraining, and finetuning in pure PyTorch, one notebook at a time.

★ 99.8k Jupyter Notebook Language Models · explained

ollama/ollama

+69 ★/day→steady

It exists so you can download, run, and chat with open-weight LLMs locally through one CLI and REST API, keeping inference on your own silicon.

★ 176.9k Go Inference · Serving · explained

openai/whisper

+52 ★/day↘cooling

To give developers a single, general-purpose speech model that handles transcription, translation, and language identification by treating tasks as tokens to predict.

★ 105.6k Python Image · Video · Audio · explained

microsoft/graphrag

+49 ★/day↗accelerating

GraphRAG exists to give LLMs a structured memory layer for reasoning over messy, private narrative text.

★ 34.8k Python RAG · Search · explained

p-e-w/heretic

+40 ★/day↘cooling

It automates the removal of transformer safety alignment so you don't have to hand-tune abliteration parameters or pay for expensive post-training.

★ 26.7k Python Language Models · explained

sgl-project/sglang

+40 ★/day↗accelerating

SGLang exists to push low-latency, high-throughput inference for LLMs and multimodal models from a single GPU up to massive clusters.

★ 30.7k Python Inference · Serving · explained

agentscope-ai/agentscope

+38 ★/day↗accelerating

A Python framework for building production multi-agent systems that leans on LLM reasoning instead of rigid prompt choreography.

★ 28.3k Python Agents · explained

huggingface/transformers

+38 ★/day↗accelerating

It centralizes model definitions so the same architecture works across PyTorch, JAX, vLLM, and llama.cpp without rewrites.

★ 163k Python Language Models · explained

karpathy/nanoGPT

+34 ★/day↗accelerating

A rewrite of minGPT that prioritizes working, hackable training code over educational scaffolding.

★ 61.5k Python Language Models · explained

karpathy/nanochat

+34 ★/day↗accelerating

nanochat is a minimal, hackable harness that lets you train and chat with a GPT-2-class LLM on a single GPU node for under $100—no hyperparameter spreadsheets required.

★ 56.6k Python Language Models · explained

mudler/LocalAI

+28 ★/day↗accelerating

LocalAI wraps 36+ inference engines behind one OpenAI-compatible API and pulls them on demand, so you can run LLMs, vision, voice, and video on anything from a CPU to a Jetson.

★ 47.9k Go Inference · Serving · explained

loading more…