Language Models

Language Models

newcomers · gaining speed
02
lyogavin/airllm
+162 ★/dayaccelerating

AirLLM slices giant transformers into layer shards so they fit in consumer VRAM without quantization or distillation.

19.8k Jupyter Notebook Inference · Serving · explained
03
Sumanth077/Hands-On-AI-Engineering
+106 ★/dayaccelerating

Twenty-five bite-sized projects showing how to wire up LLMs, RAG, and agents into things that actually do work.

2k Python Learning · explained
04
TauricResearch/TradingAgents
+359 ★/dayaccelerating

It takes a village of agents to buy a stock—analysts, debaters, risk managers, and a portfolio manager who actually says no.

85k Python Agents · explained
05
NVlabs/Eagle
+61 ★/dayaccelerating

Eagle is less a single model than NVIDIA's internal R&D pipeline for multimodal AI, now open-sourced with three generations of VLMs and a grounding specialist.

2.4k Python Language Models · explained
06
ggml-org/llama.cpp
+222 ★/dayaccelerating

A dependency-free C/C++ inference engine that squeezes large language models onto laptops, phones, and browsers through aggressive quantization and hand-rolled kernels.

116k C++ Inference · Serving · explained
07
agentscope-ai/agentscope
+87 ★/dayaccelerating

AgentScope 2.0 bets that modern LLMs need less hand-holding, not more orchestration.

26.7k Python Agents · explained
08
tile-ai/TileRT
+36 ★/dayaccelerating

TileRT squeezes millisecond-level latency out of hundred-billion-parameter models by decomposing operators into tile-level tasks and overlapping compute, I/O, and communication across 8 GPUs.

1.3k Python Inference · Serving · explained
09
BerriAI/litellm
+115 ★/dayaccelerating

LiteLLM is the adapter layer that stops your codebase from fracturing across a dozen provider SDKs.

50k Python LLMOps · Eval · explained
10
openai/whisper
+158 ★/dayaccelerating

OpenAI's Whisper replaces the usual Rube Goldberg pipeline of speech-processing tools with a single Transformer trained to do it all.

102.4k Python Image · Video · Audio · explained
11
Project-N-E-K-O/N.E.K.O
+21 ★/dayaccelerating

An AI companion platform that remembers, feels, and stares at your screen—now with a Steam release and a 1000-year SSL certificate.

1.3k Python Agents · explained
12
Exorust/TorchLeet
+21 ★/dayaccelerating

A notebook-based workout plan for PyTorch fluency, from linear regression up to building LLM components from scratch.

2.2k Jupyter Notebook Learning · explained
13
tmylla/Awesome-LLM4Cybersecurity
+14 ★/dayaccelerating

A living literature review that tracks whether researchers are using language models to attack, defend, or just benchmark each other.

1.6k Learning · explained
14
mengxi-ream/read-frog
+44 ★/dayaccelerating

Read Frog overlays AI translations, explanations, and text-to-speech onto any webpage so you can learn while you browse.

7.7k TypeScript Domain Apps · explained
15
agentscope-ai/agentscope-java
+36 ★/dayaccelerating

AgentScope Java wraps ReAct agents in the kind of runtime controls, sandboxes, and observability that enterprise deployments actually need.

3.7k Java Agents · explained
16
NVIDIA-NeMo/Nemotron
+19 ★/dayaccelerating

Complete, reproducible pipelines from raw data to deployment-ready Nemotron models, with a modular CLI that lets you remix stages like LEGO bricks.

1.4k Jupyter Notebook Language Models · explained
17
ml-explore/mlx-lm
+32 ★/dayaccelerating

A purpose-built inference and fine-tuning stack that treats M-series chips as first-class citizens instead of afterthoughts.

5.7k Python Language Models · explained
18
google-ai-edge/LiteRT-LM
+32 ★/dayaccelerating

A C++ inference engine built to run Gemma, Llama, and friends on everything from Raspberry Pi to Pixel Watch—because the cloud is sometimes just too far away.

5.5k C++ Inference · Serving · explained
19
liyupi/ai-guide
+63 ★/dayaccelerating

Curated tutorials, tool reviews, and monetization playbooks for coding with AI—written by one prolific developer and open to all.

15.5k JavaScript Learning · explained
20
Osilly/Vision-R1
+9.3 ★/dayaccelerating

Vision-R1 applies DeepSeek-R1's reinforcement-learning recipe to multimodal models, with a staged training trick that gradually loosens the leash on reasoning length.

1.3k Python Language Models · explained
loading more…

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.