LLMOps · Eval

LLMOps · Eval

newcomers · velocity + momentum
01
garrytan/gstack
+1225 ★/daysteady

Garry Tan open-sourced the exact Claude Code prompts he uses to ship 810× faster while running Y Combinator full-time.

108.1k TypeScript Coding Assistants · explained
02
JuliusBrussee/caveman
+1079 ★/daysteady

Claude Code skill makes the agent talk like a caveman and claims ~65% fewer output tokens, with benchmarks to back it up.

69.8k JavaScript Coding Assistants · explained
03
karpathy/autoresearch
+918 ★/daysteady

An AI agent that edits, trains, and evaluates LLM code overnight so you don't have to.

85.5k Python Agents · explained
04
santifer/career-ops
+775 ★/daysteady

A developer turned months of manual applications into an AI agent pipeline that evaluates, scores, and tailors CVs for each listing.

49.8k JavaScript Agents · explained
05
paperclipai/paperclip
+713 ★/daysteady

Paperclip is an open-source control plane that turns a swarm of AI agents into something resembling an actual company.

69.5k TypeScript Agents · explained
06
NousResearch/hermes-agent
+581 ★/daysteady

Hermes Agent is a self-improving AI assistant that creates skills from experience, persists knowledge across sessions, and runs anywhere from a $5 VPS to serverless cloud.

186k Python Agents · explained
07
rtk-ai/rtk
+438 ★/daysteady

rtk sits between your AI agent and the shell, compressing command output before it ever hits context.

59.8k Rust Coding Assistants · explained
08

A free, from-scratch curriculum that makes you build backprop before you touch PyTorch, then ships every lesson as a reusable prompt or agent.

29.9k Python Learning · explained
09
koala73/worldmonitor
+373 ★/daysteady

A single TypeScript codebase that turns 500+ news feeds and 65+ data APIs into a geopolitical intelligence dashboard with local AI.

56k TypeScript LLMOps · Eval · explained
10
farion1231/cc-switch
+306 ★/daysteady

Desktop app that corrals Claude Code, Codex, Gemini CLI and half a dozen other AI agents into one switchboard.

94.2k Rust Coding Assistants · explained
11

Claude Code skills that run your research through a full academic pipeline—research, writing, staged integrity checks, and multi-perspective review—while keeping a human in the driver's seat.

28.6k Python Coding Assistants · explained
12
ZhuLinsen/daily_stock_analysis
+277 ★/daysteady

A Python system that scrapes market data, asks an LLM for a verdict, and pushes buy/sell/emoji dashboards to Slack or WeChat on a cron job.

41.2k Python Agents · explained
14
NVIDIA/NemoClaw
+249 ★/daysteady

A reference stack that cages always-on agents like Hermes and OpenClaw inside hardened OpenShell sandboxes with managed inference and network policy.

21k TypeScript Agents · explained
15
AlexsJones/llmfit
+245 ★/daysteady

A Rust TUI that scores hundreds of models against your actual hardware so you stop downloading 70B weights onto a laptop.

27.6k Rust LLMOps · Eval · explained
16
mvanhorn/last30days-skill
+231 ★/daysteady

A skill that turns Claude, Cursor, or any agent host into a social-native research analyst, scoring signals by upvotes, likes, and actual money wagered.

31.2k Python Coding Assistants · explained
17

A crowdsourced library of copy-paste academic writing prompts, gathered from researchers at MSRA, ByteDance Seed, and top Chinese universities.

27.6k Learning · explained
18
THU-MAIC/OpenMAIC
+207 ★/daysteady

OpenMAIC turns any topic into a multi-agent classroom where AI teachers lecture, peers debate, and everyone draws on the whiteboard.

18.4k TypeScript Agents · explained
19
datawhalechina/hello-agents
+209 ★/daysteady

Datawhale's open-source curriculum wants to turn LLM users into agent builders, not just prompt engineers.

57.3k Python Learning · explained
20
microsoft/SkillOpt
+173 ★/daysteady

Microsoft's SkillOpt treats a markdown skill document as the trainable parameter of a frozen LLM agent, complete with epochs, batching, and validation gates.

5.3k Python Agents · explained
loading more…

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.