Agents

heavyweights · gaining speed

+0.1 ★/day→steady

An educational reinforcement learning project where an agent learns to buy and sell a single stock through rewards and penalties, not explicit instructions.

★ 518 Jupyter Notebook Domain Apps · explained

vladfi1/phillip

+0.2 ★/day→steady

Phillip is a Super Smash Bros. Melee AI trained with deep reinforcement learning to brawl in Dolphin emulator—though its creator has since moved on to imitation learning.

★ 584 Python Agents · explained

miyosuda/async_deep_reinforce

+0.2 ★/day→steady

A straightforward TensorFlow implementation of A3C that trains Pong agents for 26 hours and actually shows its work.

★ 588 Python Agents · explained

BotLibre/BotLibre

+0.2 ★/day→steady

A Java-based bot platform that predates the LLM era and still runs on JUnit and Objective-C.

★ 635 Java Agents · explained

microsoft/psi

+0.2 ★/day→steady

A C# toolkit for wiring up sensors, AI models, and actuators when latency actually matters.

★ 570 C# Agents · explained

hi-abhi/tensorflow-value-iteration-networks

+0.2 ★/day→steady

A TensorFlow port of NIPS 2016's Best Paper, embedding value iteration directly inside a neural network for grid-world navigation.

★ 550 Python ML Frameworks · explained

davechurchill/commandcenter

+0.2 ★/day→steady

A teaching-oriented AI bot that abstracts away the differences between Brood War and SC2 so you can focus on strategy, not API archaeology.

★ 540 C++ Agents · explained

ardamavi/Game-Bot

+0.2 ★/day→steady

A Keras/TensorFlow project that learns your keyboard and mouse habits by watching you play, then attempts to mimic them.

★ 550 Python Agents · explained

KaiyangZhou/pytorch-vsumm-reinforce

+0.2 ★/day→steady

A PyTorch reimplementation of an AAAI 2018 paper that frames video summarization as a reinforcement learning problem, rewarding diversity and representativeness instead of ground-truth labels.

★ 504 Python Computer Vision · explained

glample/Arnold

+0.2 ★/day→steady

A PyTorch DOOM bot that won the ViZDoom AI Competition by learning to frag with deep reinforcement learning.

★ 536 Python Agents · explained

xwhan/DeepPath

+0.2 ★/day→steady

A 2017 RL framework that treats multi-hop KG reasoning as pathfinding with embedding-based states and a reward function that cares about accuracy, diversity, and efficiency—not just getting there.

★ 563 Python Agents · explained

MycroftAI/adapt

+0.2 ★/day→steady

A lightweight, rule-based intent parser for voice assistants that trades ML complexity for explicit control.

★ 721 Python Agents · explained

theopenconversationkit/tock

+0.2 ★/day→steady

Tock is an open-source conversational AI platform for teams who want to build bots without surrendering their data to a SaaS black box.

★ 606 Kotlin Chat Assistants · explained

inoryy/reaver

+0.2 ★/day→steady

Reaver squeezed 1.5x sampling speed from single-machine setups by ditching MPI for lock-free shared memory, then the author walked away.

★ 561 Python Agents · explained

CR-Gjx/LeakGAN

+0.2 ★/day→steady

A 2018 AAAI paper that fixes the classic GAN problem for text generation—scalar rewards arriving too late—by letting the discriminator leak its own hidden features mid-generation.

★ 576 Python Language Models · explained

aleju/mario-ai

+0.2 ★/day→steady

A 2016-era DQN agent that learns Super Mario World from raw pixels, with a Spatial Transformer to focus on what matters.

★ 695 Lua Agents · explained

hellostealth/stealth

+0.2 ★/day→steady

A Ruby framework that treats bot-building like web development, with MVC architecture and Redis-backed state machines.

★ 597 Ruby Chat Assistants · explained

samtecspg/articulate

+0.2 ★/day→steady

An enterprise team's attempt to make Rasa NLU actually deployable without a PhD.

★ 593 JavaScript Chat Assistants · explained

germain-hug/Deep-RL-Keras

+0.2 ★/day→steady

A tidy reference implementation of five major deep RL algorithms, pinned to a very specific Keras version from 2018.

★ 550 Python Agents · explained

SarvagyaVaish/FlappyBirdRL

+0.2 ★/day→steady

A browser-based reinforcement learning demo that teaches a bird to fly by dying repeatedly.

★ 920 JavaScript Agents · explained

loading more…