Agents

Agents

heavyweights · gaining speed
01
ucaiado/QLearning_Trading
+0.1 ★/daysteady

An educational reinforcement learning project where an agent learns to buy and sell a single stock through rewards and penalties, not explicit instructions.

518 Jupyter Notebook Domain Apps · explained
02
vladfi1/phillip
+0.2 ★/daysteady

Phillip is a Super Smash Bros. Melee AI trained with deep reinforcement learning to brawl in Dolphin emulator—though its creator has since moved on to imitation learning.

584 Python Agents · explained
03
miyosuda/async_deep_reinforce
+0.2 ★/daysteady

A straightforward TensorFlow implementation of A3C that trains Pong agents for 26 hours and actually shows its work.

588 Python Agents · explained
04
BotLibre/BotLibre
+0.2 ★/daysteady

A Java-based bot platform that predates the LLM era and still runs on JUnit and Objective-C.

635 Java Agents · explained
05
microsoft/psi
+0.2 ★/daysteady

A C# toolkit for wiring up sensors, AI models, and actuators when latency actually matters.

570 C# Agents · explained
07
davechurchill/commandcenter
+0.2 ★/daysteady

A teaching-oriented AI bot that abstracts away the differences between Brood War and SC2 so you can focus on strategy, not API archaeology.

540 C++ Agents · explained
08
ardamavi/Game-Bot
+0.2 ★/daysteady

A Keras/TensorFlow project that learns your keyboard and mouse habits by watching you play, then attempts to mimic them.

550 Python Agents · explained
09

A PyTorch reimplementation of an AAAI 2018 paper that frames video summarization as a reinforcement learning problem, rewarding diversity and representativeness instead of ground-truth labels.

504 Python Computer Vision · explained
10
glample/Arnold
+0.2 ★/daysteady

A PyTorch DOOM bot that won the ViZDoom AI Competition by learning to frag with deep reinforcement learning.

536 Python Agents · explained
11
xwhan/DeepPath
+0.2 ★/daysteady

A 2017 RL framework that treats multi-hop KG reasoning as pathfinding with embedding-based states and a reward function that cares about accuracy, diversity, and efficiency—not just getting there.

563 Python Agents · explained
12
MycroftAI/adapt
+0.2 ★/daysteady

A lightweight, rule-based intent parser for voice assistants that trades ML complexity for explicit control.

721 Python Agents · explained
13
theopenconversationkit/tock
+0.2 ★/daysteady

Tock is an open-source conversational AI platform for teams who want to build bots without surrendering their data to a SaaS black box.

606 Kotlin Chat Assistants · explained
14
inoryy/reaver
+0.2 ★/daysteady

Reaver squeezed 1.5x sampling speed from single-machine setups by ditching MPI for lock-free shared memory, then the author walked away.

561 Python Agents · explained
15
CR-Gjx/LeakGAN
+0.2 ★/daysteady

A 2018 AAAI paper that fixes the classic GAN problem for text generation—scalar rewards arriving too late—by letting the discriminator leak its own hidden features mid-generation.

576 Python Language Models · explained
16
aleju/mario-ai
+0.2 ★/daysteady

A 2016-era DQN agent that learns Super Mario World from raw pixels, with a Spatial Transformer to focus on what matters.

695 Lua Agents · explained
17
hellostealth/stealth
+0.2 ★/daysteady

A Ruby framework that treats bot-building like web development, with MVC architecture and Redis-backed state machines.

597 Ruby Chat Assistants · explained
19
germain-hug/Deep-RL-Keras
+0.2 ★/daysteady

A tidy reference implementation of five major deep RL algorithms, pinned to a very specific Keras version from 2018.

550 Python Agents · explained
20
SarvagyaVaish/FlappyBirdRL
+0.2 ★/daysteady

A browser-based reinforcement learning demo that teaches a bird to fly by dying repeatedly.

920 JavaScript Agents · explained
loading more…

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.