TJU-DRL-LAB/AI-Optimizer
A deep reinforcement learning toolkit providing algorithm libraries for model-free, model-based, and multi-agent RL training.

Velocity · 7d
+2.2
★ / day
Trend
→steady
star history
AI-Optimizer is a comprehensive deep reinforcement learning framework implementing diverse RL algorithms spanning model-free (PPO, DQN), model-based, and multi-agent approaches. It supports transfer learning, offline RL, and self-supervised representation learning. The toolkit includes a distributed training framework for scalable policy training across multiple agents and environments.