DLR-RM/stable-baselines3
PyTorch library providing production-ready implementations of reinforcement learning algorithms including PPO, SAC, A2C, TD3, and DQN.

Stable Baselines3 (SB3) is the PyTorch-based successor to the original Stable Baselines library, offering reliable implementations of reinforcement learning algorithms. It provides well-tested, easy-to-use RL algorithm implementations that serve as research-grade baselines. The library supports environments compatible with OpenAI Gym and is used for training autonomous agents in robotics, game playing, and other sequential decision-making tasks.