← all repositories

nikhilbarhate99/PPO-PyTorch

PyTorch implementation of Proximal Policy Optimization, a reinforcement learning algorithm for training agents in OpenAI gym environments.

2.4k stars Python AgentsML Frameworks
PPO-PyTorch
Velocity · 7d
+0.8
★ / day
Trend
steady
star history

This repository provides a minimal implementation of PPO with clipped objective for reinforcement learning. It supports both discrete and continuous action spaces and includes utilities for training, testing, logging rewards to CSV, plotting graphs, and generating gifs from pretrained networks. The code is designed primarily for beginners learning reinforcement learning but can be adapted for more complex environments with hyperparameter tuning.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.