sfujim/TD3
A PyTorch implementation of TD3, a deep reinforcement learning algorithm for continuous robot control.

This repository provides a PyTorch implementation of Twin Delayed Deep Deterministic Policy Gradients (TD3), an actor-critic reinforcement learning algorithm designed to address function approximation error. The implementation is tested on MuJoCo continuous control tasks within the OpenAI gym environment. It includes both the TD3 algorithm and a DDPG implementation for comparative purposes, with networks trained on PyTorch and support for configurable hyperparameters.