← all repositories

sfujim/TD3

A PyTorch implementation of TD3, a deep reinforcement learning algorithm for continuous robot control.

2.1k stars Python ML FrameworksDomain Apps
TD3
Velocity · 7d
+0.7
★ / day
Trend
steady
star history

This repository provides a PyTorch implementation of Twin Delayed Deep Deterministic Policy Gradients (TD3), an actor-critic reinforcement learning algorithm designed to address function approximation error. The implementation is tested on MuJoCo continuous control tasks within the OpenAI gym environment. It includes both the TD3 algorithm and a DDPG implementation for comparative purposes, with networks trained on PyTorch and support for configurable hyperparameters.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.