← all repositories

awjuliani/Meta-RL

TensorFlow implementation of Meta-RL A3C algorithm enabling reinforcement learning agents to adapt to new tasks without retraining.

407 stars Jupyter Notebook AgentsML Frameworks
Meta-RL
Velocity · 7d
+0.1
★ / day
Trend
steady
star history

This repository provides a TensorFlow implementation of the Meta-Reinforcement Learning A3C algorithm based on the paper Learning to Reinforce Learn. It includes Jupyter notebooks demonstrating meta-learning across multiple task types: independent and dependent bandit tasks, restless bandits, contextual bandits using color-based reward signals, and a Rainbow Gridworld task where goal colors are randomized each episode requiring agents to learn on the fly.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.