awjuliani/Meta-RL
TensorFlow implementation of Meta-RL A3C algorithm enabling reinforcement learning agents to adapt to new tasks without retraining.

This repository provides a TensorFlow implementation of the Meta-Reinforcement Learning A3C algorithm based on the paper Learning to Reinforce Learn. It includes Jupyter notebooks demonstrating meta-learning across multiple task types: independent and dependent bandit tasks, restless bandits, contextual bandits using color-based reward signals, and a Rainbow Gridworld task where goal colors are randomized each episode requiring agents to learn on the fly.