← all repositories

danijar/daydreamer

A world-model-based reinforcement learning system that trains physical robots from real-world interaction using TensorFlow 2.

440 stars Jupyter Notebook Domain AppsML Frameworks
daydreamer
Velocity · 7d
+0.3
★ / day
Trend
steady
star history

DayDreamer learns compact discrete representations of the environment using recurrent neural networks that predict sequences given actions, reconstructing inputs and predicting rewards from recurrent states. The system trains robots using on-policy reinforcement learning purely inside the learned world model representation space, enabling farsighted behavior without simulators.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.