opendilab/PPOxFamily
An eight-lesson public course teaching Proximal Policy Optimization (PPO) and deep reinforcement learning for decision intelligence.

Velocity · 7d
+2.0
★ / day
Trend
→steady
star history
PPOxFamily is a decision intelligence introductory course teaching deep reinforcement learning through the PPO algorithm family. The course covers theory and implementation of PPO variants, ranging from basic concepts to multi-agent systems and temporal modeling. Course materials include video lectures, code annotations, and homework assignments distributed via HuggingFace.