← all repositories

opendilab/PPOxFamily

An eight-lesson public course teaching Proximal Policy Optimization (PPO) and deep reinforcement learning for decision intelligence.

2.6k stars Python LearningML Frameworks
PPOxFamily
Velocity · 7d
+2.0
★ / day
Trend
steady
star history

PPOxFamily is a decision intelligence introductory course teaching deep reinforcement learning through the PPO algorithm family. The course covers theory and implementation of PPO variants, ranging from basic concepts to multi-agent systems and temporal modeling. Course materials include video lectures, code annotations, and homework assignments distributed via HuggingFace.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.