xhyumiracle/Awesome-AgenticLLM-RL-Papers
An academic survey paper collecting and categorizing Agentic RL algorithms and research for Large Language Models.

Velocity · 7d
+6.4
★ / day
Trend
→steady
star history
This repository hosts the official implementation and documentation for a survey paper reviewing the landscape of Agentic Reinforcement Learning applied to LLMs. It catalogs algorithms including PPO variants and policy gradient methods with detailed comparison tables covering objectives, clipping strategies, KL penalties, and key mechanisms. The paper is published in Transactions on Machine Learning Research and includes full citation metadata.