jiangxinke/Agentic-RAG-R1
An agentic RAG framework that uses GRPO reinforcement learning to train language models for autonomous retrieval and reasoning tasks.

The project builds a Retrieval-Augmented Generation system enhanced with agentic capabilities through reinforcement learning. It trains a base language model using the GRPO algorithm to perform autonomous document retrieval, multi-step reasoning, and tool use. The framework includes rollout generation for training, inference pipelines for deployment, and evaluation on benchmarks like MedQA to measure medical domain reasoning performance.