← all repositories

jiangxinke/Agentic-RAG-R1

An agentic RAG framework that uses GRPO reinforcement learning to train language models for autonomous retrieval and reasoning tasks.

Agentic-RAG-R1
Velocity · 7d
+0.9
★ / day
Trend
steady
star history

The project builds a Retrieval-Augmented Generation system enhanced with agentic capabilities through reinforcement learning. It trains a base language model using the GRPO algorithm to perform autonomous document retrieval, multi-step reasoning, and tool use. The framework includes rollout generation for training, inference pipelines for deployment, and evaluation on benchmarks like MedQA to measure medical domain reasoning performance.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.