← all repositories

pat-jj/s3

A research project training search agents using reinforcement learning with minimal data to improve RAG system efficiency.

s3
Velocity · 7d
+2.0
★ / day
Trend
steady
star history

This repository implements an RL-based approach for training efficient search agents in RAG (Retrieval Augmented Generation) systems. The method uses verifier-based reinforcement learning to optimize search agent behavior with minimal training data. Published at EMNLP 2025, it focuses on improving both the effectiveness and efficiency of retrieval-augmented language model applications.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.