Truncated Step-Level Sampling with Process Rewards for Retrieval-Augmented Reasoning Paper • 2602.23440 • Published Feb 26 • 4
CoSearch: Joint Training of Reasoning and Document Ranking via Reinforcement Learning for Agentic Search Paper • 2604.17555 • Published Apr 21 • 1
GrepSeek: Training Search Agents for Direct Corpus Interaction Paper • 2605.29307 • Published 11 days ago • 103
SynPlanResearch-R1: Encouraging Tool Exploration for Deep Research with Synthetic Plans Paper • 2603.07853 • Published Mar 9 • 1