Reinforcement Learning for LLM-based Multi-Agent Systems through Orchestration Traces Paper โข 2605.02801 โข Published May 4 โข 9
Rethinking Agentic Search with Pi-Serini: Is Lexical Retrieval Sufficient? Paper โข 2605.10848 โข Published May 11 โข 5