arxiv:2605.05242
ZhuofengLi
ZhuofengLi
AI & ML interests
Agents, Reasoning LLMs/VLLMs, RL
Recent Activity
upvoted a paper about 4 hours ago
Harness-1: Reinforcement Learning for Search Agents with State-Externalizing Harnesses upvoted a paper about 5 hours ago
Masking Stale Observations Helps Search Agents -- Until It Doesn't: A Regime Map and Its Mechanism