LatentSkill: From In-Context Textual Skills to In-Weight Latent Skills for LLM Agents Paper • 2606.06087 • Published 10 days ago • 62
Flow-DPPO: Divergence Proximal Policy Optimization for Flow Matching Models Paper • 2606.11025 • Published 5 days ago • 40
FlashMemory-DeepSeek-V4: Lightning Index Ultra-Long Context via Lookahead Sparse Attention Paper • 2606.09079 • Published 6 days ago • 59
SCOPE: Self-Play via Co-Evolving Policies for Open-Ended Tasks Paper • 2605.31433 • Published 16 days ago • 28
World Models Meet Language Models: On the Complementarity of Concrete and Abstract Reasoning Paper • 2606.03603 • Published 11 days ago • 29
Agent Explorative Policy Optimization for Multimodal Agentic Reasoning Paper • 2605.28774 • Published 18 days ago • 90
YoCausal: How Far is Video Generation from World Model? A Causality Perspective Paper • 2605.30346 • Published 17 days ago • 54
minWM: A Full-Stack Open-Source Framework for Real-Time Interactive Video World Models Paper • 2605.30263 • Published 17 days ago • 58