Running on Zero Agents Featured 47 RF-DETR Realtime Webcam Demo 🎯 47 Segment objects in live webcam and uploaded media
Evaluating Large Language Models in Dynamic Clinical Decision-Making with Standardized Patient Cases Paper • 2606.05112 • Published 5 days ago • 3
Where to Look: Can Foundation Models Reach a Target Viewpoint Through Active Exploration? Paper • 2606.01247 • Published 8 days ago • 29
Which Pretraining Paradigm Better Serves Spatial Intelligence? An Empirical Comparison of Vision-Language and Video Generation Models Paper • 2605.28132 • Published 12 days ago • 25
On the Scaling of PEFT: Towards Million Personal Models of Trillion Parameters Paper • 2606.02437 • Published 7 days ago • 220
Running on Zero Agents 15 NV-Generate Synthetic Medical Imaging 🧠 15 Synthetic 3D CT and MR generation with NVIDIA NV-Generate.
Running on Zero Agents Featured 220 LTX 2.3 Studio 🎬 220 Generate videos from text, images, audio, or video clips
Running Agents 101 Omni-Video-Factory-API-iframe 🐠 101 Access video creation tools via an embedded interface
Learning A Unified Risk Map for Autonomous Driving in Partially Observable Environments Paper • 2605.22189 • Published 18 days ago • 8
WorldMemArena: Evaluating Multimodal Agent Memory Through Action-World Interaction Paper • 2605.29341 • Published 11 days ago • 17
Why Far Looks Up: Probing Spatial Representation in Vision-Language Models Paper • 2605.30161 • Published 11 days ago • 60
Qwen-VLA: Unifying Vision-Language-Action Modeling across Tasks, Environments, and Robot Embodiments Paper • 2605.30280 • Published 11 days ago • 139
OmniRetrieval: Unified Retrieval across Heterogeneous Knowledge Sources Paper • 2605.29250 • Published 11 days ago • 77