VibeThinker-3B: Exploring the Frontier of Verifiable Reasoning in Small Language Models Paper • 2606.16140 • Published 6 days ago • 101
Running on Zero Agents Featured 60 Gemma Diffusion Website Builder 🌐 60 Watch a diffusion LLM write a website live, then tweak it
Soohak: A Mathematician-Curated Benchmark for Evaluating Research-level Math Capabilities of LLMs Paper • 2605.09063 • Published May 9 • 81