4 39 5

Junbo Niu

Niujunbo2002

Niujunbo2002

AI & ML interests

Computer vision and pattern recognition

Recent Activity

upvoted a paper 3 days ago

OVO-S-Bench: A Hierarchical Benchmark for Streaming Spatial Intelligence in Multimodal LLMs

upvoted a paper 16 days ago

LatentOmni: Rethinking Omni-Modal Understanding via Unified Audio-Visual Latent Reasoning

upvoted a paper 23 days ago

WildClawBench: A Benchmark for Real-World, Long-Horizon Agent Evaluation

View all activity

Organizations

upvoted a paper 3 days ago

OVO-S-Bench: A Hierarchical Benchmark for Streaming Spatial Intelligence in Multimodal LLMs

Paper • 2606.03890 • Published 5 days ago • 31

upvoted a paper 16 days ago

LatentOmni: Rethinking Omni-Modal Understanding via Unified Audio-Visual Latent Reasoning

Paper • 2605.22012 • Published 17 days ago • 46

upvoted a paper 23 days ago

WildClawBench: A Benchmark for Real-World, Long-Horizon Agent Evaluation

Paper • 2605.10912 • Published 27 days ago • 46

upvoted a paper about 1 month ago

Turning the TIDE: Cross-Architecture Distillation for Diffusion Large Language Models

Paper • 2604.26951 • Published Apr 29 • 48

liked a model about 2 months ago

opendatalab/MinerU2.5-Pro-2604-1.2B

Image-Text-to-Text • 1B • Updated Apr 14 • 639k • 151

upvoted 3 papers 2 months ago

liked a Space 2 months ago

MinerU Diffusion V1 0320 2.5B

🦀

demo of MinerU-Diffusion

liked a model 2 months ago

opendatalab/MinerU-Diffusion-V1-0320-2.5B

Image-to-Text • 3B • Updated Mar 25 • 6k • 23

commented a paper 2 months ago

MinerU-Diffusion: Rethinking Document OCR as Inverse Rendering via Diffusion Decoding

Paper • 2603.22458 • Published Mar 23 • 137 •

published a model 2 months ago

opendatalab/MinerU-Diffusion-V1-0320-2.5B

Image-to-Text • 3B • Updated Mar 25 • 6k • 23

updated a model 2 months ago

opendatalab/MinerU-Diffusion-V1-0320-2.5B

Image-to-Text • 3B • Updated Mar 25 • 6k • 23

upvoted a paper 2 months ago

MinerU-Diffusion: Rethinking Document OCR as Inverse Rendering via Diffusion Decoding

Paper • 2603.22458 • Published Mar 23 • 137

upvoted 5 papers 3 months ago

Efficient Reasoning with Balanced Thinking

Paper • 2603.12372 • Published Mar 12 • 151

HopChain: Multi-Hop Data Synthesis for Generalizable Vision-Language Reasoning

Paper • 2603.17024 • Published Mar 17 • 110

Online Experiential Learning for Language Models

Paper • 2603.16856 • Published Mar 17 • 60

Attention Residuals

Paper • 2603.15031 • Published Mar 16 • 187

OpenClaw-RL: Train Any Agent Simply by Talking

Paper • 2603.10165 • Published Mar 10 • 156

published a model 4 months ago

Niujunbo2002/NativeRes-LLaVA-qwen2-0.5b-qwen2vit-0730-2M-ocr

Updated Jul 30, 2025

Junbo Niu

AI & ML interests

Recent Activity

Organizations

Niujunbo2002's activity

MinerU Diffusion V1 0320 2.5B