4 36

Cheng Qian

chengq9

https://qiancheng0.github.io

qiancheng0

AI & ML interests

Agent, Tool Learning

Recent Activity

upvoted a paper 10 days ago

Brick-Composer: Using MLLMs for Assembly with Diverse Bricks

upvoted a paper 11 days ago

AdaPlanBench: Evaluating Adaptive Planning in Large Language Model Agents under World and User Constraints

upvoted a paper 13 days ago

Ψ-Bench: Evaluating Persona-Sensitive Influencing in Persuasive Dialogues

View all activity

Organizations

upvoted a paper 10 days ago

Brick-Composer: Using MLLMs for Assembly with Diverse Bricks

Paper • 2606.05445 • Published 13 days ago • 7

upvoted a paper 11 days ago

AdaPlanBench: Evaluating Adaptive Planning in Large Language Model Agents under World and User Constraints

Paper • 2606.05622 • Published 12 days ago • 40

upvoted a paper 13 days ago

Ψ-Bench: Evaluating Persona-Sensitive Influencing in Persuasive Dialogues

Paper • 2606.02754 • Published 14 days ago • 13

upvoted a paper 19 days ago

Advancing Creative Physical Intelligence in Large Multimodal Models

Paper • 2605.26396 • Published 22 days ago • 19

submitted a paper to Daily Papers 19 days ago

Advancing Creative Physical Intelligence in Large Multimodal Models

Paper • 2605.26396 • Published 22 days ago • 19

updated a dataset 21 days ago

chengq9/CreativityBench-MM

Viewer • Updated 21 days ago • 1.2k • 97

published a dataset 21 days ago

chengq9/CreativityBench-MM

Viewer • Updated 21 days ago • 1.2k • 97

upvoted 2 papers 27 days ago

Code as Agent Harness

Paper • 2605.18747 • Published 29 days ago • 218

Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe

Paper • 2604.13016 • Published Apr 14 • 110

updated a dataset about 1 month ago

chengq9/CreativityBench

Viewer • Updated May 7 • 3.29k • 121 • 2

submitted a paper to Daily Papers about 1 month ago

CreativityBench: Evaluating Agent Creative Reasoning via Affordance-Based Tool Repurposing

Paper • 2605.02910 • Published May 6 • 22

upvoted a paper about 1 month ago

CreativityBench: Evaluating Agent Creative Reasoning via Affordance-Based Tool Repurposing

Paper • 2605.02910 • Published May 6 • 22

upvoted 2 papers 2 months ago

PEARL: Self-Evolving Assistant for Time Management with Reinforcement Learning

Paper • 2601.11957 • Published Jan 28 • 3

RAGEN-2: Reasoning Collapse in Agentic RL

Paper • 2604.06268 • Published Apr 7 • 68

published a dataset 2 months ago

chengq9/CreativityBench

Viewer • Updated May 7 • 3.29k • 121 • 2

upvoted 3 papers 3 months ago

upvoted a collection 4 months ago

AgentDoG

Collection

A Diagnostic Guardrail Framework for AI Agent Safety and Security • 12 items • Updated May 12 • 112

upvoted a paper 6 months ago

JustRL: Scaling a 1.5B LLM with a Simple RL Recipe

Paper • 2512.16649 • Published Dec 18, 2025 • 31

Cheng Qian

AI & ML interests

Recent Activity

Organizations

chengq9's activity