4 36

Cheng Qian

chengq9

https://qiancheng0.github.io

qiancheng0

AI & ML interests

Agent, Tool Learning

Recent Activity

upvoted a paper 11 days ago

Brick-Composer: Using MLLMs for Assembly with Diverse Bricks

upvoted a paper 12 days ago

AdaPlanBench: Evaluating Adaptive Planning in Large Language Model Agents under World and User Constraints

upvoted a paper 14 days ago

Ψ-Bench: Evaluating Persona-Sensitive Influencing in Persuasive Dialogues

View all activity

Organizations

upvoted a paper 11 days ago

Brick-Composer: Using MLLMs for Assembly with Diverse Bricks

Paper • 2606.05445 • Published 14 days ago • 7

upvoted a paper 12 days ago

AdaPlanBench: Evaluating Adaptive Planning in Large Language Model Agents under World and User Constraints

Paper • 2606.05622 • Published 13 days ago • 40

upvoted a paper 14 days ago

Ψ-Bench: Evaluating Persona-Sensitive Influencing in Persuasive Dialogues

Paper • 2606.02754 • Published 15 days ago • 13

upvoted a paper 20 days ago

Advancing Creative Physical Intelligence in Large Multimodal Models

Paper • 2605.26396 • Published 23 days ago • 19

upvoted 2 papers 28 days ago

Code as Agent Harness

Paper • 2605.18747 • Published about 1 month ago • 219

Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe

Paper • 2604.13016 • Published Apr 14 • 110

upvoted a paper about 1 month ago

CreativityBench: Evaluating Agent Creative Reasoning via Affordance-Based Tool Repurposing

Paper • 2605.02910 • Published May 6 • 22

upvoted 2 papers 2 months ago

PEARL: Self-Evolving Assistant for Time Management with Reinforcement Learning

Paper • 2601.11957 • Published Jan 28 • 3

RAGEN-2: Reasoning Collapse in Agentic RL

Paper • 2604.06268 • Published Apr 7 • 68

upvoted 2 papers 3 months ago

NarrativeTrack: Evaluating Video Language Models Beyond the Frame

Paper • 2601.01095 • Published Jan 3 • 8

How Far Can Unsupervised RLVR Scale LLM Training?

Paper • 2603.08660 • Published Mar 9 • 60

upvoted a paper 4 months ago

Tool-R0: Self-Evolving LLM Agents for Tool-Learning from Zero Data

Paper • 2602.21320 • Published Feb 24 • 12

upvoted a collection 4 months ago

AgentDoG

Collection

A Diagnostic Guardrail Framework for AI Agent Safety and Security • 12 items • Updated May 12 • 112

upvoted a paper 6 months ago

JustRL: Scaling a 1.5B LLM with a Simple RL Recipe

Paper • 2512.16649 • Published Dec 18, 2025 • 31

upvoted 2 papers 8 months ago

Multimodal Policy Internalization for Conversational Agents

Paper • 2510.09474 • Published Oct 10, 2025 • 5

Self-Improving LLM Agents at Test-Time

Paper • 2510.07841 • Published Oct 9, 2025 • 10

upvoted 3 papers 9 months ago

Where LLM Agents Fail and How They can Learn From Failures

Paper • 2509.25370 • Published Sep 29, 2025 • 12

UserRL: Training Interactive User-Centric Agent via Reinforcement Learning

Paper • 2509.19736 • Published Sep 24, 2025 • 12

Context Engineering for Trustworthiness: Rescorla Wagner Steering Under Mixed and Inappropriate Contexts

Paper • 2509.04500 • Published Sep 2, 2025 • 5

upvoted a paper 10 months ago

The Law of Knowledge Overshadowing: Towards Understanding, Predicting, and Preventing LLM Hallucination

Paper • 2502.16143 • Published Feb 22, 2025 • 6

Cheng Qian

AI & ML interests

Recent Activity

Organizations

chengq9's activity