KABI's picture

KABI

dongguanting

·

https://dongguanting.github.io/

AI & ML interests

Reasoning and Alignment for Large Language Models

Recent Activity

upvoted a paper about 19 hours ago

From Prompt Injection to Persistent Control: Defending Agentic Harness Against Trojan Backdoors

upvoted a paper 4 days ago

Qwen-VLA: Unifying Vision-Language-Action Modeling across Tasks, Environments, and Robot Embodiments

upvoted a paper 4 days ago

Towards Verifiable Multimodal Deep Research: A Multi-Agent Harness for Interleaved Report Generation

View all activity

Organizations

dongguanting 's models 16

dongguanting/Tool-Star-Qwen-1.5B

Text Generation • 2B • Updated Mar 5 • 9 • 2

dongguanting/Qwen3-8B-AEPO-DeepSearch

Text Generation • 8B • Updated Dec 20, 2025 • 12 • • 2

dongguanting/QwQ-32B-AEPO-DeepSearch

Text Generation • 33B • Updated Dec 20, 2025 • 4 • 1

dongguanting/QwQ-32B-ARPO-DeepSearch

33B • Updated Dec 20, 2025 • 5 • 1

dongguanting/aepo_light

8B • Updated Nov 3, 2025 • 6

dongguanting/Qwen2.5-7B-AEPO

Text Generation • 8B • Updated Oct 27, 2025 • 6 • 1

dongguanting/Qwen3-14B-AEPO-DeepSearch

Robotics • 15B • Updated Oct 21, 2025 • 18 • 1

dongguanting/Qwen2.5-7B-ARPO

Text Generation • 8B • Updated Aug 19, 2025 • 22 • • 2

dongguanting/Llama3.1-8B-ARPO

Text Generation • 8B • Updated Aug 12, 2025 • 5 • 1

dongguanting/Qwen2.5-3B-ARPO

Text Generation • 3B • Updated Aug 12, 2025 • 36 • • 3

dongguanting/Qwen3-14B-ARPO-DeepSearch

Text Generation • 15B • Updated Aug 12, 2025 • 29 • 5

dongguanting/Qwen3-8B-ARPO-DeepSearch

8B • Updated Jul 29, 2025 • 29 • 2

dongguanting/Tool-Star-Qwen-7B

Text Generation • 8B • Updated Jun 30, 2025 • 14 • • 2

dongguanting/RAG-Critic-3B

Text Generation • 3B • Updated Jun 28, 2025 • 13 • • 4

dongguanting/Tool-Star-Qwen-0.5B

Text Generation • 0.6B • Updated Jun 6, 2025 • 9 • 1

dongguanting/Tool-Star-Qwen-3B

Text Generation • 3B • Updated May 25, 2025 • 23 • 5