·
AI & ML interests
Reasoning and Alignment for Large Language Models
Recent Activity
Organizations
dongguanting/Tool-Star-Qwen-1.5B
Text Generation
• 2B • Updated • 9
• 2
dongguanting/Qwen3-8B-AEPO-DeepSearch
Text Generation
• 8B • Updated • 12
• • 2
dongguanting/QwQ-32B-AEPO-DeepSearch
Text Generation
• 33B • Updated • 4
• 1
dongguanting/QwQ-32B-ARPO-DeepSearch
33B • Updated • 5
• 1
8B • Updated • 6
dongguanting/Qwen2.5-7B-AEPO
Text Generation
• 8B • Updated • 6
• 1
dongguanting/Qwen3-14B-AEPO-DeepSearch
Robotics
• 15B • Updated • 18
• 1
dongguanting/Qwen2.5-7B-ARPO
Text Generation
• 8B • Updated • 22
• • 2
dongguanting/Llama3.1-8B-ARPO
Text Generation
• 8B • Updated • 5
• 1
dongguanting/Qwen2.5-3B-ARPO
Text Generation
• 3B • Updated • 36
• • 3
dongguanting/Qwen3-14B-ARPO-DeepSearch
Text Generation
• 15B • Updated • 29
• 5
dongguanting/Qwen3-8B-ARPO-DeepSearch
8B • Updated • 29
• 2
dongguanting/Tool-Star-Qwen-7B
Text Generation
• 8B • Updated • 14
• • 2
dongguanting/RAG-Critic-3B
Text Generation
• 3B • Updated • 13
• • 4
dongguanting/Tool-Star-Qwen-0.5B
Text Generation
• 0.6B • Updated • 9
• 1
dongguanting/Tool-Star-Qwen-3B
Text Generation
• 3B • Updated • 23
• 5