liyaxuan

lllyx

32 4

AI & ML interests

None yet

Recent Activity

upvoted a paper 10 days ago

EnterpriseClawBench: Benchmarking Agents from Real Workplace Sessions

upvoted a paper 11 days ago

Zone of Proximal Policy Optimization: Teacher in Prompts, Not Gradients

liked a model 11 days ago

zai-org/GLM-5.2

View all activity

Organizations

None yet

upvoted a paper 10 days ago

EnterpriseClawBench: Benchmarking Agents from Real Workplace Sessions

Paper • 2606.23654 • Published 11 days ago • 79

upvoted a paper 11 days ago

Zone of Proximal Policy Optimization: Teacher in Prompts, Not Gradients

Paper • 2606.18216 • Published 17 days ago • 63

upvoted a paper 17 days ago

JoyAI-VL-Interaction: Real-Time Vision-Language Interaction Intelligence

Paper • 2606.14777 • Published 23 days ago • 208

upvoted 2 papers 30 days ago

Draft-OPD: On-Policy Distillation for Speculative Draft Models

Paper • 2605.29343 • Published May 28 • 36

MiniCPM-V 4.5: Cooking Efficient MLLMs via Architecture, Data, and Training Recipe

Paper • 2509.18154 • Published Sep 16, 2025 • 62

upvoted a paper about 1 month ago

Rubric-based On-policy Distillation

Paper • 2605.07396 • Published May 8 • 41

upvoted 4 papers about 2 months ago

upvoted a collection about 2 months ago

Rethinking OPD

Collection

This collection includes the models used in the paper "Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recip • 5 items • Updated 30 days ago • 3

upvoted 5 papers about 2 months ago

LLMs Improving LLMs: Agentic Discovery for Test-Time Scaling

Paper • 2605.08083 • Published May 8 • 70

Beyond SFT-to-RL: Pre-alignment via Black-Box On-Policy Distillation for Multimodal RL

Paper • 2604.28123 • Published May 1 • 49

MiniCPM-o 4.5: Towards Real-Time Full-Duplex Omni-Modal Interaction

Paper • 2604.27393 • Published Apr 30 • 81

Skill1: Unified Evolution of Skill-Augmented Agents via Reinforcement Learning

Paper • 2605.06130 • Published May 7 • 116

MiA-Signature: Approximating Global Activation for Long-Context Understanding

Paper • 2605.06416 • Published May 7 • 57

upvoted 3 papers 2 months ago

MAIC-UI: Making Interactive Courseware with Generative UI

Paper • 2604.25806 • Published Apr 28 • 8

Co-Evolving Policy Distillation

Paper • 2604.27083 • Published Apr 29 • 68

Near-Future Policy Optimization

Paper • 2604.20733 • Published Apr 22 • 77

upvoted a paper 3 months ago

Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe

Paper • 2604.13016 • Published Apr 14 • 113

liyaxuan

AI & ML interests

Recent Activity

Organizations

lllyx's activity