LLaVAction: evaluating and training multi-modal large language models for action recognition Paper • 2503.18712 • Published Mar 24, 2025 • 4
BalCapRL: A Balanced Framework for RL-Based MLLM Image Captioning Paper • 2605.07394 • Published May 8 • 5
BalCapRL: A Balanced Framework for RL-Based MLLM Image Captioning Paper • 2605.07394 • Published May 8 • 5
BalCapRL: A Balanced Framework for RL-Based MLLM Image Captioning Paper • 2605.07394 • Published May 8 • 5