Skip to yearly menu bar Skip to main content


Poster

Incentivizing Consistent, Effective and Scalable Reasoning Capability in Audio LLMs via Reasoning Process Rewards

Jiajun Fan · Roger Ren · Jingyuan Li · Rahul Pandey · Prashanth Gurunath Shivakumar · Yile Gu · Ankur Gandhe · Ge Liu · Ivan Bulyko

Abstract

Log in and register to view live content