Skip to yearly menu bar Skip to main content


Poster

NFT: Bridging Supervised Learning and Reinforcement Learning in Math Reasoning

Huayu Chen · Kaiwen Zheng · Qinsheng Zhang · Ganqu Cui · Yin Cui · Haotian Ye · Tsung-Yi Lin · Ming-Yu Liu · Jun Zhu · Haoxiang Wang

Abstract

Log in and register to view live content