Skip to yearly menu bar Skip to main content


Poster

THOR: Tool-Integrated Hierarchical Optimization via RL for Mathematical Reasoning

Qikai Chang · Zhenrong Zhang · Pengfei Hu · Jun Du · Jiefeng Ma · Yicheng Pan · Jianshu Zhang · Quan Liu · Gao Jianqing

Abstract

Log in and register to view live content