Skip to yearly menu bar Skip to main content


Poster

ResT: Reshaping Token-Level Policy Gradients for Tool-Use Large Language Models

Zihan Lin · Xiaohan Wang · Jie Cao · Jiajun Chai · Guojun Yin · Wei Lin · Ran He

Abstract

Log in and register to view live content