Skip to yearly menu bar Skip to main content


Poster

Incentivizing LLM Reasoning via Reinforcement Learning with Functional Monte Carlo Tree Search

Kongcheng Zhang · QI YAO · Baisheng Lai · Jiaxing Huang · Wenkai Fang · Dacheng Tao · Mingli Song · Shunyu Liu

Abstract

Log in and register to view live content