Skip to yearly menu bar Skip to main content


Poster

Count Counts: Motivating Exploration in LLM Reasoning with Count-based Intrinsic Rewards

XUAN ZHANG · Ruixiao Li · Zhijian Zhou · Long Li · Yulei Qin · Ke Li · Xing Sun · Xiaoyu Tan · chao qu · Yuan Qi

Abstract

Log in and register to view live content