Skip to yearly menu bar Skip to main content


Poster

Beyond Markovian: Reflective Exploration via Bayes-Adaptive RL for LLM Reasoning

Shenao Zhang · Yaqing Wang · Yinxiao Liu · Tianqi Liu · Peter Grabowski · Eugene Ie · Zhaoran Wang · Yunxuan Li

Abstract

Log in and register to view live content