Skip to yearly menu bar Skip to main content


Poster

$\mathbf{T^3}$: Reducing Belief Deviation in Reinforcement Learning for Active Reasoning

Deyu Zou · Yongqiang Chen · Jianxiang Wang · Garry YANG · Mufei Li · James Cheng · Yu Gong · Pan Li · Qing Da

Abstract

Log in and register to view live content