Skip to yearly menu bar Skip to main content


Poster

Information Gain-based Policy Optimization: A Simple and Effective Approach for Multi-Turn LLM Agents

Guoqing Wang · Sunhao Dai · Guangze Ye · Zeyu Gan · Wei Yao · Yong Deng · Xiaofeng Wu · zhenzhe ying

Abstract

Log in and register to view live content