Skip to yearly menu bar Skip to main content


Poster

DeepCompress: A Dual Reward Strategy for Dynamically Exploring and Compressing Reasoning Chains

Tian Liang · Wenxiang Jiao · Zhiwei He · Jiahao Xu · Haitao Mi · Dong Yu

Abstract

Log in and register to view live content