Skip to yearly menu bar Skip to main content


Poster

Boosting Multi-Domain Reasoning of LLMs via Curvature-Guided Policy Optimization

Xize Liang · Lin Yang · Jie Wang · Rui Liu · Yang Lu · Jinliang Zeng · Hanzhu Chen · Dong Li · Jianye HAO

Abstract

Log in and register to view live content