Skip to yearly menu bar Skip to main content


Poster

Taming Overconfidence in LLMs: Reward Calibration in RLHF

Jixuan Leng ⋅ Chengsong Huang ⋅ Banghua Zhu ⋅ Jiaxin Huang
2025 Poster

Abstract

Video

Chat is not available.