Skip to yearly menu bar Skip to main content


Rethinking Fine-tuning when Scaling Test-Time Compute: Limiting Confidence Improves Mathematical Reasoning

Feng Chen · Allan Raventos · Nan Cheng · Surya Ganguli · Shaul Druckmann

Abstract

Chat is not available.