Skip to yearly menu bar Skip to main content


Scaling Test-Time Compute Without Verification or RL is Suboptimal

Amrith Setlur · Nived Rajaraman · Sergey Levine · Aviral Kumar

Abstract

Chat is not available.