Skip to yearly menu bar Skip to main content


Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling

Runze Liu · Junqi Gao · Jian Zhao · Kaiyan Zhang · Xiu Li · Biqing Qi · Wanli Ouyang · Bowen Zhou

Abstract

Chat is not available.