Skip to yearly menu bar Skip to main content


The Delta Learning Hypothesis: Preference Tuning on Weak Data Can Yield Strong Gains

Scott Geng · Hamish Ivison · Chun-Liang Li · Maarten Sap · Jerry Li · Ranjay Krishna · Pang Wei Koh

Abstract

Video

Chat is not available.