Skip to yearly menu bar Skip to main content


Distortion of AI Alignment Revisited: RLHF Is a Decent Utilitarian Aligner

Kazusato Oko ⋅ Annie Ulichney ⋅ Nika Haghtalab ⋅ Han Bao

Abstract

Chat is not available.