Skip to yearly menu bar Skip to main content


A Positive Case for Faithfulness: LLM Self-Explanations Help Predict Model Behavior

Harry Mayne ⋅ Justin Kang ⋅ Dewi Gould ⋅ Kannan Ramchandran ⋅ Adam Mahdi ⋅ Noah Y Siegel

Abstract

Chat is not available.