Skip to yearly menu bar Skip to main content


Poster

Safety-Tuned LLaMAs: Lessons From Improving the Safety of Large Language Models that Follow Instructions

Federico Bianchi ⋅ Mirac Suzgun ⋅ Giuseppe Attanasio ⋅ Paul Röttger ⋅ Dan Jurafsky ⋅ Tatsunori Hashimoto ⋅ James Y Zou

Abstract

Video

Chat is not available.