Skip to yearly menu bar Skip to main content


Poster

Safety-Tuned LLaMAs: Lessons From Improving the Safety of Large Language Models that Follow Instructions

Federico Bianchi · Mirac Suzgun · Giuseppe Attanasio · Paul Röttger · Dan Jurafsky · Tatsunori Hashimoto · James Y Zou

Abstract

Video

Chat is not available.