Skip to yearly menu bar Skip to main content


Poster

Understanding and Enhancing Safety Mechanisms of LLMs via Safety-Specific Neuron

Yiran Zhao ⋅ Wenxuan Zhang ⋅ Yuxi Xie ⋅ Anirudh Goyal ⋅ Kenji Kawaguchi ⋅ Michael Qizhe Shieh
2025 Poster

Abstract

Video

Chat is not available.