Skip to yearly menu bar Skip to main content


Poster

On Evaluating the Durability of Safeguards for Open-Weight LLMs

Xiangyu Qi ⋅ Boyi Wei ⋅ Nicholas Carlini ⋅ Yangsibo Huang ⋅ Tinghao Xie ⋅ Luxi He ⋅ Matthew Jagielski ⋅ Milad Nasr ⋅ Prateek Mittal ⋅ Peter Henderson
2025 Poster

Abstract

Video

Chat is not available.