Skip to yearly menu bar Skip to main content


Poster

Tamper-Resistant Safeguards for Open-Weight LLMs

Rishub Tamirisa ⋅ Bhrugu Bharathi ⋅ Long Phan ⋅ Andy Zhou ⋅ Alice Gatti ⋅ Tarun Suresh ⋅ Maxwell Lin ⋅ Justin Wang ⋅ Rowan Wang ⋅ Ron Arel ⋅ Andy Zou ⋅ Dawn Song ⋅ Bo Li ⋅ Dan Hendrycks ⋅ Mantas Mazeika
2025 Poster

Abstract

Video

Chat is not available.