Skip to yearly menu bar Skip to main content


Poster

Curiosity-driven Red-teaming for Large Language Models

Zhang-Wei Hong ⋅ Idan Shenfeld ⋅ Johnson (Tsun-Hsuan) Wang ⋅ Yung-Sung Chuang ⋅ Aldo Pareja ⋅ James R Glass ⋅ Akash Srivastava ⋅ Pulkit Agrawal
2024 Poster

Abstract

Video

Chat is not available.