Skip to yearly menu bar Skip to main content


Poster

HyperDAS: Towards Automating Mechanistic Interpretability with Hypernetworks

Jiuding Sun ⋅ Jing Huang ⋅ Sidharth Baskaran ⋅ Karel D'Oosterlinck ⋅ Christopher Potts ⋅ Michael Sklar ⋅ Atticus Geiger
2025 Poster

Abstract

Video

Chat is not available.