Skip to yearly menu bar Skip to main content


LLM Neurosurgeon: Targeted Knowledge Removal in LLMs using Sparse Autoencoders

Dylan Zhou · Kunal Patil · Yifan Sun · Karthik lakshmanan · Senthooran Rajamanoharan · Arthur Conmy

Abstract

Chat is not available.