Skip to yearly menu bar Skip to main content


SafeMERGE: Preserving Safety Alignment in Fine-Tuned Large Language Models via Selective Layer-Wise Model Merging

Aladin Djuhera · Swanand Kadhe · Farhan Ahmed · Syed Zawad · Holger Boche

Abstract

Chat is not available.