Skip to yearly menu bar Skip to main content


Poster

SafeMoE: Safe Fine-Tuning for MoE LLMs by Aligning Harmful Input Routing

Jaehan Kim · Minkyoo Song · Seungwon Shin · Sooel Son

Abstract

Log in and register to view live content