Skip to yearly menu bar Skip to main content


TASP: Preserving Training Dynamics in Transformers via NTK-Aware Structured Pruning

Mengting Ai · Tianxin Wei · Jingrui He

Abstract

Chat is not available.