Skip to yearly menu bar Skip to main content


Poster

Universal Model Routing for Efficient LLM Inference

Wittawat Jitkrittum · Harikrishna Narasimhan · Ankit Singh Rawat · Jeevesh Juneja · Congchao Wang · Zifeng Wang · Alec Go · Chen-Yu Lee · Pradeep Shenoy · Rina Panigrahy · Aditya Krishna Menon · Sanjiv Kumar

Abstract

Log in and register to view live content