Skip to yearly menu bar Skip to main content


Poster

iLLaVA: An Image is Worth Fewer Than 1/3 Input Tokens in Large Multimodal Models

Lianyu Hu · Liqing Gao · Fanhua Shang · Liang Wan · Wei Feng

Abstract

Log in and register to view live content