Skip to yearly menu bar Skip to main content


Virtual Poster presentation / top 5% paper

When and Why Vision-Language Models Behave like Bags-Of-Words, and What to Do About It?

Mert Yuksekgonul ⋅ Federico Bianchi ⋅ Ria Kalluri ⋅ Dan Jurafsky ⋅ James Y Zou

Abstract

Video

Chat is not available.