Skip to yearly menu bar Skip to main content


Poster

Beyond correlation: The impact of human uncertainty in measuring the effectiveness of automatic evaluation and LLM-as-a-judge

Aparna Elangovan ⋅ Lei Xu ⋅ Jongwoo Ko ⋅ Mahsa Elyasi ⋅ Ling Liu ⋅ Sravan Babu Bodapati ⋅ Dan Roth
2025 Poster

Abstract

Video

Chat is not available.