Skip to yearly menu bar Skip to main content


(6 events)   Timezone:  
Show all
Toggle Poster Visibility
Oral
Thu Apr 24 07:30 PM -- 07:42 PM (PDT) @ Garnet 212-213 None
MAP: Multi-Human-Value Alignment Palette
Xinran Wang · Qi Le · Ammar Ahmed · Enmao Diao · Yi Zhou · Nathalie Baracaldo · Jie Ding · Ali Anwar
[ Slides [ OpenReview
Oral
Thu Apr 24 07:42 PM -- 07:54 PM (PDT) @ Garnet 212-213 None
Limits to scalable evaluation at the frontier: LLM as judge won’t beat twice the data
Florian Eddie Dorner · Vivian Nastl · Moritz Hardt
[ OpenReview
Oral
Thu Apr 24 07:54 PM -- 08:06 PM (PDT) @ Garnet 212-213 None
Trust or Escalate: LLM Judges with Provable Guarantees for Human Agreement
Jaehun Jung · Faeze Brahman · Yejin Choi
[ OpenReview
Oral
Thu Apr 24 08:06 PM -- 08:18 PM (PDT) @ Garnet 212-213 None
AI as Humanity’s Salieri: Quantifying Linguistic Creativity of Language Models via Systematic Attribution of Machine Text against Web Text
Ximing Lu · Melanie Sclar · Skyler Hallinan · Niloofar Mireshghallah · Jiacheng Liu · Seungju Han · Allyson Ettinger · Liwei Jiang · Khyathi Chandu · Nouha Dziri · Yejin Choi
[ Slides [ OpenReview
Oral
Thu Apr 24 08:18 PM -- 08:30 PM (PDT) @ Garnet 212-213 None
Consistency Checks for Language Model Forecasters
Daniel Paleka · Abhimanyu Pallavi Sudhir · Alejandro Alvarez · Vineeth Bhat · Adam Shen · Evan Wang · Florian Tramer
[ OpenReview
Oral
Thu Apr 24 08:30 PM -- 08:42 PM (PDT) @ Garnet 212-213 None
Probabilistic Learning to Defer: Handling Missing Expert Annotations and Controlling Workload Distribution
Cuong Nguyen · Thanh-Toan Do · Gustavo Carneiro
[ Slides [ OpenReview