Skip to yearly menu bar Skip to main content


(6 events)   Timezone:  
Show all
Toggle Poster Visibility
Oral
Fri Apr 25 07:30 PM -- 07:42 PM (PDT) @ Garnet 213-215 None
How much of my dataset did you use? Quantitative Data Usage Inference in Machine Learning
Yao Tong · Jiayuan Ye · Sajjad Zarifzadeh · Reza Shokri
[ OpenReview
Oral
Fri Apr 25 07:42 PM -- 07:54 PM (PDT) @ Garnet 213-215 None
Proxy Denoising for Source-Free Domain Adaptation
Song Tang · Wenxin Su · Yan Gan · Mao Ye · Jianwei Dr. Zhang · Xiatian Zhu
[ OpenReview
Oral
Fri Apr 25 07:54 PM -- 08:06 PM (PDT) @ Garnet 213-215 None
Data Shapley in One Training Run
Jiachen (Tianhao) Wang · Prateek Mittal · Dawn Song · Ruoxi Jia
[ OpenReview
Oral
Fri Apr 25 08:06 PM -- 08:18 PM (PDT) @ Garnet 213-215 None
Data Selection via Optimal Control for Language Models
Yuxian Gu · Li Dong · Hongning Wang · Yaru Hao · Qingxiu Dong · Furu Wei · Minlie Huang
[ OpenReview
Oral
Fri Apr 25 08:18 PM -- 08:30 PM (PDT) @ Garnet 213-215 None
Combatting Dimensional Collapse in LLM Pre-Training Data via Submodular File Selection
Ziqing Fan · Siyuan Du · Shengchao Hu · Pingjie Wang · Li Shen · Ya Zhang · Dacheng Tao · Yanfeng Wang
[ OpenReview
Oral
Fri Apr 25 08:30 PM -- 08:42 PM (PDT) @ Garnet 213-215 None
DEPT: Decoupled Embeddings for Pre-training Language Models
Alex Iacob · Lorenzo Sani · Meghdad Kurmanji · William Shen · Xinchi Qiu · Dongqi Cai · Yan Gao · Nic Lane
[ Slides [ OpenReview