ICLR 2021 Friday 05/7

Skip to yearly menu bar Skip to main content

Timezone: Europe/Vienna

Full Schedule Mon 5/3 Tue 5/4 Wed 5/5 Thu 5/6 Fri 5/7

Invited Talk

Self-Supervision for Learning from the Bottom Up

Alexei Efros

1:00 AM - 2:00 AM

Why do self-supervised learning? A common answer is: "because data labeling is expensive." In this talk, I will argue that there are other, perhaps more fundamental reasons for working on self-supervision. First, it should allow us to get away from the tyranny of top-down semantic categorization and force meaningful associations to emerge naturally from the raw sensor data in a bottom-up fashion. Second, it should allow us to ditch fixed datasets and enable continuous, online learning, which is a much more natural setting for real-world agents. Third, and most intriguingly, there is hope that it might be possible to force a self-supervised task curriculum to emerge from first principles, even in the absence of a pre-defined downstream task or goal, similar to evolution. In this talk, I will touch upon these themes to argue that, far from running its course, research in self-supervised learning is only just beginning.

Speaker Bio

Alexei (Alyosha) Efros is a professor of computer science at UC Berkeley and member of the BAIR lab. Prior to that, he was nine years on the faculty of Carnegie Mellon University, and has also been affiliated with École Normale Supérieure/INRIA and University of Oxford. His research is in the area of computer vision and computer graphics, especially at the intersection of the two. He is particularly interested in using data-driven techniques to tackle problems where large quantities of unlabeled visual data are readily available. Efros received his PhD in 2003 from UC Berkeley. He is a recipient of the Sloan Fellowship (2008), Guggenheim Fellowship (2008), Okawa Grant (2008), SIGGRAPH Significant New Researcher Award (2010), 3 PAMI-TC Helmholtz Test-of-Time Prizes (1999,2003,2005), the ACM Prize in Computing (2016), and Diane McEntyre Award for Excellence in Teaching Computer Science (2019). He likes Paris and gelato.

Poster

Poster Session 12

2:00 AM - 4:00 AM

73 Events in this session

ANOCE: Analysis of Causal Effects with Multiple Mediators via Constrained Structural Learning

Hengrui Cai · Rui Song · Wenbin Lu

CO2: Consistent Contrast for Unsupervised Visual Representation Learning

Chen Wei · Huiyu Wang · Wei Shen · Alan Yuille

Combining Ensembles and Data Augmentation Can Harm Your Calibration

Yeming Wen · Ghassen Jerfel · Rafael Müller · Michael W Dusenberry · Jasper Snoek · Balaji Lakshminarayanan · Dustin Tran

Contrastive Syn-to-Real Generalization

Wuyang Chen · Zhiding Yu · Shalini De Mello · Sifei Liu · Jose M. Alvarez · Zhangyang Wang · Anima Anandkumar

Distributional Sliced-Wasserstein and Applications to Generative Modeling

Khai Nguyen · Nhat Ho · Tung Pham · Hung Bui

Few-Shot Learning via Learning the Representation, Provably

Simon Du · Wei Hu · Sham M Kakade · Jason Lee · Qi Lei

Generative Scene Graph Networks

Fei Deng · Zhuo Zhi · Donghun Lee · Sungjin Ahn

HeteroFL: Computation and Communication Efficient Federated Learning for Heterogeneous Clients

Enmao Diao · Jie Ding · VAHID TAROKH

In Defense of Pseudo-Labeling: An Uncertainty-Aware Pseudo-label Selection Framework for Semi-Supervised Learning

Mamshad Nayeem Rizve · Kevin Duarte · Yogesh S Rawat · Mubarak Shah

Large Scale Image Completion via Co-Modulated Generative Adversarial Networks

Shengyu Zhao · Jonathan Cui · Yilun Sheng · Yue Dong · Xiao Liang · Eric Chang · Yan Xu

Linear Convergent Decentralized Optimization with Compression

Xiaorui Liu · Yao Li · Rongrong Wang · Jiliang Tang · Ming Yan

MARS: Markov Molecular Sampling for Multi-objective Drug Discovery

Yutong Xie · Chence Shi · Hao Zhou · Yuwei Yang · Weinan Zhang · Yong Yu · Lei Li

Molecule Optimization by Explainable Evolution

Binghong Chen · Tianzhe Wang · Chengtao Li · Hanjun Dai · Le Song

Neural Pruning via Growing Regularization

Huan Wang · Can Qin · Yulun Zhang · Yun Fu

Non-asymptotic Confidence Intervals of Off-policy Evaluation: Primal and Dual Bounds

Yihao Feng · Ziyang Tang · Na Zhang · Qiang Liu

$i$-Mix: A Domain-Agnostic Strategy for Contrastive Representation Learning

Kibok Lee · Yian Zhu · Kihyuk Sohn · Chun-Liang Li · Jinwoo Shin · Honglak Lee

Adapting to Reward Progressivity via Spectral Reinforcement Learning

Michael Dann · John Thangarajah

A Design Space Study for LISTA and Beyond

Tianjian Meng · Xiaohan Chen · Yifan Jiang · Zhangyang Wang

A Learning Theoretic Perspective on Local Explainability

Jeffrey Li · Vaishnavh Nagarajan · Gregory Plumb · Ameet Talwalkar

Answering Complex Open-Domain Questions with Multi-Hop Dense Retrieval

Wenhan Xiong · Xiang Li · Srini Iyer · Jingfei Du · Patrick Lewis · William Yang Wang · Yashar Mehdad · Scott Yih · Sebastian Riedel · Douwe Kiela · Barlas Oguz

ARMOURED: Adversarially Robust MOdels using Unlabeled data by REgularizing Diversity

Kangkang Lu · Cuong Nguyen · Xun Xu · Kiran Chari · Yu Jing Goh · Chuan-Sheng Foo

Async-RED: A Provably Convergent Asynchronous Block Parallel Stochastic Method using Deep Denoising Priors

Yu Sun · Jiaming Liu · Yiran Sun · Brendt Wohlberg · Ulugbek Kamilov

Auto Seg-Loss: Searching Metric Surrogates for Semantic Segmentation

Hao Li · Chenxin Tao · Xizhou Zhu · Xiaogang Wang · Gao Huang · Jifeng Dai

BiPointNet: Binary Neural Network for Point Clouds

Haotong Qin · Zhongang Cai · Mingyuan Zhang · Yifu Ding · Haiyu Zhao · Shuai Yi · Xianglong Liu · Hao Su

Calibration of Neural Networks using Splines

Kartik Gupta · Amir Rahimi · Thalaiyasingam Ajanthan · Thomas Mensink · Cristian Sminchisescu · Richard Hartley

Clustering-friendly Representation Learning via Instance Discrimination and Feature Decorrelation

Yaling Tao · Kentaro Takagi · Kouta Nakata

Combining Label Propagation and Simple Models out-performs Graph Neural Networks

Qian Huang · Horace He · Abhay Singh · Ser-Nam Lim · Austin Benson

Convex Regularization behind Neural Reconstruction

Arda Sahiner · Morteza Mardani · Batu Ozturkler · Mert Pilanci · John M Pauly

CPT: Efficient Deep Neural Network Training via Cyclic Precision

Yonggan Fu · Han Guo · Meng Li · Xin Yang · Yining Ding · Vikas Chandra · Yingyan Lin

Cross-Attentional Audio-Visual Fusion for Weakly-Supervised Action Localization

Juntae Lee · Mihir Jain · Hyoungwoo Park · Sungrack Yun

CT-Net: Channel Tensorization Network for Video Classification

Kunchang Li · xianhang li · Yali Wang · Jun Wang · Yu Qiao

DynaTune: Dynamic Tensor Program Optimization in Deep Neural Network Compilation

Minjia Zhang · Menghao Li · Chi Wang · Mingqin Li

Estimating and Evaluating Regression Predictive Uncertainty in Deep Object Detectors

Ali Harakeh · Steven L Waslander

Evaluation of Similarity-based Explanations

Kazuaki Hanawa · Sho Yokoi · Satoshi Hara · Kentaro Inui

Extreme Memorization via Scale of Initialization

Harsh Mehta · Ashok Cutkosky · Behnam Neyshabur

Factorizing Declarative and Procedural Knowledge in Structured, Dynamical Environments

Anirudh Goyal · Alex Lamb · Phanideep Gampa · Philippe Beaudoin · Charles Blundell · Sergey Levine · Yoshua Bengio · Michael Mozer

Fast and Complete: Enabling Complete Neural Network Verification with Rapid and Massively Parallel Incomplete Verifiers

Kaidi Xu · Huan Zhang · Shiqi Wang · Yihan Wang · Suman Jana · Xue Lin · Cho-Jui Hsieh

Fast Geometric Projections for Local Robustness Certification

Aymeric Fromherz · Klas Leino · Matt Fredrikson · Bryan Parno · Corina Pasareanu

FastSpeech 2: Fast and High-Quality End-to-End Text to Speech

Yi Ren · Chenxu Hu · Xu Tan · Tao Qin · Sheng Zhao · Zhou Zhao · Tie-Yan Liu

FedBN: Federated Learning on Non-IID Features via Local Batch Normalization

Xiaoxiao Li · Meirui Jiang · Xiaofei Zhang · Michael Kamp · Qi Dou

Fully Unsupervised Diversity Denoising with Convolutional Variational Autoencoders

Mangal Prakash · Alexander Krull · Florian Jug

GAN2GAN: Generative Noise Learning for Blind Denoising with Single Noisy Images

Sungmin Cha · Taeeon Park · Byeongjoon Kim · Jongduk Baek · Taesup Moon

Generalization bounds via distillation

Daniel Hsu · Ziwei Ji · Matus Telgarsky · Lan Wang

Global Convergence of Three-layer Neural Networks in the Mean Field Regime

Huy Tuan Pham · Phan-Minh Nguyen

Greedy-GQ with Variance Reduction: Finite-time Analysis and Improved Complexity

Shaocong Ma · Ziyi Chen · Yi Zhou · Shaofeng Zou

Group Equivariant Generative Adversarial Networks

Neel Dey · Antong Chen · Soheil Ghafurian

Heteroskedastic and Imbalanced Deep Learning with Adaptive Regularization

Kaidi Cao · Yining Chen · Junwei Lu · Nikos Arechiga · Adrien Gaidon · Tengyu Ma

How Much Over-parameterization Is Sufficient to Learn Deep ReLU Networks?

Zixiang Chen · Yuan Cao · Difan Zou · Quanquan Gu

HW-NAS-Bench: Hardware-Aware Neural Architecture Search Benchmark

Chaojian Li · Zhongzhi Yu · Yonggan Fu · Yongan Zhang · Yang Zhao · Haoran You · Qixuan Yu · Yue Wang · Cong Hao · Yingyan Lin

Isotropy in the Contextual Embedding Space: Clusters and Manifolds

Xingyu Cai · Jiaji Huang · Yuchen Bian · Kenneth Church

Learning Energy-Based Generative Models via Coarse-to-Fine Expanding and Sampling

Yang Zhao · Jianwen Xie · Ping Li

Learning perturbation sets for robust machine learning

Eric Wong · Zico Kolter

Learning to Make Decisions via Submodular Regularization

Ayya Alieva · Aiden Aceves · Jialin Song · Stephen Mayo · Yisong Yue · Yuxin Chen

Learning to Sample with Local and Global Contexts in Experience Replay Buffer

Youngmin Oh · Kimin Lee · Jinwoo Shin · Eunho Yang · Sung Ju Hwang

Long-tailed Recognition by Routing Diverse Distribution-Aware Experts

Xudong Wang · Long Lian · Zhongqi Miao · Ziwei Liu · Stella Yu

LowKey: Leveraging Adversarial Attacks to Protect Social Media Users from Facial Recognition

Valeriia Cherepanova · Micah Goldblum · Harrison Foley · Shiyuan Duan · John P Dickerson · Gavin Taylor · Tom Goldstein

Minimum Width for Universal Approximation

Sejun Park · Chulhee Yun · Jaeho Lee · Jinwoo Shin

Multi-Prize Lottery Ticket Hypothesis: Finding Accurate Binary Neural Networks by Pruning A Randomly Weighted Network

James Diffenderfer · Bhavya Kailkhura

Multi-timescale Representation Learning in LSTM Language Models

Shivangi Mahto · Vy Vo · Javier Turek · Alexander Huth

Neural representation and generation for RNA secondary structures

Zichao Yan · William Hamilton · Mathieu Blanchette

Neural Thompson Sampling

Weitong ZHANG · Dongruo Zhou · Lihong Li · Quanquan Gu

No MCMC for me: Amortized sampling for fast and stable training of energy-based models

Will Grathwohl · Jacob Kelly · Milad Hashemi · Mohammad Norouzi · Kevin Swersky · David Duvenaud

Nonseparable Symplectic Neural Networks

Shiying Xiong · Yunjin Tong · Xingzhe He · Shuqi Yang · Cheng Yang · Bo Zhu

On the Curse of Memory in Recurrent Neural Networks: Approximation and Optimization Analysis

Zhong Li · Jiequn Han · Weinan E · Qianxiao Li

Prototypical Representation Learning for Relation Extraction

Ning Ding · Xiaobin Wang · Yao Fu · Guangwei Xu · Rui Wang · Pengjun Xie · Ying Shen · Fei Huang · Hai-Tao Zheng · Rui Zhang

Randomized Automatic Differentiation

Deniz Oktay · Nick McGreivy · Joshua Aduol · Alex Beatson · Ryan P Adams

Representing Partial Programs with Blended Abstract Semantics

Maxwell Nye · Yewen Pu · Matthew Bowers · Jacob Andreas · Joshua B Tenenbaum · Armando Solar-Lezama

Self-supervised Learning from a Multi-view Perspective

Yao-Hung Hubert Tsai · Yue Wu · Ruslan Salakhutdinov · Louis-Philippe Morency

Self-supervised Representation Learning with Relative Predictive Coding

Yao-Hung Hubert Tsai · Martin Ma · Muqiao Yang · Han Zhao · Louis-Philippe Morency · Ruslan Salakhutdinov

Stochastic Security: Adversarial Defense Using Long-Run Dynamics of Energy-Based Models

Mitchell Hill · Jonathan Mitchell · Song-Chun Zhu

Theoretical Analysis of Self-Training with Deep Networks on Unlabeled Data

Colin Wei · Kendrick Shen · Yining Chen · Tengyu Ma

The Recurrent Neural Tangent Kernel

Sina Alemohammad · Jack Wang · Randall Balestriero · Richard Baraniuk

When Do Curricula Work?

Xiaoxia (Shirley) Wu · Ethan Dyer · Behnam Neyshabur

Go to Event Page

Oral

Oral Session 12

4:00 AM - 6:48 AM

16 Events in this session

Theoretical Analysis of Self-Training with Deep Networks on Unlabeled Data

Colin Wei · Kendrick Shen · Yining Chen · Tengyu Ma

Long-tailed Recognition by Routing Diverse Distribution-Aware Experts

Xudong Wang · Long Lian · Zhongqi Miao · Ziwei Liu · Stella Yu

Self-Supervised Policy Adaptation during Deployment

Nicklas Hansen · Rishabh Jangir · Yu Sun · Guillem Alenyà · Pieter Abbeel · Alexei Efros · Lerrel Pinto · Xiaolong Wang

What are the Statistical Limits of Offline RL with Linear Function Approximation?

Ruosong Wang · Dean Foster · Sham M Kakade

Q&A

RMSprop converges with proper hyper-parameter

Naichen Shi · Dawei Li · Mingyi Hong · Ruoyu Sun

A Good Image Generator Is What You Need for High-Resolution Video Synthesis

Yu Tian · Jian Ren · Menglei Chai · Kyle Olszewski · Xi Peng · Dimitris Metaxas · Sergey Tulyakov

Random Feature Attention

Hao Peng · Nikolaos Pappas · Dani Yogatama · Roy Schwartz · Noah Smith · Lingpeng Kong

Learning with Feature-Dependent Label Noise: A Progressive Approach

Yikai Zhang · Songzhu Zheng · Pengxiang Wu · Mayank Goswami · Chao Chen

Sparse Quantized Spectral Clustering

Zhenyu Liao · Romain Couillet · Michael W Mahoney

Q&A

Learning a Latent Simplex in Input Sparsity Time

Ainesh Bakshi · Chiranjib Bhattacharyya · Ravi Kannan · David Woodruff · Samson Zhou

Topology-Aware Segmentation Using Discrete Morse Theory

Xiaoling Hu · Yusu Wang · Li Fuxin · Dimitris Samaras · Chao Chen

MARS: Markov Molecular Sampling for Multi-objective Drug Discovery

Yutong Xie · Chence Shi · Hao Zhou · Yuwei Yang · Weinan Zhang · Yong Yu · Lei Li

Distributional Sliced-Wasserstein and Applications to Generative Modeling

Khai Nguyen · Nhat Ho · Tung Pham · Hung Bui

Q&A

Go to Event Page

Remarks

Closing Remarks

7:00 AM - 8:00 AM

Workshop

Science and Engineering of Deep Learning

Levent Sagun · Caglar Gulcehre · Adriana Romero-Soriano · Negar Rostamzadeh · Stefano Sarao Mannelli · Lenka Zdeborova · Samy Bengio

11:30 AM - 8:30 PM

We aim to create a venue where we discuss seemingly contrasting challenges in machine learning research and their consequences. We invite researchers to discuss the boundaries between science and engineering, the implications of having blurred boundaries, and their potential consequences in areas of life beyond research.

We organized the first ``Science meets Engineering in Deep Learning'' workshop at NeurIPS 2019, which aimed to identify the potential boundaries between science and engineering and the role of theoretically driven and application-driven research in deep learning. The workshop's discussions highlighted how intertwined science and engineering are and emphasized the benefits of their symbiotic relationship to push the boundaries of both theoretically driven and application-driven research. To highlight the communication channel we aimed to build, we chose "Science meets Engineering'' in the title for the first iteration of the workshop.

Since then, such boundaries appear harder and harder to draw, and it becomes increasingly clear that we need to agree on a set of values that define us as a community, and that will shape our future research. In particular, we envision that such values will help (1) emphasize important engineering and scientific practices that we should foster to increase the robustness of our research, (2) acknowledge the broader impact of our research, and (3) abide by ethical standards.

Reflecting this shift in perspective, this year's proposed title is "Science and Engineering of Deep Learning''. With this in mind, we are proposing the second iteration of the workshop for ICLR 2021, focusing on the core themes mentioned above. In particular, we would like to ask (1) "What are the scientific and engineering practices that we should promote as a community?" and "How do those interact?", and (2) "What is the broader impact of such adopted scientific and engineering practices?"

https://sites.google.com/view/sedl-workshop

Workshop

Neural Compression: From Information Theory to Applications

Stephan Mandt · Robert Bamler · Yingzhen Li · Christopher Schroers · Yang Yang · Max Welling · Taco Cohen

12:30 PM - 10:30 PM

Data compression is a problem of great practical importance, and a new frontier for machine learning research that combines empirical findings (from the deep probabilistic modeling literature) with fundamental theoretical insights (from information theory, source coding, and minimum description length theory). Recent work building on deep generative models such as variational autoencoders, GANs, and normalizing flows showed that novel machine-learning-based compression methods can significantly outperform state-of-the-art classical compression codecs for image and video data. At the same time, these neural compression methods provide new evaluation metrics for model and inference performance on a rate/distortion trade-off. This workshop aims to draw more attention to the young and highly impactful field of neural compression. In contrast to other workshops that focus on practical compression performance, our goal is to bring together researchers from deep learning, information theory, and probabilistic modeling, to learn from each other and to encourage exchange on fundamentally novel issues such as the role of stochasticity in compression algorithms or ethical risks of semantic compression artifacts.

Workshop

Hardware-Aware Efficient Training of Deep Learning Models

Ghouthi BOUKLI HACENE · Vincent Gripon · François Leduc-Primeau · Vahid Partovi Nia · Fan Yang · Andreas Moshovos · Yoshua Bengio

1:45 PM - 9:00 PM

To reach top-tier performance, deep learning architectures usually rely on a large number of parameters and operations, and thus require to be processed using considerable power and memory. Numerous works have proposed to tackle this problem using quantization of parameters, pruning, clustering of parameters, decompositions of convolutions, or using distillation. However, most of these works aim at accelerating only the inference process and disregard the training phase. In practice, however, it is the learning phase that is by far the most complex. There has been recent efforts in introducing some compression on the training process, however, it remains challenging. In this workshop, we propose to focus on reducing the complexity of the training process. Our aim is to gather researchers interested in reducing energy, time, or memory usage for faster/cheaper/greener prototyping or deployment of deep learning models. Due to the dependence of deep learning on large computational capacities, the outcomes of the workshop could benefit all who deploy these solutions, including those who are not hardware specialists. Moreover, it would contribute to making deep learning more accessible to small businesses and small laboratories. Indeed, training complexity is of interest to many distinct communities. A first example is training on edge devices, where training can be used to specialize to data obtained online when the data cannot be transmitted back to the cloud because of constraints on privacy or communication bandwidth. Another example is accelerating training on dedicated hardware such as GPUs or TPUs.

Workshop

Geometric and Topological Representation Learning

Guy Wolf · Xiuyuan Cheng · Smita Krishnaswamy · Jure Leskovec · Bastian Rieck · Soledad Villar

2:00 PM - 1:00 AM

Over the past two decades, high-throughput data collection technologies have become commonplace in most fields of science and technology, and with them an ever-increasing amount of big high dimensional data is being generated by virtually every real-world system. While such data systems are highly diverse in nature, the underlying data analysis and exploration task give rise to common challenges at the core of modern representation learning. For example, even though modern real-world data typically have high dimensional ambient measurement spaces, they often exhibit low dimensional intrinsic structures that can be uncovered by geometry-oriented methods, such as the ones encountered in manifold learning, graph signal processing, geometric deep learning, and topological data analysis. As a result, recent years have seen significant interest and progress in geometric and topological approaches to representation learning,whichenabletractableexploratoryanalysisbydomainexpertswhoareoftennotcomputationoriented. Our overarching goal in the proposed workshop is to deepen our understanding of the challenges and opportunities in this field, while breaking the barriers between the typically disjoint computational approaches (or communities) that work in this field, with emphasis on the domains of topological data analysis, graph representation learning, and manifold learning, on which we shall subsequently briefly comment.

Website: https://gt-rl.github.io/

Workshop

S2D-OLAD: From shallow to deep, overcoming limited and adverse data

Colin Bellinger · Roberto Corizzo · Vincent Dumoulin · Nathalie Japkowicz

2:00 PM - 12:00 AM

Data coupled with the right algorithms offers the potential to save lives, protect the environment and increase profitability in different applications and domains. This potential, however, can be severely inhibited by adverse data properties specifically resulting in poor model performance, failed projects, and potentially serious social implications. This workshop will examine representation learning in the context of limited and sparse training samples, class imbalance, long-tailed distributions, rare cases and classes, and outliers. Speakers and participants will discuss the challenges and risks associated with designing, developing and learning deep representations from data with adverse properties. In addition, the workshop aims to connect researchers devoted to these topics in the traditional shallow representation learning research community and the more recent deep learning community, in order to advance novel and holistic solutions. Critically, given the growth in the application of AI to real-world decision making, the workshop will also facilitate a discussion of the potential social issues associated with application of deep representation learning in the context of data adversity. The workshop will bring together theoretical and applied deep learning researchers from academia and industry, and lay the groundwork for fruitful research collaborations that span communities that are often siloed.

Workshop

Beyond Static Papers: Rethinking How We Share Scientific Understanding in ML

Krishna Murthy Jatavallabhula · Bhairav Mehta · Tegan Maharaj · Amy Tabb · Khimya Khetarpal · Aditya Kusupati · Anna Rogers · Sara Hooker · Breandan Considine · Devi Parikh · Derek Nowrouzezahrai · Yoshua Bengio

2:15 PM - 7:00 PM

Over the last decade, the volume of conference submissions in machine learning has broken records. Despite rapid advancements and increasing hype around AI, there is growing concern in the ML community about where the field is headed. The current pandemic gives researchers a long-awaited opportunity to pause and reflect: what kind of legacy do we want to leave behind? How are scientific results presented? How do we interpret and explain them? Does this process include and/or allow access to all stakeholders? Are the results reproducible? These are some of the many facets of effective scientific communication which will shape the next decade of ML research.

How much research is overlooked due to inaccessible communication? How many papers will be as readable in ten or twenty years? How can we make the proceedings more accessible for future generations of ML researchers? These are a few of the questions we plan to discuss in our workshop. We hope to instigate an exciting discussion on redesigning the scientific paper for the next few years of machine learning research!

Workshop

Self-Supervision for Reinforcement Learning

Ankesh Anand · Bogdan Mazoure · Amy Zhang · Thang Doan · Khurram Javed · R Devon Hjelm · Martha White

2:45 PM - 1:30 AM

Reinforcement learning entails letting an agent learn through interaction with an environment. The formalism is powerful in it’s generality, and presents us with a hard open-ended problem: how can we design agents that learn efficiently, and generalize well, given only sensory information and a scalar reward signal? The goal of this workshop is to explore the role of self-supervised learning within reinforcement learning agents, to make progress towards this goal.

Workshop

Energy-Based Models: Current Perspectives, Challenges, and Opportunities

Marc Dymetman · Adji Bousso Dieng · Hady Elsahar · Igor Mordatch · Marc'Aurelio Ranzato

2:50 PM - 1:10 AM

Energy-Based Models (EBMs) are a learning framework that assigns a quality score to any given input, its energy; contrary to
probabilistic models, there is no a priori requirement that these scores be normalized (i.e. sum to one). Energies are typically
computed through a neural network, and training an EBM corresponds to shaping the energy function such that data points nearby the underlying data manifold are associated with lower energies than data points that are far from it. Not imposing normalization affords a great power and flexibility to the modelling process, e.g. in terms of combining energies, on conditioning on certain variables, of computing global scores on complex structured objects, or on expressing prior
knowledge. However, this freedom comes with significant technical challenges, in terms of learning and inference.

A strong comeback of EBMs is currently underway. This ICLR-2021 Workshop is the opportunity to increase awareness about the diversity of works in this area, to discuss current challenges, and to encourage cross-pollination between different communities around this topic.

Workshop

AI for Public Health

Bryan Wilder · Ioana Bica · Marie-Laure Charpignon · Emma Pierson

2:55 PM - 12:00 AM

The COVID-19 pandemic has cast a spotlight on the importance of public health. Even beyond this current emergency, public health is an essential component of population-level wellbeing. Topics such as infectious disease surveillance and control, preventative health, behavioral and mental health, maternal and child wellbeing, and more all play a crucial role in society. Moreover, a range of applications in public health benefit from careful use of data to uncover outbreak dynamics, learn patterns of behavior, optimize the design of interventions, and more. The science of machine learning in a public health context is still rapidly developing, and our aim is to build a community encompassing researchers based in both machine learning and public health to address these shared questions.

Workshop

The Role of Mathematical Reasoning in General Artificial Intelligence

Yuhuai Wu · Kshitij Bansal · Wenda Li · Melanie Mitchell · David McAllester · John Harrison

2:55 PM - 11:35 PM

In this workshop, we focus on a particular kind of reasoning ability, namely, mathematical reasoning. Advanced mathematical reasoning is unique in human intelligence, and it is also a fundamental building block for many intellectual pursuits and scientific developments. We believe that addressing this problem has the potential to shed light on a path towards general reasoning mechanisms, and hence general artificial intelligence. Therefore, we would like to bring together a group of experts from various backgrounds to discuss the role of mathematical reasoning ability towards the path of demonstrating general artificial intelligence. In addition, we hope to identify missing elements and major bottlenecks towards demonstrating mathematical reasoning ability in AI systems.

Workshop

Workshop on Neural Architecture Search

Arber Zela · Aaron Klein · Frank Hutter · Liam Li · Jan Hendrik Metzen · Jovita Lukasik

3:00 PM - 12:00 AM

Neural Architecture Search (NAS) is an exciting new field of study that is taking representation learning to the next level by allowing us to learn the architectures in a data-driven way that then enables efficient learning of representations. While representation learning removed the need of manual feature engineering, it shifted the manual task to the manual selection of architectures; as a natural next step, NAS replaces this manual architecture selection step, allowing us true end-to-end learning of the architecture, the features, and the final classifier using the features expressed as instantiations of the architecture.

Since the first workshop on NAS at ICLR 2020, there have been many new developments in NAS. Firstly, there has been a large increase in standardized tabular benchmarks and more researchers releasing source code, leading to more rigorous empirical NAS research and also allowing research groups without access to industry-scale compute resources to run thorough experimental evaluations. Secondly, there are now several works aiming for standardized and modularized open-source libraries that allow for both clean evaluations of different approaches without confounding factors and for mixing and matching components of different NAS methods. Finally, by now there are also several applications of NAS beyond its original narrow focus on object recognition, to fields like semantic segmentation, speech recognition, and natural language processing.

In this workshop, we want to push NAS to the next level and aim to address questions (see proposal) which are of particular relevance to the NAS community. In terms of prospective participants, our main targets are machine learning researchers interested in understanding and improving current NAS methods, but ML researchers planning to apply existing NAS methods to novel domains are also amongst the target community.

Workshop

A Roadmap to Never-Ending RL

Feryal Behbahani · Khimya Khetarpal · Louis Kirsch · Rose Wang · Annie Xie · Adam White · Doina Precup

3:00 PM - 10:55 PM

Humans have a remarkable ability to continually learn and adapt to new scenarios over the duration of their lifetime (Smith & Gasser, 2005). This ability is referred to as never ending learning, also known as continual learning or lifelong learning. Never-ending learning is the constant development of increasingly complex behaviors and the process of building complicated skills on top of those already developed (Ring, 1997), while being able to reapply, adapt and generalize its abilities to new situations. A never-ending learner has the following desiderata

1) it learns behaviors and skills while solving its tasks
2) it invents new subtasks that may later serve as stepping stones
3) it learns hierarchically, i.e. skills learned now can be built upon later
4) it learns without ergodic or resetting assumptions on the underlying (PO)MDP
5) it learns without episode boundaries
6) it learns in a single life without leveraging multiple episodes of experience

There are several facets to building AI agents with never-ending learning abilities. Moreover, different fields have a variety of perspectives to achieving this goal. To this end, we identify key themes for our workshop including cognitive sciences, developmental robotics, agency and abstractions, open-ended learning, world modelling and active inference.

Workshop

AIMOCC -- AI: Modeling Oceans and Climate Change

Luis Martí · Nayat Sánchez-Pi

3:00 PM - 7:00 PM

Oceans play a key role in the biosphere, regulating the carbon cycle; absorbing emitted CO2 through the biological pump, and a large part of the heat that the remaining CO2 and other greenhouse gases retained in the atmosphere. Understanding the drivers of micro and macroorganisms in the ocean is of paramount importance to understand the functioning of ecosystems and the efficiency of the biological pump in sequestering carbon and thus abating climate change.

AI, ML, and mathematical modeling tools are key to understanding oceans and climate change. Consequently, the topics of interest of this workshop can be grouped into two sets.

In regard to AI and modeling, the main focus is set on:
- handling of graph-structured information,
- ML methods to learn in small data contexts,
- causal relations, interpretability, and explainability in AI,
- integrating model-driven and data-driven approaches, and
- to develop, calibrate, and validate existing mechanistic models.

In the domain application area, the main questions to be addressed are:
- Which are the major patterns in plankton taxa and functional diversity?
- How these patterns and drivers will likely change under climate change?
- How will changes affect the capacity of ocean ecosystems to sequester carbon from the atmosphere?
- What relations bind communities and local conditions?
- What are the links between biodiversity functioning and structure?
- How modern AI and computer vision can be applied as research and discovery support tool to understand planktonic communities?
- How new knowledge can be derived from anomaly detection, causal learning, and explainable AI.

The goal of this workshop is to bring together researchers that are interested and/or applying AI and ML techniques to problems related to marine biology, modeling, and climate change mitigation. We also expect to attract natural science researchers interested in learning about and applying modern AI and ML methods.

Workshop

How Can Findings About The Brain Improve AI Systems?

Shinji Nishimoto · Leila Wehbe · Alexander Huth · Javier Turek · Nicole Beckage · Vy Vo · Mariya Toneva · Hsiang-Yun Chien · Shailee Jain · Richard Antonello

3:30 PM - 2:30 AM

The brain comprises billions of neurons organized into an intricate network of highly specialized functional areas. This biological cognitive system can efficiently process vast amounts of multi-modal data to perceive and react to its ever-changing environment. Unlike current AI systems, it does not struggle with domain adaptation, few-shot learning, or common-sense reasoning. Inspiration from neuroscience has benefited AI in the past: dopamine reward signals inspired TD learning, modern convolutional networks mimic the deep, nested information flow in visual cortex, and hippocampal replay of previous experiences has brought about experience replay in reinforcement learning. Recent work at the intersection of neuroscience and AI has made progress in directly integrating neuroscientific data with AI systems and has led to learned representations that are more robust to label corruptions, allow for better generalization in some language tasks, and provide new ways to interpret and evaluate what domain-relevant information is learned by deep neural networks. In this workshop, we aim to examine the extent to which insights about the brain can lead to better AI.

Workshop

Responsible AI (RAI)

Ahmad Beirami · Emily Black · Krishna Gummadi · Hoda Heidari · Baharan Mirzasoleiman · Meisam Razaviyayn · Joshua Williams

3:45 PM - 4:00 AM

Artificial Intelligence and Machine Learning are increasingly employed by industry and government alike to make or inform high-stakes decisions for people in areas such as employment, credit lending, policing, criminal justice, healthcare, and beyond. Over the past several years, we have witnessed growing concern regarding the risks and unintended consequences of inscrutable ML techniques (in particular, deep learning) in such socially consequential domains. This realization has motivated the community to look closer at the societal impacts of automated decision making and develop tools to ensure the responsible use of AI in society. Chief among the ideals that the ML community has set out to formalize and ensure are safety, interpretability, robustness, and fairness. In this workshop, we examine the community’s progress toward these values and aim to identify areas that call for additional research efforts. In particular, by bringing researchers with diverse backgrounds, we will focus on the limitations of existing formulations of fairness, explainability, robustness and safety, and discuss the tradeoffs among them.

Our workshop will consist of a diverse set of speakers (ranging from researchers with social work background to researchers in the ML community) to discuss transparency, bias and inequity in various real-world problems, including but not limited to criminal justice, health care and medicine, poverty and homelessness, and education. In addition, our invited talks will cover interpretability, and safety of modern machine learning models, their conflicting constraints, ethical and legal issues, and unintended consequences in areas such as self-driving cars and robotics. The workshop aims to further develop these research directions for the machine learning community.

Workshop

Neural Conversational AI: Bridging the Gap Between Research and Real World (NeuCAIR)

Ahmad Beirami · Asli Celikyilmaz · Yun-Nung Chen · Paul Crook · Orianna DeMasi · Stephen Roller · Chinnadhurai Sankar · Joao Sedoc · Zhou Yu

4:00 PM - 5:30 AM

Every day, millions of people use natural language interfaces in virtual digital assistants such as Amazon Alexa, Apple’s Siri, Google, Microsoft Cortana, Samsung’s Bixby and Facebook Potal via in-home devices or phones. At the same time, interest among the NLP research community in conversational systems has blossomed to the extent that Dialogue and Interactive Systems is consistently among the top three tracks in NLP conferences receiving a record number of submissions. Today’s industrial conversational AI systems are built using the traditional NLP pipeline, i.e., natural language understanding, dialog state tracking, dialog policy, and natural language generation. Despite its success, this pipeline fundamentally limits performance, humanness, and scaling of conversational AI systems. To overcome these challenges, dialog researchers have started embracing end-to-end neural approaches for the next generation of conversational AI systems, as such approaches have been setting state-of-the-art performance records on several NLP tasks. However, Neural Conversational AI systems are still far from shippable in the real world. We identify the following main outstanding questions to bridge this gap:
- Grounding in external systems
- Safety/integrity/robustness
- Continual learning

The goal of this workshop is to bring together machine learning researchers and dialog researchers from academia and industry to encourage knowledge transfer and collaboration in this space with the goal of bridging the gap between research and real world use cases in neural approaches to Conversational AI. The ideal outcome of the workshop is to identify a set of concrete research directions for the research community (both NLP and representation learning communities) to enable the next generation of digital assistants via Neural Conversational AI systems. We will make the findings from this workshop broadly available to the research community.

Workshop

Generalization beyond the training distribution in brains and machines

Christina Funke · Judith Borowski · Drew Linsley · Xavier Boix

4:00 PM - 12:55 AM

Deep Neural Networks (DNNs) are the leading approach for nearly all domains of machine learning and computer vision, with performance at times rivaling human perception. However, there is consensus that these models are outmatched by the robustness and versatility of biological brains. DNNs are sensitive to so-called shifts of the training distribution, where systematic differences between the train and test sets can significantly degrade performance. Distributional shifts can be induced by random or structured (adversarial) perturbations, changes in object or scene viewpoint, illumination, or color, and novel compositions of familiar features. These issues are magnified in domains where training data is scarce. In contrast, flexible and efficient generalization is a hallmark of biological perception and intelligence. We believe that the algorithms implemented in biological brains offer clues for how to construct artificial intelligence that can generalize beyond the training distribution.
The limited generalization of neural networks is a critical problem for artificial intelligence, in applications ranging from automated driving and biomedical image analysis, and domains like reinforcement learning, control, and representational theory. Our goal is to address these issues by creating synergies among neuroscientists, cognitive scientists, and artificial intelligence researchers that might lead to novel solutions to this problem or emphasize relevant existing classical work.

Workshop

Synthetic Data Generation: Quality, Privacy, Bias

Sergul Aydore · Krishnaram Kenthapadi · Haipeng Chen · Edward Choi · Jamie Hayes · Mario Fritz · Rachel Cummings · Krishnaram Kenthapadi

4:00 PM - 11:30 PM

Data are the most valuable ingredient of machine learning models to help researchers and companies make informed decisions. However, access to rich, diverse, and clean datasets may not always be possible. One of the reasons for the lack of rich datasets is the substantial amount of time needed for data collection, especially when manual annotation is required. Another reason is the need for protecting privacy, whenever raw data contains sensitive information about individuals and hence cannot be shared directly. A powerful solution that can address both of these challenging scenarios is generating synthetic data. Thanks to the recent advances in generative models, it is possible to create realistic synthetic samples that closely match the distribution of complex, real data. In the case of limited labeled data, synthetic data can be used to augment training data to mitigate overfitting. In the case of protecting privacy, data curators can share the synthetic data instead of the original data, where the utility of the original data is preserved but privacy is protected. Despite the substantial benefits from using synthetic data, the process of synthetic data generation is still an ongoing technical challenge. Although the two scenarios of limited data and privacy concerns share similar technical challenges such as quality and fairness, they are often studied separately. We will bring together researchers from both fields in order to discuss challenges and advances in synthetic data generation.

Workshop

Workshop on Weakly Supervised Learning

Benjamin Roth · Barbara Plank · Alex Ratner · Katharina Kann · Dietrich Klakow · Michael Hedderich

4:00 PM - 3:00 AM

Deep learning relies on massive training sets of labeled examples to learn from - often tens of thousands to millions to reach peak predictive performance. However, large amounts of training data are only available for very few standardized learning problems. Even small variations of the problem specification or changes in the data distribution would necessitate re-annotation of large amounts of data.

However, domain knowledge can often be expressed by sets of prototypical descriptions. These knowledge-based descriptions can be either used as rule-based predictors or as labeling functions for providing partial data annotations. The growing field of weak supervision provides methods for refining and generalizing such heuristic-based annotations in interaction with deep neural networks and large amounts of unannotated data.

In this workshop, we want to advance theory, methods and tools for allowing experts to express prior coded knowledge for automatic data annotations that can be used to train arbitrary deep neural networks for prediction. Learning with weak supervision is both studied from a theoretical perspective as well as applied to a variety of tasks from areas like natural language processing and computer vision. This workshop aims at bringing together researchers from this wide range of fields to facilitate discussions across research areas that share the common ground of using weak supervision. A target of this workshop is also to inspire applications of weak supervision to new scenarios and to enable researchers to work on tasks that so far have been considered too low-resource.

As weak supervision addresses one of the major issues of current machine learning techniques, the lack of labeled data, it has also started to obtain commercial interest. This workshop is an opportunity to bridge innovations from academia and the requirements of industry settings.

Workshop

2nd Workshop on Practical ML for Developing Countries: Learning Under Limited/low Resource Scenarios

Esube Bekele · Waheeda Saib · Timnit Gebru · Meareg Hailemariam · Vukosi Marivate · Judy Gichoya

4:00 PM - 9:15 PM

The constant progress being made in artificial intelligence needs to extend across borders if we are to democratize AI in developing countries. Adapting the state-of-the-art (SOTA) methods to resource constrained environments such as developing countries is challenging in practice. Recent breakthroughs in natural language processing (NLP), for instance, rely on increasingly complex and large models (e.g. most models based on transformers such as BERT, VilBERT, ALBERT, and GPT-2) that are pre-trained in on large corpus of unlabeled data. In most developing countries, low/limited resources means hard path towards adoption of these breakthroughs. Methods such as transfer learning will not fully solve the problem either due to bias in pre-training datasets that do not reflect real test cases in developing countries as well as the prohibitive cost of fine-tuning these large models. Recent progress with focus given to ML for social good has the potential to alleviate the problem in part. However, the themes in such workshops are usually application driven such as ML for healthcare and for education, and less attention is given to practical aspects as it relates to developing countries in implementing these solutions in low or limited resource scenarios. This, in turn, hinders the democratization of AI in developing countries. As a result, we aim to fill the gap by bringing together researchers, experts, policy makers and related stakeholders under the umbrella of practical ML for developing countries. The workshop is geared towards fostering collaborations and soliciting submissions under the broader theme of practical aspects of implementing machine learning (ML) solutions for problems in developing countries. We specifically encourage contributions that highlight challenges of learning under limited or low resource environments that are typical in developing countries.

Workshop

Workshop on Learning to Learn

Sarah Bechtle · Todor Davchev · Yevgen Chebotar · Timothy Hospedales · Franziska Meier

4:00 PM - 9:45 PM

Recent years have seen a lot of interest in the use and development of learning-to-learn algorithms. Research on learning-to-learn, or meta-learning, algorithms is often motivated by the hope to learn representations that can be easily transferred to the learning of new skills, and lead to faster learning. Yet, current meta-learned representations often struggle to generalize to novel task settings. In this workshop, we’d like to discuss how humans meta-learn, and what we can and should expect from learning-to-learn in the field of machine learning. Our aim is to bring together researchers from a variety of backgrounds with the hope to discuss and reason about what learning to learn means from a cognitive perspective, and how this knowledge might translate into algorithmic advances. In particular we are interested in creating a platform to enable the exchange between the fields of neuroscience and machine learning.
We believe that it is an important moment for the machine learning community to reflect upon these questions in order to advance the field and increase its variety in approaching learning to learn. We hope that by fostering discussions between cognitive science and machine learning researchers, we enable both sides to draw inspiration to further the understanding and development of learning-to-learn algorithms.

Workshop

Workshop on Enormous Language Models: Perspectives and Benchmarks

Colin Raffel · Adam Roberts · Amanda Askell · Daphne Ippolito · Ethan Dyer · Guy Gur-Ari · Jared Kaplan · Jascha Sohl-Dickstein · Katherine Lee · Melanie Subbiah · Sam McCandlish · Tom Brown · William Fedus · Vedant Misra · Ambrose Slone · Daniel Freeman

4:45 PM - 2:00 AM

Language models that have been trained on unlabeled text data are a cornerstone of modern natural language processing (NLP) research, and many recent state-of-the-art results in NLP were achieved by leveraging these self-supervised models. The success of this recipe is largely thanks to scalability: Better results can often be obtained by training larger models on larger amounts of unlabeled text data. This places our field at a crossroads. Will scaling lead to models that outperform humans on all text-based tasks, or are there limits to the scalability of these models? Should we focus on simply scaling these models, or should we design more sophisticated architectures and training schemes? Do our current benchmark effectively test capabilities that humans can master but large language models lack? How can we address the legal and ethical issues that arise from using unstructured web crawls for training language models? What can we learn from the fields of cognition, linguistics, and philosophy as we attempt to measure the “intelligence” of machines? The goal of this workshop is to find answers to these questions by inviting a diverse group of researchers to critically examine the state of giant language models.

This workshop will have a non-standard submission format: Rather than submitting research papers, participants will be invited to contribute diverse tasks that they believe measure uniquely human or particularly challenging capabilities for large language models. Teams at Google and OpenAI have committed to evaluate this task set on their best-performing model architectures, across models spanning from tens of thousands through hundreds of billions or more of parameters. Researchers will also be invited to contribute and evaluate their own models on these tasks. We will analyze these experiments, and report the results at the workshop, with a particular focus on how model performance on different task types scales with model size. By inviting contributions of tasks or models, we provide a means for researchers to participate whether or not they have the (cost-prohibitive) computational resources to train giant language models. The end result will be the Beyond the Imitation Game Benchmark (BIG Bench): A novel participant-driven test of the limits of giant language models. Find out more about BIG Bench and participate here.

Workshop

ICLR 2021 Workshop on Embodied Multimodal Learning (EML)

Ruohan Gao · Andrew Owens · Dinesh Jayaraman · Yuke Zhu · Jiajun Wu · Kristen Grauman

4:55 PM - 12:35 AM

Despite encouraging progress in embodied learning over the past two decades, there is still a large gap between embodied agents' perception and human perception. Humans have remarkable capabilities combining all our multisensory inputs. To close the gap, embodied agents should also be enabled to see, hear, touch, and interact with their surroundings in order to select the appropriate actions. However, today's learning algorithms primarily operate on a single modality. In order for Artificial Intelligence to make progress in understanding the world around us, it needs to be able to interpret such multimodal signals jointly. The goal of this workshop is to share recent progress and discuss current challenges on embodied learning with multiple modalities.

The EML workshop will bring together researchers in different subareas of embodied multimodal learning including computer vision, robotics, machine learning, natural language processing, and cognitive science to examine the challenges and opportunities emerging from the design of embodied agents that unify their multisensory inputs. We will review the current state and identify the research infrastructure needed to enable a stronger collaboration between researchers working on different modalities.

Workshop

Robust and reliable machine learning in the real world

Di Jin · Eric Wong · Yonatan Belinkov · Kai-Wei Chang · Zhijing Jin · Yanjun Qi · Aditi Raghunathan · Tristan Naumann · Mohit Bansal

5:00 PM - 2:05 AM

As machine learning (ML) is deployed pervasively, there is an increasing demand for ML systems to behave reliably when the input to the system has changed. Much work has emerged regarding artificial and natural changes to data, with a growing interest towards studying robustness and reliability of ML systems in the presence of real-world changes. This shift towards more realistic considerations raises both old and new fundamental questions for machine learning:
1. Can we bring principled research in robustness closer to real-world effects?
2. How can we demonstrate the reliability of ML systems in real-world deployments?
3. What are the unique societal and legal challenges facing robustness for deployed ML systems?
Consequently, the goal of this workshop is to bring together research in robust machine learning with the demands and reliability constraints of real-world processes and systems, with a focus on the practical, theoretical, and societal challenges in bringing these approaches to real world-scenarios. We highlight emerging directions, paradigms, and applications which include 1. Characterizing real-world changes for robustness; 2. Reliability of real-world systems; 3. Societal and legal considerations.

Workshop

Workshop on Distributed and Private Machine Learning

Niloofar Mireshghallah · Praneeth Vepakomma · Ayush Chopra · Vivek Sharma · Abhishek Singh · Adam Smith · Ramesh Raskar · Gautam Kamath · Reza Shokri

5:30 PM - 10:00 PM

Over the last decade, progress in machine learning has resulted in a surge of data-driven services affecting our daily lives. Conversational agents, healthcare providers, online retailers, and social networks continually access and jointly process vast amounts of data about their geographically distributed customers. Progress in distributed machine learning technology which has enabled widespread adoption and personalization has also raised issues regarding privacy, accountability, and fairness. This tension is particularly apparent in the context of the Covid-19 pandemic. This motivates the need to jointly address distributed and private machine learning technologies.

Workshop

Security and Safety in Machine Learning Systems

Xinyun Chen · Cihang Xie · Ali Shafahi · Bo Li · Ding Zhao · Tom Goldstein · Dawn Song

5:45 PM - 3:00 AM

While machine learning (ML) models have achieved great success in many applications, concerns have been raised about their potential vulnerabilities and risks when applied to safety-critical applications. On the one hand, from the security perspective, studies have been conducted to explore worst-case attacks against ML models and therefore inspire both empirical and certifiable defense approaches. On the other hand, from the safety perspective, researchers have looked into safe constraints, which should be satisfied by safe AI systems (e.g. autonomous driving vehicles should not hit pedestrians). This workshop makes the first attempts towards bridging the gap of these two communities and aims to discuss principles of developing secure and safe ML systems. The workshop also focuses on how future practitioners should prepare themselves for reducing the risks of unintended behaviors of sophisticated ML models.

The workshop will bring together experts from machine learning, computer security, and AI safety communities. We attempt to highlight recent related work from different communities, clarify the foundations of secure and safe ML, and chart out important directions for future work and cross-community collaborations.

Workshop

Machine Learning for Preventing and Combating Pandemics

Pengtao Xie · Xiaodan Liang · Jure Leskovec · Judy Wawira · Jeremy Weiss · Manuel Gomez Rodriguez · Madalina Fiterau · Yueyu Jiang · Leo Celi · Eric P Xing

5:45 PM - 2:50 AM

Pandemics are major disasters in human history. The recent COVID-19 pandemic has caused about 0.52 million deaths and infected about 11 million people all over the world as of July 3. In the past two decades, several pandemics/ epidemics including Zika, SARS, Ebola, H1N1 Flu, etc. have killed a large number of people. Medical experts predict that future pandemics will periodically occur and may be even worse than past ones. Since the outbreak of COVID-19, AI researchers have been developing methods to combat this pandemic, including building forecasting models to predict the spread of coronavirus, developing computer vision methods to analyze CT scans and chest X-rays for screening and risk assessment of infected cases, leveraging computational biology methods for vaccine development, etc. These efforts have shown high utility in controlling the spread of COVID-19 and pave a promising way for preventing future pandemics. To further promote research on AI-based control of pandemics, we aim to organize a workshop which brings together researchers in machine learning, healthcare, medicine, public health, etc. and facilitates discussions and collaborations in developing machine learning and AI methods to diagnose and treat infectious diseases and prevent and contain pandemics. Different from previous healthcare-related workshops, our workshop focuses on infectious diseases and health problems related to pandemic.

Workshop

Deep Learning for Simulation

Rex Ying · Tailin Wu · Peter Battaglia · Rose Yu · Ryan P Adams · Jure Leskovec

5:45 PM - 2:00 AM

Recently there has been a surge in interest in using deep learning to facilitate simulation, in application areas including physics, chemistry, robotics and graphics.
We define simulation as the process of iteratively generating output of the next time step using the output of the previous time step as input starting from an initial condition. The primary motivation of the workshop is thus to encourage knowledge sharing and communication. Recent works have started to actively explore the potential of using deep learning to improve these highly important simulations in terms of accuracy and efficiency. We believe that this workshop will bring these communities together, create communication and collaboration, in order to speed-up research on this important topic.