Skip to yearly menu bar Skip to main content


Spotlight presentation
in
Workshop: Physics for Machine Learning

Learning protein family manifolds with smoothed energy-based models

Nathan Frey


Abstract:

We resolve difficulties in training and sampling from discrete energy-based models (EBMs) by learning a smoothed energy landscape, sampling the smoothed data manifold with Langevin Markov chain Monte Carlo, and projecting back to the true data manifold with one-step denoising. Our formalism combines the attractive properties of EBMs and improved sample quality of score-based models, while simplifying training and sampling by requiring only a single noise scale. We demonstrate the robustness of our approach on generative modeling of antibody proteins.

Chat is not available.