Skip to yearly menu bar Skip to main content


In-Person Poster presentation / poster accept

Protein Sequence and Structure Co-Design with Equivariant Translation

Chence Shi · Chuanrui Wang · Jiarui Lu · Bozitao Zhong · Jian Tang

MH1-2-3-4 #73

Keywords: [ Machine Learning for Sciences ] [ sequence structure co-design ] [ geometric deep learning ] [ protein design ] [ equivariant translation ]


Abstract:

Proteins are macromolecules that perform essential functions in all living organisms. Designing novel proteins with specific structures and desired functions has been a long-standing challenge in the field of bioengineering. Existing approaches generate both protein sequence and structure using either autoregressive models or diffusion models, both of which suffer from high inference costs. In this paper, we propose a new approach capable of protein sequence and structure co-design, which iteratively translates both protein sequence and structure into the desired state from random initialization, based on context features given a priori. Our model consists of a trigonometry-aware encoder that reasons geometrical constraints and interactions from context features, and a roto-translation equivariant decoder that translates protein sequence and structure interdependently. Notably, all protein amino acids are updated in one shot in each translation step, which significantly accelerates the inference process. Experimental results across multiple tasks show that our model outperforms previous state-of-the-art baselines by a large margin, and is able to design proteins of high fidelity as regards both sequence and structure, with running time orders of magnitude less than sampling-based methods.

Chat is not available.