Contrastive Representation Learning for 3D Protein Structures

05/31/2022
by   Pedro Hermosilla, et al.
22

Learning from 3D protein structures has gained wide interest in protein modeling and structural bioinformatics. Unfortunately, the number of available structures is orders of magnitude lower than the training data sizes commonly used in computer vision and machine learning. Moreover, this number is reduced even further, when only annotated protein structures can be considered, making the training of existing models difficult and prone to over-fitting. To address this challenge, we introduce a new representation learning framework for 3D protein structures. Our framework uses unsupervised contrastive learning to learn meaningful representations of protein structures, making use of proteins from the Protein Data Bank. We show, how these representations can be used to solve a large variety of tasks, such as protein function prediction, protein fold classification, structural similarity prediction, and protein-ligand binding affinity prediction. Moreover, we show how fine-tuned networks, pre-trained with our algorithm, lead to significantly improved task performance, achieving new state-of-the-art results in many tasks.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/11/2022

Protein Representation Learning by Geometric Structure Pretraining

Learning effective protein representations is critical in a variety of t...
research
08/05/2020

Protein Conformational States: A First Principles Bayesian Method

Automated identification of protein conformational states from simulatio...
research
12/07/2022

When Geometric Deep Learning Meets Pretrained Protein Language Models

Geometric deep learning has recently achieved great success in non-Eucli...
research
10/04/2016

A novel and effective scoring scheme for structure classification and pairwise similarity measurement

Protein tertiary structure defines its functions, classification and bin...
research
08/01/2020

A Visual Analytics Framework for Contrastive Network Analysis

A common network analysis task is comparison of two networks to identify...
research
09/02/2014

CoMOGrad and PHOG: From Computer Vision to Fast and Accurate Protein Tertiary Structure Retrieval

Due to the advancements in technology number of entries in the structura...
research
07/26/2022

Learning Protein Representations via Complete 3D Graph Networks

We consider representation learning for proteins with 3D structures. We ...

Please sign up or login with your details

Forgot password? Click here to reset