Uncovering the Folding Landscape of RNA Secondary Structure with Deep Graph Embeddings

06/12/2020
by   Egbert Castro, et al.
0

Biomolecular graph analysis has recently gained much attention in the emerging field of geometric deep learning. While numerous approaches aim to train classifiers that accurately predict molecular properties from graphs that encode their structure, an equally important task is to organize biomolecular graphs in ways that expose meaningful relations and variations between them. We propose a geometric scattering autoencoder (GSAE) network for learning such graph embeddings. Our embedding network first extracts rich graph features using the recently proposed geometric scattering transform. Then, it leverages a semi-supervised variational autoencoder to extract a low-dimensional embedding that retains the information in these features that enable prediction of molecular properties as well as characterize graphs. Our approach is based on the intuition that geometric scattering generates multi-resolution features with in-built invariance to deformations, but as they are unsupervised, these features may not be tuned for optimally capturing relevant domain-specific properties. We demonstrate the effectiveness of our approach to data exploration of RNA foldings. Like proteins, RNA molecules can fold to create low energy functional structures such as hairpins, but the landscape of possible folds and fold sequences are not well visualized by existing methods. We show that GSAE organizes RNA graphs both by structure and energy, accurately reflecting bistable RNA structures. Furthermore, it enables interpolation of embedded molecule sequences mimicking folding trajectories. Finally, using an auxiliary inverse-scattering model, we demonstrate our ability to generate synthetic RNA graphs along the trajectory thus providing hypothetical folding sequences for further analysis.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/12/2021

Molecular Graph Generation via Geometric Scattering

Graph neural networks (GNNs) have been used extensively for addressing p...
research
09/14/2023

Directed Scattering for Knowledge Graph-based Cellular Signaling Analysis

Directed graphs are a natural model for many phenomena, in particular sc...
research
08/22/2019

Tiered Graph Autoencoders with PyTorch Geometric for Molecular Graphs

Tiered latent representations and latent spaces for molecular graphs pro...
research
08/17/2022

Geometric Scattering on Measure Spaces

The scattering transform is a multilayered, wavelet-based transform init...
research
09/29/2020

Physics-Constrained Predictive Molecular Latent Space Discovery with Graph Scattering Variational Autoencoder

Recent advances in artificial intelligence have propelled the developmen...
research
06/26/2022

Edge Direction-invariant Graph Neural Networks for Molecular Dipole Moments Prediction

The dipole moment is a physical quantity indicating the polarity of a mo...

Please sign up or login with your details

Forgot password? Click here to reset