When does deep learning fail and how to tackle it? A critical analysis on polymer sequence-property surrogate models

10/12/2022
by   himanshu, et al.
0

Deep learning models are gaining popularity and potency in predicting polymer properties. These models can be built using pre-existing data and are useful for the rapid prediction of polymer properties. However, the performance of a deep learning model is intricately connected to its topology and the volume of training data. There is no facile protocol available to select a deep learning architecture, and there is a lack of a large volume of homogeneous sequence-property data of polymers. These two factors are the primary bottleneck for the efficient development of deep learning models. Here we assess the severity of these factors and propose new algorithms to address them. We show that a linear layer-by-layer expansion of a neural network can help in identifying the best neural network topology for a given problem. Moreover, we map the discrete sequence space of a polymer to a continuous one-dimensional latent space using a machine learning pipeline to identify minimal data points for building a universal deep learning model. We implement these approaches for three representative cases of building sequence-property surrogate models, viz., the single-molecule radius of gyration of a copolymer, adhesive free energy of a copolymer, and copolymer compatibilizer, demonstrating the generality of the proposed strategies. This work establishes efficient methods for building universal deep learning models with minimal data and hyperparameters for predicting sequence-defined properties of polymers.

READ FULL TEXT

page 4

page 7

page 8

page 10

page 11

page 12

research
08/14/2021

Investigating the Relationship Between Dropout Regularization and Model Complexity in Neural Networks

Dropout Regularization, serving to reduce variance, is nearly ubiquitous...
research
03/29/2023

A Comprehensive and Versatile Multimodal Deep Learning Approach for Predicting Diverse Properties of Advanced Materials

We present a multimodal deep learning (MDL) framework for predicting phy...
research
01/15/2019

Comparing two deep learning sequence-based models for protein-protein interaction prediction

Biological data are extremely diverse, complex but also quite sparse. Th...
research
10/05/2017

How Much Chemistry Does a Deep Neural Network Need to Know to Make Accurate Predictions?

In the last few years, we have seen the rise of deep learning applicatio...
research
06/01/2023

Adversarial-Aware Deep Learning System based on a Secondary Classical Machine Learning Verification Approach

Deep learning models have been used in creating various effective image ...
research
12/29/2019

Deep learning surrogate models for spatial and visual connectivity

Spatial and visual connectivity are important metrics when developing wo...
research
10/07/2020

Combination of digital signal processing and assembled predictive models facilitates the rational design of proteins

Predicting the effect of mutations in proteins is one of the most critic...

Please sign up or login with your details

Forgot password? Click here to reset