Bayesian semi-supervised learning for uncertainty-calibrated prediction of molecular properties and active learning

02/03/2019
by   Yao Zhang, et al.
0

Predicting bioactivity and physical properties of small molecules is a central challenge in drug discovery. Deep learning is becoming the method of choice but studies to date focus on mean accuracy as the main metric. However, to replace costly and mission-critical experiments by models, a high mean accuracy is not enough: Outliers can derail a discovery campaign, thus models need reliably predict when it will fail, even when the training data is biased; experiments are expensive, thus models need to be data-efficient and suggest informative training sets using active learning. We show that uncertainty quantification and active learning can be achieved by Bayesian semi-supervised graph convolutional neural networks. The Bayesian approach estimates uncertainty in a statistically principled way through sampling from the posterior distribution. Semi-supervised learning disentangles representation learning and regression, keeping uncertainty estimates accurate in the low data limit and allowing the model to start active learning from a small initial pool of training data. Our study highlights the promise of Bayesian deep learning for chemistry.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/04/2023

Incorporating Unlabelled Data into Bayesian Neural Networks

We develop a contrastive framework for learning better prior distributio...
research
07/07/2020

ASGN: An Active Semi-supervised Graph Neural Network for Molecular Property Prediction

Molecular property prediction (e.g., energy) is an essential problem in ...
research
01/07/2020

A semi-supervised learning framework for quantitative structure-activity regression modelling

Supervised learning models, also known as quantitative structure-activit...
research
02/08/2023

Revisiting Deep Active Learning for Semantic Segmentation

Active learning automatically selects samples for annotation from a data...
research
11/06/2020

Beyond Marginal Uncertainty: How Accurately can Bayesian Regression Models Estimate Posterior Predictive Correlations?

While uncertainty estimation is a well-studied topic in deep learning, m...
research
11/29/2018

The Relevance of Bayesian Layer Positioning to Model Uncertainty in Deep Bayesian Active Learning

One of the main challenges of deep learning tools is their inability to ...
research
10/18/2021

A-Optimal Active Learning

In this work we discuss the problem of active learning. We present an ap...

Please sign up or login with your details

Forgot password? Click here to reset