Unsupervised learning of dynamical and molecular similarity using variance minimization

12/20/2017
by   Brooke E. Husic, et al.
0

In this report, we present an unsupervised machine learning method for determining groups of molecular systems according to similarity in their dynamics or structures using Ward's minimum variance objective function. We first apply the minimum variance clustering to a set of simulated tripeptides using the information theoretic Jensen-Shannon divergence between Markovian transition matrices in order to gain insight into how point mutations affect protein dynamics. Then, we extend the method to partition two chemoinformatic datasets according to structural similarity to motivate a train/validation/test split for supervised learning that avoids overfitting.

READ FULL TEXT
research
09/28/2018

A kernel-based approach to molecular conformation analysis

We present a novel machine learning approach to understanding conformati...
research
11/07/2016

An Information-Theoretic Framework for Fast and Robust Unsupervised Learning via Neural Population Infomax

A framework is presented for unsupervised learning of representations ba...
research
02/18/2019

Graph Dynamical Networks: Unsupervised Learning of Atomic Scale Dynamics in Materials

Understanding the dynamical processes that govern the performance of fun...
research
11/19/2015

Towards Principled Unsupervised Learning

General unsupervised learning is a long-standing conceptual problem in m...
research
10/05/2016

Learning Protein Dynamics with Metastable Switching Systems

We introduce a machine learning approach for extracting fine-grained rep...
research
08/05/2020

Protein Conformational States: A First Principles Bayesian Method

Automated identification of protein conformational states from simulatio...
research
01/20/2015

Regroupement sémantique de définitions en espagnol

This article focuses on the description and evaluation of a new unsuperv...

Please sign up or login with your details

Forgot password? Click here to reset