Algebraic Machine Learning with an Application to Chemistry

05/11/2022
by   Ezzeddine El Sai, et al.
1

As datasets used in scientific applications become more complex, studying the geometry and topology of data has become an increasingly prevalent part of the data analysis process. This can be seen for example with the growing interest in topological tools such as persistent homology. However, on the one hand, topological tools are inherently limited to providing only coarse information about the underlying space of the data. On the other hand, more geometric approaches rely predominately on the manifold hypothesis, which asserts that the underlying space is a smooth manifold. This assumption fails for many physical models where the underlying space contains singularities. In this paper we develop a machine learning pipeline that captures fine-grain geometric information without having to rely on any smoothness assumptions. Our approach involves working within the scope of algebraic geometry and algebraic varieties instead of differential geometry and smooth manifolds. In the setting of the variety hypothesis, the learning problem becomes to find the underlying variety using sample data. We cast this learning problem into a Maximum A Posteriori optimization problem which we solve in terms of an eigenvalue computation. Having found the underlying variety, we explore the use of Gröbner bases and numerical methods to reveal information about its geometry. In particular, we propose a heuristic for numerically detecting points lying near the singular locus of the underlying variety.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/11/2017

An introduction to Topological Data Analysis: fundamental and practical aspects for data scientists

Topological Data Analysis (tda) is a recent and fast growing eld providi...
research
02/01/2020

General witness sets for numerical algebraic geometry

Numerical algebraic geometry has a close relationship to intersection th...
research
12/07/2021

Towards Modeling and Resolving Singular Parameter Spaces using Stratifolds

When analyzing parametric statistical models, a useful approach consists...
research
07/31/2019

Persistent Intersection Homology for the Analysis of Discrete Data

Topological data analysis is becoming increasingly relevant to support t...
research
06/17/2022

ICLR 2022 Challenge for Computational Geometry and Topology: Design and Results

This paper presents the computational challenge on differential geometry...
research
10/26/2021

Topologically penalized regression on manifolds

We study a regression problem on a compact manifold M. In order to take ...
research
04/03/2021

Joint Geometric and Topological Analysis of Hierarchical Datasets

In a world abundant with diverse data arising from complex acquisition t...

Please sign up or login with your details

Forgot password? Click here to reset