Scaling up the Automatic Statistician: Scalable Structure Discovery using Gaussian Processes

06/08/2017
by   Hyunjik Kim, et al.
0

Automating statistical modelling is a challenging problem that has far-reaching implications for artificial intelligence. The Automatic Statistician employs a kernel search algorithm to provide a first step in this direction for regression problems. However this does not scale due to its O(N^3) running time for the model selection. This is undesirable not only because the average size of data sets is growing fast, but also because there is potentially more information in bigger data, implying a greater need for more expressive models that can discover finer structure. We propose Scalable Kernel Composition (SKC), a scalable kernel search algorithm, to encompass big data within the boundaries of automated statistical modelling.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/21/2022

Structural Kernel Search via Bayesian Optimization and Symbolical Optimal Transport

Despite recent advances in automated machine learning, model selection i...
research
12/21/2020

Learning Compositional Sparse Gaussian Processes with a Shrinkage Prior

Choosing a proper set of kernel functions is an important problem in lea...
research
08/22/2019

Clustered Hierarchical Entropy-Scaling Search of Astronomical and Biological Data

Both astronomy and biology are experiencing explosive growth of data, re...
research
04/11/2017

Parametric Gaussian Process Regression for Big Data

This work introduces the concept of parametric Gaussian processes (PGPs)...
research
07/10/2018

Fast Model-Selection through Adapting Design of Experiments Maximizing Information Gain

To perform model-selection efficiently, we must run informative experime...
research
02/23/2017

A Unified Parallel Algorithm for Regularized Group PLS Scalable to Big Data

Partial Least Squares (PLS) methods have been heavily exploited to analy...
research
02/18/2014

Automatic Construction and Natural-Language Description of Nonparametric Regression Models

This paper presents the beginnings of an automatic statistician, focusin...

Please sign up or login with your details

Forgot password? Click here to reset