Fast and Sample Near-Optimal Algorithms for Learning Multidimensional Histograms

02/23/2018
by   Ilias Diakonikolas, et al.
0

We study the problem of robustly learning multi-dimensional histograms. A d-dimensional function h: D →R is called a k-histogram if there exists a partition of the domain D ⊆R^d into k axis-aligned rectangles such that h is constant within each such rectangle. Let f: D →R be a d-dimensional probability density function and suppose that f is OPT-close, in L_1-distance, to an unknown k-histogram (with unknown partition). Our goal is to output a hypothesis that is O(OPT) + ϵ close to f, in L_1-distance. We give an algorithm for this learning problem that uses n = Õ_d(k/ϵ^2) samples and runs in time Õ_d(n). For any fixed dimension, our algorithm has optimal sample complexity, up to logarithmic factors, and runs in near-linear time. Prior to our work, the time complexity of the d=1 case was well-understood, but significant gaps in our understanding remained even for d=2.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/10/2018

Testing Identity of Multidimensional Histograms

We investigate the problem of identity testing for multidimensional hist...
research
07/14/2022

Near-Optimal Bounds for Testing Histogram Distributions

We investigate the problem of testing whether a discrete probability dis...
research
02/19/2014

Near-optimal-sample estimators for spherical Gaussian mixtures

Statistical and machine-learning algorithms are frequently applied to hi...
research
09/21/2019

Efficient Learning of Distributed Linear-Quadratic Controllers

In this work, we propose a robust approach to design distributed control...
research
08/17/2021

Statistically Near-Optimal Hypothesis Selection

Hypothesis Selection is a fundamental distribution learning problem wher...
research
07/25/2022

Improved Bounds for Sampling Solutions of Random CNF Formulas

Let Φ be a random k-CNF formula on n variables and m clauses, where each...
research
07/22/2019

The k-Dimensional Weisfeiler-Leman Algorithm

In this note, we provide details of the k-dimensional Weisfeiler-Leman A...

Please sign up or login with your details

Forgot password? Click here to reset