A General Characterization of the Statistical Query Complexity

08/07/2016
by   Vitaly Feldman, et al.
0

Statistical query (SQ) algorithms are algorithms that have access to an SQ oracle for the input distribution D instead of i.i.d. samples from D. Given a query function ϕ:X → [-1,1], the oracle returns an estimate of E_ x∼ D[ϕ(x)] within some tolerance τ_ϕ that roughly corresponds to the number of samples. In this work we demonstrate that the complexity of solving general problems over distributions using SQ algorithms can be captured by a relatively simple notion of statistical dimension that we introduce. SQ algorithms capture a broad spectrum of algorithmic approaches used in theory and practice, most notably, convex optimization techniques. Hence our statistical dimension allows to investigate the power of a variety of algorithmic approaches by analyzing a single linear-algebraic parameter. Such characterizations were investigated over the past 20 years in learning theory but prior characterizations are restricted to the much simpler setting of classification problems relative to a fixed distribution on the domain (Blum et al., 1994; Bshouty and Feldman, 2002; Yang, 2001; Simon, 2007; Feldman, 2012; Szorenyi, 2009). Our characterization is also the first to precisely characterize the necessary tolerance of queries. We give applications of our techniques to two open problems in learning theory and to algorithms that are subject to memory and communication constraints.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/20/2016

Dealing with Range Anxiety in Mean Estimation via Statistical Queries

We give algorithms for estimating the expectation of a given real-valued...
research
02/09/2023

Quadratic Memory is Necessary for Optimal Query Complexity in Convex Optimization: Center-of-Mass is Pareto-Optimal

We give query complexity lower bounds for convex optimization and the re...
research
07/01/2019

Open Problem: The Oracle Complexity of Convex Optimization with Limited Memory

We note that known methods achieving the optimal oracle complexity for f...
research
06/29/2020

Statistical-Query Lower Bounds via Functional Gradients

We give the first statistical-query lower bounds for agnostically learni...
research
06/02/2022

A Scalable Shannon Entropy Estimator

We revisit the well-studied problem of estimating the Shannon entropy of...
research
10/18/2020

Robust Learning under Strong Noise via SQs

This work provides several new insights on the robustness of Kearns' sta...
research
07/17/2018

Parameterized Query Complexity of Hitting Set using Stability of Sunflowers

In this paper, we study the query complexity of parameterized decision a...

Please sign up or login with your details

Forgot password? Click here to reset