Representational Power of ReLU Networks and Polynomial Kernels: Beyond Worst-Case Analysis

05/29/2018
by   Frederic Koehler, et al.
0

There has been a large amount of interest, both in the past and particularly recently, into the power of different families of universal approximators, e.g. ReLU networks, polynomials, rational functions. However, current research has focused almost exclusively on understanding this problem in a worst-case setting, e.g. bounding the error of the best infinity-norm approximation in a box. In this setting a high-degree polynomial is required to even approximate a single ReLU. However, in real applications with high dimensional data we expect it is only important to approximate the desired function well on certain relevant parts of its domain. With this motivation, we analyze the ability of neural networks and polynomial kernels of bounded degree to achieve good statistical performance on a simple, natural inference problem with sparse latent structure. We give almost-tight bounds on the performance of both neural networks and low degree polynomials for this problem. Our bounds for polynomials involve new techniques which may be of independent interest and show major qualitative differences with what is known in the worst-case setting.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/11/2017

Neural networks and rational functions

Neural networks and rational functions efficiently approximate each othe...
research
02/13/2022

Beyond Worst-Case Analysis for Root Isolation Algorithms

Isolating the real roots of univariate polynomials is a fundamental prob...
research
09/09/2019

PowerNet: Efficient Representations of Polynomials and Smooth Functions by Deep Neural Networks with Rectified Power Units

Deep neural network with rectified linear units (ReLU) is getting more a...
research
05/23/2021

Precise Approximation of Convolutional Neural Networks for Homomorphically Encrypted Data

Homomorphic encryption is one of the representative solutions to privacy...
research
05/22/2018

Gossip of Statistical Observations using Orthogonal Polynomials

Consider a network of agents connected by communication links, where eac...
research
05/18/2023

Worst-Case VCG Redistribution Mechanism Design Based on the Lottery Ticket Hypothesis

We study worst-case VCG redistribution mechanism design for the public p...
research
09/26/2019

Towards neural networks that provably know when they don't know

It has recently been shown that ReLU networks produce arbitrarily over-c...

Please sign up or login with your details

Forgot password? Click here to reset