Interpolating Discriminant Functions in High-Dimensional Gaussian Latent Mixtures

10/25/2022
by   Xin Bing, et al.
0

This paper considers binary classification of high-dimensional features under a postulated model with a low-dimensional latent Gaussian mixture structure and non-vanishing noise. A generalized least squares estimator is used to estimate the direction of the optimal separating hyperplane. The estimated hyperplane is shown to interpolate on the training data. While the direction vector can be consistently estimated as could be expected from recent results in linear regression, a naive plug-in estimate fails to consistently estimate the intercept. A simple correction, that requires an independent hold-out sample, renders the procedure minimax optimal in many scenarios. The interpolation property of the latter procedure can be retained, but surprisingly depends on the way the labels are encoded.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/02/2023

High-dimensional latent Gaussian count time series: Concentration results for autocovariances and applications

This work considers stationary vector count time series models defined v...
research
11/13/2021

Minimax Supervised Clustering in the Anisotropic Gaussian Mixture Model: A new take on Robust Interpolation

We study the supervised clustering problem under the two-component aniso...
research
10/23/2022

Optimal Discriminant Analysis in High-Dimensional Latent Factor Models

In high-dimensional classification problems, a commonly used approach is...
research
02/17/2023

Are Gaussian data all you need? Extents and limits of universality in high-dimensional generalized linear estimation

In this manuscript we consider the problem of generalized linear estimat...
research
07/13/2023

A zero-estimator approach for estimating the signal level in a high-dimensional regression setting

Analysis of high-dimensional data, where the number of covariates is lar...
research
02/20/2022

Memorize to Generalize: on the Necessity of Interpolation in High Dimensional Linear Regression

We examine the necessity of interpolation in overparameterized models, t...
research
09/30/2014

Hyper-Spectral Image Analysis with Partially-Latent Regression and Spatial Markov Dependencies

Hyper-spectral data can be analyzed to recover physical properties at la...

Please sign up or login with your details

Forgot password? Click here to reset