Multivariate, Heteroscedastic Empirical Bayes via Nonparametric Maximum Likelihood

09/08/2021
by   Jake A. Soloff, et al.
0

Multivariate, heteroscedastic errors complicate statistical inference in many large-scale denoising problems. Empirical Bayes is attractive in such settings, but standard parametric approaches rest on assumptions about the form of the prior distribution which can be hard to justify and which introduce unnecessary tuning parameters. We extend the nonparametric maximum likelihood estimator (NPMLE) for Gaussian location mixture densities to allow for multivariate, heteroscedastic errors. NPMLEs estimate an arbitrary prior by solving an infinite-dimensional, convex optimization problem; we show that this convex optimization problem can be tractably approximated by a finite-dimensional version. We introduce a dual mixture density whose modes contain the atoms of every NPMLE, and we leverage the dual both to show non-uniqueness in multivariate settings as well as to construct explicit bounds on the support of the NPMLE. The empirical Bayes posterior means based on an NPMLE have low regret, meaning they closely target the oracle posterior means one would compute with the true prior in hand. We prove an oracle inequality implying that the empirical Bayes estimator performs at nearly the optimal level (up to logarithmic factors) for denoising without prior knowledge. We provide finite-sample bounds on the average Hellinger accuracy of an NPMLE for estimating the marginal densities of the observations. We also demonstrate the adaptive and nearly-optimal properties of NPMLEs for deconvolution. We apply the method to two astronomy datasets, constructing a fully data-driven color-magnitude diagram of 1.4 million stars in the Milky Way and investigating the distribution of chemical abundance ratios for 27 thousand stars in the red clump.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/06/2017

On the nonparametric maximum likelihood estimator for Gaussian location mixture densities with application to Gaussian denoising

We study the Nonparametric Maximum Likelihood Estimator (NPMLE) for esti...
research
03/06/2023

Empirical partially Bayes multiple testing and compound χ^2 decisions

We study multiple testing in the normal means problem with estimated var...
research
08/16/2022

On Efficient and Scalable Computation of the Nonparametric Maximum Likelihood Estimator in Mixture Models

In this paper we study the computation of the nonparametric maximum like...
research
02/07/2020

Empirical Bayes for Large-scale Randomized Experiments: a Spectral Approach

Large-scale randomized experiments, sometimes called A/B tests, are incr...
research
09/20/2023

No need for an oracle: the nonparametric maximum likelihood decision in the compound decision problem is minimax

We discuss the asymptotics of the nonparametric maximum likelihood estim...
research
08/22/2021

A Nonparametric Maximum Likelihood Approach to Mixture of Regression

Mixture of regression models are useful for regression analysis in heter...
research
07/05/2023

Empirical Bayes via ERM and Rademacher complexities: the Poisson model

We consider the problem of empirical Bayes estimation for (multivariate)...

Please sign up or login with your details

Forgot password? Click here to reset