Concentration of the missing mass in metric spaces

06/04/2022
by   Andreas Maurer, et al.
0

We study the estimation of the probability to observe data further than a specified distance from a given iid sample in a metric space. The problem extends the classical problem of estimation of the missing mass in discrete spaces. We show that estimation is difficult in general and identify conditions on the distribution, under which the Good-Turing estimator and the conditional missing mass concentrate on their expectations. Applications to supervised learning are sketched.

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset