Determinantal Point Processes Implicitly Regularize Semi-parametric Regression Problems

by   Michaël Fanuel, et al.

Semi-parametric regression models are used in several applications which require comprehensibility without sacrificing accuracy. Examples are spline interpolation in geophysics, or non-linear time series problems, where the system includes for instance a linear and non-linear component. We discuss here the use of a finite Determinantal Point Process (DPP) sampling for approximating semi-parametric models in two cases. On the one hand, in the case of large training data sets, DPP sampling is used to reduce the number of model parameters. On the other hand, DPPs can determine experimental designs in the case of the optimal interpolation models. Recently, Barthelmé, Tremblay, Usevich, and Amblard introduced a novel representation of finite DPP's. They formulated extended L-ensembles that can conveniently represent for instance partial-projection DPPs and suggest their use for optimal interpolation. With the help of this formalism, we derive a key identity illustrating the implicit regularization effect of determinantal sampling for semi-parametric regression and interpolation. Also, a novel projected Nyström approximation is defined and used to derive a bound on the expected risk for the corresponding approximation of semi-parametric regression. This work naturally extends similar results obtained for kernel ridge regression.


page 1

page 2

page 3

page 4


Equally spaced points are optimal for Brownian Bridge kernel interpolation

In this paper we show how ideas from spline theory can be used to constr...

Parametric model reduction via rational interpolation along parameters

We present a novel projection-based model reduction framework for parame...

Locally D-optimal Designs for Non-linear Models on the k-dimensional Ball

In this paper we construct (locally) D-optimal designs for a wide class ...

Semi-Infinite Linear Regression and Its Applications

Finite linear least squares is one of the core problems of numerical lin...

Semi-parametric generalized estimating equations for repeated measurements in cross-over designs

A model for cross-over designs with repeated measures within each period...

Emulation as an Accurate Alternative to Interpolation in Sampling Radiative Transfer Codes

Computationally expensive Radiative Transfer Models (RTMs) are widely us...

Please sign up or login with your details

Forgot password? Click here to reset