Distribution-on-Distribution Regression with Wasserstein Metric: Multivariate Gaussian Case

07/12/2023
by   Ryo Okano, et al.
0

Distribution data refers to a data set where each sample is represented as a probability distribution, a subject area receiving burgeoning interest in the field of statistics. Although several studies have developed distribution-to-distribution regression models for univariate variables, the multivariate scenario remains under-explored due to technical complexities. In this study, we introduce models for regression from one Gaussian distribution to another, utilizing the Wasserstein metric. These models are constructed using the geometry of the Wasserstein space, which enables the transformation of Gaussian distributions into components of a linear matrix space. Owing to their linear regression frameworks, our models are intuitively understandable, and their implementation is simplified because of the optimal transport problem's analytical solution between Gaussian distributions. We also explore a generalization of our models to encompass non-Gaussian scenarios. We establish the convergence rates of in-sample prediction errors for the empirical risk minimizations in our models. In comparative simulation experiments, our models demonstrate superior performance over a simpler alternative method that transforms Gaussian distributions into matrices. We present an application of our methodology using weather data for illustration purposes.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/17/2020

Wasserstein Regression

The analysis of samples of random objects that do not lie in a vector sp...
research
06/18/2023

Sliced Wasserstein Regression

While statistical modeling of distributional data has gained increased a...
research
07/20/2021

Conditional Wasserstein Barycenters and Interpolation/Extrapolation of Distributions

Increasingly complex data analysis tasks motivate the study of the depen...
research
06/10/2020

Robustified Multivariate Regression and Classification Using Distributionally Robust Optimization under the Wasserstein Metric

We develop Distributionally Robust Optimization (DRO) formulations for M...
research
11/12/2017

Aggregated Wasserstein Metric and State Registration for Hidden Markov Models

We propose a framework, named Aggregated Wasserstein, for computing a di...
research
10/19/2022

Geostatistics in the presence of multivariate complexities: comparison of multi-Gaussian transforms

Geostatistical simulation of two or more continuous variables is a commo...
research
08/24/2023

Wasserstein Regression with Empirical Measures and Density Estimation for Sparse Data

The problem of modeling the relationship between univariate distribution...

Please sign up or login with your details

Forgot password? Click here to reset