Compositional Data Regression in Insurance with Exponential Family PCA

12/29/2021
by   Guojun Gan, et al.
0

Compositional data are multivariate observations that carry only relative information between components. Applying standard multivariate statistical methodology directly to analyze compositional data can lead to paradoxes and misinterpretations. Compositional data also frequently appear in insurance, especially with telematics information. However, such type of data does not receive deserved special treatment in most existing actuarial literature. In this paper, we explore and investigate the use of exponential family principal component analysis (EPCA) to analyze compositional data in insurance. The method is applied to analyze a dataset obtained from the U.S. Mine Safety and Health Administration. The numerical results show that EPCA is able to produce principal components that are significant predictors and improve the prediction accuracy of the regression model. The EPCA method can be a promising useful tool for actuaries to analyze compositional data.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/01/2020

Independent Component Analysis for Compositional Data

Compositional data represent a specific family of multivariate data, whe...
research
12/21/2018

Primal path algorithm for compositional data analysis

Compositional data have two unique characteristics compared to typical m...
research
04/11/2019

Robust Principal Component Analysis for Compositional Tables

A data table which is arranged according to two factors can often be con...
research
01/25/2022

Compositional Cubes: A New Concept for Multi-factorial Compositions

Compositional data are commonly known as multivariate observations carry...
research
11/03/2022

Principal Balances of Compositional Data for Regression and Classification using Partial Least Squares

High-dimensional compositional data are commonplace in the modern omics ...
research
06/21/2021

A causal view on compositional data

Many scientific datasets are compositional in nature. Important examples...
research
02/19/2023

Identifying Heterogeneity in Regression Compositional Data Integration with Many Categories

In compositional data, detecting which part of the whole delineates hete...

Please sign up or login with your details

Forgot password? Click here to reset