Robust high dimensional factor models with applications to statistical machine learning

08/12/2018
by   Jianqing Fan, et al.
0

Factor models are a class of powerful statistical models that have been widely used to deal with dependent measurements that arise frequently from various applications from genomics and neuroscience to economics and finance. As data are collected at an ever-growing scale, statistical machine learning faces some new challenges: high dimensionality, strong dependence among observed variables, heavy-tailed variables and heterogeneity. High-dimensional robust factor analysis serves as a powerful toolkit to conquer these challenges. This paper gives a selective overview on recent advance on high-dimensional factor models and their applications to statistics including Factor-Adjusted Robust Model selection (FarmSelect) and Factor-Adjusted Robust Multiple testing (FarmTest). We show that classical methods, especially principal component analysis (PCA), can be tailored to many new problems and provide powerful tools for statistical estimation and inference. We highlight PCA and its connections to matrix perturbation theory, robust statistics, random projection, false discovery rate, etc., and illustrate through several applications how insights from these fields yield solutions to modern challenges. We also present far-reaching connections between factor models and popular statistical learning problems, including network analysis and low-rank matrix recovery.

READ FULL TEXT
research
10/11/2021

Learned Robust PCA: A Scalable Deep Unfolding Approach for High-Dimensional Outlier Detection

Robust principal component analysis (RPCA) is a critical tool in modern ...
research
09/21/2020

Recent Developments on Factor Models and its Applications in Econometric Learning

This paper makes a selective survey on the recent development of the fac...
research
03/14/2023

Robust Multiple Testing under High-dimensional Dynamic Factor Model

Large-scale multiple testing under static factor models is commonly used...
research
03/06/2023

Huber Principal Component Analysis for Large-dimensional Factor Models

Factor models have been widely used in economics and finance. However, t...
research
11/15/2017

FARM-Test: Factor-Adjusted Robust Multiple Testing with False Discovery Control

Large-scale multiple testing with correlated and heavy-tailed data arise...
research
05/09/2023

Robust Model Selection with Application in Single-Cell Multiomics Data

Model selection is critical in the modern statistics and machine learnin...
research
09/22/2022

PC Adjusted Testing for Low Dimensional Parameters

In this paper we consider the effect of high dimensional Principal Compo...

Please sign up or login with your details

Forgot password? Click here to reset