FANOK: Knockoffs in Linear Time

06/15/2020
by   Armin Askari, et al.
0

We describe a series of algorithms that efficiently implement Gaussian model-X knockoffs to control the false discovery rate on large scale feature selection problems. Identifying the knockoff distribution requires solving a large scale semidefinite program for which we derive several efficient methods. One handles generic covariance matrices, has a complexity scaling as O(p^3) where p is the ambient dimension, while another assumes a rank k factor model on the covariance matrix to reduce this complexity bound to O(pk^2). We also derive efficient procedures to both estimate factor models and sample knockoff covariates with complexity linear in the dimension. We test our methods on problems with p as large as 500,000.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/21/2014

A Computationally Efficient Limited Memory CMA-ES for Large Scale Optimization

We propose a computationally efficient limited memory Covariance Matrix ...
research
07/17/2021

Sparse Bayesian Learning with Diagonal Quasi-Newton Method For Large Scale Classification

Sparse Bayesian Learning (SBL) constructs an extremely sparse probabilis...
research
10/03/2016

Sequential Low-Rank Change Detection

Detecting emergence of a low-rank signal from high-dimensional data is a...
research
12/10/2019

Testing and Estimating Change-Points in the Covariance Matrix of a High-Dimensional Time Series

This paper studies methods for testing and estimating change-points in t...
research
09/03/2018

Shrinkage for Covariance Estimation: Asymptotics, Confidence Intervals, Bounds and Applications in Sensor Monitoring and Finance

When shrinking a covariance matrix towards (a multiple) of the identity ...
research
04/03/2013

A Fast Semidefinite Approach to Solving Binary Quadratic Problems

Many computer vision problems can be formulated as binary quadratic prog...

Please sign up or login with your details

Forgot password? Click here to reset