Asymptotically Sharp Upper Bound for the Column Subset Selection Problem

03/14/2023
βˆ™
by   Jian-Feng Cai, et al.
βˆ™
0
βˆ™

This paper investigates the spectral norm version of the column subset selection problem. Given a matrix π€βˆˆβ„^nΓ— d and a positive integer k≀rank(𝐀), the objective is to select exactly k columns of 𝐀 that minimize the spectral norm of the residual matrix after projecting 𝐀 onto the space spanned by the selected columns. We use the method of interlacing polynomials introduced by Marcus-Spielman-Srivastava to derive an asymptotically sharp upper bound on the minimal approximation error, and propose a deterministic polynomial-time algorithm that achieves this error bound (up to a computational error). Furthermore, we extend our result to a column partition problem in which the columns of 𝐀 can be partitioned into rβ‰₯ 2 subsets such that 𝐀 can be well approximated by subsets from various groups. We show that the machinery of interlacing polynomials also works in this context, and establish a connection between the relevant expected characteristic polynomials and the r-characteristic polynomials introduced by Ravichandran and Leake. As a consequence, we prove that the columns of a rank-d matrix π€βˆˆβ„^nΓ— d can be partitioned into r subsets S_1,… S_r, such that the column space of 𝐀 can be well approximated by the span of the columns in the complement of S_i for each 1≀ i≀ r.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
βˆ™ 08/16/2019

Low-rank approximation in the Frobenius norm by column and row subset selection

A CUR approximation of a matrix A is a particular type of low-rank appro...
research
βˆ™ 03/15/2019

Subset Selection for Matrices with Fixed Blocks

Subset selection for matrices is the task of extracting a column sub-mat...
research
βˆ™ 05/17/2015

Provably Correct Algorithms for Matrix Column Subset Selection with Selectively Sampled Data

We consider the problem of matrix column subset selection, which selects...
research
βˆ™ 06/16/2022

Generalized Leverage Scores: Geometric Interpretation and Applications

In problems involving matrix computations, the concept of leverage has f...
research
βˆ™ 12/23/2018

A determinantal point process for column subset selection

Dimensionality reduction is a first step of many machine learning pipeli...
research
βˆ™ 03/08/2017

Polynomial Time Algorithms for Dual Volume Sampling

We study dual volume sampling, a method for selecting k columns from an ...

Please sign up or login with your details

Forgot password? Click here to reset