Representer Point Selection for Explaining Regularized High-dimensional Models

05/31/2023
by   Che-Ping Tsai, et al.
0

We introduce a novel class of sample-based explanations we term high-dimensional representers, that can be used to explain the predictions of a regularized high-dimensional model in terms of importance weights for each of the training samples. Our workhorse is a novel representer theorem for general regularized high-dimensional models, which decomposes the model prediction in terms of contributions from each of the training samples: with positive (negative) values corresponding to positive (negative) impact training samples to the model's prediction. We derive consequences for the canonical instances of ℓ_1 regularized sparse models, and nuclear norm regularized low-rank models. As a case study, we further investigate the application of low-rank models in the context of collaborative filtering, where we instantiate high-dimensional representers for specific popular classes of models. Finally, we study the empirical performance of our proposed methods on three real-world binary classification datasets and two recommender system datasets. We also showcase the utility of high-dimensional representers in explaining model recommendations.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/09/2018

Low Rank and Structured Modeling of High-dimensional Vector Autoregressions

Network modeling of high-dimensional time series data is a key learning ...
research
04/17/2012

Regularized Partial Least Squares with an Application to NMR Spectroscopy

High-dimensional data common in genomics, proteomics, and chemometrics o...
research
08/14/2022

High-dimensional cointegration and Kuramoto systems

This paper presents a novel estimator for a non-standard restriction to ...
research
12/17/2018

ℓ_0-Motivated Low-Rank Sparse Subspace Clustering

In many applications, high-dimensional data points can be well represent...
research
04/15/2015

Theory of Dual-sparse Regularized Randomized Reduction

In this paper, we study randomized reduction methods, which reduce high-...
research
08/14/2020

Binarised Regression with Instance-Varying Costs: Evaluation using Impact Curves

Many evaluation methods exist, each for a particular prediction task, an...
research
02/08/2022

Class Density and Dataset Quality in High-Dimensional, Unstructured Data

We provide a definition for class density that can be used to measure th...

Please sign up or login with your details

Forgot password? Click here to reset