Methods and Models for Interpretable Linear Classification

05/16/2014
by   Berk Ustun, et al.
0

We present an integer programming framework to build accurate and interpretable discrete linear classification models. Unlike existing approaches, our framework is designed to provide practitioners with the control and flexibility they need to tailor accurate and interpretable models for a domain of choice. To this end, our framework can produce models that are fully optimized for accuracy, by minimizing the 0--1 classification loss, and that address multiple aspects of interpretability, by incorporating a range of discrete constraints and penalty functions. We use our framework to produce models that are difficult to create with existing methods, such as scoring systems and M-of-N rule tables. In addition, we propose specially designed optimization methods to improve the scalability of our framework through decomposition and data reduction. We show that discrete linear classifiers can attain the training accuracy of any other linear classifier, and provide an Occam's Razor type argument as to why the use of small discrete coefficients can provide better generalization. We demonstrate the performance and flexibility of our framework through numerical experiments and a case study in which we construct a highly tailored clinical tool for sleep apnea diagnosis.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/15/2015

Supersparse Linear Integer Models for Optimized Medical Scoring Systems

Scoring systems are linear classification models that only require users...
research
06/27/2013

Supersparse Linear Integer Models for Interpretable Classification

Scoring systems are classification models that only require users to add...
research
03/26/2015

Interpretable Classification Models for Recidivism Prediction

We investigate a long-debated question, which is how to create predictiv...
research
04/11/2023

Learning Optimal Fair Scoring Systems for Multi-Class Classification

Machine Learning models are increasingly used for decision making, in pa...
research
09/21/2022

SERF: Interpretable Sleep Staging using Embeddings, Rules, and Features

The accuracy of recent deep learning based clinical decision support sys...
research
11/06/2015

Learning Optimized Or's of And's

Or's of And's (OA) models are comprised of a small number of disjunction...
research
06/20/2022

Efficient and Flexible Sublabel-Accurate Energy Minimization

We address the problem of minimizing a class of energy functions consist...

Please sign up or login with your details

Forgot password? Click here to reset