Constrained Optimization for Training Deep Neural Networks Under Class Imbalance

02/21/2021
by   Sara Sangalli, et al.
52

Deep neural networks (DNNs) are notorious for making more mistakes for the classes that have substantially fewer samples than the others during training. Such class imbalance is ubiquitous in clinical applications and very crucial to handle because the classes with fewer samples most often correspond to critical cases (e.g., cancer) where misclassifications can have severe consequences. Not to miss such cases, binary classifiers need to be operated at high True Positive Rates (TPR) by setting a higher threshold but this comes at the cost of very high False Positive Rates (FPR) for problems with class imbalance. Existing methods for learning under class imbalance most often do not take this into account. We argue that prediction accuracy should be improved by emphasizing reducing FPRs at high TPRs for problems where misclassification of the positive samples are associated with higher cost. To this end, we pose the training of a DNN for binary classification as a constrained optimization problem and introduce a novel constraint that can be used with existing loss functions to enforce maximal area under the ROC curve (AUC). We solve the resulting constrained optimization problem using an Augmented Lagrangian method (ALM), where the constraint emphasizes reduction of FPR at high TPR. We present experimental results for image-based classification applications using the CIFAR10 and an in-house medical imaging dataset. Our results demonstrate that the proposed method almost always improves the loss functions it is used with by attaining lower FPR at high TPR and higher or equal AUC.

READ FULL TEXT
research
07/02/2021

Optimizing ROC Curves with a Sort-Based Surrogate Loss Function for Binary Classification and Changepoint Detection

Receiver Operating Characteristic (ROC) curves are plots of true positiv...
research
02/28/2018

Constrained Classification and Ranking via Quantiles

In most machine learning applications, classification accuracy is not th...
research
03/25/2018

DeepVesselNet: Vessel Segmentation, Centerline Prediction, and Bifurcation Detection in 3-D Angiographic Volumes

We present DeepVesselNet, an architecture tailored to the challenges to ...
research
09/23/2019

Class-dependent Compression of Deep Neural Networks

Today's deep neural networks require substantial computation resources f...
research
02/20/2021

Analyzing Overfitting under Class Imbalance in Neural Networks for Image Segmentation

Class imbalance poses a challenge for developing unbiased, accurate pred...
research
09/02/2020

A Heaviside Function Approximation for Neural Network Binary Classification

Neural network binary classifiers are often evaluated on metrics like ac...

Please sign up or login with your details

Forgot password? Click here to reset