Correcting Data Imbalance for Semi-Supervised Covid-19 Detection Using X-ray Chest Images

08/19/2020
by   Saul Calderon-Ramirez, et al.
0

The Corona Virus (COVID-19) is an internationalpandemic that has quickly propagated throughout the world. The application of deep learning for image classification of chest X-ray images of Covid-19 patients, could become a novel pre-diagnostic detection methodology. However, deep learning architectures require large labelled datasets. This is often a limitation when the subject of research is relatively new as in the case of the virus outbreak, where dealing with small labelled datasets is a challenge. Moreover, in the context of a new highly infectious disease, the datasets are also highly imbalanced,with few observations from positive cases of the new disease. In this work we evaluate the performance of the semi-supervised deep learning architecture known as MixMatch using a very limited number of labelled observations and highly imbalanced labelled dataset. We propose a simple approach for correcting data imbalance, re-weight each observationin the loss function, giving a higher weight to the observationscorresponding to the under-represented class. For unlabelled observations, we propose the usage of the pseudo and augmentedlabels calculated by MixMatch to choose the appropriate weight. The MixMatch method combined with the proposed pseudo-label based balance correction improved classification accuracy by up to 10 algorithm, with statistical significance. We tested our proposed approach with several available datasets using 10, 15 and 20 labelledobservations. Additionally, a new dataset is included among thetested datasets, composed of chest X-ray images of Costa Rican adult patients

READ FULL TEXT

page 1

page 9

research
09/03/2021

Automated detection of COVID-19 cases from chest X-ray images using deep neural network and XGBoost

In late 2019 and after COVID-19 pandemic in the world, many researchers ...
research
06/21/2019

Boosting the rule-out accuracy of deep disease detection using class weight modifiers

In many screening applications, the primary goal of a radiologist or ass...
research
04/04/2018

Generative Visual Rationales

Interpretability and small labelled datasets are key issues in the pract...
research
10/20/2020

Synthesis of COVID-19 Chest X-rays using Unpaired Image-to-Image Translation

Motivated by the lack of publicly available datasets of chest radiograph...
research
09/30/2020

GraphXCOVID: Explainable Deep Graph Diffusion Pseudo-Labelling for Identifying COVID-19 on Chest X-rays

Can one learn to diagnose COVID-19 under extreme minimal supervision? Si...
research
01/10/2022

Demonstrating The Risk of Imbalanced Datasets in Chest X-ray Image-based Diagnostics by Prototypical Relevance Propagation

The recent trend of integrating multi-source Chest X-Ray datasets to imp...
research
08/13/2022

Incoporating Weighted Board Learning System for Accurate Occupational Pneumoconiosis Staging

Occupational pneumoconiosis (OP) staging is a vital task concerning the ...

Please sign up or login with your details

Forgot password? Click here to reset