Publicly available datasets of breast histopathology H E whole-slide images: A systematic review

06/02/2023
by   Masoud Tafavvoghi, et al.
0

Advancements in digital pathology and computing resources have made a significant impact in the field of computational pathology for breast cancer diagnosis and treatment. However, access to high-quality labeled histopathological images of breast cancer is a big challenge that limits the development of accurate and robust deep learning models. In this systematic review, we identified the publicly available datasets of breast H E stained whole-slide images (WSI) that can be used to develop deep learning algorithms. We systematically searched nine scientific literature databases and nine research data repositories. We found twelve publicly available datasets, containing 5153 H E WSIs of breast cancer. Moreover, we reported image metadata and characteristics for each dataset to assist researchers in selecting proper datasets for specific tasks in breast cancer computational pathology. In addition, we compiled a list of patch and private datasets that were used in the included articles as a supplementary resource for researchers. Notably, 22 of the included articles utilized multiple datasets, and only 12 articles used an external validation set, suggesting that the performance of other developed models may be susceptible to overestimation. The TCGA-BRCA was used in 47.4 selection bias that can impact the robustness and generalizability of the trained algorithms. There is also a lack of consistent metadata reporting of breast WSI datasets that can be an issue in developing accurate deep learning models, indicating the necessity of establishing explicit guidelines for documenting breast WSI dataset characteristics and metadata.

READ FULL TEXT

page 2

page 11

research
04/02/2019

A frame semantic overview of NLP-based information extraction for cancer-related EHR notes

Objective: There is a lot of information about cancer in Electronic Heal...
research
05/29/2020

Artificial Neural Network Based Breast Cancer Screening: A Comprehensive Review

Breast cancer is a common fatal disease for women. Early diagnosis and d...
research
03/27/2020

A Comprehensive Review for Breast Histopathology Image Analysis Using Classical and Deep Neural Networks

Breast cancer is one of the most common and deadliest cancers among wome...
research
05/05/2023

Breast Cancer Immunohistochemical Image Generation: a Benchmark Dataset and Challenge Review

For invasive breast cancer, immunohistochemical (IHC) techniques are oft...
research
06/27/2021

An XAI Approach to Deep Learning Models in the Detection of Ductal Carcinoma in Situ

During the last decade or so, there has been an insurgence in the deep l...
research
03/20/2022

Breast Cancer Induced Bone Osteolysis Prediction Using Temporal Variational Auto-Encoders

Objective and Impact Statement. We adopt a deep learning model for bone ...

Please sign up or login with your details

Forgot password? Click here to reset