A Holistic Assessment of the Reliability of Machine Learning Systems

07/20/2023
by   Anthony Corso, et al.
0

As machine learning (ML) systems increasingly permeate high-stakes settings such as healthcare, transportation, military, and national security, concerns regarding their reliability have emerged. Despite notable progress, the performance of these systems can significantly diminish due to adversarial attacks or environmental changes, leading to overconfident predictions, failures to detect input faults, and an inability to generalize in unexpected scenarios. This paper proposes a holistic assessment methodology for the reliability of ML systems. Our framework evaluates five key properties: in-distribution accuracy, distribution-shift robustness, adversarial robustness, calibration, and out-of-distribution detection. A reliability score is also introduced and used to assess the overall system reliability. To provide insights into the performance of different algorithmic approaches, we identify and categorize state-of-the-art techniques, then evaluate a selection on real-world tasks using our proposed reliability metrics and reliability score. Our analysis of over 500 models reveals that designing for one metric does not necessarily constrain others but certain algorithmic techniques can improve reliability across multiple metrics simultaneously. This study contributes to a more comprehensive understanding of ML reliability and provides a roadmap for future research and development.

READ FULL TEXT

page 4

page 26

research
03/03/2023

Adversarial Attacks on Machine Learning in Embedded and IoT Platforms

Machine learning (ML) algorithms are increasingly being integrated into ...
research
11/11/2020

Automatic Open-World Reliability Assessment

Image classification in the open-world must handle out-of-distribution (...
research
12/02/2022

Measuring Competency of Machine Learning Systems and Enforcing Reliability

We explore the impact of environmental conditions on the competency of m...
research
01/18/2023

A Domain-Agnostic Approach for Characterization of Lifelong Learning Systems

Despite the advancement of machine learning techniques in recent years, ...
research
08/08/2023

Dynamic Model Agnostic Reliability Evaluation of Machine-Learning Methods Integrated in Instrumentation Control Systems

In recent years, the field of data-driven neural network-based machine l...
research
02/15/2022

Recent Advances in Reliable Deep Graph Learning: Adversarial Attack, Inherent Noise, and Distribution Shift

Deep graph learning (DGL) has achieved remarkable progress in both busin...
research
09/19/2018

Towards Accountable AI: Hybrid Human-Machine Analyses for Characterizing System Failure

As machine learning systems move from computer-science laboratories into...

Please sign up or login with your details

Forgot password? Click here to reset