Principles and Practice of Explainable Machine Learning

09/18/2020
by   Vaishak Belle, et al.
48

Artificial intelligence (AI) provides many opportunities to improve private and public life. Discovering patterns and structures in large troves of data in an automated manner is a core component of data science, and currently drives applications in diverse areas such as computational biology, law and finance. However, such a highly positive impact is coupled with significant challenges: how do we understand the decisions suggested by these systems in order that we can trust them? In this report, we focus specifically on data-driven methods – machine learning (ML) and pattern recognition models in particular – so as to survey and distill the results and observations from the literature. The purpose of this report can be especially appreciated by noting that ML models are increasingly deployed in a wide range of businesses. However, with the increasing prevalence and complexity of methods, business stakeholders in the very least have a growing number of concerns about the drawbacks of models, data-specific biases, and so on. Analogously, data science practitioners are often not aware about approaches emerging from the academic literature, or may struggle to appreciate the differences between different methods, so end up using industry standards such as SHAP. Here, we have undertaken a survey to help industry practitioners (but also data scientists more broadly) understand the field of explainable machine learning better and apply the right tools. Our latter sections build a narrative around a putative data scientist, and discuss how she might go about explaining her models by asking the right questions.

READ FULL TEXT

page 19

page 22

page 23

page 24

page 25

page 26

page 27

page 28

research
10/10/2019

The Quest for Interpretable and Responsible Artificial Intelligence

Artificial Intelligence (AI) provides many opportunities to improve priv...
research
03/12/2021

Challenges and Governance Solutions for Data Science Services based on Open Data and APIs

Increasingly common open data and open application programming interface...
research
03/29/2023

Machine Learning for Uncovering Biological Insights in Spatial Transcriptomics Data

Development and homeostasis in multicellular systems both require exquis...
research
10/08/2021

Opportunities for Machine Learning to Accelerate Halide Perovskite Commercialization and Scale-Up

While halide perovskites attract significant academic attention, example...
research
05/13/2021

Providing Assurance and Scrutability on Shared Data and Machine Learning Models with Verifiable Credentials

Adopting shared data resources requires scientists to place trust in the...
research
01/07/2020

Vamsa: Tracking Provenance in Data Science Scripts

Machine learning (ML) which was initially adopted for search ranking and...

Please sign up or login with your details

Forgot password? Click here to reset