Related papers: Techniques for Interpretable Machine Learning

Towards A Rigorous Science of Interpretable Machine Learning

As machine learning systems become ubiquitous, there has been a surge of interest in interpretable machine learning: systems that provide explanation for their outputs. These explanations are often used to qualitatively assess other…

Machine Learning · Statistics 2017-03-06 Finale Doshi-Velez , Been Kim

Interpretability of machine learning based prediction models in healthcare

There is a need of ensuring machine learning models that are interpretable. Higher interpretability of the model means easier comprehension and explanation of future predictions for end-users. Further, interpretable machine learning models…

Machine Learning · Computer Science 2020-08-17 Gregor Stiglic , Primoz Kocbek , Nino Fijacko , Marinka Zitnik , Katrien Verbert , Leona Cilar

Interpretable Machine Learning: Fundamental Principles and 10 Grand Challenges

Interpretability in machine learning (ML) is crucial for high stakes decisions and troubleshooting. In this work, we provide fundamental principles for interpretable ML, and dispel common misunderstandings that dilute the importance of this…

Machine Learning · Computer Science 2021-09-02 Cynthia Rudin , Chaofan Chen , Zhi Chen , Haiyang Huang , Lesia Semenova , Chudi Zhong

Interpretable Machine Learning for Discovery: Statistical Challenges \& Opportunities

New technologies have led to vast troves of large and complex datasets across many scientific domains and industries. People routinely use machine learning techniques to not only process, visualize, and make predictions from this big data,…

Machine Learning · Statistics 2023-08-04 Genevera I. Allen , Luqin Gan , Lili Zheng

Interpretability and Explainability: A Machine Learning Zoo Mini-tour

In this review, we examine the problem of designing interpretable and explainable machine learning models. Interpretability and explainability lie at the core of many machine learning and statistical applications in medicine, economics,…

Machine Learning · Computer Science 2023-03-02 Ričards Marcinkevičs , Julia E. Vogt

Interpretable Deep Learning: Interpretation, Interpretability, Trustworthiness, and Beyond

Deep neural networks have been well-known for their superb handling of various machine learning and artificial intelligence tasks. However, due to their over-parameterized black-box nature, it is often difficult to understand the prediction…

Machine Learning · Computer Science 2022-07-18 Xuhong Li , Haoyi Xiong , Xingjian Li , Xuanyu Wu , Xiao Zhang , Ji Liu , Jiang Bian , Dejing Dou

Interpretable Machine Learning for Weather and Climate Prediction: A Survey

Advanced machine learning models have recently achieved high predictive accuracy for weather and climate prediction. However, these complex models often lack inherent transparency and interpretability, acting as "black boxes" that impede…

Atmospheric and Oceanic Physics · Physics 2024-03-29 Ruyi Yang , Jingyu Hu , Zihao Li , Jianli Mu , Tingzhao Yu , Jiangjiang Xia , Xuhong Li , Aritra Dasgupta , Haoyi Xiong

Interpretable machine learning: definitions, methods, and applications

Machine-learning models have demonstrated great success in learning complex patterns that enable them to make predictions about unobserved data. In addition to using models for prediction, the ability to interpret what a model has learned…

Machine Learning · Statistics 2019-11-15 W. James Murdoch , Chandan Singh , Karl Kumbier , Reza Abbasi-Asl , Bin Yu

The Definitions of Interpretability and Learning of Interpretable Models

As machine learning algorithms getting adopted in an ever-increasing number of applications, interpretation has emerged as a crucial desideratum. In this paper, we propose a mathematical definition for the human-interpretable model. In…

Machine Learning · Computer Science 2021-06-01 Weishen Pan , Changshui Zhang

Causal Interpretability for Machine Learning -- Problems, Methods and Evaluation

Machine learning models have had discernible achievements in a myriad of applications. However, most of these models are black-boxes, and it is obscure how the decisions are made by them. This makes the models unreliable and untrustworthy.…

Machine Learning · Computer Science 2020-03-23 Raha Moraffah , Mansooreh Karami , Ruocheng Guo , Adrienne Raglin , Huan Liu

Interpretable and Explainable Machine Learning for Materials Science and Chemistry

While the uptake of data-driven approaches for materials science and chemistry is at an exciting, early stage, to realise the true potential of machine learning models for successful scientific discovery, they must have qualities beyond…

Materials Science · Physics 2022-06-28 Felipe Oviedo , Juan Lavista Ferres , Tonio Buonassisi , Keith Butler

Meaningful Models: Utilizing Conceptual Structure to Improve Machine Learning Interpretability

The last decade has seen huge progress in the development of advanced machine learning models; however, those models are powerless unless human users can interpret them. Here we show how the mind's construction of concepts and meaning can…

Machine Learning · Statistics 2016-07-04 Nick Condry

Rethinking Interpretability in the Era of Large Language Models

Interpretable machine learning has exploded as an area of interest over the last decade, sparked by the rise of increasingly large datasets and deep neural networks. Simultaneously, large language models (LLMs) have demonstrated remarkable…

Computation and Language · Computer Science 2024-02-06 Chandan Singh , Jeevana Priya Inala , Michel Galley , Rich Caruana , Jianfeng Gao

Assessing the Local Interpretability of Machine Learning Models

The increasing adoption of machine learning tools has led to calls for accountability via model interpretability. But what does it mean for a machine learning model to be interpretable by humans, and how can this be assessed? We focus on…

Machine Learning · Computer Science 2019-08-06 Dylan Slack , Sorelle A. Friedler , Carlos Scheidegger , Chitradeep Dutta Roy

Rigorous Interpretation Is a Form of Evaluation

Current machine learning models are evaluated through behavioral snapshots, with benchmark accuracies, win rates and outcome-based metrics. Model explanations and evaluations, however, are fundamentally intertwined: understanding why a…

Computers and Society · Computer Science 2026-05-08 Isabelle Lee , Emmy Liu , Cathy Jiao , Brihi Joshi , Dani Yogatama , Fazl Barez , Michael Saxon

On the Relationship Between Interpretability and Explainability in Machine Learning

Interpretability and explainability have gained more and more attention in the field of machine learning as they are crucial when it comes to high-stakes decisions and troubleshooting. Since both provide information about predictors and…

Machine Learning · Computer Science 2024-04-26 Benjamin Leblanc , Pascal Germain

On the Semantic Interpretability of Artificial Intelligence Models

Artificial Intelligence models are becoming increasingly more powerful and accurate, supporting or even replacing humans' decision making. But with increased power and accuracy also comes higher complexity, making it hard for users to…

Artificial Intelligence · Computer Science 2019-07-10 Vivian S. Silva , André Freitas , Siegfried Handschuh

Interpretable Machine Learning in Physics: A Review

Machine learning is increasingly transforming various scientific fields, enabled by advancements in computational power and access to large data sets from experiments and simulations. As artificial intelligence (AI) continues to grow in…

Computational Physics · Physics 2025-04-01 Sebastian Johann Wetzel , Seungwoong Ha , Raban Iten , Miriam Klopotek , Ziming Liu

Open Problems in Mechanistic Interpretability

Mechanistic interpretability aims to understand the computational mechanisms underlying neural networks' capabilities in order to accomplish concrete scientific and engineering goals. Progress in this field thus promises to provide greater…

Machine Learning · Computer Science 2025-01-29 Lee Sharkey , Bilal Chughtai , Joshua Batson , Jack Lindsey , Jeff Wu , Lucius Bushnaq , Nicholas Goldowsky-Dill , Stefan Heimersheim , Alejandro Ortega , Joseph Bloom , Stella Biderman , Adria Garriga-Alonso , Arthur Conmy , Neel Nanda , Jessica Rumbelow , Martin Wattenberg , Nandi Schoots , Joseph Miller , Eric J. Michaud , Stephen Casper , Max Tegmark , William Saunders , David Bau , Eric Todd , Atticus Geiger , Mor Geva , Jesse Hoogland , Daniel Murfet , Tom McGrath

Interpretable Machine Learning -- A Brief History, State-of-the-Art and Challenges

We present a brief history of the field of interpretable machine learning (IML), give an overview of state-of-the-art interpretation methods, and discuss challenges. Research in IML has boomed in recent years. As young as the field is, it…

Machine Learning · Statistics 2022-01-24 Christoph Molnar , Giuseppe Casalicchio , Bernd Bischl