Related papers: Manipulating and Measuring Model Interpretability

Assessing the Local Interpretability of Machine Learning Models

The increasing adoption of machine learning tools has led to calls for accountability via model interpretability. But what does it mean for a machine learning model to be interpretable by humans, and how can this be assessed? We focus on…

Machine Learning · Computer Science 2019-08-06 Dylan Slack , Sorelle A. Friedler , Carlos Scheidegger , Chitradeep Dutta Roy

Causal Interpretability for Machine Learning -- Problems, Methods and Evaluation

Machine learning models have had discernible achievements in a myriad of applications. However, most of these models are black-boxes, and it is obscure how the decisions are made by them. This makes the models unreliable and untrustworthy.…

Machine Learning · Computer Science 2020-03-23 Raha Moraffah , Mansooreh Karami , Ruocheng Guo , Adrienne Raglin , Huan Liu

The Mythos of Model Interpretability

Supervised machine learning models boast remarkable predictive capabilities. But can you trust your model? Will it work in deployment? What else can it tell you about the world? We want models to be not only good, but interpretable. And yet…

Machine Learning · Computer Science 2017-03-07 Zachary C. Lipton

Investigating the Duality of Interpretability and Explainability in Machine Learning

The rapid evolution of machine learning (ML) has led to the widespread adoption of complex "black box" models, such as deep neural networks and ensemble methods. These models exhibit exceptional predictive performance, making them…

Machine Learning · Computer Science 2025-03-28 Moncef Garouani , Josiane Mothe , Ayah Barhrhouj , Julien Aligon

Model-Agnostic Interpretability of Machine Learning

Understanding why machine learning models behave the way they do empowers both system designers and end-users in many ways: in model selection, feature engineering, in order to trust and act upon the predictions, and in more intuitive user…

Machine Learning · Statistics 2016-06-20 Marco Tulio Ribeiro , Sameer Singh , Carlos Guestrin

Interpretable Companions for Black-Box Models

We present an interpretable companion model for any pre-trained black-box classifiers. The idea is that for any input, a user can decide to either receive a prediction from the black-box model, with high accuracy but no explanations, or…

Machine Learning · Statistics 2020-02-12 Danqing Pan , Tong Wang , Satoshi Hara

Techniques for Interpretable Machine Learning

Interpretable machine learning tackles the important problem that humans cannot understand the behaviors of complex machine learning models and how these models arrive at a particular decision. Although many approaches have been proposed, a…

Machine Learning · Computer Science 2019-05-21 Mengnan Du , Ninghao Liu , Xia Hu

Human Factors in Model Interpretability: Industry Practices, Challenges, and Needs

As the use of machine learning (ML) models in product development and data-driven decision-making processes became pervasive in many domains, people's focus on building a well-performing model has increasingly shifted to understanding how…

Human-Computer Interaction · Computer Science 2020-06-02 Sungsoo Ray Hong , Jessica Hullman , Enrico Bertini

Individual Explanations in Machine Learning Models: A Survey for Practitioners

In recent years, the use of sophisticated statistical models that influence decisions in domains of high societal relevance is on the rise. Although these models can often bring substantial improvements in the accuracy and efficiency of…

Machine Learning · Computer Science 2021-04-13 Alfredo Carrillo , Luis F. Cantú , Alejandro Noriega

On The Stability of Interpretable Models

Interpretable classification models are built with the purpose of providing a comprehensible description of the decision logic to an external oversight agent. When considered in isolation, a decision tree, a set of classification rules, or…

Machine Learning · Computer Science 2019-03-18 Riccardo Guidotti , Salvatore Ruggieri

Model Transparency and Interpretability : Survey and Application to the Insurance Industry

The use of models, even if efficient, must be accompanied by an understanding at all levels of the process that transforms data (upstream and downstream). Thus, needs increase to define the relationships between individual data and the…

Machine Learning · Statistics 2022-09-02 Dimitri Delcaillau , Antoine Ly , Alize Papp , Franck Vermet

Interpretability of machine learning based prediction models in healthcare

There is a need of ensuring machine learning models that are interpretable. Higher interpretability of the model means easier comprehension and explanation of future predictions for end-users. Further, interpretable machine learning models…

Machine Learning · Computer Science 2020-08-17 Gregor Stiglic , Primoz Kocbek , Nino Fijacko , Marinka Zitnik , Katrien Verbert , Leona Cilar

The Promise and Peril of Human Evaluation for Model Interpretability

Transparency, user trust, and human comprehension are popular ethical motivations for interpretable machine learning. In support of these goals, researchers evaluate model explanation performance using humans and real world applications.…

Artificial Intelligence · Computer Science 2019-10-31 Bernease Herman

Explainable Artificial Intelligence: Understanding, Visualizing and Interpreting Deep Learning Models

With the availability of large databases and recent improvements in deep learning methodology, the performance of AI systems is reaching or even exceeding the human level on an increasing number of complex tasks. Impressive examples of this…

Artificial Intelligence · Computer Science 2017-08-29 Wojciech Samek , Thomas Wiegand , Klaus-Robert Müller

On the Relationship Between Interpretability and Explainability in Machine Learning

Interpretability and explainability have gained more and more attention in the field of machine learning as they are crucial when it comes to high-stakes decisions and troubleshooting. Since both provide information about predictors and…

Machine Learning · Computer Science 2024-04-26 Benjamin Leblanc , Pascal Germain

The Need for Interpretable Features: Motivation and Taxonomy

Through extensive experience developing and explaining machine learning (ML) applications for real-world domains, we have learned that ML models are only as interpretable as their features. Even simple, highly interpretable model types such…

Machine Learning · Computer Science 2022-02-25 Alexandra Zytek , Ignacio Arnaldo , Dongyu Liu , Laure Berti-Equille , Kalyan Veeramachaneni

Interpretable Deep Learning: Interpretation, Interpretability, Trustworthiness, and Beyond

Deep neural networks have been well-known for their superb handling of various machine learning and artificial intelligence tasks. However, due to their over-parameterized black-box nature, it is often difficult to understand the prediction…

Machine Learning · Computer Science 2022-07-18 Xuhong Li , Haoyi Xiong , Xingjian Li , Xuanyu Wu , Xiao Zhang , Ji Liu , Jiang Bian , Dejing Dou

Interpretability of machine-learning models in physical sciences

In machine learning (ML), it is in general challenging to provide a detailed explanation on how a trained model arrives at its prediction. Thus, usually we are left with a black-box, which from a scientific standpoint is not satisfactory.…

Materials Science · Physics 2021-04-22 Luca M. Ghiringhelli

Rigorous Interpretation Is a Form of Evaluation

Current machine learning models are evaluated through behavioral snapshots, with benchmark accuracies, win rates and outcome-based metrics. Model explanations and evaluations, however, are fundamentally intertwined: understanding why a…

Computers and Society · Computer Science 2026-05-08 Isabelle Lee , Emmy Liu , Cathy Jiao , Brihi Joshi , Dani Yogatama , Fazl Barez , Michael Saxon

Interpretable models for extrapolation in scientific machine learning

Data-driven models are central to scientific discovery. In efforts to achieve state-of-the-art model accuracy, researchers are employing increasingly complex machine learning algorithms that often outperform simple regressions in…

Materials Science · Physics 2022-12-21 Eric S. Muckley , James E. Saal , Bryce Meredig , Christopher S. Roper , John H. Martin