Related papers: An interpretable neural network model through piec…

An Interpretable Loan Credit Evaluation Method Based on Rule Representation Learner

The interpretability of model has become one of the obstacles to its wide application in the high-stake fields. The usual way to obtain interpretability is to build a black-box first and then explain it using the post-hoc methods. However,…

Machine Learning · Computer Science 2023-04-04 Zihao Chen , Xiaomeng Wang , Yuanjiang Huang , Tao Jia

Hybrid Predictive Model: When an Interpretable Model Collaborates with a Black-box Model

Interpretable machine learning has become a strong competitor for traditional black-box models. However, the possible loss of the predictive performance for gaining interpretability is often inevitable, putting practitioners in a dilemma of…

Machine Learning · Computer Science 2019-05-13 Tong Wang , Qihang Lin

Interpretable Companions for Black-Box Models

We present an interpretable companion model for any pre-trained black-box classifiers. The idea is that for any input, a user can decide to either receive a prediction from the black-box model, with high accuracy but no explanations, or…

Machine Learning · Statistics 2020-02-12 Danqing Pan , Tong Wang , Satoshi Hara

A Semiparametric Approach to Interpretable Machine Learning

Black box models in machine learning have demonstrated excellent predictive performance in complex problems and high-dimensional settings. However, their lack of transparency and interpretability restrict the applicability of such models in…

Machine Learning · Computer Science 2020-06-09 Numair Sani , Jaron Lee , Razieh Nabi , Ilya Shpitser

Interpretable Machine Learning for Weather and Climate Prediction: A Survey

Advanced machine learning models have recently achieved high predictive accuracy for weather and climate prediction. However, these complex models often lack inherent transparency and interpretability, acting as "black boxes" that impede…

Atmospheric and Oceanic Physics · Physics 2024-03-29 Ruyi Yang , Jingyu Hu , Zihao Li , Jianli Mu , Tingzhao Yu , Jiangjiang Xia , Xuhong Li , Aritra Dasgupta , Haoyi Xiong

Making Neural Networks Interpretable with Attribution: Application to Implicit Signals Prediction

Explaining recommendations enables users to understand whether recommended items are relevant to their needs and has been shown to increase their trust in the system. More generally, if designing explainable machine learning models is key…

Machine Learning · Computer Science 2020-08-27 Darius Afchar , Romain Hennequin

An Interpretable Probabilistic Approach for Demystifying Black-box Predictive Models

The use of sophisticated machine learning models for critical decision making is faced with a challenge that these models are often applied as a "black-box". This has led to an increased interest in interpretable machine learning, where…

Artificial Intelligence · Computer Science 2020-07-22 Catarina Moreira , Yu-Liang Chou , Mythreyi Velmurugan , Chun Ouyang , Renuka Sindhgatta , Peter Bruza

Partially Interpretable Estimators (PIE): Black-Box-Refined Interpretable Machine Learning

We propose Partially Interpretable Estimators (PIE) which attribute a prediction to individual features via an interpretable model, while a (possibly) small part of the PIE prediction is attributed to the interaction of features via a…

Machine Learning · Computer Science 2021-05-07 Tong Wang , Jingyi Yang , Yunyi Li , Boxiang Wang

An Additive Instance-Wise Approach to Multi-class Model Interpretation

Interpretable machine learning offers insights into what factors drive a certain prediction of a black-box system. A large number of interpreting methods focus on identifying explanatory input features, which generally fall into two main…

Machine Learning · Computer Science 2023-06-02 Vy Vo , Van Nguyen , Trung Le , Quan Hung Tran , Gholamreza Haffari , Seyit Camtepe , Dinh Phung

Model Interpretability through the Lens of Computational Complexity

In spite of several claims stating that some models are more interpretable than others -- e.g., "linear models are more interpretable than deep neural networks" -- we still lack a principled notion of interpretability to formally compare…

Artificial Intelligence · Computer Science 2020-11-16 Pablo Barceló , Mikaël Monet , Jorge Pérez , Bernardo Subercaseaux

Investigating the Duality of Interpretability and Explainability in Machine Learning

The rapid evolution of machine learning (ML) has led to the widespread adoption of complex "black box" models, such as deep neural networks and ensemble methods. These models exhibit exceptional predictive performance, making them…

Machine Learning · Computer Science 2025-03-28 Moncef Garouani , Josiane Mothe , Ayah Barhrhouj , Julien Aligon

Optimal Piecewise Local-Linear Approximations

Existing works on "black-box" model interpretation use local-linear approximations to explain the predictions made for each data instance in terms of the importance assigned to the different features for arriving at the prediction. These…

Machine Learning · Computer Science 2019-08-28 Kartik Ahuja , William Zame , Mihaela van der Schaar

Explaining Language Models' Predictions with High-Impact Concepts

The emergence of large-scale pretrained language models has posed unprecedented challenges in deriving explanations of why the model has made some predictions. Stemmed from the compositional nature of languages, spurious correlations have…

Computation and Language · Computer Science 2023-05-04 Ruochen Zhao , Shafiq Joty , Yongjie Wang , Tan Wang

Regularizing Black-box Models for Improved Interpretability (HILL 2019 Version)

Most of the work on interpretable machine learning has focused on designing either inherently interpretable models, which typically trade-off accuracy for interpretability, or post-hoc explanation systems, which lack guarantees about their…

Machine Learning · Computer Science 2019-06-05 Gregory Plumb , Maruan Al-Shedivat , Eric Xing , Ameet Talwalkar

Lifting Interpretability-Performance Trade-off via Automated Feature Engineering

Complex black-box predictive models may have high performance, but lack of interpretability causes problems like lack of trust, lack of stability, sensitivity to concept drift. On the other hand, achieving satisfactory accuracy of…

Machine Learning · Computer Science 2020-02-12 Alicja Gosiewska , Przemyslaw Biecek

Learning Hybrid Interpretable Models: Theory, Taxonomy, and Methods

A hybrid model involves the cooperation of an interpretable model and a complex black box. At inference, any input of the hybrid model is assigned to either its interpretable or complex component based on a gating mechanism. The advantages…

Machine Learning · Computer Science 2023-03-09 Julien Ferry , Gabriel Laberge , Ulrich Aïvodji

A Framework to Learn with Interpretation

To tackle interpretability in deep learning, we present a novel framework to jointly learn a predictive model and its associated interpretation model. The interpreter provides both local and global interpretability about the predictive…

Machine Learning · Computer Science 2022-02-24 Jayneel Parekh , Pavlo Mozharovskyi , Florence d'Alché-Buc

Interpretation of Prediction Models Using the Input Gradient

State of the art machine learning algorithms are highly optimized to provide the optimal prediction possible, naturally resulting in complex models. While these models often outperform simpler more interpretable models by order of…

Machine Learning · Statistics 2016-11-24 Yotam Hechtlinger

Interpretability Needs a New Paradigm

Interpretability is the study of explaining models in understandable terms to humans. At present, interpretability is divided into two paradigms: the intrinsic paradigm, which believes that only models designed to be explained can be…

Machine Learning · Computer Science 2024-11-14 Andreas Madsen , Himabindu Lakkaraju , Siva Reddy , Sarath Chandar

Achieving interpretable machine learning by functional decomposition of black-box models into explainable predictor effects

Machine learning (ML) has seen significant growth in both popularity and importance. The high prediction accuracy of ML models is often achieved through complex black-box architectures that are difficult to interpret. This interpretability…

Machine Learning · Statistics 2024-07-29 David Köhler , David Rügamer , Matthias Schmid