Related papers: Understanding Interpretability by generalized dist…

Quantifying Interpretability and Trust in Machine Learning Systems

Decisions by Machine Learning (ML) models have become ubiquitous. Trusting these decisions requires understanding how algorithms take them. Hence interpretability methods for ML are an active focus of research. A central problem in this…

Machine Learning · Computer Science 2019-01-25 Philipp Schmidt , Felix Biessmann

TIP: Typifying the Interpretability of Procedures

We provide a novel notion of what it means to be interpretable, looking past the usual association with human understanding. Our key insight is that interpretability is not an absolute concept and so we define it relative to a target model,…

Artificial Intelligence · Computer Science 2018-10-30 Amit Dhurandhar , Vijay Iyengar , Ronny Luss , Karthikeyan Shanmugam

What is Interpretable? Using Machine Learning to Design Interpretable Decision-Support Systems

Recent efforts in Machine Learning (ML) interpretability have focused on creating methods for explaining black-box ML models. However, these methods rely on the assumption that simple approximations, such as linear models or decision-trees,…

Machine Learning · Computer Science 2019-06-13 Owen Lahav , Nicholas Mastronarde , Mihaela van der Schaar

A Theory of Diagnostic Interpretation in Supervised Classification

Interpretable deep learning is a fundamental building block towards safer AI, especially when the deployment possibilities of deep learning-based computer-aided medical diagnostic systems are so eminent. However, without a computational…

Machine Learning · Computer Science 2018-06-27 Anirban Mukhopadhyay

Improving the Interpretability of Deep Neural Networks with Knowledge Distillation

Deep Neural Networks have achieved huge success at a wide spectrum of applications from language modeling, computer vision to speech recognition. However, nowadays, good performance alone is not sufficient to satisfy the needs of practical…

Machine Learning · Computer Science 2018-12-31 Xuan Liu , Xiaoguang Wang , Stan Matwin

Interpretable Deep Learning: Interpretation, Interpretability, Trustworthiness, and Beyond

Deep neural networks have been well-known for their superb handling of various machine learning and artificial intelligence tasks. However, due to their over-parameterized black-box nature, it is often difficult to understand the prediction…

Machine Learning · Computer Science 2022-07-18 Xuhong Li , Haoyi Xiong , Xingjian Li , Xuanyu Wu , Xiao Zhang , Ji Liu , Jiang Bian , Dejing Dou

Distilling Interpretable Models into Human-Readable Code

The goal of model distillation is to faithfully transfer teacher model knowledge to a model which is faster, more generalizable, more interpretable, or possesses other desirable characteristics. Human-readability is an important and…

Machine Learning · Computer Science 2021-02-10 Walker Ravina , Ethan Sterling , Olexiy Oryeshko , Nathan Bell , Honglei Zhuang , Xuanhui Wang , Yonghui Wu , Alexander Grushetsky

Interpretable machine learning: definitions, methods, and applications

Machine-learning models have demonstrated great success in learning complex patterns that enable them to make predictions about unobserved data. In addition to using models for prediction, the ability to interpret what a model has learned…

Machine Learning · Statistics 2019-11-15 W. James Murdoch , Chandan Singh , Karl Kumbier , Reza Abbasi-Asl , Bin Yu

Interpretable and Explainable Machine Learning Methods for Predictive Process Monitoring: A Systematic Literature Review

This paper presents a systematic literature review (SLR) on the explainability and interpretability of machine learning (ML) models within the context of predictive process mining, using the PRISMA framework. Given the rapid advancement of…

Machine Learning · Computer Science 2024-01-01 Nijat Mehdiyev , Maxim Majlatow , Peter Fettke

Interpretable Deep Convolutional Neural Networks via Meta-learning

Model interpretability is a requirement in many applications in which crucial decisions are made by users relying on a model's outputs. The recent movement for "algorithmic fairness" also stipulates explainability, and therefore…

Machine Learning · Computer Science 2018-08-21 Xuan Liu , Xiaoguang Wang , Stan Matwin

A Framework to Learn with Interpretation

To tackle interpretability in deep learning, we present a novel framework to jointly learn a predictive model and its associated interpretation model. The interpreter provides both local and global interpretability about the predictive…

Machine Learning · Computer Science 2022-02-24 Jayneel Parekh , Pavlo Mozharovskyi , Florence d'Alché-Buc

Supervision Complexity and its Role in Knowledge Distillation

Despite the popularity and efficacy of knowledge distillation, there is limited understanding of why it helps. In order to study the generalization behavior of a distilled student, we propose a new theoretical framework that leverages…

Machine Learning · Computer Science 2023-01-31 Hrayr Harutyunyan , Ankit Singh Rawat , Aditya Krishna Menon , Seungyeon Kim , Sanjiv Kumar

Distilling Model Knowledge

Top-performing machine learning systems, such as deep neural networks, large ensembles and complex probabilistic graphical models, can be expensive to store, slow to evaluate and hard to integrate into larger systems. Ideally, we would like…

Machine Learning · Statistics 2015-10-09 George Papamakarios

Local Interpretability of Calibrated Prediction Models: A Case of Type 2 Diabetes Mellitus Screening Test

Machine Learning (ML) models are often complex and difficult to interpret due to their 'black-box' characteristics. Interpretability of a ML model is usually defined as the degree to which a human can understand the cause of decisions…

Methodology · Statistics 2020-06-25 Simon Kocbek , Primoz Kocbek , Leona Cilar , Gregor Stiglic

Local Interpretations for Explainable Natural Language Processing: A Survey

As the use of deep learning techniques has grown across various fields over the past decade, complaints about the opaqueness of the black-box models have increased, resulting in an increased focus on transparency in deep learning models.…

Computation and Language · Computer Science 2024-03-19 Siwen Luo , Hamish Ivison , Caren Han , Josiah Poon

A deep Natural Language Inference predictor without language-specific training data

In this paper we present a technique of NLP to tackle the problem of inference relation (NLI) between pairs of sentences in a target language of choice without a language-specific training dataset. We exploit a generic translation dataset,…

Computation and Language · Computer Science 2023-09-07 Lorenzo Corradi , Alessandro Manenti , Francesca Del Bonifro , Francesco Setti , Dario Del Sorbo

Interpretable Few-Shot Learning via Linear Distillation

It is important to develop mathematically tractable models than can interpret knowledge extracted from the data and provide reasonable predictions. In this paper, we present a Linear Distillation Learning, a simple remedy to improve the…

Machine Learning · Computer Science 2019-10-14 Arip Asadulaev , Igor Kuznetsov , Andrey Filchenkov

Model Distillation for Revenue Optimization: Interpretable Personalized Pricing

Data-driven pricing strategies are becoming increasingly common, where customers are offered a personalized price based on features that are predictive of their valuation of a product. It is desirable for this pricing policy to be simple…

Machine Learning · Statistics 2021-06-11 Max Biggs , Wei Sun , Markus Ettl

Explaining Knowledge Distillation by Quantifying the Knowledge

This paper presents a method to interpret the success of knowledge distillation by quantifying and analyzing task-relevant and task-irrelevant visual concepts that are encoded in intermediate layers of a deep neural network (DNN). More…

Machine Learning · Computer Science 2020-03-26 Xu Cheng , Zhefan Rao , Yilan Chen , Quanshi Zhang

Explainability as statistical inference

A wide variety of model explanation approaches have been proposed in recent years, all guided by very different rationales and heuristics. In this paper, we take a new route and cast interpretability as a statistical inference problem. We…

Machine Learning · Computer Science 2024-01-01 Hugo Henri Joseph Senetaire , Damien Garreau , Jes Frellsen , Pierre-Alexandre Mattei