Related papers: Machines Explaining Linear Programs

Attributions Beyond Neural Networks: The Linear Program Case

Linear Programs (LPs) have been one of the building blocks in machine learning and have championed recent strides in differentiable optimizers for learning systems. While there exist solvers for even high-dimensional LPs, understanding said…

Machine Learning · Computer Science 2022-06-16 Florian Peter Busch , Matej Zečević , Kristian Kersting , Devendra Singh Dhami

Logic Traps in Evaluating Attribution Scores

Modern deep learning models are notoriously opaque, which has motivated the development of methods for interpreting how deep models predict. This goal is usually approached with attribution method, which assesses the influence of features…

Machine Learning · Computer Science 2023-03-07 Yiming Ju , Yuanzhe Zhang , Zhao Yang , Zhongtao Jiang , Kang Liu , Jun Zhao

Toward Understanding the Disagreement Problem in Neural Network Feature Attribution

In recent years, neural networks have demonstrated their remarkable ability to discern intricate patterns and relationships from raw data. However, understanding the inner workings of these black box models remains challenging, yet crucial…

Machine Learning · Statistics 2024-04-18 Niklas Koenen , Marvin N. Wright

Benchmarking the Attribution Quality of Vision Models

Attribution maps are one of the most established tools to explain the functioning of computer vision models. They assign importance scores to input features, indicating how relevant each feature is for the prediction of a deep neural…

Computer Vision and Pattern Recognition · Computer Science 2024-12-10 Robin Hesse , Simone Schaub-Meyer , Stefan Roth

Making Neural Networks Interpretable with Attribution: Application to Implicit Signals Prediction

Explaining recommendations enables users to understand whether recommended items are relevant to their needs and has been shown to increase their trust in the system. More generally, if designing explainable machine learning models is key…

Machine Learning · Computer Science 2020-08-27 Darius Afchar , Romain Hennequin

Attribution Explanations for Deep Neural Networks: A Theoretical Perspective

Attribution explanation is a typical approach for explaining deep neural networks (DNNs), inferring an importance or contribution score for each input variable to the final output. In recent years, numerous attribution methods have been…

Machine Learning · Computer Science 2025-08-12 Huiqi Deng , Hongbin Pei , Quanshi Zhang , Mengnan Du

Utilizing Mutations to Evaluate Interpretability of Neural Networks on Genomic Data

Even though deep neural networks (DNNs) achieve state-of-the-art results for a number of problems involving genomic data, getting DNNs to explain their decision-making process has been a major challenge due to their black-box nature. One…

Genomics · Quantitative Biology 2022-12-14 Utku Ozbulak , Solha Kang , Jasper Zuallaert , Stephen Depuydt , Joris Vankerschaver

Learning how to explain neural networks: PatternNet and PatternAttribution

DeConvNet, Guided BackProp, LRP, were invented to better understand deep neural networks. We show that these methods do not produce the theoretically correct explanation for a linear model. Yet they are used on multi-layer networks with…

Machine Learning · Statistics 2017-10-26 Pieter-Jan Kindermans , Kristof T. Schütt , Maximilian Alber , Klaus-Robert Müller , Dumitru Erhan , Been Kim , Sven Dähne

Understanding Deep Networks via Extremal Perturbations and Smooth Masks

The problem of attribution is concerned with identifying the parts of an input that are responsible for a model's output. An important family of attribution methods is based on measuring the effect of perturbations applied to the input. In…

Computer Vision and Pattern Recognition · Computer Science 2019-10-21 Ruth Fong , Mandela Patrick , Andrea Vedaldi

Advancing Attribution-Based Neural Network Explainability through Relative Absolute Magnitude Layer-Wise Relevance Propagation and Multi-Component Evaluation

Recent advancement in deep-neural network performance led to the development of new state-of-the-art approaches in numerous areas. However, the black-box nature of neural networks often prohibits their use in areas where model…

Computer Vision and Pattern Recognition · Computer Science 2024-12-13 Davor Vukadin , Petar Afrić , Marin Šilić , Goran Delač

Model Explainability in Deep Learning Based Natural Language Processing

Machine learning (ML) model explainability has received growing attention, especially in the area related to model risk and regulations. In this paper, we reviewed and compared some popular ML model explainability methodologies, especially…

Artificial Intelligence · Computer Science 2021-06-15 Shafie Gholizadeh , Nengfeng Zhou

Interpreting Interpretations: Organizing Attribution Methods by Criteria

Motivated by distinct, though related, criteria, a growing number of attribution methods have been developed tointerprete deep learning. While each relies on the interpretability of the concept of "importance" and our ability to visualize…

Artificial Intelligence · Computer Science 2020-04-07 Zifan Wang , Piotr Mardziel , Anupam Datta , Matt Fredrikson

CHALLENGER: Training with Attribution Maps

We show that utilizing attribution maps for training neural networks can improve regularization of models and thus increase performance. Regularization is key in deep learning, especially when training complex models on relatively small…

Machine Learning · Computer Science 2022-05-31 Christian Tomani , Daniel Cremers

Time to Focus: A Comprehensive Benchmark Using Time Series Attribution Methods

In the last decade neural network have made huge impact both in industry and research due to their ability to extract meaningful features from imprecise or complex data, and by achieving super human performance in several domains. However,…

Artificial Intelligence · Computer Science 2022-02-09 Dominique Mercier , Jwalin Bhatt , Andreas Dengel , Sheraz Ahmed

Explaining, Evaluating and Enhancing Neural Networks' Learned Representations

Most efforts in interpretability in deep learning have focused on (1) extracting explanations of a specific downstream task in relation to the input features and (2) imposing constraints on the model, often at the expense of predictive…

Machine Learning · Computer Science 2022-02-22 Marco Bertolini , Djork-Arné Clevert , Floriane Montanari

Evaluating Human Alignment and Model Faithfulness of LLM Rationale

We study how well large language models (LLMs) explain their generations through rationales -- a set of tokens extracted from the input text that reflect the decision-making process of LLMs. Specifically, we systematically study rationales…

Computation and Language · Computer Science 2024-10-23 Mohsen Fayyaz , Fan Yin , Jiao Sun , Nanyun Peng

Enhancing Causal Reasoning in Large Language Models: A Causal Attribution Model for Precision Fine-Tuning

This paper introduces a causal attribution model to enhance the interpretability of large language models (LLMs) and improve their causal reasoning abilities via precise fine-tuning. Despite LLMs' proficiency in diverse tasks, their…

Artificial Intelligence · Computer Science 2026-05-22 Hengrui Cai , Shengjie Liu , Rui Song

Training Feature Attribution for Vision Models

Deep neural networks are often considered opaque systems, prompting the need for explainability methods to improve trust and accountability. Existing approaches typically attribute test-time predictions either to input features (e.g.,…

Computer Vision and Pattern Recognition · Computer Science 2025-10-13 Aziz Bacha , Thomas George

LLM attribution analysis across different fine-tuning strategies and model scales for automated code compliance

Existing research on large language models (LLMs) for automated code compliance has primarily focused on performance, treating the models as black boxes and overlooking how training decisions affect their interpretive behavior. This paper…

Computation and Language · Computer Science 2026-04-20 Jack Wei Lun Shi , Minghao Dang , Wawan Solihin , Justin K. W. Yeoh

Adaptive Methods for Linear Programming Decoding

Detectability of failures of linear programming (LP) decoding and the potential for improvement by adding new constraints motivate the use of an adaptive approach in selecting the constraints for the underlying LP problem. In this paper, we…

Information Theory · Computer Science 2007-07-13 Mohammad H. Taghavi , Paul H. Siegel