Related papers: Explaining Deep Learning Hidden Neuron Activations…

Understanding CNN Hidden Neuron Activations Using Structured Background Knowledge and Deductive Reasoning

A major challenge in Explainable AI is in correctly interpreting activations of hidden neurons: accurate interpretations would provide insights into the question of what a deep learning system has internally detected as relevant on the…

Machine Learning · Computer Science 2023-08-10 Abhilekha Dalal , Md Kamruzzaman Sarker , Adrita Barua , Eugene Vasserman , Pascal Hitzler

On the Value of Labeled Data and Symbolic Methods for Hidden Neuron Activation Analysis

A major challenge in Explainable AI is in correctly interpreting activations of hidden neurons: accurate interpretations would help answer the question of what a deep learning system internally detects as relevant in the input, demystifying…

Artificial Intelligence · Computer Science 2026-02-23 Abhilekha Dalal , Rushrukh Rayan , Adrita Barua , Eugene Y. Vasserman , Md Kamruzzaman Sarker , Pascal Hitzler

A Case Study on Concept Induction for Neuron-Level Interpretability in CNN

Deep Neural Networks (DNNs) have advanced applications in domains such as healthcare, autonomous systems, and scene understanding, yet the internal semantics of their hidden neurons remain poorly understood. Prior work introduced a Concept…

Computer Vision and Pattern Recognition · Computer Science 2026-03-03 Moumita Sen Sarma , Samatha Ereshi Akkamahadevi , Pascal Hitzler

LLM-assisted Concept Discovery: Automatically Identifying and Explaining Neuron Functions

Providing textual concept-based explanations for neurons in deep neural networks (DNNs) is of importance in understanding how a DNN model works. Prior works have associated concepts with neurons based on examples of concepts or a…

Computer Vision and Pattern Recognition · Computer Science 2024-06-14 Nhat Hoang-Xuan , Minh Vu , My T. Thai

Concept backpropagation: An Explainable AI approach for visualising learned concepts in neural network models

Neural network models are widely used in a variety of domains, often as black-box solutions, since they are not directly interpretable for humans. The field of explainable artificial intelligence aims at developing explanation methods to…

Machine Learning · Computer Science 2023-07-25 Patrik Hammersborg , Inga Strümke

ConceptLens: from Pixels to Understanding

ConceptLens is an innovative tool designed to illuminate the intricate workings of deep neural networks (DNNs) by visualizing hidden neuron activations. By integrating deep learning with symbolic methods, ConceptLens offers users a unique…

Machine Learning · Computer Science 2024-10-10 Abhilekha Dalal , Pascal Hitzler

Explaining Deep Neural Networks by Leveraging Intrinsic Methods

Despite their impact on the society, deep neural networks are often regarded as black-box models due to their intricate structures and the absence of explanations for their decisions. This opacity poses a significant challenge to AI systems…

Machine Learning · Computer Science 2024-07-18 Biagio La Rosa

From Neural Activations to Concepts: A Survey on Explaining Concepts in Neural Networks

In this paper, we review recent approaches for explaining concepts in neural networks. Concepts can act as a natural link between learning and reasoning: once the concepts are identified that a neural learning system uses, one can integrate…

Artificial Intelligence · Computer Science 2024-05-06 Jae Hee Lee , Sergio Lanza , Stefan Wermter

ConceptExplainer: Interactive Explanation for Deep Neural Networks from a Concept Perspective

Traditional deep learning interpretability methods which are suitable for model users cannot explain network behaviors at the global level and are inflexible at providing fine-grained explanations. As a solution, concept-based explanations…

Human-Computer Interaction · Computer Science 2022-10-26 Jinbin Huang , Aditi Mishra , Bum Chul Kwon , Chris Bryan

Image classification network enhancement methods based on knowledge injection

The current deep neural network algorithm still stays in the end-to-end training supervision method like Image-Label pairs, which makes traditional algorithm is difficult to explain the reason for the results, and the prediction logic is…

Computer Vision and Pattern Recognition · Computer Science 2024-01-10 Yishuang Tian , Ning Wang , Liang Zhang

How Important Is a Neuron?

The problem of attributing a deep network's prediction to its \emph{input/base} features is well-studied. We introduce the notion of \emph{conductance} to extend the notion of attribution to the understanding the importance of \emph{hidden}…

Machine Learning · Computer Science 2018-06-01 Kedar Dhamdhere , Mukund Sundararajan , Qiqi Yan

Expressive Explanations of DNNs by Combining Concept Analysis with ILP

Explainable AI has emerged to be a key component for black-box machine learning approaches in domains with a high demand for reliability or transparency. Examples are medical assistant systems, and applications concerned with the General…

Machine Learning · Computer Science 2021-05-18 Johannes Rabold , Gesina Schwalbe , Ute Schmid

Wider Vision: Enriching Convolutional Neural Networks via Alignment to External Knowledge Bases

Deep learning models suffer from opaqueness. For Convolutional Neural Networks (CNNs), current research strategies for explaining models focus on the target classes within the associated training dataset. As a result, the understanding of…

Computer Vision and Pattern Recognition · Computer Science 2021-02-23 Xuehao Liu , Sarah Jane Delany , Susan McKeever

Understanding Convolutional Networks with APPLE : Automatic Patch Pattern Labeling for Explanation

With the success of deep learning, recent efforts have been focused on analyzing how learned networks make their classifications. We are interested in analyzing the network output based on the network structure and information flow through…

Machine Learning · Computer Science 2018-02-13 Sandeep Konam , Ian Quah , Stephanie Rosenthal , Manuela Veloso

Hierarchical Semantic Tree Concept Whitening for Interpretable Image Classification

With the popularity of deep neural networks (DNNs), model interpretability is becoming a critical concern. Many approaches have been developed to tackle the problem through post-hoc analysis, such as explaining how predictions are made or…

Computer Vision and Pattern Recognition · Computer Science 2023-07-11 Haixing Dai , Lu Zhang , Lin Zhao , Zihao Wu , Zhengliang Liu , David Liu , Xiaowei Yu , Yanjun Lyu , Changying Li , Ninghao Liu , Tianming Liu , Dajiang Zhu

Human-Centered Concept Explanations for Neural Networks

Understanding complex machine learning models such as deep neural networks with explanations is crucial in various applications. Many explanations stem from the model perspective, and may not necessarily effectively communicate why the…

Machine Learning · Computer Science 2022-02-28 Chih-Kuan Yeh , Been Kim , Pradeep Ravikumar

Causal Learning and Explanation of Deep Neural Networks via Autoencoded Activations

Deep neural networks are complex and opaque. As they enter application in a variety of important and safety critical domains, users seek methods to explain their output predictions. We develop an approach to explaining deep neural networks…

Artificial Intelligence · Computer Science 2018-02-05 Michael Harradon , Jeff Druce , Brian Ruttenberg

Automated Natural Language Explanation of Deep Visual Neurons with Large Models

Deep neural networks have exhibited remarkable performance across a wide range of real-world tasks. However, comprehending the underlying reasons for their effectiveness remains a challenging problem. Interpreting deep neural networks…

Computer Vision and Pattern Recognition · Computer Science 2023-10-18 Chenxu Zhao , Wei Qian , Yucheng Shi , Mengdi Huai , Ninghao Liu

DeepBase: Deep Inspection of Neural Networks

Although deep learning models perform remarkably well across a range of tasks such as language translation and object recognition, it remains unclear what high-level logic, if any, they follow. Understanding this logic may lead to more…

Databases · Computer Science 2019-01-08 Thibault Sellam , Kevin Lin , Ian Yiran Huang , Yiru Chen , Michelle Yang , Carl Vondrick , Eugene Wu

Neural Activation Patterns (NAPs): Visual Explainability of Learned Concepts

A key to deciphering the inner workings of neural networks is understanding what a model has learned. Promising methods for discovering learned features are based on analyzing activation values, whereby current techniques focus on analyzing…

Machine Learning · Computer Science 2022-06-23 Alex Bäuerle , Daniel Jönsson , Timo Ropinski