Related papers: Interpretable Neural Network Decoupling

PICNN: A Pathway towards Interpretable Convolutional Neural Networks

Convolutional Neural Networks (CNNs) have exhibited great performance in discriminative feature learning for complex visual tasks. Besides discrimination power, interpretability is another important yet under-explored property for CNNs. One…

Computer Vision and Pattern Recognition · Computer Science 2023-12-20 Wengang Guo , Jiayi Yang , Huilin Yin , Qijun Chen , Wei Ye

Decoupling Deep Learning for Interpretable Image Recognition

The interpretability of neural networks has recently received extensive attention. Previous prototype-based explainable networks involved prototype activation in both reasoning and interpretation processes, requiring specific explainable…

Computer Vision and Pattern Recognition · Computer Science 2022-11-22 Yitao Peng , Yihang Liu , Longzhen Yang , Lianghua He

Training Interpretable Convolutional Neural Networks by Differentiating Class-specific Filters

Convolutional neural networks (CNNs) have been successfully used in a range of tasks. However, CNNs are often viewed as "black-box" and lack of interpretability. One main reason is due to the filter-class entanglement -- an intricate…

Computer Vision and Pattern Recognition · Computer Science 2021-07-02 Haoyu Liang , Zhihao Ouyang , Yuyuan Zeng , Hang Su , Zihao He , Shu-Tao Xia , Jun Zhu , Bo Zhang

Interpretable Disentanglement of Neural Networks by Extracting Class-Specific Subnetwork

We propose a novel perspective to understand deep neural networks in an interpretable disentanglement form. For each semantic class, we extract a class-specific functional subnetwork from the original full model, with compressed structure…

Machine Learning · Computer Science 2019-10-08 Yulong Wang , Xiaolin Hu , Hang Su

Understanding learned CNN features through Filter Decoding with Substitution

In parallel with the success of CNNs to solve vision problems, there is a growing interest in developing methodologies to understand and visualize the internal representations of these networks. How the responses of a trained CNN encode the…

Computer Vision and Pattern Recognition · Computer Science 2015-11-18 Ivet Rafegas , Maria Vanrell

Interpretable Convolutional Filters with SincNet

Deep learning is currently playing a crucial role toward higher levels of artificial intelligence. This paradigm allows neural networks to learn complex and abstract representations, that are progressively obtained by combining simpler…

Audio and Speech Processing · Electrical Eng. & Systems 2019-08-12 Mirco Ravanelli , Yoshua Bengio

Disentangling Polysemantic Channels in Convolutional Neural Networks

Mechanistic interpretability is concerned with analyzing individual components in a (convolutional) neural network (CNN) and how they form larger circuits representing decision mechanisms. These investigations are challenging since CNNs…

Computer Vision and Pattern Recognition · Computer Science 2025-04-18 Robin Hesse , Jonas Fischer , Simone Schaub-Meyer , Stefan Roth

FICNN: A Framework for the Interpretation of Deep Convolutional Neural Networks

With the continue development of Convolutional Neural Networks (CNNs), there is a growing concern regarding representations that they encode internally. Analyzing these internal representations is referred to as model interpretation. While…

Computer Vision and Pattern Recognition · Computer Science 2023-05-18 Hamed Behzadi-Khormouji , José Oramas

Generating Interpretable Networks using Hypernetworks

An essential goal in mechanistic interpretability to decode a network, i.e., to convert a neural network's raw weights to an interpretable algorithm. Given the difficulty of the decoding problem, progress has been made to understand the…

Machine Learning · Computer Science 2023-12-07 Isaac Liao , Ziming Liu , Max Tegmark

Towards Combinatorial Interpretability of Neural Computation

We introduce combinatorial interpretability, a methodology for understanding neural computation by analyzing the combinatorial structures in the sign-based categorization of a network's weights and biases. We demonstrate its power through…

Machine Learning · Computer Science 2025-05-07 Micah Adler , Dan Alistarh , Nir Shavit

A Disentangling Invertible Interpretation Network for Explaining Latent Representations

Neural networks have greatly boosted performance in computer vision by learning powerful representations of input data. The drawback of end-to-end training for maximal overall performance are black-box models whose hidden representations…

Computer Vision and Pattern Recognition · Computer Science 2020-04-29 Patrick Esser , Robin Rombach , Björn Ommer

Interpretable Compositional Convolutional Neural Networks

The reasonable definition of semantic interpretability presents the core challenge in explainable AI. This paper proposes a method to modify a traditional convolutional neural network (CNN) into an interpretable compositional CNN, in order…

Computer Vision and Pattern Recognition · Computer Science 2021-07-12 Wen Shen , Zhihua Wei , Shikun Huang , Binbin Zhang , Jiaqi Fan , Ping Zhao , Quanshi Zhang

Visual Interpretability for Deep Learning: a Survey

This paper reviews recent studies in understanding neural-network representations and learning neural networks with interpretable/disentangled middle-layer representations. Although deep neural networks have exhibited superior performance…

Computer Vision and Pattern Recognition · Computer Science 2018-02-08 Quanshi Zhang , Song-Chun Zhu

The SVD of Convolutional Weights: A CNN Interpretability Framework

Deep neural networks used for image classification often use convolutional filters to extract distinguishing features before passing them to a linear classifier. Most interpretability literature focuses on providing semantic meaning to…

Computer Vision and Pattern Recognition · Computer Science 2022-08-16 Brenda Praggastis , Davis Brown , Carlos Ortiz Marrero , Emilie Purvine , Madelyn Shapiro , Bei Wang

Interpreting Deep Visual Representations via Network Dissection

The success of recent deep convolutional neural networks (CNNs) depends on learning hidden representations that can summarize the important factors of variation behind the data. However, CNNs often criticized as being black boxes that lack…

Computer Vision and Pattern Recognition · Computer Science 2018-06-27 Bolei Zhou , David Bau , Aude Oliva , Antonio Torralba

Neighborhood Convolutional Network: A New Paradigm of Graph Neural Networks for Node Classification

The decoupled Graph Convolutional Network (GCN), a recent development of GCN that decouples the neighborhood aggregation and feature transformation in each convolutional layer, has shown promising performance for graph representation…

Machine Learning · Computer Science 2022-11-16 Jinsong Chen , Boyu Li , Kun He

Interpretable CNNs for Object Classification

This paper proposes a generic method to learn interpretable convolutional filters in a deep convolutional neural network (CNN) for object classification, where each interpretable filter encodes features of a specific object part. Our method…

Machine Learning · Computer Science 2020-03-13 Quanshi Zhang , Xin Wang , Ying Nian Wu , Huilin Zhou , Song-Chun Zhu

Modular Representation of Layered Neural Networks

Layered neural networks have greatly improved the performance of various applications including image processing, speech recognition, natural language processing, and bioinformatics. However, it is still difficult to discover or interpret…

Machine Learning · Statistics 2017-10-05 Chihiro Watanabe , Kaoru Hiramatsu , Kunio Kashino

Interpretable Convolutional Neural Networks

This paper proposes a method to modify traditional convolutional neural networks (CNNs) into interpretable CNNs, in order to clarify knowledge representations in high conv-layers of CNNs. In an interpretable CNN, each filter in a high…

Computer Vision and Pattern Recognition · Computer Science 2018-02-15 Quanshi Zhang , Ying Nian Wu , Song-Chun Zhu

InDeed: Interpretable image deep decomposition with guaranteed generalizability

Image decomposition aims to analyze an image into elementary components, which is essential for numerous downstream tasks and also by nature provides certain interpretability to the analysis. Deep learning can be powerful for such tasks,…

Computer Vision and Pattern Recognition · Computer Science 2025-01-03 Sihan Wang , Shangqi Gao , Fuping Wu , Xiahai Zhuang