Related papers: Explaining Representation by Mutual Information

On Mutual Information Maximization for Representation Learning

Many recent methods for unsupervised or self-supervised representation learning train feature extractors by maximizing an estimate of the mutual information (MI) between different views of the data. This comes with several immediate…

Machine Learning · Computer Science 2020-01-24 Michael Tschannen , Josip Djolonga , Paul K. Rubenstein , Sylvain Gelly , Mario Lucic

A Measure of the Complexity of Neural Representations based on Partial Information Decomposition

In neural networks, task-relevant information is represented jointly by groups of neurons. However, the specific way in which this mutual information about the classification label is distributed among the individual neurons is not well…

Information Theory · Computer Science 2023-06-08 David A. Ehrlich , Andreas C. Schneider , Viola Priesemann , Michael Wibral , Abdullah Makkeh

Learning Disentangled Representations via Mutual Information Estimation

In this paper, we investigate the problem of learning disentangled representations. Given a pair of images sharing some attributes, we aim to create a low-dimensional representation which is split into two parts: a shared representation…

Machine Learning · Statistics 2019-12-10 Eduardo Hugo Sanchez , Mathieu Serrurier , Mathias Ortner

Learning Speaker Representations with Mutual Information

Learning good representations is of crucial importance in deep learning. Mutual Information (MI) or similar measures of statistical dependence are promising tools for learning these representations in an unsupervised way. Even though the…

Audio and Speech Processing · Electrical Eng. & Systems 2019-04-09 Mirco Ravanelli , Yoshua Bengio

High Mutual Information in Representation Learning with Symmetric Variational Inference

We introduce the Mutual Information Machine (MIM), a novel formulation of representation learning, using a joint distribution over the observations and latent state in an encoder/decoder framework. Our key principles are symmetry and mutual…

Machine Learning · Statistics 2019-10-10 Micha Livne , Kevin Swersky , David J. Fleet

From Neurons to Neutrons: A Case Study in Interpretability

Mechanistic Interpretability (MI) promises a path toward fully understanding how neural networks make their predictions. Prior work demonstrates that even when trained to perform simple arithmetic, models can implement a variety of…

Machine Learning · Computer Science 2024-05-28 Ouail Kitouni , Niklas Nolte , Víctor Samuel Pérez-Díaz , Sokratis Trifinopoulos , Mike Williams

A Mutual Information Maximization Perspective of Language Representation Learning

We show state-of-the-art word representation learning methods maximize an objective function that is a lower bound on the mutual information between different parts of a word sequence (i.e., a sentence). Our formulation provides an…

Computation and Language · Computer Science 2019-11-27 Lingpeng Kong , Cyprien de Masson d'Autume , Wang Ling , Lei Yu , Zihang Dai , Dani Yogatama

Structure Learning via Mutual Information

This paper presents a novel approach to machine learning algorithm design based on information theory, specifically mutual information (MI). We propose a framework for learning and representing functional relationships in data using…

Machine Learning · Computer Science 2024-09-24 Jeremy Nixon

Decomposed Mutual Information Estimation for Contrastive Representation Learning

Recent contrastive representation learning methods rely on estimating mutual information (MI) between multiple views of an underlying context. E.g., we can derive multiple views of a given image by applying data augmentation, or we can…

Machine Learning · Computer Science 2021-06-28 Alessandro Sordoni , Nouha Dziri , Hannes Schulz , Geoff Gordon , Phil Bachman , Remi Tachet

Challenges in Mechanistically Interpreting Model Representations

Mechanistic interpretability (MI) aims to understand AI models by reverse-engineering the exact algorithms neural networks learn. Most works in MI so far have studied behaviors and capabilities that are trivial and token-aligned. However,…

Machine Learning · Computer Science 2024-07-15 Satvik Golechha , James Dao

A robust estimator of mutual information for deep learning interpretability

We develop the use of mutual information (MI), a well-established metric in information theory, to interpret the inner workings of deep learning models. To accurately estimate MI from a finite number of samples, we present GMM-MI…

Data Analysis, Statistics and Probability · Physics 2023-04-12 Davide Piras , Hiranya V. Peiris , Andrew Pontzen , Luisa Lucie-Smith , Ningyuan Guo , Brian Nord

A Mathematical Philosophy of Explanations in Mechanistic Interpretability -- The Strange Science Part I.i

Mechanistic Interpretability aims to understand neural networks through causal explanations. We argue for the Explanatory View Hypothesis: that Mechanistic Interpretability research is a principled approach to understanding models because…

Machine Learning · Computer Science 2025-05-05 Kola Ayonrinde , Louis Jaburi

A Probabilistic Representation of DNNs: Bridging Mutual Information and Generalization

Recently, Mutual Information (MI) has attracted attention in bounding the generalization error of Deep Neural Networks (DNNs). However, it is intractable to accurately estimate the MI in DNNs, thus most previous works have to relax the MI…

Machine Learning · Computer Science 2021-06-21 Xinjie Lan , Kenneth Barner

Interpretable Data Fusion for Distributed Learning: A Representative Approach via Gradient Matching

This paper introduces a representative-based approach for distributed learning that transforms multiple raw data points into a virtual representation. Unlike traditional distributed learning methods such as Federated Learning, which do not…

Machine Learning · Computer Science 2025-02-12 Mengchen Fan , Baocheng Geng , Keren Li , Xueqian Wang , Pramod K. Varshney

Tracking Equivalent Mechanistic Interpretations Across Neural Networks

Mechanistic interpretability (MI) is an emerging framework for interpreting neural networks. Given a task and model, MI aims to discover a succinct algorithmic process, an interpretation, that explains the model's decision process on that…

Machine Learning · Computer Science 2026-04-01 Alan Sun , Mariya Toneva

Mutual information neural estimation for unsupervised multi-modal registration of brain images

Many applications in image-guided surgery and therapy require fast and reliable non-linear, multi-modal image registration. Recently proposed unsupervised deep learning-based registration methods have demonstrated superior performance…

Image and Video Processing · Electrical Eng. & Systems 2022-10-07 Gerard Snaauw , Michele Sasdelli , Gabriel Maicas , Stephan Lau , Johan Verjans , Mark Jenkinson , Gustavo Carneiro

Finding Representative Interpretations on Convolutional Neural Networks

Interpreting the decision logic behind effective deep convolutional neural networks (CNN) on images complements the success of deep learning models. However, the existing methods can only interpret some specific decision logic on individual…

Computer Vision and Pattern Recognition · Computer Science 2021-08-24 Peter Cho-Ho Lam , Lingyang Chu , Maxim Torgonskiy , Jian Pei , Yong Zhang , Lanjun Wang

Invariant Rationalization

Selective rationalization improves neural network interpretability by identifying a small subset of input features -- the rationale -- that best explains or supports the prediction. A typical rationalization criterion, i.e. maximum mutual…

Machine Learning · Computer Science 2020-03-24 Shiyu Chang , Yang Zhang , Mo Yu , Tommi S. Jaakkola

Maximizing Mutual Information Across Feature and Topology Views for Learning Graph Representations

Recently, maximizing mutual information has emerged as a powerful method for unsupervised graph representation learning. The existing methods are typically effective to capture information from the topology view but ignore the feature view.…

Machine Learning · Computer Science 2022-10-12 Xiaolong Fan , Maoguo Gong , Yue Wu , Hao Li

GAMI-Net: An Explainable Neural Network based on Generalized Additive Models with Structured Interactions

The lack of interpretability is an inevitable problem when using neural network models in real applications. In this paper, an explainable neural network based on generalized additive models with structured interactions (GAMI-Net) is…

Machine Learning · Statistics 2021-06-03 Zebin Yang , Aijun Zhang , Agus Sudjianto