English
Related papers

Related papers: Explaining Representation by Mutual Information

200 papers

Many recent methods for unsupervised or self-supervised representation learning train feature extractors by maximizing an estimate of the mutual information (MI) between different views of the data. This comes with several immediate…

Machine Learning · Computer Science 2020-01-24 Michael Tschannen , Josip Djolonga , Paul K. Rubenstein , Sylvain Gelly , Mario Lucic

In neural networks, task-relevant information is represented jointly by groups of neurons. However, the specific way in which this mutual information about the classification label is distributed among the individual neurons is not well…

Information Theory · Computer Science 2023-06-08 David A. Ehrlich , Andreas C. Schneider , Viola Priesemann , Michael Wibral , Abdullah Makkeh

In this paper, we investigate the problem of learning disentangled representations. Given a pair of images sharing some attributes, we aim to create a low-dimensional representation which is split into two parts: a shared representation…

Machine Learning · Statistics 2019-12-10 Eduardo Hugo Sanchez , Mathieu Serrurier , Mathias Ortner

Learning good representations is of crucial importance in deep learning. Mutual Information (MI) or similar measures of statistical dependence are promising tools for learning these representations in an unsupervised way. Even though the…

Audio and Speech Processing · Electrical Eng. & Systems 2019-04-09 Mirco Ravanelli , Yoshua Bengio

We introduce the Mutual Information Machine (MIM), a novel formulation of representation learning, using a joint distribution over the observations and latent state in an encoder/decoder framework. Our key principles are symmetry and mutual…

Machine Learning · Statistics 2019-10-10 Micha Livne , Kevin Swersky , David J. Fleet

Mechanistic Interpretability (MI) promises a path toward fully understanding how neural networks make their predictions. Prior work demonstrates that even when trained to perform simple arithmetic, models can implement a variety of…

Machine Learning · Computer Science 2024-05-28 Ouail Kitouni , Niklas Nolte , Víctor Samuel Pérez-Díaz , Sokratis Trifinopoulos , Mike Williams

We show state-of-the-art word representation learning methods maximize an objective function that is a lower bound on the mutual information between different parts of a word sequence (i.e., a sentence). Our formulation provides an…

Computation and Language · Computer Science 2019-11-27 Lingpeng Kong , Cyprien de Masson d'Autume , Wang Ling , Lei Yu , Zihang Dai , Dani Yogatama

This paper presents a novel approach to machine learning algorithm design based on information theory, specifically mutual information (MI). We propose a framework for learning and representing functional relationships in data using…

Machine Learning · Computer Science 2024-09-24 Jeremy Nixon

Recent contrastive representation learning methods rely on estimating mutual information (MI) between multiple views of an underlying context. E.g., we can derive multiple views of a given image by applying data augmentation, or we can…

Machine Learning · Computer Science 2021-06-28 Alessandro Sordoni , Nouha Dziri , Hannes Schulz , Geoff Gordon , Phil Bachman , Remi Tachet

Mechanistic interpretability (MI) aims to understand AI models by reverse-engineering the exact algorithms neural networks learn. Most works in MI so far have studied behaviors and capabilities that are trivial and token-aligned. However,…

Machine Learning · Computer Science 2024-07-15 Satvik Golechha , James Dao

We develop the use of mutual information (MI), a well-established metric in information theory, to interpret the inner workings of deep learning models. To accurately estimate MI from a finite number of samples, we present GMM-MI…

Data Analysis, Statistics and Probability · Physics 2023-04-12 Davide Piras , Hiranya V. Peiris , Andrew Pontzen , Luisa Lucie-Smith , Ningyuan Guo , Brian Nord

Mechanistic Interpretability aims to understand neural networks through causal explanations. We argue for the Explanatory View Hypothesis: that Mechanistic Interpretability research is a principled approach to understanding models because…

Machine Learning · Computer Science 2025-05-05 Kola Ayonrinde , Louis Jaburi

Recently, Mutual Information (MI) has attracted attention in bounding the generalization error of Deep Neural Networks (DNNs). However, it is intractable to accurately estimate the MI in DNNs, thus most previous works have to relax the MI…

Machine Learning · Computer Science 2021-06-21 Xinjie Lan , Kenneth Barner

This paper introduces a representative-based approach for distributed learning that transforms multiple raw data points into a virtual representation. Unlike traditional distributed learning methods such as Federated Learning, which do not…

Machine Learning · Computer Science 2025-02-12 Mengchen Fan , Baocheng Geng , Keren Li , Xueqian Wang , Pramod K. Varshney

Mechanistic interpretability (MI) is an emerging framework for interpreting neural networks. Given a task and model, MI aims to discover a succinct algorithmic process, an interpretation, that explains the model's decision process on that…

Machine Learning · Computer Science 2026-04-01 Alan Sun , Mariya Toneva

Many applications in image-guided surgery and therapy require fast and reliable non-linear, multi-modal image registration. Recently proposed unsupervised deep learning-based registration methods have demonstrated superior performance…

Image and Video Processing · Electrical Eng. & Systems 2022-10-07 Gerard Snaauw , Michele Sasdelli , Gabriel Maicas , Stephan Lau , Johan Verjans , Mark Jenkinson , Gustavo Carneiro

Interpreting the decision logic behind effective deep convolutional neural networks (CNN) on images complements the success of deep learning models. However, the existing methods can only interpret some specific decision logic on individual…

Computer Vision and Pattern Recognition · Computer Science 2021-08-24 Peter Cho-Ho Lam , Lingyang Chu , Maxim Torgonskiy , Jian Pei , Yong Zhang , Lanjun Wang

Selective rationalization improves neural network interpretability by identifying a small subset of input features -- the rationale -- that best explains or supports the prediction. A typical rationalization criterion, i.e. maximum mutual…

Machine Learning · Computer Science 2020-03-24 Shiyu Chang , Yang Zhang , Mo Yu , Tommi S. Jaakkola

Recently, maximizing mutual information has emerged as a powerful method for unsupervised graph representation learning. The existing methods are typically effective to capture information from the topology view but ignore the feature view.…

Machine Learning · Computer Science 2022-10-12 Xiaolong Fan , Maoguo Gong , Yue Wu , Hao Li

The lack of interpretability is an inevitable problem when using neural network models in real applications. In this paper, an explainable neural network based on generalized additive models with structured interactions (GAMI-Net) is…

Machine Learning · Statistics 2021-06-03 Zebin Yang , Aijun Zhang , Agus Sudjianto
‹ Prev 1 2 3 10 Next ›