Related papers: Information Subtraction: Learning Representations …

Sequential Disentanglement by Extracting Static Information From A Single Sequence Element

One of the fundamental representation learning tasks is unsupervised sequential disentanglement, where latent codes of inputs are decomposed to a single static factor and a sequence of dynamic factors. To extract this latent information,…

Machine Learning · Computer Science 2025-10-09 Nimrod Berman , Ilan Naiman , Idan Arbiv , Gal Fadlon , Omri Azencot

Representation Unlearning: Forgetting through Information Compression

Machine unlearning seeks to remove the influence of specific training data from a model, a need driven by privacy regulations and robustness concerns. Existing approaches typically modify model parameters, but such updates can be unstable,…

Machine Learning · Computer Science 2026-05-29 Antonio Almudévar , Alfonso Ortega

Information Theoretic Perspective on Representation Learning

An information-theoretic framework is introduced to analyze last-layer embedding, focusing on learned representations for regression tasks. We define representation-rate and derive limits on the reliability with which input-output…

Information Theory · Computer Science 2026-05-27 Deborah Pereg , Michael Wand

Improving Disentangled Text Representation Learning with Information-Theoretic Guidance

Learning disentangled representations of natural language is essential for many NLP tasks, e.g., conditional text generation, style transfer, personalized dialogue systems, etc. Similar problems have been studied extensively for other forms…

Machine Learning · Computer Science 2022-01-13 Pengyu Cheng , Martin Renqiang Min , Dinghan Shen , Christopher Malon , Yizhe Zhang , Yitong Li , Lawrence Carin

Leveraging Relational Information for Learning Weakly Disentangled Representations

Disentanglement is a difficult property to enforce in neural representations. This might be due, in part, to a formalization of the disentanglement problem that focuses too heavily on separating relevant factors of variation of the data in…

Machine Learning · Computer Science 2022-05-23 Andrea Valenti , Davide Bacciu

Interpretability with full complexity by constraining feature information

Interpretability is a pressing issue for machine learning. Common approaches to interpretable machine learning constrain interactions between features of the input, rendering the effects of those features on a model's output comprehensible…

Machine Learning · Computer Science 2023-05-11 Kieran A. Murphy , Dani S. Bassett

Nonparametric Maximum Entropy Estimation on Information Diagrams

Maximum entropy estimation is of broad interest for inferring properties of systems across many different disciplines. In this work, we significantly extend a technique we previously introduced for estimating the maximum entropy of a set of…

Data Analysis, Statistics and Probability · Physics 2016-01-05 Elliot A. Martin , Jaroslav Hlinka , Alexander Meinke , Filip Děchtěrenko , Jörn Davidsen

Compressed Predictive Information Coding

Unsupervised learning plays an important role in many fields, such as artificial intelligence, machine learning, and neuroscience. Compared to static data, methods for extracting low-dimensional structure for dynamic data are lagging. We…

Machine Learning · Computer Science 2022-03-07 Rui Meng , Tianyi Luo , Kristofer Bouchard

Disentangled Representations via Synergy Minimization

Scientists often seek simplified representations of complex systems to facilitate prediction and understanding. If the factors comprising a representation allow us to make accurate predictions about our system, but obscuring any subset of…

Machine Learning · Computer Science 2017-10-12 Greg Ver Steeg , Rob Brekelmans , Hrayr Harutyunyan , Aram Galstyan

Learning Discrete Structured Representations by Adversarially Maximizing Mutual Information

We propose learning discrete structured representations from unlabeled data by maximizing the mutual information between a structured latent variable and a target variable. Calculating mutual information is intractable in this setting. Our…

Machine Learning · Computer Science 2020-07-17 Karl Stratos , Sam Wiseman

Entropy and mutual information in models of deep neural networks

We examine a class of deep learning models with a tractable method to compute information-theoretic quantities. Our contributions are three-fold: (i) We show how entropies and mutual informations can be derived from heuristic statistical…

Machine Learning · Computer Science 2020-01-22 Marylou Gabrié , Andre Manoel , Clément Luneau , Jean Barbier , Nicolas Macris , Florent Krzakala , Lenka Zdeborová

Towards Better Understanding of Disentangled Representations via Mutual Information

Most existing works on disentangled representation learning are solely built upon an marginal independence assumption: all factors in disentangled representations should be statistically independent. This assumption is necessary but…

Machine Learning · Computer Science 2020-07-02 Xiaojiang Yang , Wendong Bi , Yitong Sun , Yu Cheng , Junchi Yan

Submodular Combinatorial Information Measures with Applications in Machine Learning

Information-theoretic quantities like entropy and mutual information have found numerous uses in machine learning. It is well known that there is a strong connection between these entropic quantities and submodularity since entropy over a…

Machine Learning · Computer Science 2021-03-04 Rishabh Iyer , Ninad Khargonkar , Jeff Bilmes , Himanshu Asnani

Disentangled Text Representation Learning with Information-Theoretic Perspective for Adversarial Robustness

Adversarial vulnerability remains a major obstacle to constructing reliable NLP systems. When imperceptible perturbations are added to raw input text, the performance of a deep learning model may drop dramatically under attacks. Recent work…

Computation and Language · Computer Science 2022-10-28 Jiahao Zhao , Wenji Mao

Entropy and Source Coding for Integer-Dimensional Singular Random Variables

Entropy and differential entropy are important quantities in information theory. A tractable extension to singular random variables-which are neither discrete nor continuous-has not been available so far. Here, we present such an extension…

Information Theory · Computer Science 2017-01-04 Günther Koliander , Georg Pichler , Erwin Riegler , Franz Hlawatsch

Information Theoretic Representation Distillation

Despite the empirical success of knowledge distillation, current state-of-the-art methods are computationally expensive to train, which makes them difficult to adopt in practice. To address this problem, we introduce two distinct…

Computer Vision and Pattern Recognition · Computer Science 2022-10-10 Roy Miles , Adrian Lopez Rodriguez , Krystian Mikolajczyk

Representation Learning with Conditional Information Flow Maximization

This paper proposes an information-theoretic representation learning framework, named conditional information flow maximization, to extract noise-invariant sufficient representations for the input data and target task. It promotes the…

Machine Learning · Computer Science 2024-08-13 Dou Hu , Lingwei Wei , Wei Zhou , Songlin Hu

Learning Disentangled Representations via Mutual Information Estimation

In this paper, we investigate the problem of learning disentangled representations. Given a pair of images sharing some attributes, we aim to create a low-dimensional representation which is split into two parts: a shared representation…

Machine Learning · Statistics 2019-12-10 Eduardo Hugo Sanchez , Mathieu Serrurier , Mathias Ortner

Revisiting Disentanglement in Downstream Tasks: A Study on Its Necessity for Abstract Visual Reasoning

In representation learning, a disentangled representation is highly desirable as it encodes generative factors of data in a separable and compact pattern. Researchers have advocated leveraging disentangled representations to complete…

Computer Vision and Pattern Recognition · Computer Science 2024-03-04 Ruiqian Nai , Zixin Wen , Ji Li , Yuanzhi Li , Yang Gao

Synergy, suppression and immorality: forward differences of the entropy function

Conditional mutual information is important in the selection and interpretation of graphical models. Its empirical version is well known as a generalised likelihood ratio test and that it may be represented as a difference in entropy. We…

Methodology · Statistics 2015-01-20 Joe Whittaker , Florian Martin , Yang Xiang