Related papers: How Does Information Bottleneck Help Deep Learning…

A Critical Review of Information Bottleneck Theory and its Applications to Deep Learning

In the past decade, deep neural networks have seen unparalleled improvements that continue to impact every aspect of today's society. With the development of high performance GPUs and the availability of vast amounts of data, learning…

Machine Learning · Computer Science 2021-05-12 Mohammad Ali Alomrani

Deep Learning and the Information Bottleneck Principle

Deep Neural Networks (DNNs) are analyzed via the theoretical framework of the information bottleneck (IB) principle. We first show that any DNN can be quantified by the mutual information between the layers and the input and output…

Machine Learning · Computer Science 2015-03-10 Naftali Tishby , Noga Zaslavsky

Unpacking Information Bottlenecks: Unifying Information-Theoretic Objectives in Deep Learning

The Information Bottleneck principle offers both a mechanism to explain how deep neural networks train and generalize, as well as a regularized objective with which to train models. However, multiple competing objectives are proposed in the…

Machine Learning · Computer Science 2021-01-06 Andreas Kirsch , Clare Lyle , Yarin Gal

The Information Bottleneck Problem and Its Applications in Machine Learning

Inference capabilities of machine learning (ML) systems skyrocketed in recent years, now playing a pivotal role in various aspect of society. The goal in statistical learning is to use data to obtain simple algorithms for predicting a…

Machine Learning · Computer Science 2020-05-04 Ziv Goldfeld , Yury Polyanskiy

Perturbation Theory for the Information Bottleneck

Extracting relevant information from data is crucial for all forms of learning. The information bottleneck (IB) method formalizes this, offering a mathematically precise and conceptually appealing framework for understanding learning…

Machine Learning · Computer Science 2021-10-27 Vudtiwat Ngampruetikorn , David J. Schwab

Information Flow in Deep Neural Networks

Although deep neural networks have been immensely successful, there is no comprehensive theoretical understanding of how they work or are structured. As a result, deep networks are often seen as black boxes with unclear interpretations and…

Machine Learning · Computer Science 2022-02-22 Ravid Shwartz-Ziv

Elastic Information Bottleneck

Information bottleneck is an information-theoretic principle of representation learning that aims to learn a maximally compressed representation that preserves as much information about labels as possible. Under this principle, two…

Information Theory · Computer Science 2023-11-08 Yuyan Ni , Yanyan Lan , Ao Liu , Zhiming Ma

The Variational Deficiency Bottleneck

We introduce a bottleneck method for learning data representations based on information deficiency, rather than the more traditional information sufficiency. A variational upper bound allows us to implement this method efficiently. The…

Information Theory · Computer Science 2020-11-05 Pradeep Kr. Banerjee , Guido Montúfar

Information Bottleneck: Exact Analysis of (Quantized) Neural Networks

The information bottleneck (IB) principle has been suggested as a way to analyze deep neural networks. The learning dynamics are studied by inspecting the mutual information (MI) between the hidden layers and the input and output. Notably,…

Machine Learning · Computer Science 2022-02-15 Stephan Sloth Lorenzen , Christian Igel , Mads Nielsen

Drill the Cork of Information Bottleneck by Inputting the Most Important Data

Deep learning has become the most powerful machine learning tool in the last decade. However, how to efficiently train deep neural networks remains to be thoroughly solved. The widely used minibatch stochastic gradient descent (SGD) still…

Machine Learning · Computer Science 2021-05-18 Xinyu Peng , Jiawei Zhang , Fei-Yue Wang , Li Li

Information Bottleneck and its Applications in Deep Learning

Information Theory (IT) has been used in Machine Learning (ML) from early days of this field. In the last decade, advances in Deep Neural Networks (DNNs) have led to surprising improvements in many applications of ML. The result has been a…

Machine Learning · Computer Science 2019-04-09 Hassan Hafez-Kolahi , Shohreh Kasaei

Explaining Grokking and Information Bottleneck through Neural Collapse Emergence

The training dynamics of deep neural networks often defy expectations, even as these models form the foundation of modern machine learning. Two prominent examples are grokking, where test performance improves abruptly long after the…

Machine Learning · Computer Science 2026-01-28 Keitaro Sakamoto , Issei Sato

To Compress or Not to Compress- Self-Supervised Learning and Information Theory: A Review

Deep neural networks excel in supervised learning tasks but are constrained by the need for extensive labeled data. Self-supervised learning emerges as a promising alternative, allowing models to learn without explicit labels. Information…

Machine Learning · Computer Science 2023-11-22 Ravid Shwartz-Ziv , Yann LeCun

A Generalized Information Bottleneck Theory of Deep Learning

The Information Bottleneck (IB) principle offers a compelling theoretical framework to understand how neural networks (NNs) learn. However, its practical utility has been constrained by unresolved theoretical ambiguities and significant…

Machine Learning · Computer Science 2026-02-02 Charles Westphal , Stephen Hailes , Mirco Musolesi

Towards Information Theory-Based Discovery of Equivariances

The presence of symmetries imposes a stringent set of constraints on a system. This constrained structure allows intelligent agents interacting with such a system to drastically improve the efficiency of learning and generalization, through…

Information Theory · Computer Science 2024-10-03 Hippolyte Charvin , Nicola Catenacci Volpi , Daniel Polani

Causal learning with sufficient statistics: an information bottleneck approach

The inference of causal relationships using observational data from partially observed multivariate systems with hidden variables is a fundamental question in many scientific domains. Methods extracting causal information from conditional…

Machine Learning · Statistics 2020-10-13 Daniel Chicharro , Michel Besserve , Stefano Panzeri

Tighter Bounds on the Information Bottleneck with Application to Deep Learning

Deep Neural Nets (DNNs) learn latent representations induced by their downstream task, objective function, and other parameters. The quality of the learned representations impacts the DNN's generalization ability and the coherence of the…

Machine Learning · Computer Science 2024-02-13 Nir Weingarten , Zohar Yakhini , Moshe Butman , Ran Gilad-Bachrach

The Information Bottleneck Principle in Corporate Hierarchies

The hierarchical nature of corporate information processing is a topic of great interest in economic and management literature. Firms are characterised by a need to make complex decisions, often aggregating partial and uncertain…

Social and Information Networks · Computer Science 2022-10-27 Cameron Gordon

Multivariate Information Bottleneck

The Information bottleneck method is an unsupervised non-parametric data organization technique. Given a joint distribution P(A,B), this method constructs a new variable T that extracts partitions, or clusters, over the values of A that are…

Machine Learning · Computer Science 2013-01-14 Nir Friedman , Ori Mosenzon , Noam Slonim , Naftali Tishby

Layer-wise Learning of Stochastic Neural Networks with Information Bottleneck

Information Bottleneck (IB) is a generalization of rate-distortion theory that naturally incorporates compression and relevance trade-offs for learning. Though the original IB has been extensively studied, there has not been much…

Machine Learning · Computer Science 2019-10-08 Thanh T. Nguyen , Jaesik Choi