Related papers: IB-AdCSCNet:Adaptive Convolutional Sparse Coding N…

Adversarial Information Bottleneck

The information bottleneck (IB) principle has been adopted to explain deep learning in terms of information compression and prediction, which are balanced by a trade-off hyperparameter. How to optimize the IB principle for better robustness…

Machine Learning · Computer Science 2021-03-04 Penglong Zhai , Shihua Zhang

Elastic Information Bottleneck

Information bottleneck is an information-theoretic principle of representation learning that aims to learn a maximally compressed representation that preserves as much information about labels as possible. Under this principle, two…

Information Theory · Computer Science 2023-11-08 Yuyan Ni , Yanyan Lan , Ao Liu , Zhiming Ma

A Critical Review of Information Bottleneck Theory and its Applications to Deep Learning

In the past decade, deep neural networks have seen unparalleled improvements that continue to impact every aspect of today's society. With the development of high performance GPUs and the availability of vast amounts of data, learning…

Machine Learning · Computer Science 2021-05-12 Mohammad Ali Alomrani

Layer-wise Learning of Stochastic Neural Networks with Information Bottleneck

Information Bottleneck (IB) is a generalization of rate-distortion theory that naturally incorporates compression and relevance trade-offs for learning. Though the original IB has been extensively studied, there has not been much…

Machine Learning · Computer Science 2019-10-08 Thanh T. Nguyen , Jaesik Choi

Cauchy-Schwarz Divergence Information Bottleneck for Regression

The information bottleneck (IB) approach is popular to improve the generalization, robustness and explainability of deep neural networks. Essentially, it aims to find a minimum sufficient representation $\mathbf{t}$ by striking a trade-off…

Machine Learning · Computer Science 2024-04-30 Shujian Yu , Xi Yu , Sigurd Løkse , Robert Jenssen , Jose C. Principe

Tighter Bounds on the Information Bottleneck with Application to Deep Learning

Deep Neural Nets (DNNs) learn latent representations induced by their downstream task, objective function, and other parameters. The quality of the learned representations impacts the DNN's generalization ability and the coherence of the…

Machine Learning · Computer Science 2024-02-13 Nir Weingarten , Zohar Yakhini , Moshe Butman , Ran Gilad-Bachrach

Opportunistic Information-Bottleneck for Goal-oriented Feature Extraction and Communication

The Information Bottleneck (IB) method is an information theoretical framework to design a parsimonious and tunable feature-extraction mechanism, such that the extracted features are maximally relevant to a specific learning or inference…

Signal Processing · Electrical Eng. & Systems 2024-04-17 Francesco Binucci , Paolo Banelli , Paolo Di Lorenzo , Sergio Barbarossa

Causal Information Bottleneck Boosts Adversarial Robustness of Deep Neural Network

The information bottleneck (IB) method is a feasible defense solution against adversarial attacks in deep learning. However, this method suffers from the spurious correlation, which leads to the limitation of its further improvement of…

Machine Learning · Computer Science 2022-10-27 Huan Hua , Jun Yan , Xi Fang , Weiquan Huang , Huilin Yin , Wancheng Ge

Recognizing Predictive Substructures with Subgraph Information Bottleneck

The emergence of Graph Convolutional Network (GCN) has greatly boosted the progress of graph learning. However, two disturbing factors, noise and redundancy in graph data, and lack of interpretation for prediction results, impede further…

Machine Learning · Computer Science 2021-03-23 Junchi Yu , Tingyang Xu , Yu Rong , Yatao Bian , Junzhou Huang , Ran He

Information Bottleneck Analysis of Deep Neural Networks via Lossy Compression

The Information Bottleneck (IB) principle offers an information-theoretic framework for analyzing the training process of deep neural networks (DNNs). Its essence lies in tracking the dynamics of two mutual information (MI) values: between…

Machine Learning · Computer Science 2024-05-10 Ivan Butakov , Alexander Tolmachev , Sofia Malanchuk , Anna Neopryatnaya , Alexey Frolov , Kirill Andreev

Learning Representations for Neural Network-Based Classification Using the Information Bottleneck Principle

In this theory paper, we investigate training deep neural networks (DNNs) for classification via minimizing the information bottleneck (IB) functional. We show that the resulting optimization problem suffers from two severe issues: First,…

Machine Learning · Computer Science 2020-08-10 Rana Ali Amjad , Bernhard C. Geiger

Drill the Cork of Information Bottleneck by Inputting the Most Important Data

Deep learning has become the most powerful machine learning tool in the last decade. However, how to efficiently train deep neural networks remains to be thoroughly solved. The widely used minibatch stochastic gradient descent (SGD) still…

Machine Learning · Computer Science 2021-05-18 Xinyu Peng , Jiawei Zhang , Fei-Yue Wang , Li Li

Information Bottleneck: Exact Analysis of (Quantized) Neural Networks

The information bottleneck (IB) principle has been suggested as a way to analyze deep neural networks. The learning dynamics are studied by inspecting the mutual information (MI) between the hidden layers and the input and output. Notably,…

Machine Learning · Computer Science 2022-02-15 Stephan Sloth Lorenzen , Christian Igel , Mads Nielsen

Deep Learning and the Information Bottleneck Principle

Deep Neural Networks (DNNs) are analyzed via the theoretical framework of the information bottleneck (IB) principle. We first show that any DNN can be quantified by the mutual information between the layers and the input and output…

Machine Learning · Computer Science 2015-03-10 Naftali Tishby , Noga Zaslavsky

Goal-Oriented Communication for Edge Learning based on the Information Bottleneck

Whenever communication takes place to fulfil a goal, an effective way to encode the source data to be transmitted is to use an encoding rule that allows the receiver to meet the requirements of the goal. A formal way to identify the…

Signal Processing · Electrical Eng. & Systems 2022-11-24 Francesco Pezone , Sergio Barbarossa , Paolo Di Lorenzo

Mixture of Balanced Information Bottlenecks for Long-Tailed Visual Recognition

Deep neural networks (DNNs) have achieved significant success in various applications with large-scale and balanced data. However, data in real-world visual recognition are usually long-tailed, bringing challenges to efficient training and…

Computer Vision and Pattern Recognition · Computer Science 2025-09-03 Yifan Lan , Xin Cai , Jun Cheng , Shan Tan

The Information Bottleneck's Ordinary Differential Equation: First-Order Root-Tracking for the IB

The Information Bottleneck (IB) is a method of lossy compression of relevant information. Its rate-distortion (RD) curve describes the fundamental tradeoff between input compression and the preservation of relevant information embedded in…

Information Theory · Computer Science 2023-07-27 Shlomi Agmon

Nonlinear Information Bottleneck

Information bottleneck (IB) is a technique for extracting information in one random variable $X$ that is relevant for predicting another random variable $Y$. IB works by encoding $X$ in a compressed "bottleneck" random variable $M$ from…

Information Theory · Computer Science 2022-11-22 Artemy Kolchinsky , Brendan D. Tracey , David H. Wolpert

Structured IB: Improving Information Bottleneck with Structured Feature Learning

The Information Bottleneck (IB) principle has emerged as a promising approach for enhancing the generalization, robustness, and interpretability of deep neural networks, demonstrating efficacy across image segmentation, document clustering,…

Information Theory · Computer Science 2025-04-18 Hanzhe Yang , Youlong Wu , Dingzhu Wen , Yong Zhou , Yuanming Shi

IBCapsNet: Information Bottleneck Capsule Network for Noise-Robust Representation Learning

Capsule networks (CapsNets) are superior at modeling hierarchical spatial relationships but suffer from two critical limitations: high computational cost due to iterative dynamic routing and poor robustness under input corruptions. To…

Computer Vision and Pattern Recognition · Computer Science 2026-03-24 Canqun Xiang , Chen Yang , Jiaoyan Zhao