English
Related papers

Related papers: Aggregated Learning: A Vector-Quantization Approac…

200 papers

Based on the notion of information bottleneck (IB), we formulate a quantization problem called "IB quantization". We show that IB quantization is equivalent to learning based on the IB principle. Under this equivalence, the standard neural…

Machine Learning · Computer Science 2019-02-13 Hongyu Guo , Yongyi Mao , Ali Al-Bashabsheh , Richong Zhang

In this theory paper, we investigate training deep neural networks (DNNs) for classification via minimizing the information bottleneck (IB) functional. We show that the resulting optimization problem suffers from two severe issues: First,…

Machine Learning · Computer Science 2020-08-10 Rana Ali Amjad , Bernhard C. Geiger

Information Bottleneck (IB) is a generalization of rate-distortion theory that naturally incorporates compression and relevance trade-offs for learning. Though the original IB has been extensively studied, there has not been much…

Machine Learning · Computer Science 2019-10-08 Thanh T. Nguyen , Jaesik Choi

The information bottleneck (IB) principle has been suggested as a way to analyze deep neural networks. The learning dynamics are studied by inspecting the mutual information (MI) between the hidden layers and the input and output. Notably,…

Machine Learning · Computer Science 2022-02-15 Stephan Sloth Lorenzen , Christian Igel , Mads Nielsen

The Information Bottleneck (IB) principle offers a compelling theoretical framework to understand how neural networks (NNs) learn. However, its practical utility has been constrained by unresolved theoretical ambiguities and significant…

Machine Learning · Computer Science 2026-02-02 Charles Westphal , Stephen Hailes , Mirco Musolesi

Deep Neural Networks (DNNs) are analyzed via the theoretical framework of the information bottleneck (IB) principle. We first show that any DNN can be quantified by the mutual information between the layers and the input and output…

Machine Learning · Computer Science 2015-03-10 Naftali Tishby , Noga Zaslavsky

In the past decade, deep neural networks have seen unparalleled improvements that continue to impact every aspect of today's society. With the development of high performance GPUs and the availability of vast amounts of data, learning…

Machine Learning · Computer Science 2021-05-12 Mohammad Ali Alomrani

In this work, we generalize the information bottleneck (IB) approach to the multi-view learning context. The exponentially growing complexity of the optimal representation motivates the development of two novel formulations with more…

Information Theory · Computer Science 2022-09-20 Teng-Hui Huang , Aly El Gamal , Hesham El Gamal

Deep neural networks suffer from poor generalization to unseen environments when the underlying data distribution is different from that in the training set. By learning minimum sufficient representations from training data, the information…

Machine Learning · Computer Science 2021-10-13 Francesco Alesiani , Shujian Yu , Xi Yu

The information bottleneck (IB) method is a technique designed to extract meaningful information related to one random variable from another random variable, and has found extensive applications in machine learning problems. In this paper,…

Information Theory · Computer Science 2025-07-29 Lingyi Chen , Shitong Wu , Sicheng Xu , Huihui Wu , Wenyi Zhang

This paper describes a novel design of a neural network-based speech generation model for learning prosodic representation.The problem of representation learning is formulated according to the information bottleneck (IB) principle. A…

Audio and Speech Processing · Electrical Eng. & Systems 2021-08-09 Guangyan Zhang , Ying Qin , Daxin Tan , Tan Lee

The information bottleneck principle is an elegant and useful approach to representation learning. In this paper, we investigate the problem of representation learning in the context of reinforcement learning using the information…

Machine Learning · Computer Science 2019-11-14 Pei Yingjun , Hou Xinwen

We study a distributed learning problem in which Alice sends a compressed distillation of a set of training data to Bob, who uses the distilled version to best solve an associated learning problem. We formalize this as a rate-distortion…

Information Theory · Computer Science 2018-10-30 Parinaz Farajiparvar , Ahmad Beirami , Matthew Nokleby

Deep neural networks (DNNs) have achieved significant success in various applications with large-scale and balanced data. However, data in real-world visual recognition are usually long-tailed, bringing challenges to efficient training and…

Computer Vision and Pattern Recognition · Computer Science 2025-09-03 Yifan Lan , Xin Cai , Jun Cheng , Shan Tan

Although deep neural networks have been immensely successful, there is no comprehensive theoretical understanding of how they work or are structured. As a result, deep networks are often seen as black boxes with unclear interpretations and…

Machine Learning · Computer Science 2022-02-22 Ravid Shwartz-Ziv

Extracting relevant information from data is crucial for all forms of learning. The information bottleneck (IB) method formalizes this, offering a mathematically precise and conceptually appealing framework for understanding learning…

Machine Learning · Computer Science 2021-10-27 Vudtiwat Ngampruetikorn , David J. Schwab

The information bottleneck (IB) principle has been adopted to explain deep learning in terms of information compression and prediction, which are balanced by a trade-off hyperparameter. How to optimize the IB principle for better robustness…

Machine Learning · Computer Science 2021-03-04 Penglong Zhai , Shihua Zhang

Deep Neural Nets (DNNs) learn latent representations induced by their downstream task, objective function, and other parameters. The quality of the learned representations impacts the DNN's generalization ability and the coherence of the…

Machine Learning · Computer Science 2024-02-13 Nir Weingarten , Zohar Yakhini , Moshe Butman , Ran Gilad-Bachrach

Inference capabilities of machine learning (ML) systems skyrocketed in recent years, now playing a pivotal role in various aspect of society. The goal in statistical learning is to use data to obtain simple algorithms for predicting a…

Machine Learning · Computer Science 2020-05-04 Ziv Goldfeld , Yury Polyanskiy

Neural collapse describes the geometry of activation in the final layer of a deep neural network when it is trained beyond performance plateaus. Open questions include whether neural collapse leads to better generalization and, if so, why…

Machine Learning · Computer Science 2024-06-28 Siwei Wang , Stephanie E Palmer
‹ Prev 1 2 3 10 Next ›