English
Related papers

Related papers: How Does Information Bottleneck Help Deep Learning…

200 papers

In the past decade, deep neural networks have seen unparalleled improvements that continue to impact every aspect of today's society. With the development of high performance GPUs and the availability of vast amounts of data, learning…

Machine Learning · Computer Science 2021-05-12 Mohammad Ali Alomrani

Deep Neural Networks (DNNs) are analyzed via the theoretical framework of the information bottleneck (IB) principle. We first show that any DNN can be quantified by the mutual information between the layers and the input and output…

Machine Learning · Computer Science 2015-03-10 Naftali Tishby , Noga Zaslavsky

The Information Bottleneck principle offers both a mechanism to explain how deep neural networks train and generalize, as well as a regularized objective with which to train models. However, multiple competing objectives are proposed in the…

Machine Learning · Computer Science 2021-01-06 Andreas Kirsch , Clare Lyle , Yarin Gal

Inference capabilities of machine learning (ML) systems skyrocketed in recent years, now playing a pivotal role in various aspect of society. The goal in statistical learning is to use data to obtain simple algorithms for predicting a…

Machine Learning · Computer Science 2020-05-04 Ziv Goldfeld , Yury Polyanskiy

Extracting relevant information from data is crucial for all forms of learning. The information bottleneck (IB) method formalizes this, offering a mathematically precise and conceptually appealing framework for understanding learning…

Machine Learning · Computer Science 2021-10-27 Vudtiwat Ngampruetikorn , David J. Schwab

Although deep neural networks have been immensely successful, there is no comprehensive theoretical understanding of how they work or are structured. As a result, deep networks are often seen as black boxes with unclear interpretations and…

Machine Learning · Computer Science 2022-02-22 Ravid Shwartz-Ziv

Information bottleneck is an information-theoretic principle of representation learning that aims to learn a maximally compressed representation that preserves as much information about labels as possible. Under this principle, two…

Information Theory · Computer Science 2023-11-08 Yuyan Ni , Yanyan Lan , Ao Liu , Zhiming Ma

We introduce a bottleneck method for learning data representations based on information deficiency, rather than the more traditional information sufficiency. A variational upper bound allows us to implement this method efficiently. The…

Information Theory · Computer Science 2020-11-05 Pradeep Kr. Banerjee , Guido Montúfar

The information bottleneck (IB) principle has been suggested as a way to analyze deep neural networks. The learning dynamics are studied by inspecting the mutual information (MI) between the hidden layers and the input and output. Notably,…

Machine Learning · Computer Science 2022-02-15 Stephan Sloth Lorenzen , Christian Igel , Mads Nielsen

Deep learning has become the most powerful machine learning tool in the last decade. However, how to efficiently train deep neural networks remains to be thoroughly solved. The widely used minibatch stochastic gradient descent (SGD) still…

Machine Learning · Computer Science 2021-05-18 Xinyu Peng , Jiawei Zhang , Fei-Yue Wang , Li Li

Information Theory (IT) has been used in Machine Learning (ML) from early days of this field. In the last decade, advances in Deep Neural Networks (DNNs) have led to surprising improvements in many applications of ML. The result has been a…

Machine Learning · Computer Science 2019-04-09 Hassan Hafez-Kolahi , Shohreh Kasaei

The training dynamics of deep neural networks often defy expectations, even as these models form the foundation of modern machine learning. Two prominent examples are grokking, where test performance improves abruptly long after the…

Machine Learning · Computer Science 2026-01-28 Keitaro Sakamoto , Issei Sato

Deep neural networks excel in supervised learning tasks but are constrained by the need for extensive labeled data. Self-supervised learning emerges as a promising alternative, allowing models to learn without explicit labels. Information…

Machine Learning · Computer Science 2023-11-22 Ravid Shwartz-Ziv , Yann LeCun

The Information Bottleneck (IB) principle offers a compelling theoretical framework to understand how neural networks (NNs) learn. However, its practical utility has been constrained by unresolved theoretical ambiguities and significant…

Machine Learning · Computer Science 2026-02-02 Charles Westphal , Stephen Hailes , Mirco Musolesi

The presence of symmetries imposes a stringent set of constraints on a system. This constrained structure allows intelligent agents interacting with such a system to drastically improve the efficiency of learning and generalization, through…

Information Theory · Computer Science 2024-10-03 Hippolyte Charvin , Nicola Catenacci Volpi , Daniel Polani

The inference of causal relationships using observational data from partially observed multivariate systems with hidden variables is a fundamental question in many scientific domains. Methods extracting causal information from conditional…

Machine Learning · Statistics 2020-10-13 Daniel Chicharro , Michel Besserve , Stefano Panzeri

Deep Neural Nets (DNNs) learn latent representations induced by their downstream task, objective function, and other parameters. The quality of the learned representations impacts the DNN's generalization ability and the coherence of the…

Machine Learning · Computer Science 2024-02-13 Nir Weingarten , Zohar Yakhini , Moshe Butman , Ran Gilad-Bachrach

The hierarchical nature of corporate information processing is a topic of great interest in economic and management literature. Firms are characterised by a need to make complex decisions, often aggregating partial and uncertain…

Social and Information Networks · Computer Science 2022-10-27 Cameron Gordon

The Information bottleneck method is an unsupervised non-parametric data organization technique. Given a joint distribution P(A,B), this method constructs a new variable T that extracts partitions, or clusters, over the values of A that are…

Machine Learning · Computer Science 2013-01-14 Nir Friedman , Ori Mosenzon , Noam Slonim , Naftali Tishby

Information Bottleneck (IB) is a generalization of rate-distortion theory that naturally incorporates compression and relevance trade-offs for learning. Though the original IB has been extensively studied, there has not been much…

Machine Learning · Computer Science 2019-10-08 Thanh T. Nguyen , Jaesik Choi
‹ Prev 1 2 3 10 Next ›