Related papers: Dynamic learning rate using Mutual Information

DeepMI: A Mutual Information Based Framework For Unsupervised Deep Learning of Tasks

In this work, we propose an information theory based framework DeepMI to train deep neural networks (DNN) using Mutual Information (MI). The DeepMI framework is especially targeted but not limited to the learning of real world tasks in an…

Computer Vision and Pattern Recognition · Computer Science 2022-03-07 Ashish Kumar , Laxmidhar Behera

Mutual information neural estimation for unsupervised multi-modal registration of brain images

Many applications in image-guided surgery and therapy require fast and reliable non-linear, multi-modal image registration. Recently proposed unsupervised deep learning-based registration methods have demonstrated superior performance…

Image and Video Processing · Electrical Eng. & Systems 2022-10-07 Gerard Snaauw , Michele Sasdelli , Gabriel Maicas , Stephan Lau , Johan Verjans , Mark Jenkinson , Gustavo Carneiro

Structure Learning via Mutual Information

This paper presents a novel approach to machine learning algorithm design based on information theory, specifically mutual information (MI). We propose a framework for learning and representing functional relationships in data using…

Machine Learning · Computer Science 2024-09-24 Jeremy Nixon

Monitoring Shortcut Learning using Mutual Information

The failure of deep neural networks to generalize to out-of-distribution data is a well-known problem and raises concerns about the deployment of trained networks in safety-critical domains such as healthcare, finance and autonomous…

Machine Learning · Computer Science 2022-06-28 Mohammed Adnan , Yani Ioannou , Chuan-Yung Tsai , Angus Galloway , H. R. Tizhoosh , Graham W. Taylor

On Mutual Information Maximization for Representation Learning

Many recent methods for unsupervised or self-supervised representation learning train feature extractors by maximizing an estimate of the mutual information (MI) between different views of the data. This comes with several immediate…

Machine Learning · Computer Science 2020-01-24 Michael Tschannen , Josip Djolonga , Paul K. Rubenstein , Sylvain Gelly , Mario Lucic

Mutual information rate and bounds for it

The amount of information exchanged per unit of time between two nodes in a dynamical network or between two data sets is a powerful concept for analysing complex systems. This quantity, known as the mutual information rate (MIR), is…

Chaotic Dynamics · Physics 2015-05-27 M. S. Baptista , R. M. Rubinger , E. R. V. Junior , J. C. Sartorelli , U. Parlitz , C. Grebogi

Mutual Information Learned Classifiers: an Information-theoretic Viewpoint of Training Deep Learning Classification Systems

Deep learning systems have been reported to acheive state-of-the-art performances in many applications, and one of the keys for achieving this is the existence of well trained classifiers on benchmark datasets which can be used as backbone…

Machine Learning · Computer Science 2022-10-04 Jirong Yi , Qiaosheng Zhang , Zhen Chen , Qiao Liu , Wei Shao

Learning Curves for Mutual Information Maximization

An unsupervised learning procedure based on maximizing the mutual information between the outputs of two networks receiving different but statistically dependent inputs is analyzed (Becker and Hinton, Nature, 355, 92, 161). For a generic…

Disordered Systems and Neural Networks · Physics 2009-11-10 Robert Urbanczik

Fast Mutual Information Computation for Large Binary Datasets

Mutual Information (MI) is a powerful statistical measure that quantifies shared information between random variables, particularly valuable in high-dimensional data analysis across fields like genomics, natural language processing, and…

Machine Learning · Computer Science 2024-12-02 Andre O. Falcao

Distributed Differentially Private Mutual Information Ranking and Its Applications

Computation of Mutual Information (MI) helps understand the amount of information shared between a pair of random variables. Automated feature selection techniques based on MI ranking are regularly used to extract information from sensitive…

Cryptography and Security · Computer Science 2020-09-24 Ankit Srivastava , Samira Pouyanfar , Joshua Allen , Ken Johnston , Qida Ma

A Hessian-informed hyperparameter optimization for differential learning rate

Differential learning rate (DLR), a technique that applies different learning rates to different model parameters, has been widely used in deep learning and achieved empirical success via its various forms. For example, parameter-efficient…

Machine Learning · Computer Science 2025-05-20 Shiyun Xu , Zhiqi Bu , Yiliang Zhang , Ian Barnett

Conditional Mutual Information-Based Generalization Bound for Meta Learning

Meta-learning optimizes an inductive bias---typically in the form of the hyperparameters of a base-learning algorithm---by observing data from a finite number of related tasks. This paper presents an information-theoretic bound on the…

Machine Learning · Computer Science 2021-02-09 Arezou Rezazadeh , Sharu Theresa Jose , Giuseppe Durisi , Osvaldo Simeone

Forget the Learning Rate, Decay Loss

In the usual deep neural network optimization process, the learning rate is the most important hyper parameter, which greatly affects the final convergence effect. The purpose of learning rate is to control the stepsize and gradually reduce…

Machine Learning · Computer Science 2019-05-02 Jiakai Wei

What we learn from the learning rate

The learning rate is an information-theoretical quantity for bipartite Markov chains describing two coupled subsystems. It is defined as the rate at which transitions in the downstream subsystem tend to increase the mutual information…

Statistical Mechanics · Physics 2017-07-04 Rory A. Brittain , Nick S. Jones , Thomas E. Ouldridge

A Neural Difference-of-Entropies Estimator for Mutual Information

Estimating Mutual Information (MI), a key measure of dependence of random quantities without specific modelling assumptions, is a challenging problem in high dimensions. We propose a novel mutual information estimator based on parametrizing…

Machine Learning · Statistics 2025-10-24 Haoran Ni , Martin Lotz

Learning Speaker Representations with Mutual Information

Learning good representations is of crucial importance in deep learning. Mutual Information (MI) or similar measures of statistical dependence are promising tools for learning these representations in an unsupervised way. Even though the…

Audio and Speech Processing · Electrical Eng. & Systems 2019-04-09 Mirco Ravanelli , Yoshua Bengio

On Mutual Information Neural Estimation for Localization

Mutual information (MI) is a promising candidate measure for the assessment and optimization of localization systems, as it captures nonlinear dependencies between random variables. However, the high cost of computing MI, especially for…

Signal Processing · Electrical Eng. & Systems 2025-11-04 Sven Hinderer , Manuel Buchfink , Bin Yang

On The Utility of Conditional Generation Based Mutual Information for Characterizing Adversarial Subspaces

Recent studies have found that deep learning systems are vulnerable to adversarial examples; e.g., visually unrecognizable adversarial images can easily be crafted to result in misclassification. The robustness of neural networks has been…

Computer Vision and Pattern Recognition · Computer Science 2018-09-25 Chia-Yi Hsu , Pei-Hsuan Lu , Pin-Yu Chen , Chia-Mu Yu

Diversified Mutual Learning for Deep Metric Learning

Mutual learning is an ensemble training strategy to improve generalization by transferring individual knowledge to each other while simultaneously training multiple models. In this work, we propose an effective mutual learning method for…

Computer Vision and Pattern Recognition · Computer Science 2020-09-10 Wonpyo Park , Wonjae Kim , Kihyun You , Minsu Cho

Stochastic Mutual Information Gradient Estimation for Dimensionality Reduction Networks

Feature ranking and selection is a widely used approach in various applications of supervised dimensionality reduction in discriminative machine learning. Nevertheless there exists significant evidence on feature ranking and selection…

Machine Learning · Computer Science 2021-05-04 Ozan Ozdenizci , Deniz Erdogmus