Mutual Information Multinomial Estimation
Machine Learning
2024-08-20 v1 Information Theory
math.IT
Machine Learning
Abstract
Estimating mutual information (MI) is a fundamental yet challenging task in data science and machine learning. This work proposes a new estimator for mutual information. Our main discovery is that a preliminary estimate of the data distribution can dramatically help estimate. This preliminary estimate serves as a bridge between the joint and the marginal distribution, and by comparing with this bridge distribution we can easily obtain the true difference between the joint distributions and the marginal distributions. Experiments on diverse tasks including non-Gaussian synthetic problems with known ground-truth and real-world applications demonstrate the advantages of our method.
Cite
@article{arxiv.2408.09377,
title = {Mutual Information Multinomial Estimation},
author = {Yanzhi Chen and Zijing Ou and Adrian Weller and Yingzhen Li},
journal= {arXiv preprint arXiv:2408.09377},
year = {2024}
}