Related papers: Maximizing Multi-Information

Maximally Informative Statistics

In this paper we propose a Bayesian, information theoretic approach to dimensionality reduction. The approach is formulated as a variational principle on mutual information, and seamlessly addresses the notions of sufficiency, relevance,…

Data Analysis, Statistics and Probability · Physics 2007-05-23 David R. Wolf , Edward I. George

Expansion of the Kullback-Leibler Divergence, and a new class of information metrics

Inferring and comparing complex, multivariable probability density functions is fundamental to problems in several fields, including probabilistic learning, network theory, and data analysis. Classification and prediction are the two faces…

Information Theory · Computer Science 2017-03-30 David J. Galas , T. Gregory Dewey , James Kunert-Graf , Nikita A. Sakhanenko

Factorized Mutual Information Maximization

We investigate the sets of joint probability distributions that maximize the average multi-information over a collection of margins. These functionals serve as proxies for maximizing the multi-information of a set of variables or the mutual…

Information Theory · Computer Science 2019-06-14 Thomas Merkh , Guido Montúfar

Finding the Maximizers of the Information Divergence from an Exponential Family

This paper investigates maximizers of the information divergence from an exponential family $E$. It is shown that the $rI$-projection of a maximizer $P$ to $E$ is a convex combination of $P$ and a probability measure $P_-$ with disjoint…

Information Theory · Computer Science 2014-06-18 Johannes Rauh

Semiparametric estimation of mutual information and related criteria : optimal test of independence

We derive independence tests by means of dependence measures thresholding in a semiparametric context. Precisely, estimates of phi-mutual informations, associated to phi-divergences between a joint distribution and the product distribution…

Statistics Theory · Mathematics 2015-08-20 Amor Keziou , Philippe Regnault

Maximizing Multivariate Information with Error-Correcting Codes

Multivariate mutual information provides a conceptual framework for characterizing higher-order interactions in complex systems. Two well-known measures of multivariate information---total correlation and dual total correlation---admit a…

Information Theory · Computer Science 2018-11-28 Kyle Reing , Greg Ver Steeg , Aram Galstyan

Revisiting Chernoff Information with Likelihood Ratio Exponential Families

The Chernoff information between two probability measures is a statistical divergence measuring their deviation defined as their maximally skewed Bhattacharyya distance. Although the Chernoff information was originally introduced for…

Information Theory · Computer Science 2022-10-04 Frank Nielsen

Distributionally Robust Parametric Maximum Likelihood Estimation

We consider the parameter estimation problem of a probabilistic generative model prescribed using a natural exponential family of distributions. For this problem, the typical maximum likelihood estimator usually overfits under limited…

Machine Learning · Statistics 2020-10-13 Viet Anh Nguyen , Xuhui Zhang , Jose Blanchet , Angelos Georghiou

Robust Stochastic Outperformance under Kullback-Leibler Ambiguity

We study the worst-case probability that $Y$ outperforms a benchmark $X$ when the law of $Y$ lies in a Kullback-Leibler neighbourhood of the benchmark. The max-min problem over couplings admits a tractable dual (via optimal transport),…

Probability · Mathematics 2025-09-03 Ozan Hür

Optimally approximating exponential families

This article studies exponential families $\mathcal{E}$ on finite sets such that the information divergence $D(P\|\mathcal{E})$ of an arbitrary probability distribution from $\mathcal{E}$ is bounded by some constant $D>0$. A particular…

Statistics Theory · Mathematics 2014-06-18 Johannes Rauh

Connecting Jensen-Shannon and Kullback-Leibler Divergences: A New Bound for Representation Learning

Mutual Information (MI) is a fundamental measure of statistical dependence widely used in representation learning. While direct optimization of MI via its definition as a Kullback-Leibler divergence (KLD) is often intractable, many recent…

Machine Learning · Computer Science 2026-03-18 Reuben Dorent , Polina Golland , William Wells

Interdependent Bilateral Trade: Information vs Approximation

Welfare maximization in bilateral trade has been extensively studied in recent years. Previous literature obtained incentive-compatible approximation mechanisms only for the private values case. In this paper, we study welfare maximization…

Computer Science and Game Theory · Computer Science 2025-07-01 Shahar Dobzinski , Alon Eden , Kira Goldner , Ariel Shaulker , Thodoris Tsilivis

Distributed Estimation, Information Loss and Exponential Families

Distributed learning of probabilistic models from multiple data repositories with minimum communication is increasingly important. We study a simple communication-efficient learning framework that first calculates the local maximum…

Machine Learning · Statistics 2014-10-13 Qiang Liu , Alexander Ihler

Mutual Information in Coupled Double Quantum Dots: A Simple Analytic Model for Potential Artificial Consciousness

The integrated information theory is thought to be a key clue towards the theoretical understanding of consciousness. In this study, we propose a simple numerical model comprising a set of coupled double quantum dots, where the…

Quantum Physics · Physics 2020-09-30 Katsuaki Tanabe

Investigation of Alternative Measures for Mutual Information

Mutual information $I(X;Y)$ is a useful definition in information theory to estimate how much information the random variable $Y$ holds about the random variable $X$. One way to define the mutual information is by comparing the joint…

Information Theory · Computer Science 2022-04-14 Bulut Kuskonmaz , Jaron Skovsted Gundersen , Rafal Wisniewski

The diameter of a stochastic matrix: A new measure for sensitivity analysis in Bayesian networks

Bayesian networks are one of the most widely used classes of probabilistic models for risk management and decision support because of their interpretability and flexibility in including heterogeneous pieces of information. In any applied…

Methodology · Statistics 2024-07-08 Manuele Leonelli , Jim Q. Smith , Sophia K. Wright

MLE convergence speed to information projection of exponential family: Criterion for model dimension and sample size -- complete proof version--

For a parametric model of distributions, the closest distribution in the model to the true distribution located outside the model is considered. Measuring the closeness between two distributions with the Kullback-Leibler (K-L) divergence,…

Statistics Theory · Mathematics 2025-10-14 Yo Sheena

Visualizing probabilistic models in Minkowski space with intensive symmetrized Kullback-Leibler embedding

We show that the predicted probability distributions for any $N$-parameter statistical model taking the form of an exponential family can be explicitly and analytically embedded isometrically in a $N{+}N$-dimensional Minkowski space. That…

Statistical Mechanics · Physics 2020-08-12 Han Kheng Teoh , Katherine N. Quinn , Jaron Kent-Dobias , Colin B. Clement , Qingyang Xu , James P. Sethna

Theoretical properties of the log-concave maximum likelihood estimator of a multidimensional density

We present theoretical properties of the log-concave maximum likelihood estimator of a density based on an independent and identically distributed sample in $\mathbb{R}^d$. Our study covers both the case where the true underlying density is…

Statistics Theory · Mathematics 2009-09-01 Madeleine Cule , Richard Samworth

Relative Entropy and Statistics

Formalising the confrontation of opinions (models) to observations (data) is the task of Inferential Statistics. Information Theory provides us with a basic functional, the relative entropy (or Kullback-Leibler divergence), an asymmetrical…

Information Theory · Computer Science 2015-03-13 François Bavaud