Related papers: Sub-Gaussian Error Bounds for Hypothesis Testing

Refined Statistical Bounds for Classification Error Mismatches with Constrained Bayes Error

In statistical classification/multiple hypothesis testing and machine learning, a model distribution estimated from the training data is usually applied to replace the unknown true distribution in the Bayes decision rule, which introduces a…

Information Theory · Computer Science 2024-09-24 Zijian Yang , Vahe Eminyan , Ralf Schlüter , Hermann Ney

The Hellinger Bounds on the Kullback-Leibler Divergence and the Bernstein Norm

The Kullback-Leibler divergence, the Kullback-Leibler variation, and the Bernstein "norm" are used to quantify discrepancies among probability distributions in likelihood models such as nonparametric maximum likelihood and nonparametric…

Statistics Theory · Mathematics 2026-01-27 Tetsuya Kaji

On the Universality of the Logistic Loss Function

A loss function measures the discrepancy between the true values (observations) and their estimated fits, for a given instance of data. A loss function is said to be proper (unbiased, Fisher consistent) if the fits are defined over a unit…

Information Theory · Computer Science 2018-05-11 Amichai Painsky , Gregory W. Wornell

On the Properties of Kullback-Leibler Divergence Between Multivariate Gaussian Distributions

Kullback-Leibler (KL) divergence is one of the most important divergence measures between probability distributions. In this paper, we prove several properties of KL divergence between multivariate Gaussian distributions. First, for any two…

Information Theory · Computer Science 2023-01-24 Yufeng Zhang , Wanwei Liu , Zhenbang Chen , Ji Wang , Kenli Li

A Neural Network Algorithm for KL Divergence Estimation with Quantitative Error Bounds

Estimating the Kullback-Leibler (KL) divergence between random variables is a fundamental problem in statistical analysis. For continuous random variables, traditional information-theoretic estimators scale poorly with dimension and/or…

Machine Learning · Computer Science 2025-10-08 Mikil Foss , Andrew Lamperski

A New Estimator of Kullback--Leibler Divergence via Shannon Entropy

We examine the estimation of the Kullback-Leibler (KL) divergence and the use of the goodness-of-fit test for multivariate continuous distributions. Our starting point is the maximum entropy principle for Shannon entropy: among all…

Statistics Theory · Mathematics 2026-03-10 Mehmet Siddik Cadirci , Martin Singull

Notes on Kullback-Leibler Divergence and Likelihood

The Kullback-Leibler (KL) divergence is a fundamental equation of information theory that quantifies the proximity of two probability distributions. Although difficult to understand by examining the equation, an intuition and understanding…

Information Theory · Computer Science 2014-04-09 Jonathon Shlens

Kullback-Leibler Approximation for Probability Measures on Infinite Dimensional Spaces

In a variety of applications it is important to extract information from a probability measure $\mu$ on an infinite dimensional space. Examples include the Bayesian approach to inverse problems and possibly conditioned) continuous time…

Probability · Mathematics 2016-06-02 Frank Pinski , Gideon Simpson , Andrew Stuart , Hendrik Weber

Taming numerical imprecision by adapting the KL divergence to negative probabilities

The Kullback-Leibler (KL) divergence is frequently used in data science. For discrete distributions on large state spaces, approximations of probability vectors may result in a few small negative entries, rendering the KL divergence…

Computation · Statistics 2024-09-04 Simon Pfahler , Peter Georg , Rudolf Schill , Maren Klever , Lars Grasedyck , Rainer Spang , Tilo Wettig

Second-Order Asymptotics of Hoeffding-Like Hypothesis Tests

We consider a binary statistical hypothesis testing problem, where $n$ independent and identically distributed random variables $Z^n$ are either distributed according to the null hypothesis $P$ or the alternate hypothesis $Q$, and only $P$…

Information Theory · Computer Science 2022-05-12 K. V. Harsha , Jithin Ravi , Tobias Koch

Samplers and Extractors for Unbounded Functions

Blasiok (SODA'18) recently introduced the notion of a subgaussian sampler, defined as an averaging sampler for approximating the mean of functions $f:\{0,1\}^m \to \mathbb{R}$ such that $f(U_m)$ has subgaussian tails, and asked for explicit…

Computational Complexity · Computer Science 2019-09-19 Rohit Agrawal

On the Robustness to Misspecification of $\alpha$-Posteriors and Their Variational Approximations

$\alpha$-posteriors and their variational approximations distort standard posterior inference by downweighting the likelihood and introducing variational approximation errors. We show that such distortions, if tuned appropriately, reduce…

Machine Learning · Statistics 2021-04-20 Marco Avella Medina , José Luis Montiel Olea , Cynthia Rush , Amilcar Velez

A New Lower Bound for Kullback-Leibler Divergence Based on Hammersley-Chapman-Robbins Bound

In this paper, we derive a useful lower bound for the Kullback-Leibler divergence (KL-divergence) based on the Hammersley-Chapman-Robbins bound (HCRB). The HCRB states that the variance of an estimator is bounded from below by the…

Statistics Theory · Mathematics 2019-11-05 Tomohiro Nishiyama

On one-sample Bayesian tests for the mean

This paper deals with a new Bayesian approach to the standard one-sample $z$- and $t$- tests. More specifically, let $x_1,\ldots,x_n$ be an independent random sample from a normal distribution with mean $\mu$ and variance $\sigma^2$. The…

Statistics Theory · Mathematics 2020-04-01 Ibrahim Abdelrazeq , Luai Al-Labadi

Risk Bounds for Mixture Density Estimation on Compact Domains via the $h$-Lifted Kullback--Leibler Divergence

We consider the problem of estimating probability density functions based on sample data, using a finite mixture of densities from some component class. To this end, we introduce the $h$-lifted Kullback--Leibler (KL) divergence as a…

Machine Learning · Statistics 2024-12-24 Mark Chiu Chong , Hien Duy Nguyen , TrungTin Nguyen

On model misspecification and KL separation for Gaussian graphical models

We establish bounds on the KL divergence between two multivariate Gaussian distributions in terms of the Hamming distance between the edge sets of the corresponding graphical models. We show that the KL divergence is bounded below by a…

Information Theory · Computer Science 2015-04-06 Varun Jog , Po-Ling Loh

Independent Gaussian Distributions Minimize the Kullback-Leibler (KL) Divergence from Independent Gaussian Distributions

This short note is on a property of the Kullback-Leibler (KL) divergence which indicates that independent Gaussian distributions minimize the KL divergence from given independent Gaussian distributions. The primary purpose of this note is…

Information Theory · Computer Science 2020-12-04 Song Fang , Quanyan Zhu

KL Divergence Between Gaussians: A Step-by-Step Derivation for the Variational Autoencoder Objective

Kullback-Leibler (KL) divergence is a fundamental concept in information theory that quantifies the discrepancy between two probability distributions. In the context of Variational Autoencoders (VAEs), it serves as a central regularization…

Machine Learning · Computer Science 2026-04-14 Andrés Muñoz , Rodrigo Ramele

From Kinetic Theory to AI: a Rediscovery of High-Dimensional Divergences and Their Properties

Selecting an appropriate divergence measure is a critical aspect of machine learning, as it directly impacts model performance. Among the most widely used, we find the Kullback-Leibler (KL) divergence, originally introduced in kinetic…

Mathematical Physics · Physics 2025-07-16 Gennaro Auricchio , Giovanni Brigati , Paolo Giudici , Giuseppe Toscani

Robust Kullback-Leibler Divergence and Universal Hypothesis Testing for Continuous Distributions

Universal hypothesis testing refers to the problem of deciding whether samples come from a nominal distribution or an unknown distribution that is different from the nominal distribution. Hoeffding's test, whose test statistic is equivalent…

Information Theory · Computer Science 2017-11-15 Pengfei Yang , Biao Chen