Related papers: Statistical Inference for Generative Models with M…

Scalable Kernel-Based Distances for Statistical Inference and Integration

Representing, comparing, and measuring the distance between probability distributions is a key task in computational statistics and machine learning. The choice of representation and the associated distance determine properties of the…

Machine Learning · Statistics 2026-02-26 Masha Naslidnyk

Optimally-Weighted Estimators of the Maximum Mean Discrepancy for Likelihood-Free Inference

Likelihood-free inference methods typically make use of a distance between simulated and real data. A common example is the maximum mean discrepancy (MMD), which has previously been used for approximate Bayesian computation, minimum…

Methodology · Statistics 2023-05-11 Ayush Bharti , Masha Naslidnyk , Oscar Key , Samuel Kaski , François-Xavier Briol

On the Optimization Landscape of Maximum Mean Discrepancy

Generative models have been successfully used for generating realistic signals. Because the likelihood function is typically intractable in most of these models, the common practice is to use "implicit" models that avoid likelihood…

Machine Learning · Computer Science 2024-05-07 Itai Alon , Amir Globerson , Ami Wiesel

Unbiased estimators for the variance of MMD estimators

The maximum mean discrepancy (MMD) is a kernel-based distance between probability distributions useful in many applications (Gretton et al. 2012), bearing a simple estimator with pleasing computational and statistical properties. Being able…

Machine Learning · Statistics 2022-11-16 Danica J. Sutherland , Namrata Deka

Generative Models and Model Criticism via Optimized Maximum Mean Discrepancy

We propose a method to optimize the representation and distinguishability of samples from two probability distributions, by maximizing the estimated power of a statistical test based on the maximum mean discrepancy (MMD). This optimized MMD…

Machine Learning · Statistics 2021-01-15 Danica J. Sutherland , Hsiao-Yu Tung , Heiko Strathmann , Soumyajit De , Aaditya Ramdas , Alex Smola , Arthur Gretton

Discrepancy-based Inference for Intractable Generative Models using Quasi-Monte Carlo

Intractable generative models are models for which the likelihood is unavailable but sampling is possible. Most approaches to parameter inference in this setting require the computation of some discrepancy between the data and the…

Computation · Statistics 2022-07-05 Ziang Niu , Johanna Meier , François-Xavier Briol

Estimation of time series by Maximum Mean Discrepancy

We define two minimum distance estimators for dependent data by minimizing some approximated Maximum Mean Discrepancy distances between the true empirical distribution of observations and their assumed (parametric) model distribution. When…

Methodology · Statistics 2026-01-19 Pierre Alquier , Jean-David Fermanian , Benjamin Poignard

Two-sample Statistics Based on Anisotropic Kernels

The paper introduces a new kernel-based Maximum Mean Discrepancy (MMD) statistic for measuring the distance between two distributions given finitely-many multivariate samples. When the distributions are locally low-dimensional, the proposed…

Machine Learning · Statistics 2018-09-03 Xiuyuan Cheng , Alexander Cloninger , Ronald R. Coifman

A Practical Guide to Sample-based Statistical Distances for Evaluating Generative Models in Science

Generative models are invaluable in many fields of science because of their ability to capture high-dimensional and complicated distributions, such as photo-realistic images, protein structures, and connectomes. How do we evaluate the…

Machine Learning · Computer Science 2024-10-11 Sebastian Bischoff , Alana Darcher , Michael Deistler , Richard Gao , Franziska Gerken , Manuel Gloeckler , Lisa Haxel , Jaivardhan Kapoor , Janne K Lappalainen , Jakob H Macke , Guy Moss , Matthijs Pals , Felix Pei , Rachel Rapp , A Erdem Sağtekin , Cornelius Schröder , Auguste Schulz , Zinovia Stefanidi , Shoji Toyota , Linda Ulmer , Julius Vetter

A Martingale Kernel Two-Sample Test

The Maximum Mean Discrepancy (MMD) is a widely used multivariate distance metric for two-sample testing. The standard MMD test statistic has an intractable null distribution typically requiring costly resampling or permutation approaches…

Methodology · Statistics 2026-02-24 Anirban Chatterjee , Aaditya Ramdas

Two Sample Testing in High Dimension via Maximum Mean Discrepancy

Maximum Mean Discrepancy (MMD) has been widely used in the areas of machine learning and statistics to quantify the distance between two distributions in the $p$-dimensional Euclidean space. The asymptotic property of the sample MMD has…

Statistics Theory · Mathematics 2023-08-29 Hanjia Gao , Xiaofeng Shao

PT-MMD: A Novel Statistical Framework for the Evaluation of Generative Systems

Stochastic-sampling-based Generative Neural Networks, such as Restricted Boltzmann Machines and Generative Adversarial Networks, are now used for applications such as denoising, image occlusion removal, pattern completion, and motion…

Machine Learning · Computer Science 2019-10-29 Alexander Potapov , Ian Colbert , Ken Kreutz-Delgado , Alexander Cloninger , Srinjoy Das

Robust and Efficient Estimation in Ordinal Response Models using the Density Power Divergence

In real life, we frequently come across data sets that involve some independent explanatory variable(s) generating a set of ordinal responses. These ordinal responses may correspond to an underlying continuous latent variable, which is…

Methodology · Statistics 2024-01-08 Arijit Pyne , Subhrajyoty Roy , Abhik Ghosh , Ayanendranath Basu

Keep it Tighter -- A Story on Analytical Mean Embeddings

Kernel techniques are among the most popular and flexible approaches in data science allowing to represent probability measures without loss of information under mild conditions. The resulting mapping called mean embedding gives rise to a…

Machine Learning · Statistics 2024-11-27 Linda Chamakh , Zoltan Szabo

Fisher Efficient Inference of Intractable Models

Maximum Likelihood Estimators (MLE) has many good properties. For example, the asymptotic variance of MLE solution attains equality of the asymptotic Cram{\'e}r-Rao lower bound (efficiency bound), which is the minimum possible variance for…

Machine Learning · Statistics 2019-11-05 Song Liu , Takafumi Kanamori , Wittawat Jitkrittum , Yu Chen

High finite-sample efficiency and robustness based on distance-constrained maximum likelihood

Good robust estimators can be tuned to combine a high breakdown point and a specified asymptotic efficiency at a central model. This happens in regression with MM- and tau-estimators among others. However, the finite-sample efficiency of…

Statistics Theory · Mathematics 2013-11-21 Ricardo Maronna , Víctor Yohai

Optimal quantisation of probability measures using maximum mean discrepancy

Several researchers have proposed minimisation of maximum mean discrepancy (MMD) as a method to quantise probability measures, i.e., to approximate a target distribution by a representative point set. We consider sequential algorithms that…

Machine Learning · Statistics 2021-02-15 Onur Teymur , Jackson Gorham , Marina Riabiz , Chris. J. Oates

Convolutional Maximum Mean Discrepancy for Inference in Noisy Data

Modern data analyses frequently encounter settings where samples of variables are contaminated by measurement error. Ignoring measurement noise can substantially degrade statistical inference, while existing correction techniques are often…

Methodology · Statistics 2026-04-15 Ritwik Vashistha , Jeff M. Phillips , Abhra Sarkar , Arya Farahi

Statistical Inference based on Bridge Divergences

M-estimators offer simple robust alternatives to the maximum likelihood estimator. Much of the robustness literature, however, has focused on the problems of location, location-scale and regression estimation rather than on estimation of…

Methodology · Statistics 2017-06-20 Arun Kumar Kuchibhotla , Somabha Mukherjee , Ayanendranath Basu

Maximum Mean Discrepancy on Exponential Windows for Online Change Detection

Detecting changes is of fundamental importance when analyzing data streams and has many applications, e.g., in predictive maintenance, fraud detection, or medicine. A principled approach to detect changes is to compare the distributions of…

Machine Learning · Computer Science 2025-02-13 Florian Kalinke , Marco Heyden , Georg Gntuni , Edouard Fouché , Klemens Böhm