Related papers: Gradient-Free Kernel Stein Discrepancy

Stochastic Stein Discrepancies

Stein discrepancies (SDs) monitor convergence and non-convergence in approximate inference when exact integration and sampling are intractable. However, the computation of a Stein discrepancy can be prohibitive if the Stein operator - often…

Machine Learning · Statistics 2020-10-26 Jackson Gorham , Anant Raj , Lester Mackey

Random Feature Stein Discrepancies

Computable Stein discrepancies have been deployed for a variety of applications, ranging from sampler selection in posterior inference to approximate Bayesian inference to goodness-of-fit testing. Existing convergence-determining Stein…

Machine Learning · Statistics 2021-10-12 Jonathan H. Huggins , Lester Mackey

On the geometry of Stein variational gradient descent

Bayesian inference problems require sampling or approximating high-dimensional probability distributions. The focus of this paper is on the recently introduced Stein variational gradient descent methodology, a class of algorithms that rely…

Machine Learning · Statistics 2023-02-14 A. Duncan , N. Nuesken , L. Szpruch

Stein Points

An important task in computational statistics and machine learning is to approximate a posterior distribution $p(x)$ with an empirical measure supported on a set of representative points $\{x_i\}_{i=1}^n$. This paper focuses on methods…

Computation · Statistics 2018-06-20 Wilson Ye Chen , Lester Mackey , Jackson Gorham , François-Xavier Briol , Chris J. Oates

Stein Variational Gradient Descent as Moment Matching

Stein variational gradient descent (SVGD) is a non-parametric inference algorithm that evolves a set of particles to fit a given distribution of interest. We analyze the non-asymptotic properties of SVGD, showing that there exists a set of…

Machine Learning · Statistics 2018-10-30 Qiang Liu , Dilin Wang

Stein Variational Inference for Discrete Distributions

Gradient-based approximate inference methods, such as Stein variational gradient descent (SVGD), provide simple and general-purpose inference engines for differentiable continuous distributions. However, existing forms of SVGD cannot be…

Machine Learning · Computer Science 2020-03-03 Jun Han , Fan Ding , Xianglong Liu , Lorenzo Torresani , Jian Peng , Qiang Liu

Kernel Stein Discrepancy Descent

Among dissimilarities between probability distributions, the Kernel Stein Discrepancy (KSD) has received much interest recently. We investigate the properties of its Wasserstein gradient flow to approximate a target probability distribution…

Machine Learning · Statistics 2021-05-24 Anna Korba , Pierre-Cyril Aubin-Frankowski , Szymon Majewski , Pierre Ablin

Kernelized Complete Conditional Stein Discrepancy

Much of machine learning relies on comparing distributions with discrepancy measures. Stein's method creates discrepancy measures between two distributions that require only the unnormalized density of one and samples from the other. Stein…

Machine Learning · Statistics 2020-07-21 Raghav Singhal , Xintian Han , Saad Lahlou , Rajesh Ranganath

Stein Variational Gradient Descent: A General Purpose Bayesian Inference Algorithm

We propose a general purpose variational inference algorithm that forms a natural counterpart of gradient descent for optimization. Our method iteratively transports a set of particles to match the target distribution, by applying a form of…

Machine Learning · Statistics 2019-09-10 Qiang Liu , Dilin Wang

Gradient Estimation with Discrete Stein Operators

Gradient estimation -- approximating the gradient of an expectation with respect to the parameters of a distribution -- is central to the solution of many machine learning problems. However, when the distribution is discrete, most common…

Machine Learning · Statistics 2024-04-16 Jiaxin Shi , Yuhao Zhou , Jessica Hwang , Michalis K. Titsias , Lester Mackey

Minimum Stein Discrepancy Estimators

When maximum likelihood estimation is infeasible, one often turns to score matching, contrastive divergence, or minimum probability flow to obtain tractable parameter estimates. We provide a unifying perspective of these techniques as…

Statistics Theory · Mathematics 2022-10-07 Alessandro Barp , Francois-Xavier Briol , Andrew B. Duncan , Mark Girolami , Lester Mackey

Existence of Stein Kernels under a Spectral Gap, and Discrepancy Bound

We establish existence of Stein kernels for probability measures on $\mathbb{R}^d$ satisfying a Poincar\'e inequality, and obtain bounds on the Stein discrepancy of such measures. Applications to quantitative central limit theorems are…

Probability · Mathematics 2018-03-09 Thomas A. Courtade , Max Fathi , Ashwin Pananjady

Control functionals for Monte Carlo integration

A non-parametric extension of control variates is presented. These leverage gradient information on the sampling density to achieve substantial variance reduction. It is not required that the sampling density be normalised. The novel…

Methodology · Statistics 2016-04-05 Chris J. Oates , Mark Girolami , Nicolas Chopin

Convergence Rates for a Class of Estimators Based on Stein's Method

Gradient information on the sampling distribution can be used to reduce the variance of Monte Carlo estimators via Stein's method. An important application is that of estimating an expectation of a test function along the sample path of a…

Statistics Theory · Mathematics 2017-12-29 Chris J. Oates , Jon Cockayne , François-Xavier Briol , Mark Girolami

Higher-order Stein kernels for Gaussian approximation

We introduce higher-order Stein kernels relative to the standard Gaussian measure, which generalize the usual Stein kernels by involving higher-order derivatives of test functions. We relate the associated discrepancies to various metrics…

Probability · Mathematics 2018-12-07 Max Fathi

Probabilistic Inference and Learning with Stein's Method

This monograph provides a rigorous overview of theoretical and methodological aspects of probabilistic inference and learning with Stein's method. Recipes are provided for constructing Stein discrepancies from Stein operators and Stein…

Machine Learning · Statistics 2026-03-10 Qiang Liu , Lester Mackey , Chris Oates

Targeted Separation and Convergence with Kernel Discrepancies

Maximum mean discrepancies (MMDs) like the kernel Stein discrepancy (KSD) have grown central to a wide range of applications, including hypothesis testing, sampler selection, distribution approximation, and variational inference. In each…

Machine Learning · Statistics 2025-03-26 Alessandro Barp , Carl-Johann Simon-Gabriel , Mark Girolami , Lester Mackey

Gradient Estimators for Implicit Models

Implicit models, which allow for the generation of samples but not for point-wise evaluation of probabilities, are omnipresent in real-world problems tackled by machine learning and a hot topic of current research. Some examples include…

Machine Learning · Statistics 2018-04-27 Yingzhen Li , Richard E. Turner

Importance Sampled Stochastic Optimization for Variational Inference

Variational inference approximates the posterior distribution of a probabilistic model with a parameterized density by maximizing a lower bound for the model evidence. Modern solutions fit a flexible approximation with stochastic gradient…

Machine Learning · Statistics 2017-07-13 Joseph Sakaya , Arto Klami

Stein Variational Evolution Strategies

Stein Variational Gradient Descent (SVGD) is a highly efficient method to sample from an unnormalized probability distribution. However, the SVGD update relies on gradients of the log-density, which may not always be available. Existing…

Machine Learning · Computer Science 2026-03-13 Cornelius V. Braun , Robert T. Lange , Marc Toussaint