Related papers: Gradient Estimation with Discrete Stein Operators

Double Control Variates for Gradient Estimation in Discrete Latent Variable Models

Stochastic gradient-based optimisation for discrete latent variable models is challenging due to the high variance of gradients. We introduce a variance reduction technique for score function estimators that makes use of double control…

Machine Learning · Statistics 2022-06-07 Michalis K. Titsias , Jiaxin Shi

Stein Variational Inference for Discrete Distributions

Gradient-based approximate inference methods, such as Stein variational gradient descent (SVGD), provide simple and general-purpose inference engines for differentiable continuous distributions. However, existing forms of SVGD cannot be…

Machine Learning · Computer Science 2020-03-03 Jun Han , Fan Ding , Xianglong Liu , Lorenzo Torresani , Jian Peng , Qiang Liu

REBAR: Low-variance, unbiased gradient estimates for discrete latent variable models

Learning in models with discrete latent variables is challenging due to high variance gradient estimators. Generally, approaches have relied on control variates to reduce the variance of the REINFORCE estimator. Recent work (Jang et al.…

Machine Learning · Computer Science 2017-11-07 George Tucker , Andriy Mnih , Chris J. Maddison , Dieterich Lawson , Jascha Sohl-Dickstein

Stein Variational Rare Event Simulation

Rare event simulation and rare event probability estimation are important tasks within the analysis of systems subject to uncertainty and randomness. Simultaneously, accurately estimating rare event probabilities is an inherently difficult…

Methodology · Statistics 2024-07-18 Max Ehre , Iason Papaioannou , Daniel Straub

Gradient Estimators for Implicit Models

Implicit models, which allow for the generation of samples but not for point-wise evaluation of probabilities, are omnipresent in real-world problems tackled by machine learning and a hot topic of current research. Some examples include…

Machine Learning · Statistics 2018-04-27 Yingzhen Li , Richard E. Turner

Pathwise Gradient Variance Reduction with Control Variates in Variational Inference

Variational inference in Bayesian deep learning often involves computing the gradient of an expectation that lacks a closed-form solution. In these cases, pathwise and score-function gradient estimators are the most common approaches. The…

Machine Learning · Statistics 2024-10-10 Kenyon Ng , Susan Wei

A Spectral Approach to Gradient Estimation for Implicit Distributions

Recently there have been increasing interests in learning and inference with implicit distributions (i.e., distributions without tractable densities). To this end, we develop a gradient estimator for implicit distributions based on Stein's…

Machine Learning · Statistics 2018-06-11 Jiaxin Shi , Shengyang Sun , Jun Zhu

DBsurf: A Discrepancy Based Method for Discrete Stochastic Gradient Estimation

Computing gradients of an expectation with respect to the distributional parameters of a discrete distribution is a problem arising in many fields of science and engineering. Typically, this problem is tackled using Reinforce, which frames…

Machine Learning · Computer Science 2023-09-11 Pau Mulet Arabi , Alec Flowers , Lukas Mauch , Fabien Cardinaux

Action-depedent Control Variates for Policy Optimization via Stein's Identity

Policy gradient methods have achieved remarkable successes in solving challenging reinforcement learning problems. However, it still often suffers from the large variance issue on policy gradient estimation, which leads to poor sample…

Machine Learning · Statistics 2018-02-26 Hao Liu , Yihao Feng , Yi Mao , Dengyong Zhou , Jian Peng , Qiang Liu

Gradient Estimation Using Stochastic Computation Graphs

In a variety of problems originating in supervised, unsupervised, and reinforcement learning, the loss function is defined by an expectation over a collection of random variables, which might be part of a probabilistic model or the external…

Machine Learning · Computer Science 2016-01-06 John Schulman , Nicolas Heess , Theophane Weber , Pieter Abbeel

Gradient Estimation and Variance Reduction in Stochastic and Deterministic Models

It seems that in the current age, computers, computation, and data have an increasingly important role to play in scientific research and discovery. This is reflected in part by the rise of machine learning and artificial intelligence,…

Machine Learning · Computer Science 2024-05-15 Ronan Keane

Generalizing Stochastic Smoothing for Differentiation and Gradient Estimation

We deal with the problem of gradient estimation for stochastic differentiable relaxations of algorithms, operators, simulators, and other non-differentiable functions. Stochastic smoothing conventionally perturbs the input of a…

Machine Learning · Computer Science 2024-10-11 Felix Petersen , Christian Borgelt , Aashwin Mishra , Stefano Ermon

Bayesian Deep Learning for Remaining Useful Life Estimation via Stein Variational Gradient Descent

A crucial task in predictive maintenance is estimating the remaining useful life of physical systems. In the last decade, deep learning has improved considerably upon traditional model-based and statistical approaches in terms of predictive…

Machine Learning · Computer Science 2024-02-05 Luca Della Libera , Jacopo Andreoli , Davide Dalle Pezze , Mirco Ravanelli , Gian Antonio Susto

Stein Variational Gradient Descent as Gradient Flow

Stein variational gradient descent (SVGD) is a deterministic sampling algorithm that iteratively transports a set of particles to approximate given distributions, based on an efficient gradient-based update that guarantees to optimally…

Machine Learning · Statistics 2017-11-15 Qiang Liu

Reducing Reparameterization Gradient Variance

Optimization with noisy gradients has become ubiquitous in statistics and machine learning. Reparameterization gradients, or gradient estimates computed via the "reparameterization trick," represent a class of noisy gradients often used in…

Machine Learning · Statistics 2017-05-23 Andrew C. Miller , Nicholas J. Foti , Alexander D'Amour , Ryan P. Adams

On the geometry of Stein variational gradient descent

Bayesian inference problems require sampling or approximating high-dimensional probability distributions. The focus of this paper is on the recently introduced Stein variational gradient descent methodology, a class of algorithms that rely…

Machine Learning · Statistics 2023-02-14 A. Duncan , N. Nuesken , L. Szpruch

Variance Reduction for Score Functions Using Optimal Baselines

Many problems involve the use of models which learn probability distributions or incorporate randomness in some way. In such problems, because computing the true expected gradient may be intractable, a gradient estimator is used to update…

Machine Learning · Computer Science 2022-12-29 Ronan Keane , H. Oliver Gao

Estimating or Propagating Gradients Through Stochastic Neurons for Conditional Computation

Stochastic neurons and hard non-linearities can be useful for a number of reasons in deep learning models, but in many cases they pose a challenging problem: how to estimate the gradient of a loss function with respect to the input of such…

Machine Learning · Computer Science 2013-08-16 Yoshua Bengio , Nicholas Léonard , Aaron Courville

Stein Variational Evolution Strategies

Stein Variational Gradient Descent (SVGD) is a highly efficient method to sample from an unnormalized probability distribution. However, the SVGD update relies on gradients of the log-density, which may not always be available. Existing…

Machine Learning · Computer Science 2026-03-13 Cornelius V. Braun , Robert T. Lange , Marc Toussaint

A Rule for Gradient Estimator Selection, with an Application to Variational Inference

Stochastic gradient descent (SGD) is the workhorse of modern machine learning. Sometimes, there are many different potential gradient estimators that can be used. When so, choosing the one with the best tradeoff between cost and variance is…

Machine Learning · Computer Science 2020-10-23 Tomas Geffner , Justin Domke