English
Related papers

Related papers: Gradient Estimation with Discrete Stein Operators

200 papers

Stochastic gradient-based optimisation for discrete latent variable models is challenging due to the high variance of gradients. We introduce a variance reduction technique for score function estimators that makes use of double control…

Machine Learning · Statistics 2022-06-07 Michalis K. Titsias , Jiaxin Shi

Gradient-based approximate inference methods, such as Stein variational gradient descent (SVGD), provide simple and general-purpose inference engines for differentiable continuous distributions. However, existing forms of SVGD cannot be…

Machine Learning · Computer Science 2020-03-03 Jun Han , Fan Ding , Xianglong Liu , Lorenzo Torresani , Jian Peng , Qiang Liu

Learning in models with discrete latent variables is challenging due to high variance gradient estimators. Generally, approaches have relied on control variates to reduce the variance of the REINFORCE estimator. Recent work (Jang et al.…

Machine Learning · Computer Science 2017-11-07 George Tucker , Andriy Mnih , Chris J. Maddison , Dieterich Lawson , Jascha Sohl-Dickstein

Rare event simulation and rare event probability estimation are important tasks within the analysis of systems subject to uncertainty and randomness. Simultaneously, accurately estimating rare event probabilities is an inherently difficult…

Methodology · Statistics 2024-07-18 Max Ehre , Iason Papaioannou , Daniel Straub

Implicit models, which allow for the generation of samples but not for point-wise evaluation of probabilities, are omnipresent in real-world problems tackled by machine learning and a hot topic of current research. Some examples include…

Machine Learning · Statistics 2018-04-27 Yingzhen Li , Richard E. Turner

Variational inference in Bayesian deep learning often involves computing the gradient of an expectation that lacks a closed-form solution. In these cases, pathwise and score-function gradient estimators are the most common approaches. The…

Machine Learning · Statistics 2024-10-10 Kenyon Ng , Susan Wei

Recently there have been increasing interests in learning and inference with implicit distributions (i.e., distributions without tractable densities). To this end, we develop a gradient estimator for implicit distributions based on Stein's…

Machine Learning · Statistics 2018-06-11 Jiaxin Shi , Shengyang Sun , Jun Zhu

Computing gradients of an expectation with respect to the distributional parameters of a discrete distribution is a problem arising in many fields of science and engineering. Typically, this problem is tackled using Reinforce, which frames…

Machine Learning · Computer Science 2023-09-11 Pau Mulet Arabi , Alec Flowers , Lukas Mauch , Fabien Cardinaux

Policy gradient methods have achieved remarkable successes in solving challenging reinforcement learning problems. However, it still often suffers from the large variance issue on policy gradient estimation, which leads to poor sample…

Machine Learning · Statistics 2018-02-26 Hao Liu , Yihao Feng , Yi Mao , Dengyong Zhou , Jian Peng , Qiang Liu

In a variety of problems originating in supervised, unsupervised, and reinforcement learning, the loss function is defined by an expectation over a collection of random variables, which might be part of a probabilistic model or the external…

Machine Learning · Computer Science 2016-01-06 John Schulman , Nicolas Heess , Theophane Weber , Pieter Abbeel

It seems that in the current age, computers, computation, and data have an increasingly important role to play in scientific research and discovery. This is reflected in part by the rise of machine learning and artificial intelligence,…

Machine Learning · Computer Science 2024-05-15 Ronan Keane

We deal with the problem of gradient estimation for stochastic differentiable relaxations of algorithms, operators, simulators, and other non-differentiable functions. Stochastic smoothing conventionally perturbs the input of a…

Machine Learning · Computer Science 2024-10-11 Felix Petersen , Christian Borgelt , Aashwin Mishra , Stefano Ermon

A crucial task in predictive maintenance is estimating the remaining useful life of physical systems. In the last decade, deep learning has improved considerably upon traditional model-based and statistical approaches in terms of predictive…

Machine Learning · Computer Science 2024-02-05 Luca Della Libera , Jacopo Andreoli , Davide Dalle Pezze , Mirco Ravanelli , Gian Antonio Susto

Stein variational gradient descent (SVGD) is a deterministic sampling algorithm that iteratively transports a set of particles to approximate given distributions, based on an efficient gradient-based update that guarantees to optimally…

Machine Learning · Statistics 2017-11-15 Qiang Liu

Optimization with noisy gradients has become ubiquitous in statistics and machine learning. Reparameterization gradients, or gradient estimates computed via the "reparameterization trick," represent a class of noisy gradients often used in…

Machine Learning · Statistics 2017-05-23 Andrew C. Miller , Nicholas J. Foti , Alexander D'Amour , Ryan P. Adams

Bayesian inference problems require sampling or approximating high-dimensional probability distributions. The focus of this paper is on the recently introduced Stein variational gradient descent methodology, a class of algorithms that rely…

Machine Learning · Statistics 2023-02-14 A. Duncan , N. Nuesken , L. Szpruch

Many problems involve the use of models which learn probability distributions or incorporate randomness in some way. In such problems, because computing the true expected gradient may be intractable, a gradient estimator is used to update…

Machine Learning · Computer Science 2022-12-29 Ronan Keane , H. Oliver Gao

Stochastic neurons and hard non-linearities can be useful for a number of reasons in deep learning models, but in many cases they pose a challenging problem: how to estimate the gradient of a loss function with respect to the input of such…

Machine Learning · Computer Science 2013-08-16 Yoshua Bengio , Nicholas Léonard , Aaron Courville

Stein Variational Gradient Descent (SVGD) is a highly efficient method to sample from an unnormalized probability distribution. However, the SVGD update relies on gradients of the log-density, which may not always be available. Existing…

Machine Learning · Computer Science 2026-03-13 Cornelius V. Braun , Robert T. Lange , Marc Toussaint

Stochastic gradient descent (SGD) is the workhorse of modern machine learning. Sometimes, there are many different potential gradient estimators that can be used. When so, choosing the one with the best tradeoff between cost and variance is…

Machine Learning · Computer Science 2020-10-23 Tomas Geffner , Justin Domke
‹ Prev 1 2 3 10 Next ›