Related papers: Reparameterization Gradients through Acceptance-Re…

Implicit Reparameterization Gradients

By providing a simple and efficient way of computing low-variance gradients of continuous random variables, the reparameterization trick has become the technique of choice for training a variety of latent variable models. However, it is not…

Machine Learning · Computer Science 2019-01-31 Michael Figurnov , Shakir Mohamed , Andriy Mnih

Reparameterization trick for discrete variables

Low-variance gradient estimation is crucial for learning directed graphical models parameterized by neural networks, where the reparameterization trick is widely used for those with continuous variables. While this technique gives…

Machine Learning · Statistics 2016-11-07 Seiya Tokui , Issei sato

Reparameterization Gradient for Non-differentiable Models

We present a new algorithm for stochastic variational inference that targets at models with non-differentiable densities. One of the key challenges in stochastic variational inference is to come up with a low-variance estimator of the…

Machine Learning · Computer Science 2018-10-26 Wonyeol Lee , Hangyeol Yu , Hongseok Yang

Variance reduction properties of the reparameterization trick

The reparameterization trick is widely used in variational inference as it yields more accurate estimates of the gradient of the variational objective than alternative approaches such as the score function method. Although there is…

Machine Learning · Statistics 2018-12-31 Ming Xu , Matias Quiroz , Robert Kohn , Scott A. Sisson

The Generalized Reparameterization Gradient

The reparameterization gradient has become a widely used method to obtain Monte Carlo gradients to optimize the variational objective. However, this technique does not easily apply to commonly used distributions such as beta or gamma…

Machine Learning · Statistics 2016-10-20 Francisco J. R. Ruiz , Michalis K. Titsias , David M. Blei

Stochastic Backpropagation through Mixture Density Distributions

The ability to backpropagate stochastic gradients through continuous latent distributions has been crucial to the emergence of variational autoencoders and stochastic gradient variational Bayes. The key ingredient is an unbiased and…

Neural and Evolutionary Computing · Computer Science 2016-07-20 Alex Graves

Generalized Transformation-based Gradient

The reparameterization trick has become one of the most useful tools in the field of variational inference. However, the reparameterization trick is based on the standardization transformation which restricts the scope of application of…

Machine Learning · Computer Science 2020-02-26 Anbang Wu , Shuangxi Chen , Chunming Wu

Fast and Correct Gradient-Based Optimisation for Probabilistic Programming via Smoothing

We study the foundations of variational inference, which frames posterior inference as an optimisation problem, for probabilistic programming. The dominant approach for optimisation in practice is stochastic gradient descent. In particular,…

Programming Languages · Computer Science 2023-01-10 Basim Khajwal , C. -H. Luke Ong , Dominik Wagner

A unified view of likelihood ratio and reparameterization gradients and an optimal importance sampling scheme

Reparameterization (RP) and likelihood ratio (LR) gradient estimators are used throughout machine and reinforcement learning; however, they are usually explained as simple mathematical tricks without providing any insight into their nature.…

Machine Learning · Computer Science 2019-10-16 Paavo Parmas , Masashi Sugiyama

An Empirical Analysis of Measure-Valued Derivatives for Policy Gradients

Reinforcement learning methods for robotics are increasingly successful due to the constant development of better policy gradient techniques. A precise (low variance) and accurate (low bias) gradient estimator is crucial to face…

Machine Learning · Computer Science 2021-07-21 João Carvalho , Davide Tateo , Fabio Muratore , Jan Peters

An Analysis of Measure-Valued Derivatives for Policy Gradients

Reinforcement learning methods for robotics are increasingly successful due to the constant development of better policy gradient techniques. A precise (low variance) and accurate (low bias) gradient estimator is crucial to face…

Machine Learning · Computer Science 2022-03-09 Joao Carvalho , Jan Peters

Generalized Doubly Reparameterized Gradient Estimators

Efficient low-variance gradient estimation enabled by the reparameterization trick (RT) has been essential to the success of variational autoencoders. Doubly-reparameterized gradients (DReGs) improve on the RT for multi-sample variational…

Machine Learning · Statistics 2021-07-14 Matthias Bauer , Andriy Mnih

GO Gradient for Expectation-Based Objectives

Within many machine learning algorithms, a fundamental problem concerns efficient calculation of an unbiased gradient wrt parameters $\gammav$ for expectation-based objectives $\Ebb_{q_{\gammav} (\yv)} [f(\yv)]$. Most existing methods…

Machine Learning · Statistics 2019-01-21 Yulai Cong , Miaoyun Zhao , Ke Bai , Lawrence Carin

The Concrete Distribution: A Continuous Relaxation of Discrete Random Variables

The reparameterization trick enables optimizing large scale stochastic computation graphs via gradient descent. The essence of the trick is to refactor each stochastic node into a differentiable function of its parameters and a random…

Machine Learning · Computer Science 2017-03-07 Chris J. Maddison , Andriy Mnih , Yee Whye Teh

Use of model reparametrization to improve variational Bayes

We propose using model reparametrization to improve variational Bayes inference for hierarchical models whose variables can be classified as global (shared across observations) or local (observation specific). Posterior dependence between…

Methodology · Statistics 2021-01-28 Linda S. L. Tan

Pathwise Derivatives Beyond the Reparameterization Trick

We observe that gradients computed via the reparameterization trick are in direct correspondence with solutions of the transport equation in the formalism of optimal transport. We use this perspective to compute (approximate) pathwise…

Machine Learning · Statistics 2018-07-06 Martin Jankowiak , Fritz Obermeyer

Reducing Reparameterization Gradient Variance

Optimization with noisy gradients has become ubiquitous in statistics and machine learning. Reparameterization gradients, or gradient estimates computed via the "reparameterization trick," represent a class of noisy gradients often used in…

Machine Learning · Statistics 2017-05-23 Andrew C. Miller , Nicholas J. Foti , Alexander D'Amour , Ryan P. Adams

Importance Sampled Stochastic Optimization for Variational Inference

Variational inference approximates the posterior distribution of a probabilistic model with a parameterized density by maximizing a lower bound for the model evidence. Modern solutions fit a flexible approximation with stochastic gradient…

Machine Learning · Statistics 2017-07-13 Joseph Sakaya , Arto Klami

New Tricks for Estimating Gradients of Expectations

We introduce a family of pairwise stochastic gradient estimators for gradients of expectations, which are related to the log-derivative trick, but involve pairwise interactions between samples. The simplest example of our new estimator,…

Machine Learning · Computer Science 2022-04-21 Christian J. Walder , Paul Roussel , Richard Nock , Cheng Soon Ong , Masashi Sugiyama

Reparameterizable Subset Sampling via Continuous Relaxations

Many machine learning tasks require sampling a subset of items from a collection based on a parameterized distribution. The Gumbel-softmax trick can be used to sample a single item, and allows for low-variance reparameterized gradients with…

Machine Learning · Computer Science 2021-03-02 Sang Michael Xie , Stefano Ermon