Related papers: Generalized Transformation-based Gradient

The Generalized Reparameterization Gradient

The reparameterization gradient has become a widely used method to obtain Monte Carlo gradients to optimize the variational objective. However, this technique does not easily apply to commonly used distributions such as beta or gamma…

Machine Learning · Statistics 2016-10-20 Francisco J. R. Ruiz , Michalis K. Titsias , David M. Blei

Reparameterization Gradients through Acceptance-Rejection Sampling Algorithms

Variational inference using the reparameterization trick has enabled large-scale approximate Bayesian inference in complex probabilistic models, leveraging stochastic optimization to sidestep intractable expectations. The reparameterization…

Machine Learning · Statistics 2020-02-13 Christian A. Naesseth , Francisco J. R. Ruiz , Scott W. Linderman , David M. Blei

Variance reduction properties of the reparameterization trick

The reparameterization trick is widely used in variational inference as it yields more accurate estimates of the gradient of the variational objective than alternative approaches such as the score function method. Although there is…

Machine Learning · Statistics 2018-12-31 Ming Xu , Matias Quiroz , Robert Kohn , Scott A. Sisson

Implicit Reparameterization Gradients

By providing a simple and efficient way of computing low-variance gradients of continuous random variables, the reparameterization trick has become the technique of choice for training a variety of latent variable models. However, it is not…

Machine Learning · Computer Science 2019-01-31 Michael Figurnov , Shakir Mohamed , Andriy Mnih

Reparameterization trick for discrete variables

Low-variance gradient estimation is crucial for learning directed graphical models parameterized by neural networks, where the reparameterization trick is widely used for those with continuous variables. While this technique gives…

Machine Learning · Statistics 2016-11-07 Seiya Tokui , Issei sato

Use of model reparametrization to improve variational Bayes

We propose using model reparametrization to improve variational Bayes inference for hierarchical models whose variables can be classified as global (shared across observations) or local (observation specific). Posterior dependence between…

Methodology · Statistics 2021-01-28 Linda S. L. Tan

Approximation Based Variance Reduction for Reparameterization Gradients

Flexible variational distributions improve variational inference but are harder to optimize. In this work we present a control variate that is applicable for any reparameterizable distribution with known mean and covariance matrix, e.g.…

Machine Learning · Computer Science 2020-10-26 Tomas Geffner , Justin Domke

GO Gradient for Expectation-Based Objectives

Within many machine learning algorithms, a fundamental problem concerns efficient calculation of an unbiased gradient wrt parameters $\gammav$ for expectation-based objectives $\Ebb_{q_{\gammav} (\yv)} [f(\yv)]$. Most existing methods…

Machine Learning · Statistics 2019-01-21 Yulai Cong , Miaoyun Zhao , Ke Bai , Lawrence Carin

On the Generalization Gap in Reparameterizable Reinforcement Learning

Understanding generalization in reinforcement learning (RL) is a significant challenge, as many common assumptions of traditional supervised learning theory do not apply. We focus on the special class of reparameterizable RL problems, where…

Machine Learning · Computer Science 2019-05-31 Huan Wang , Stephan Zheng , Caiming Xiong , Richard Socher

Reparameterization Gradient for Non-differentiable Models

We present a new algorithm for stochastic variational inference that targets at models with non-differentiable densities. One of the key challenges in stochastic variational inference is to come up with a low-variance estimator of the…

Machine Learning · Computer Science 2018-10-26 Wonyeol Lee , Hangyeol Yu , Hongseok Yang

Stochastic Backpropagation through Mixture Density Distributions

The ability to backpropagate stochastic gradients through continuous latent distributions has been crucial to the emergence of variational autoencoders and stochastic gradient variational Bayes. The key ingredient is an unbiased and…

Neural and Evolutionary Computing · Computer Science 2016-07-20 Alex Graves

Generalized Doubly Reparameterized Gradient Estimators

Efficient low-variance gradient estimation enabled by the reparameterization trick (RT) has been essential to the success of variational autoencoders. Doubly-reparameterized gradients (DReGs) improve on the RT for multi-sample variational…

Machine Learning · Statistics 2021-07-14 Matthias Bauer , Andriy Mnih

The Concrete Distribution: A Continuous Relaxation of Discrete Random Variables

The reparameterization trick enables optimizing large scale stochastic computation graphs via gradient descent. The essence of the trick is to refactor each stochastic node into a differentiable function of its parameters and a random…

Machine Learning · Computer Science 2017-03-07 Chris J. Maddison , Andriy Mnih , Yee Whye Teh

Learning Model Reparametrizations: Implicit Variational Inference by Fitting MCMC distributions

We introduce a new algorithm for approximate inference that combines reparametrization, Markov chain Monte Carlo and variational methods. We construct a very flexible implicit variational distribution synthesized by an arbitrary Markov…

Machine Learning · Statistics 2017-08-07 Michalis K. Titsias

Sticking the Landing: Simple, Lower-Variance Gradient Estimators for Variational Inference

We propose a simple and general variant of the standard reparameterized gradient estimator for the variational evidence lower bound. Specifically, we remove a part of the total derivative with respect to the variational parameters that…

Machine Learning · Statistics 2017-05-30 Geoffrey Roeder , Yuhuai Wu , David Duvenaud

Reparameterization through Spatial Gradient Scaling

Reparameterization aims to improve the generalization of deep neural networks by transforming convolutional layers into equivalent multi-branched structures during training. However, there exists a gap in understanding how…

Machine Learning · Computer Science 2023-03-08 Alexander Detkov , Mohammad Salameh , Muhammad Fetrat Qharabagh , Jialin Zhang , Wei Lui , Shangling Jui , Di Niu

Variational Dropout and the Local Reparameterization Trick

We investigate a local reparameterizaton technique for greatly reducing the variance of stochastic gradients for variational Bayesian inference (SGVB) of a posterior over model parameters, while retaining parallelizability. This local…

Machine Learning · Statistics 2015-12-22 Diederik P. Kingma , Tim Salimans , Max Welling

Understanding the Generalization Benefit of Model Invariance from a Data Perspective

Machine learning models that are developed with invariance to certain types of data transformations have demonstrated superior generalization performance in practice. However, the underlying mechanism that explains why invariance leads to…

Machine Learning · Computer Science 2023-02-24 Sicheng Zhu , Bang An , Furong Huang

Pathwise Derivatives Beyond the Reparameterization Trick

We observe that gradients computed via the reparameterization trick are in direct correspondence with solutions of the transport equation in the formalism of optimal transport. We use this perspective to compute (approximate) pathwise…

Machine Learning · Statistics 2018-07-06 Martin Jankowiak , Fritz Obermeyer

Provable Gradient Variance Guarantees for Black-Box Variational Inference

Recent variational inference methods use stochastic gradient estimators whose variance is not well understood. Theoretical guarantees for these estimators are important to understand when these methods will or will not work. This paper…

Machine Learning · Computer Science 2019-10-29 Justin Domke