Related papers: Implicit Reparameterization Gradients

Reparameterization Gradients through Acceptance-Rejection Sampling Algorithms

Variational inference using the reparameterization trick has enabled large-scale approximate Bayesian inference in complex probabilistic models, leveraging stochastic optimization to sidestep intractable expectations. The reparameterization…

Machine Learning · Statistics 2020-02-13 Christian A. Naesseth , Francisco J. R. Ruiz , Scott W. Linderman , David M. Blei

Reparameterization trick for discrete variables

Low-variance gradient estimation is crucial for learning directed graphical models parameterized by neural networks, where the reparameterization trick is widely used for those with continuous variables. While this technique gives…

Machine Learning · Statistics 2016-11-07 Seiya Tokui , Issei sato

The Generalized Reparameterization Gradient

The reparameterization gradient has become a widely used method to obtain Monte Carlo gradients to optimize the variational objective. However, this technique does not easily apply to commonly used distributions such as beta or gamma…

Machine Learning · Statistics 2016-10-20 Francisco J. R. Ruiz , Michalis K. Titsias , David M. Blei

Reparameterization Gradient for Non-differentiable Models

We present a new algorithm for stochastic variational inference that targets at models with non-differentiable densities. One of the key challenges in stochastic variational inference is to come up with a low-variance estimator of the…

Machine Learning · Computer Science 2018-10-26 Wonyeol Lee , Hangyeol Yu , Hongseok Yang

Implicit Regularization for Group Sparsity

We study the implicit regularization of gradient descent towards structured sparsity via a novel neural reparameterization, which we call a diagonally grouped linear neural network. We show the following intriguing property of our…

Machine Learning · Statistics 2023-01-31 Jiangyuan Li , Thanh V. Nguyen , Chinmay Hegde , Raymond K. W. Wong

Pathwise Derivatives Beyond the Reparameterization Trick

We observe that gradients computed via the reparameterization trick are in direct correspondence with solutions of the transport equation in the formalism of optimal transport. We use this perspective to compute (approximate) pathwise…

Machine Learning · Statistics 2018-07-06 Martin Jankowiak , Fritz Obermeyer

Variance reduction properties of the reparameterization trick

The reparameterization trick is widely used in variational inference as it yields more accurate estimates of the gradient of the variational objective than alternative approaches such as the score function method. Although there is…

Machine Learning · Statistics 2018-12-31 Ming Xu , Matias Quiroz , Robert Kohn , Scott A. Sisson

Reducing Reparameterization Gradient Variance

Optimization with noisy gradients has become ubiquitous in statistics and machine learning. Reparameterization gradients, or gradient estimates computed via the "reparameterization trick," represent a class of noisy gradients often used in…

Machine Learning · Statistics 2017-05-23 Andrew C. Miller , Nicholas J. Foti , Alexander D'Amour , Ryan P. Adams

Learning Model Reparametrizations: Implicit Variational Inference by Fitting MCMC distributions

We introduce a new algorithm for approximate inference that combines reparametrization, Markov chain Monte Carlo and variational methods. We construct a very flexible implicit variational distribution synthesized by an arbitrary Markov…

Machine Learning · Statistics 2017-08-07 Michalis K. Titsias

Stochastic Backpropagation through Mixture Density Distributions

The ability to backpropagate stochastic gradients through continuous latent distributions has been crucial to the emergence of variational autoencoders and stochastic gradient variational Bayes. The key ingredient is an unbiased and…

Neural and Evolutionary Computing · Computer Science 2016-07-20 Alex Graves

Categorical Reparameterization with Denoising Diffusion models

Learning models with categorical variables requires optimizing expectations over discrete distributions, a setting in which stochastic gradient-based optimization is challenging due to the non-differentiability of categorical sampling. A…

Machine Learning · Computer Science 2026-02-10 Samson Gourevitch , Alain Durmus , Eric Moulines , Jimmy Olsson , Yazid Janati

Generalized Doubly Reparameterized Gradient Estimators

Efficient low-variance gradient estimation enabled by the reparameterization trick (RT) has been essential to the success of variational autoencoders. Doubly-reparameterized gradients (DReGs) improve on the RT for multi-sample variational…

Machine Learning · Statistics 2021-07-14 Matthias Bauer , Andriy Mnih

Gradient Estimators for Implicit Models

Implicit models, which allow for the generation of samples but not for point-wise evaluation of probabilities, are omnipresent in real-world problems tackled by machine learning and a hot topic of current research. Some examples include…

Machine Learning · Statistics 2018-04-27 Yingzhen Li , Richard E. Turner

Generalized Transformation-based Gradient

The reparameterization trick has become one of the most useful tools in the field of variational inference. However, the reparameterization trick is based on the standardization transformation which restricts the scope of application of…

Machine Learning · Computer Science 2020-02-26 Anbang Wu , Shuangxi Chen , Chunming Wu

Implicit Gradient Regularization

Gradient descent can be surprisingly good at optimizing deep neural networks without overfitting and without explicit regularization. We find that the discrete steps of gradient descent implicitly regularize models by penalizing gradient…

Machine Learning · Computer Science 2022-07-20 David G. T. Barrett , Benoit Dherin

On Training Implicit Models

This paper focuses on training implicit models of infinite layers. Specifically, previous works employ implicit differentiation and solve the exact gradient for the backward propagation. However, is it necessary to compute such an exact but…

Machine Learning · Computer Science 2022-01-13 Zhengyang Geng , Xin-Yu Zhang , Shaojie Bai , Yisen Wang , Zhouchen Lin

GO Gradient for Expectation-Based Objectives

Within many machine learning algorithms, a fundamental problem concerns efficient calculation of an unbiased gradient wrt parameters $\gammav$ for expectation-based objectives $\Ebb_{q_{\gammav} (\yv)} [f(\yv)]$. Most existing methods…

Machine Learning · Statistics 2019-01-21 Yulai Cong , Miaoyun Zhao , Ke Bai , Lawrence Carin

An Empirical Analysis of Measure-Valued Derivatives for Policy Gradients

Reinforcement learning methods for robotics are increasingly successful due to the constant development of better policy gradient techniques. A precise (low variance) and accurate (low bias) gradient estimator is crucial to face…

Machine Learning · Computer Science 2021-07-21 João Carvalho , Davide Tateo , Fabio Muratore , Jan Peters

Generalized Gumbel-Softmax Gradient Estimator for Generic Discrete Random Variables

Estimating the gradients of stochastic nodes in stochastic computational graphs is one of the crucial research questions in the deep generative modeling community, which enables the gradient descent optimization on neural network…

Machine Learning · Computer Science 2023-02-23 Weonyoung Joo , Dongjun Kim , Seungjae Shin , Il-Chul Moon

A Spectral Approach to Gradient Estimation for Implicit Distributions

Recently there have been increasing interests in learning and inference with implicit distributions (i.e., distributions without tractable densities). To this end, we develop a gradient estimator for implicit distributions based on Stein's…

Machine Learning · Statistics 2018-06-11 Jiaxin Shi , Shengyang Sun , Jun Zhu