Related papers: Kernelized Wasserstein Natural Gradient

Stochastic Optimization for Regularized Wasserstein Estimators

Optimal transport is a foundational problem in optimization, that allows to compare probability distributions while taking into account geometric aspects. Its optimal objective value, the Wasserstein distance, provides an important loss…

Machine Learning · Computer Science 2020-02-21 Marin Ballu , Quentin Berthet , Francis Bach

Natural gradient via optimal transport

We study a natural Wasserstein gradient flow on manifolds of probability distributions with discrete sample spaces. We derive the Riemannian structure for the probability simplex from the dynamical formulation of the Wasserstein distance on…

Optimization and Control · Mathematics 2021-04-19 Wuchen Li , Guido Montufar

Efficient Natural Gradient Descent Methods for Large-Scale PDE-Based Optimization Problems

We propose efficient numerical schemes for implementing the natural gradient descent (NGD) for a broad range of metric spaces with applications to PDE-based optimization problems. Our technique represents the natural gradient direction as a…

Optimization and Control · Mathematics 2023-01-12 Levon Nurbekyan , Wanzhou Lei , Yunan Yang

Wasserstein Proximal of GANs

We introduce a new method for training generative adversarial networks by applying the Wasserstein-2 metric proximal on the generators. The approach is based on Wasserstein information geometry. It defines a parametrization invariant…

Machine Learning · Computer Science 2021-02-16 Alex Tong Lin , Wuchen Li , Stanley Osher , Guido Montufar

Optimal transport natural gradient for statistical manifolds with continuous sample space

We study the Wasserstein natural gradient in parametric statistical models with continuous sample spaces. Our approach is to pull back the $L^2$-Wasserstein metric tensor in the probability density space to a parameter space, equipping the…

Optimization and Control · Mathematics 2024-08-20 Yifan Chen , Wuchen Li

Optimal Neural Network Approximation of Wasserstein Gradient Direction via Convex Optimization

The computation of Wasserstein gradient direction is essential for posterior sampling problems and scientific computing. The approximation of the Wasserstein gradient with finite samples requires solving a variational problem. We study the…

Machine Learning · Computer Science 2022-05-27 Yifei Wang , Peng Chen , Mert Pilanci , Wuchen Li

Unregularized limit of stochastic gradient method for Wasserstein distributionally robust optimization

Wasserstein distributionally robust optimization offers a framework for model fitting in machine learning under potential shifts in the data distribution. We study a regularized variant of this problem in which entropic smoothing produces a…

Optimization and Control · Mathematics 2026-05-28 Tam Le

On the geometry of Stein variational gradient descent

Bayesian inference problems require sampling or approximating high-dimensional probability distributions. The focus of this paper is on the recently introduced Stein variational gradient descent methodology, a class of algorithms that rely…

Machine Learning · Statistics 2023-02-14 A. Duncan , N. Nuesken , L. Szpruch

Optimising Distributions with Natural Gradient Surrogates

Natural gradient methods have been used to optimise the parameters of probability distributions in a variety of settings, often resulting in fast-converging procedures. Unfortunately, for many distributions of interest, computing the…

Machine Learning · Statistics 2024-05-28 Jonathan So , Richard E. Turner

Wasserstein Gradient Flow over Variational Parameter Space for Variational Inference

Variational inference (VI) can be cast as an optimization problem in which the variational parameters are tuned to closely align a variational distribution with the true posterior. The optimization task can be approached through vanilla…

Machine Learning · Computer Science 2025-04-24 Dai Hai Nguyen , Tetsuya Sakurai , Hiroshi Mamitsuka

Quantum statistical learning via Quantum Wasserstein natural gradient

In this article, we introduce a new approach towards the statistical learning problem $\operatorname{argmin}_{\rho(\theta) \in \mathcal P_{\theta}} W_{Q}^2 (\rho_{\star},\rho(\theta))$ to approximate a target quantum state $\rho_{\star}$ by…

Mathematical Physics · Physics 2021-02-03 Simon Becker , Wuchen Li

Application of gradient descent algorithms based on geodesic distances

In this paper, the Riemannian gradient algorithm and the natural gradient algorithm are applied to solve descent direction problems on the manifold of positive definite Hermitian matrices, where the geodesic distance is considered as the…

Optimization and Control · Mathematics 2021-06-01 Xiaomin Duan , Huafei Sun , Linyu Peng

Efficient Wasserstein Natural Gradients for Reinforcement Learning

A novel optimization approach is proposed for application to policy gradient methods and evolution strategies for reinforcement learning (RL). The procedure uses a computationally efficient Wasserstein natural gradient (WNG) descent that…

Machine Learning · Computer Science 2021-03-19 Ted Moskovitz , Michael Arbel , Ferenc Huszar , Arthur Gretton

Fast Sinkhorn I: An O(N) algorithm for the Wasserstein-1 metric

The Wasserstein metric is broadly used in optimal transport for comparing two probabilistic distributions, with successful applications in various fields such as machine learning, signal processing, seismic inversion, etc. Nevertheless, the…

Optimization and Control · Mathematics 2022-02-22 Qichen Liao , Jing Chen , Zihao Wang , Bo Bai , Shi Jin , Hao Wu

Fast yet Simple Natural-Gradient Descent for Variational Inference in Complex Models

Bayesian inference plays an important role in advancing machine learning, but faces computational challenges when applied to complex models such as deep neural networks. Variational inference circumvents these challenges by formulating…

Machine Learning · Statistics 2018-08-03 Mohammad Emtiyaz Khan , Didrik Nielsen

Learning Wasserstein Embeddings

The Wasserstein distance received a lot of attention recently in the community of machine learning, especially for its principled way of comparing distributions. It has found numerous applications in several hard problems, such as domain…

Machine Learning · Statistics 2017-10-23 Nicolas Courty , Rémi Flamary , Mélanie Ducoffe

Multiple Wasserstein Gradient Descent Algorithm for Multi-Objective Distributional Optimization

We address the optimization problem of simultaneously minimizing multiple objective functionals over a family of probability distributions. This type of Multi-Objective Distributional Optimization commonly arises in machine learning and…

Machine Learning · Computer Science 2025-05-27 Dai Hai Nguyen , Hiroshi Mamitsuka , Atsuyoshi Nakamura

A kernel method for the learning of Wasserstein geometric flows

Wasserstein gradient and Hamiltonian flows have emerged as essential tools for modeling complex dynamics in the natural sciences, with applications ranging from partial differential equations (PDEs) and optimal transport to quantum…

Numerical Analysis · Mathematics 2025-11-11 Jianyu Hu , Juan-Pablo Ortega , Daiying Yin

Kernel Wasserstein Distance

The Wasserstein distance is a powerful metric based on the theory of optimal transport. It gives a natural measure of the distance between two distributions with a wide range of applications. In contrast to a number of the common…

Machine Learning · Computer Science 2021-02-16 Jung Hun Oh , Maryam Pouryahya , Aditi Iyer , Aditya P. Apte , Allen Tannenbaum , Joseph O. Deasy

The Cramer Distance as a Solution to Biased Wasserstein Gradients

The Wasserstein probability metric has received much attention from the machine learning community. Unlike the Kullback-Leibler divergence, which strictly measures change in probability, the Wasserstein metric reflects the underlying…

Machine Learning · Computer Science 2017-06-01 Marc G. Bellemare , Ivo Danihelka , Will Dabney , Shakir Mohamed , Balaji Lakshminarayanan , Stephan Hoyer , Rémi Munos