Related papers: Wasserstein Gradient Boosting: A Framework for Dis…

Proximal boosting: aggregating weak learners to minimize non-differentiable losses

Gradient boosting is a prediction method that iteratively combines weak learners to produce a complex and accurate model. From an optimization point of view, the learning procedure of gradient boosting mimics a gradient descent on a…

Machine Learning · Computer Science 2022-11-30 Erwan Fouillen , Claire Boyer , Maxime Sangnier

Unregularized limit of stochastic gradient method for Wasserstein distributionally robust optimization

Wasserstein distributionally robust optimization offers a framework for model fitting in machine learning under potential shifts in the data distribution. We study a regularized variant of this problem in which entropic smoothing produces a…

Optimization and Control · Mathematics 2026-05-28 Tam Le

Wasserstein Generative Learning of Conditional Distribution

Conditional distribution is a fundamental quantity for describing the relationship between a response and a predictor. We propose a Wasserstein generative approach to learning a conditional distribution. The proposed approach uses a…

Machine Learning · Computer Science 2021-12-21 Shiao Liu , Xingyu Zhou , Yuling Jiao , Jian Huang

Boost Like a (Var)Pro: Trust-Region Gradient Boosting via Variable Projection

Gradient boosting, a method of building additive ensembles from weak learners, has established itself as a practical and theoretically-motivated approach to approximate functions, especially using decision tree weak learners. Comparable…

Machine Learning · Computer Science 2026-03-26 Abhijit Chowdhary , Elizabeth Newman , Deepanshu Verma

Wasserstein Regression as a Variational Approximation of Probabilistic Trajectories through the Bernstein Basis

This paper considers the problem of regression over distributions, which is becoming increasingly important in machine learning. Existing approaches often ignore the geometry of the probability space or are computationally expensive. To…

Machine Learning · Computer Science 2025-10-31 Maksim Maslov , Alexander Kugaevskikh , Matthew Ivanov

Gradient Boosting for Linear Mixed Models

Gradient boosting from the field of statistical learning is widely known as a powerful framework for estimation and selection of predictor effects in various regression models by adapting concepts from classification theory. Current…

Methodology · Statistics 2020-11-03 Colin Griesbach , Benjamin Säfken , Elisabeth Waldmann

Gradient Boosting on Stochastic Data Streams

Boosting is a popular ensemble algorithm that generates more powerful learners by linearly combining base models from a simpler hypothesis class. In this work, we investigate the problem of adapting batch gradient boosting for minimizing…

Machine Learning · Computer Science 2017-03-02 Hanzhang Hu , Wen Sun , Arun Venkatraman , Martial Hebert , J. Andrew Bagnell

Wasserstein Transfer Learning

Transfer learning is a powerful paradigm for leveraging knowledge from source domains to enhance learning in a target domain. However, traditional transfer learning approaches often focus on scalar or multivariate data within Euclidean…

Machine Learning · Computer Science 2025-10-24 Kaicheng Zhang , Sinian Zhang , Doudou Zhou , Yidong Zhou

Sequential Training of Neural Networks with Gradient Boosting

This paper presents a novel technique based on gradient boosting to train the final layers of a neural network (NN). Gradient boosting is an additive expansion algorithm in which a series of models are trained sequentially to approximate a…

Machine Learning · Computer Science 2023-05-05 Seyedsaman Emami , Gonzalo Martínez-Muñoz

A Wasserstein Minimum Velocity Approach to Learning Unnormalized Models

Score matching provides an effective approach to learning flexible unnormalized models, but its scalability is limited by the need to evaluate a second-order derivative. In this paper, we present a scalable approximation to a general family…

Machine Learning · Statistics 2020-02-19 Ziyu Wang , Shuyu Cheng , Yueru Li , Jun Zhu , Bo Zhang

Wasserstein Generative Regression

In this paper, we propose a new and unified approach for nonparametric regression and conditional distribution learning. Our approach simultaneously estimates a regression function and a conditional generator using a generative learning…

Machine Learning · Statistics 2023-06-28 Shanshan Song , Tong Wang , Guohao Shen , Yuanyuan Lin , Jian Huang

Wasserstein Dropout

Despite of its importance for safe machine learning, uncertainty quantification for neural networks is far from being solved. State-of-the-art approaches to estimate neural uncertainties are often hybrid, combining parametric models with…

Machine Learning · Computer Science 2021-12-03 Joachim Sicking , Maram Akila , Maximilian Pintz , Tim Wirtz , Asja Fischer , Stefan Wrobel

Soft Gradient Boosting with Learnable Feature Transforms for Sequential Regression

We propose a soft gradient boosting framework for sequential regression that embeds a learnable linear feature transform within the boosting procedure. At each boosting iteration, we train a soft decision tree and learn a linear input…

Machine Learning · Computer Science 2025-09-17 Huseyin Karaca , Suleyman Serdar Kozat

Wasserstein Distributionally Robust Optimization: Theory and Applications in Machine Learning

Many decision problems in science, engineering and economics are affected by uncertain parameters whose distribution is only indirectly observable through samples. The goal of data-driven decision-making is to learn a decision from finitely…

Machine Learning · Statistics 2024-11-05 Daniel Kuhn , Peyman Mohajerin Esfahani , Viet Anh Nguyen , Soroosh Shafieezadeh-Abadeh

Learning with Differentially Private (Sliced) Wasserstein Gradients

In this work, we introduce a novel framework for privately optimizing objectives that rely on Wasserstein distances between data-dependent empirical measures. Our main theoretical contribution is, based on an explicit formulation of the…

Machine Learning · Computer Science 2025-05-22 David Rodríguez-Vítores , Clément Lalanne , Jean-Michel Loubes

Fast Distributionally Robust Learning with Variance Reduced Min-Max Optimization

Distributionally robust supervised learning (DRSL) is emerging as a key paradigm for building reliable machine learning systems for real-world applications -- reflecting the need for classifiers and predictive models that are robust to the…

Machine Learning · Computer Science 2022-01-26 Yaodong Yu , Tianyi Lin , Eric Mazumdar , Michael I. Jordan

Wasserstein Variational Inference

This paper introduces Wasserstein variational inference, a new form of approximate Bayesian inference based on optimal transport theory. Wasserstein variational inference uses a new family of divergences that includes both f-divergences and…

Machine Learning · Statistics 2018-06-05 Luca Ambrogioni , Umut Güçlü , Yağmur Güçlütürk , Max Hinne , Eric Maris , Marcel A. J. van Gerven

Bridging the Gap Between Variational Inference and Wasserstein Gradient Flows

Variational inference is a technique that approximates a target distribution by optimizing within the parameter space of variational families. On the other hand, Wasserstein gradient flows describe optimization within the space of…

Machine Learning · Statistics 2023-11-01 Mingxuan Yi , Song Liu

Gradient tree boosting with random output projections for multi-label classification and multi-output regression

In many applications of supervised learning, multiple classification or regression outputs have to be predicted jointly. We consider several extensions of gradient boosting to address such problems. We first propose a straightforward…

Machine Learning · Statistics 2019-05-21 Arnaud Joly , Louis Wehenkel , Pierre Geurts

Generalized Boosting Algorithms for Convex Optimization

Boosting is a popular way to derive powerful learners from simpler hypothesis classes. Following previous work (Mason et al., 1999; Friedman, 2000) on general boosting frameworks, we analyze gradient-based descent algorithms for boosting…

Machine Learning · Computer Science 2012-02-15 Alexander Grubb , J. Andrew Bagnell