Related papers: Robust SGLD algorithm for solving non-convex distr…

Non-Convex Optimization via Non-Reversible Stochastic Gradient Langevin Dynamics

Stochastic Gradient Langevin Dynamics (SGLD) is a powerful algorithm for optimizing a non-convex objective, where a controlled and properly scaled Gaussian noise is added to the stochastic gradients to steer the iterates towards a global…

Optimization and Control · Mathematics 2020-06-04 Yuanhan Hu , Xiaoyu Wang , Xuefeng Gao , Mert Gurbuzbalaban , Lingjiong Zhu

Nested Stochastic Algorithm for Generalized Sinkhorn distance-Regularized Distributionally Robust Optimization

Distributionally robust optimization (DRO) is a powerful technique to train robust models against data distribution shift. This paper aims to solve regularized nonconvex DRO problems, where the uncertainty set is modeled by a so-called…

Optimization and Control · Mathematics 2025-06-30 Yufeng Yang , Yi Zhou , Zhaosong Lu

Stochastic Constrained DRO with a Complexity Independent of Sample Size

Distributionally Robust Optimization (DRO), as a popular method to train robust models against distribution shift between training and test sets, has received tremendous attention in recent years. In this paper, we propose and analyze…

Machine Learning · Computer Science 2023-08-17 Qi Qi , Jiameng Lyu , Kung sik Chan , Er Wei Bai , Tianbao Yang

Large-Scale Non-convex Stochastic Constrained Distributionally Robust Optimization

Distributionally robust optimization (DRO) is a powerful framework for training robust models against data distribution shifts. This paper focuses on constrained DRO, which has an explicit characterization of the robustness level. Existing…

Machine Learning · Statistics 2024-04-02 Qi Zhang , Yi Zhou , Ashley Prater-Bennette , Lixin Shen , Shaofeng Zou

A Hitting Time Analysis of Stochastic Gradient Langevin Dynamics

We study the Stochastic Gradient Langevin Dynamics (SGLD) algorithm for non-convex optimization. The algorithm performs stochastic gradient descent, where in each step it injects appropriately scaled Gaussian noise to the update. We analyze…

Machine Learning · Computer Science 2018-04-10 Yuchen Zhang , Percy Liang , Moses Charikar

Non-convex Distributionally Robust Optimization: Non-asymptotic Analysis

Distributionally robust optimization (DRO) is a widely-used approach to learn models that are robust against distribution shift. Compared with the standard optimization setting, the objective function in DRO is more difficult to optimize,…

Machine Learning · Computer Science 2021-10-27 Jikai Jin , Bohang Zhang , Haiyang Wang , Liwei Wang

Stochastic Approximation Approaches to Group Distributionally Robust Optimization and Beyond

This paper investigates group distributionally robust optimization (GDRO) with the goal of learning a model that performs well over $m$ different distributions. First, we formulate GDRO as a stochastic convex-concave saddle-point problem,…

Machine Learning · Computer Science 2024-11-21 Lijun Zhang , Haomin Bai , Peng Zhao , Tianbao Yang , Zhi-Hua Zhou

A fully data-driven approach to minimizing CVaR for portfolio of assets via SGLD with discontinuous updating

A new approach in stochastic optimization via the use of stochastic gradient Langevin dynamics (SGLD) algorithms, which is a variant of stochastic gradient decent (SGD) methods, allows us to efficiently approximate global minimizers of…

Portfolio Management · Quantitative Finance 2020-07-06 Sotirios Sabanis , Ying Zhang

Non-convex learning via Stochastic Gradient Langevin Dynamics: a nonasymptotic analysis

Stochastic Gradient Langevin Dynamics (SGLD) is a popular variant of Stochastic Gradient Descent, where properly scaled isotropic Gaussian noise is added to an unbiased estimate of the gradient at each iteration. This modest change allows…

Machine Learning · Computer Science 2017-06-06 Maxim Raginsky , Alexander Rakhlin , Matus Telgarsky

Improved Convergence Rate of Stochastic Gradient Langevin Dynamics with Variance Reduction and its Application to Optimization

The stochastic gradient Langevin Dynamics is one of the most fundamental algorithms to solve sampling problems and non-convex optimization appearing in several machine learning applications. Especially, its variance reduced versions have…

Machine Learning · Computer Science 2022-11-22 Yuri Kinoshita , Taiji Suzuki

On stochastic gradient Langevin dynamics with dependent data streams: the fully non-convex case

We consider the problem of sampling from a target distribution, which is \emph {not necessarily logconcave}, in the context of empirical risk minimization and stochastic optimization as presented in Raginsky et al. (2017). Non-asymptotic…

Statistics Theory · Mathematics 2021-02-03 Ngoc Huy Chau , Éric Moulines , Miklos Rásonyi , Sotirios Sabanis , Ying Zhang

Efficient Stochastic Gradient Descent for Learning with Distributionally Robust Optimization

Distributionally robust optimization (DRO) problems are increasingly seen as a viable method to train machine learning models for improved model generalization. These min-max formulations, however, are more difficult to solve. We therefore…

Machine Learning · Statistics 2020-11-03 Soumyadip Ghosh , Mark Squillante , Ebisa Wollega

A Contour Stochastic Gradient Langevin Dynamics Algorithm for Simulations of Multi-modal Distributions

We propose an adaptively weighted stochastic gradient Langevin dynamics algorithm (SGLD), so-called contour stochastic gradient Langevin dynamics (CSGLD), for Bayesian learning in big data statistics. The proposed algorithm is essentially a…

Machine Learning · Statistics 2022-05-24 Wei Deng , Guang Lin , Faming Liang

Robust Stochastic Optimization via Gradient Quantile Clipping

We introduce a clipping strategy for Stochastic Gradient Descent (SGD) which uses quantiles of the gradient norm as clipping thresholds. We prove that this new strategy provides a robust and efficient optimization algorithm for smooth…

Machine Learning · Statistics 2024-10-15 Ibrahim Merad , Stéphane Gaïffas

Hybrid Stochastic Gradient Descent Algorithms for Stochastic Nonconvex Optimization

We introduce a hybrid stochastic estimator to design stochastic gradient algorithms for solving stochastic optimization problems. Such a hybrid estimator is a convex combination of two existing biased and unbiased estimators and leads to…

Optimization and Control · Mathematics 2019-05-16 Quoc Tran-Dinh , Nhan H. Pham , Dzung T. Phan , Lam M. Nguyen

Non-Reversible Langevin Algorithms for Constrained Sampling

We consider the constrained sampling problem where the goal is to sample from a target distribution on a constrained domain. We propose skew-reflected non-reversible Langevin dynamics (SRNLD), a continuous-time stochastic differential…

Machine Learning · Computer Science 2025-04-16 Hengrong Du , Qi Feng , Changwei Tu , Xiaoyu Wang , Lingjiong Zhu

Global Convergence of Langevin Dynamics Based Algorithms for Nonconvex Optimization

We present a unified framework to analyze the global convergence of Langevin dynamics based algorithms for nonconvex finite-sum optimization with $n$ component functions. At the core of our analysis is a direct analysis of the ergodicity of…

Machine Learning · Statistics 2020-10-20 Pan Xu , Jinghui Chen , Difan Zou , Quanquan Gu

Distributed Stochastic Optimization via Adaptive SGD

Stochastic convex optimization algorithms are the most popular way to train machine learning models on large-scale data. Scaling up the training process of these models is crucial, but the most popular algorithm, Stochastic Gradient Descent…

Machine Learning · Statistics 2018-10-30 Ashok Cutkosky , Robert Busa-Fekete

Distributionally Robust Groupwise Regularization Estimator

Regularized estimators in the context of group variables have been applied successfully in model and feature selection in order to preserve interpretability. We formulate a Distributionally Robust Optimization (DRO) problem which recovers…

Statistics Theory · Mathematics 2017-05-12 Jose Blanchet , Yang Kang

Faster Convergence of Stochastic Gradient Langevin Dynamics for Non-Log-Concave Sampling

We provide a new convergence analysis of stochastic gradient Langevin dynamics (SGLD) for sampling from a class of distributions that can be non-log-concave. At the core of our approach is a novel conductance analysis of SGLD using an…

Machine Learning · Computer Science 2021-02-24 Difan Zou , Pan Xu , Quanquan Gu