English
Related papers

Related papers: Robust SGLD algorithm for solving non-convex distr…

200 papers

Stochastic Gradient Langevin Dynamics (SGLD) is a powerful algorithm for optimizing a non-convex objective, where a controlled and properly scaled Gaussian noise is added to the stochastic gradients to steer the iterates towards a global…

Optimization and Control · Mathematics 2020-06-04 Yuanhan Hu , Xiaoyu Wang , Xuefeng Gao , Mert Gurbuzbalaban , Lingjiong Zhu

Distributionally robust optimization (DRO) is a powerful technique to train robust models against data distribution shift. This paper aims to solve regularized nonconvex DRO problems, where the uncertainty set is modeled by a so-called…

Optimization and Control · Mathematics 2025-06-30 Yufeng Yang , Yi Zhou , Zhaosong Lu

Distributionally Robust Optimization (DRO), as a popular method to train robust models against distribution shift between training and test sets, has received tremendous attention in recent years. In this paper, we propose and analyze…

Machine Learning · Computer Science 2023-08-17 Qi Qi , Jiameng Lyu , Kung sik Chan , Er Wei Bai , Tianbao Yang

Distributionally robust optimization (DRO) is a powerful framework for training robust models against data distribution shifts. This paper focuses on constrained DRO, which has an explicit characterization of the robustness level. Existing…

Machine Learning · Statistics 2024-04-02 Qi Zhang , Yi Zhou , Ashley Prater-Bennette , Lixin Shen , Shaofeng Zou

We study the Stochastic Gradient Langevin Dynamics (SGLD) algorithm for non-convex optimization. The algorithm performs stochastic gradient descent, where in each step it injects appropriately scaled Gaussian noise to the update. We analyze…

Machine Learning · Computer Science 2018-04-10 Yuchen Zhang , Percy Liang , Moses Charikar

Distributionally robust optimization (DRO) is a widely-used approach to learn models that are robust against distribution shift. Compared with the standard optimization setting, the objective function in DRO is more difficult to optimize,…

Machine Learning · Computer Science 2021-10-27 Jikai Jin , Bohang Zhang , Haiyang Wang , Liwei Wang

This paper investigates group distributionally robust optimization (GDRO) with the goal of learning a model that performs well over $m$ different distributions. First, we formulate GDRO as a stochastic convex-concave saddle-point problem,…

Machine Learning · Computer Science 2024-11-21 Lijun Zhang , Haomin Bai , Peng Zhao , Tianbao Yang , Zhi-Hua Zhou

A new approach in stochastic optimization via the use of stochastic gradient Langevin dynamics (SGLD) algorithms, which is a variant of stochastic gradient decent (SGD) methods, allows us to efficiently approximate global minimizers of…

Portfolio Management · Quantitative Finance 2020-07-06 Sotirios Sabanis , Ying Zhang

Stochastic Gradient Langevin Dynamics (SGLD) is a popular variant of Stochastic Gradient Descent, where properly scaled isotropic Gaussian noise is added to an unbiased estimate of the gradient at each iteration. This modest change allows…

Machine Learning · Computer Science 2017-06-06 Maxim Raginsky , Alexander Rakhlin , Matus Telgarsky

The stochastic gradient Langevin Dynamics is one of the most fundamental algorithms to solve sampling problems and non-convex optimization appearing in several machine learning applications. Especially, its variance reduced versions have…

Machine Learning · Computer Science 2022-11-22 Yuri Kinoshita , Taiji Suzuki

We consider the problem of sampling from a target distribution, which is \emph {not necessarily logconcave}, in the context of empirical risk minimization and stochastic optimization as presented in Raginsky et al. (2017). Non-asymptotic…

Statistics Theory · Mathematics 2021-02-03 Ngoc Huy Chau , Éric Moulines , Miklos Rásonyi , Sotirios Sabanis , Ying Zhang

Distributionally robust optimization (DRO) problems are increasingly seen as a viable method to train machine learning models for improved model generalization. These min-max formulations, however, are more difficult to solve. We therefore…

Machine Learning · Statistics 2020-11-03 Soumyadip Ghosh , Mark Squillante , Ebisa Wollega

We propose an adaptively weighted stochastic gradient Langevin dynamics algorithm (SGLD), so-called contour stochastic gradient Langevin dynamics (CSGLD), for Bayesian learning in big data statistics. The proposed algorithm is essentially a…

Machine Learning · Statistics 2022-05-24 Wei Deng , Guang Lin , Faming Liang

We introduce a clipping strategy for Stochastic Gradient Descent (SGD) which uses quantiles of the gradient norm as clipping thresholds. We prove that this new strategy provides a robust and efficient optimization algorithm for smooth…

Machine Learning · Statistics 2024-10-15 Ibrahim Merad , Stéphane Gaïffas

We introduce a hybrid stochastic estimator to design stochastic gradient algorithms for solving stochastic optimization problems. Such a hybrid estimator is a convex combination of two existing biased and unbiased estimators and leads to…

Optimization and Control · Mathematics 2019-05-16 Quoc Tran-Dinh , Nhan H. Pham , Dzung T. Phan , Lam M. Nguyen

We consider the constrained sampling problem where the goal is to sample from a target distribution on a constrained domain. We propose skew-reflected non-reversible Langevin dynamics (SRNLD), a continuous-time stochastic differential…

Machine Learning · Computer Science 2025-04-16 Hengrong Du , Qi Feng , Changwei Tu , Xiaoyu Wang , Lingjiong Zhu

We present a unified framework to analyze the global convergence of Langevin dynamics based algorithms for nonconvex finite-sum optimization with $n$ component functions. At the core of our analysis is a direct analysis of the ergodicity of…

Machine Learning · Statistics 2020-10-20 Pan Xu , Jinghui Chen , Difan Zou , Quanquan Gu

Stochastic convex optimization algorithms are the most popular way to train machine learning models on large-scale data. Scaling up the training process of these models is crucial, but the most popular algorithm, Stochastic Gradient Descent…

Machine Learning · Statistics 2018-10-30 Ashok Cutkosky , Robert Busa-Fekete

Regularized estimators in the context of group variables have been applied successfully in model and feature selection in order to preserve interpretability. We formulate a Distributionally Robust Optimization (DRO) problem which recovers…

Statistics Theory · Mathematics 2017-05-12 Jose Blanchet , Yang Kang

We provide a new convergence analysis of stochastic gradient Langevin dynamics (SGLD) for sampling from a class of distributions that can be non-log-concave. At the core of our approach is a novel conductance analysis of SGLD using an…

Machine Learning · Computer Science 2021-02-24 Difan Zou , Pan Xu , Quanquan Gu
‹ Prev 1 2 3 10 Next ›