Related papers: Variance-based regularization with convex objectiv…

Stochastic dual averaging methods using variance reduction techniques for regularized empirical risk minimization problems

We consider a composite convex minimization problem associated with regularized empirical risk minimization, which often arises in machine learning. We propose two new stochastic gradient methods that are based on stochastic dual averaging…

Optimization and Control · Mathematics 2016-03-09 Tomoya Murata , Taiji Suzuki

Un-regularizing: approximate proximal point and faster stochastic algorithms for empirical risk minimization

We develop a family of accelerated stochastic algorithms that minimize sums of convex functions. Our algorithms improve upon the fastest running time for empirical risk minimization (ERM), and in particular linear least-squares regression,…

Machine Learning · Statistics 2015-06-25 Roy Frostig , Rong Ge , Sham M. Kakade , Aaron Sidford

Analytic approach to variance optimization under an $\ell_1$ constraint

The optimization of the variance supplemented by a budget constraint and an asymmetric $\ell_1$ regularizer is carried out analytically by the replica method borrowed from the theory of disordered systems. The asymmetric regularizer allows…

Portfolio Management · Quantitative Finance 2018-07-16 Imre Kondor , Gábor Papp , Fabio Caccioli

Enhanced Balancing of Bias-Variance Tradeoff in Stochastic Estimation: A Minimax Perspective

Biased stochastic estimators, such as finite-differences for noisy gradient estimation, often contain parameters that need to be properly chosen to balance impacts from the bias and the variance. While the optimal order of these parameters…

Methodology · Statistics 2019-02-14 Henry Lam , Xinyu Zhang , Xuhui Zhang

Instance-optimal stochastic convex optimization: Can we improve upon sample-average and robust stochastic approximation?

We study the unconstrained minimization of a smooth and strongly convex population loss function under a stochastic oracle that introduces both additive and multiplicative noise; this is a canonical and widely-studied setting that arises…

Optimization and Control · Mathematics 2026-03-27 Liwei Jiang , Ashwin Pananjady

Beyond Least-Squares: Fast Rates for Regularized Empirical Risk Minimization through Self-Concordance

We consider learning methods based on the regularization of a convex empirical risk by a squared Hilbertian norm, a setting that includes linear predictors and non-linear predictors through positive-definite kernels. In order to go beyond…

Machine Learning · Computer Science 2019-06-19 Ulysse Marteau-Ferey , Dmitrii Ostrovskii , Francis Bach , Alessandro Rudi

A Stochastic Subgradient Method for Distributionally Robust Non-Convex Learning

We consider a distributionally robust formulation of stochastic optimization problems arising in statistical learning, where robustness is with respect to uncertainty in the underlying data distribution. Our formulation builds on…

Optimization and Control · Mathematics 2021-06-09 Mert Gürbüzbalaban , Andrzej Ruszczyński , Landi Zhu

On variance reduction for stochastic smooth convex optimization with multiplicative noise

We propose dynamic sampled stochastic approximation (SA) methods for stochastic optimization with a heavy-tailed distribution (with finite 2nd moment). The objective is the sum of a smooth convex function with a convex regularizer.…

Optimization and Control · Mathematics 2017-05-26 Alejandro Jofré , Philip Thompson

Doubly Accelerated Stochastic Variance Reduced Dual Averaging Method for Regularized Empirical Risk Minimization

In this paper, we develop a new accelerated stochastic gradient method for efficiently solving the convex regularized empirical risk minimization problem in mini-batch settings. The use of mini-batches is becoming a golden standard in the…

Optimization and Control · Mathematics 2017-09-20 Tomoya Murata , Taiji Suzuki

Regularization via Mass Transportation

The goal of regression and classification methods in supervised learning is to minimize the empirical risk, that is, the expectation of some loss function quantifying the prediction error under the empirical distribution. When facing scarce…

Optimization and Control · Mathematics 2019-07-15 Soroosh Shafieezadeh-Abadeh , Daniel Kuhn , Peyman Mohajerin Esfahani

Optimal Guarantees for Algorithmic Reproducibility and Gradient Complexity in Convex Optimization

Algorithmic reproducibility measures the deviation in outputs of machine learning algorithms upon minor changes in the training process. Previous work suggests that first-order methods would need to trade-off convergence rate (gradient…

Machine Learning · Computer Science 2024-01-11 Liang Zhang , Junchi Yang , Amin Karbasi , Niao He

A new analytical approach to consistency and overfitting in regularized empirical risk minimization

This work considers the problem of binary classification: given training data $x_1, \dots, x_n$ from a certain population, together with associated labels $y_1,\dots, y_n \in \left\{0,1 \right\}$, determine the best label for an element $x$…

Statistics Theory · Mathematics 2016-07-04 Nicolas Garcia Trillos , Ryan Murray

Consistency of Distributionally Robust Risk- and Chance-Constrained Optimization under Wasserstein Ambiguity Sets

We study stochastic optimization problems with chance and risk constraints, where in the latter, risk is quantified in terms of the conditional value-at-risk (CVaR). We consider the distributionally robust versions of these problems, where…

Optimization and Control · Mathematics 2020-12-17 Ashish Cherukuri , Ashish R. Hota

On a Class of Optimal Reinsurance Problems

De Finetti's optimal reinsurance is a set of contracts, one for each risk in a portfolio, that caps the retained aggregate variance to a pre-specified level while minimizing total expected loss. The premiums are determined using the…

Optimization and Control · Mathematics 2026-03-03 N. D. Shyamalkumar , Tianrun Wang

Variance Regularization for Accelerating Stochastic Optimization

While nowadays most gradient-based optimization methods focus on exploring the high-dimensional geometric features, the random error accumulated in a stochastic version of any algorithm implementation has not been stressed yet. In this…

Machine Learning · Computer Science 2020-08-14 Tong Yang , Long Sha , Pengyu Hong

General risk measures for robust machine learning

A wide array of machine learning problems are formulated as the minimization of the expectation of a convex loss function on some parameter space. Since the probability distribution of the data of interest is usually unknown, it is is often…

Optimization and Control · Mathematics 2019-05-27 Emilie Chouzenoux , Henri Gérard , Jean-Christophe Pesquet

Stability and Sample-based Approximations of Composite Stochastic Optimization Problems

Optimization under uncertainty and risk is indispensable in many practical situations. Our paper addresses stability of optimization problems using composite risk functionals which are subjected to measure perturbations. Our main focus is…

Optimization and Control · Mathematics 2022-01-06 Darinka Dentcheva , Yang Lin , Spiridon Penev

Analytic solution to variance optimization with no short-selling

A large portfolio of independent returns is optimized under the variance risk measure with a ban on short positions. The no-short selling constraint acts as an asymmetric $\ell_1$ regularizer, setting some of the portfolio weights to zero…

Portfolio Management · Quantitative Finance 2018-01-17 Imre Kondor , Gábor Papp , Fabio Caccioli

On the relations of stochastic convex optimization problems with empirical risk minimization problems on $p$-norm balls

In this paper, we consider convex stochastic optimization problems arising in machine learning applications (e.g., risk minimization) and mathematical statistics (e.g., maximum likelihood estimation). There are two main approaches to solve…

Optimization and Control · Mathematics 2022-03-03 Darina Dvinskikh , Vitali Pirau , Alexander Gasnikov

Nonconvex Variance Reduced Optimization with Arbitrary Sampling

We provide the first importance sampling variants of variance reduced algorithms for empirical risk minimization with non-convex loss functions. In particular, we analyze non-convex versions of SVRG, SAGA and SARAH. Our methods have the…

Optimization and Control · Mathematics 2019-02-01 Samuel Horváth , Peter Richtárik