Related papers: Stochastic optimization with arbitrary recurrent d…

An optimally fast objective-function-free minimization algorithm using random subspaces

An algorithm for unconstrained non-convex optimization is described, which does not evaluate the objective function and in which minimization is carried out, at each iteration, within a randomly selected subspace. It is shown that this…

Optimization and Control · Mathematics 2025-01-31 S. Bellavia , S. Gratton , B. Morini , Ph. L. Toint

Stochastic Majorization-Minimization Algorithms for Large-Scale Optimization

Majorization-minimization algorithms consist of iteratively minimizing a majorizing surrogate of an objective function. Because of its simplicity and its wide applicability, this principle has been very popular in statistics and in signal…

Machine Learning · Statistics 2013-09-11 Julien Mairal

Optimal Rates for Robust Stochastic Convex Optimization

Machine learning algorithms in high-dimensional settings are highly susceptible to the influence of even a small fraction of structured outliers, making robust optimization techniques essential. In particular, within the…

Machine Learning · Computer Science 2025-04-25 Changyu Gao , Andrew Lowy , Xingyu Zhou , Stephen J. Wright

Stochastic regularized majorization-minimization with weakly convex and multi-convex surrogates

Stochastic majorization-minimization (SMM) is a class of stochastic optimization algorithms that proceed by sampling new data points and minimizing a recursive average of surrogate functions of an objective function. The surrogates are…

Optimization and Control · Mathematics 2023-03-22 Hanbaek Lyu

Stochastic Learning under Random Reshuffling with Constant Step-sizes

In empirical risk optimization, it has been observed that stochastic gradient implementations that rely on random reshuffling of the data achieve better performance than implementations that rely on sampling the data uniformly. Recent works…

Machine Learning · Computer Science 2019-01-30 Bicheng Ying , Kun Yuan , Stefan Vlaski , Ali H. Sayed

Sampling without Replacement Leads to Faster Rates in Finite-Sum Minimax Optimization

We analyze the convergence rates of stochastic gradient algorithms for smooth finite-sum minimax optimization and show that, for many such algorithms, sampling the data points without replacement leads to faster convergence compared to…

Optimization and Control · Mathematics 2022-10-11 Aniket Das , Bernhard Schölkopf , Michael Muehlebach

Stochastic optimization and sparse statistical recovery: An optimal algorithm for high dimensions

We develop and analyze stochastic optimization algorithms for problems in which the expected loss is strongly convex, and the optimum is (approximately) sparse. Previous approaches are able to exploit only one of these two structures,…

Machine Learning · Statistics 2012-07-19 Alekh Agarwal , Sahand Negahban , Martin J. Wainwright

Instance-Optimality in I/O-Efficient Sampling and Sequential Estimation

Suppose we have a memory storing $0$s and $1$s and we want to estimate the frequency of $1$s by sampling. We want to do this I/O-efficiently, exploiting that each read gives a block of $B$ bits at unit cost; not just one bit. If the input…

Data Structures and Algorithms · Computer Science 2024-10-21 Shyam Narayanan , Václav Rozhoň , Jakub Tětek , Mikkel Thorup

On the Convergence Analysis of an Inexact Preconditioned Stochastic Model-Based Algorithm

This paper focuses on investigating an inexact stochastic model-based optimization algorithm that integrates preconditioning techniques for solving stochastic composite optimization problems. The proposed framework unifies and extends the…

Optimization and Control · Mathematics 2025-12-12 Chenglong Bao , Yancheng Yuan , Shulan Zhu

Mirror Descent Strikes Again: Optimal Stochastic Convex Optimization under Infinite Noise Variance

We study stochastic convex optimization under infinite noise variance. Specifically, when the stochastic gradient is unbiased and has uniformly bounded $(1+\kappa)$-th moment, for some $\kappa \in (0,1]$, we quantify the convergence rate of…

Machine Learning · Statistics 2022-02-24 Nuri Mert Vural , Lu Yu , Krishnakumar Balasubramanian , Stanislav Volgushev , Murat A. Erdogdu

Distributed stochastic proximal algorithm with random reshuffling for non-smooth finite-sum optimization

The non-smooth finite-sum minimization is a fundamental problem in machine learning. This paper develops a distributed stochastic proximal-gradient algorithm with random reshuffling to solve the finite-sum minimization over time-varying…

Optimization and Control · Mathematics 2022-10-11 Xia Jiang , Xianlin Zeng , Jian Sun , Jie Chen , Lihua Xie

Accelerated First-Order Optimization under Nonlinear Constraints

We exploit analogies between first-order algorithms for constrained optimization and non-smooth dynamical systems to design a new class of accelerated first-order algorithms for constrained optimization. Unlike Frank-Wolfe or projected…

Optimization and Control · Mathematics 2025-05-02 Michael Muehlebach , Michael I. Jordan

Assortment Optimization and the Sample Average Approximation

We consider a simple approach to solving assortment optimization under the random utility maximization model. The approach uses Monte-Carlo simulation to construct a ranking-based choice model that serves as a proxy for the true choice…

Optimization and Control · Mathematics 2025-10-02 Hassaan Khalid , Bradley Sturt

New nonasymptotic convergence rates of stochastic proximal pointalgorithm for convex optimization problems

Large sectors of the recent optimization literature focused in the last decade on the development of optimal stochastic first order schemes for constrained convex models under progressively relaxed assumptions. Stochastic proximal point is…

Optimization and Control · Mathematics 2020-05-05 Andrei Patrascu

Stochastic optimization with decision-dependent distributions

Stochastic optimization problems often involve data distributions that change in reaction to the decision variables. This is the case for example when members of the population respond to a deployed classifier by manipulating their features…

Optimization and Control · Mathematics 2020-12-15 Dmitriy Drusvyatskiy , Lin Xiao

Alternating minimization for generalized rank one matrix sensing: Sharp predictions from a random initialization

We consider the problem of estimating the factors of a rank-$1$ matrix with i.i.d. Gaussian, rank-$1$ measurements that are nonlinearly transformed and corrupted by noise. Considering two prototypical choices for the nonlinearity, we study…

Optimization and Control · Mathematics 2024-10-02 Kabir Aladin Chandrasekher , Mengqi Lou , Ashwin Pananjady

Randomized Smoothing for Stochastic Optimization

We analyze convergence rates of stochastic optimization procedures for non-smooth convex optimization problems. By combining randomized smoothing techniques with accelerated gradient methods, we obtain convergence rates of stochastic…

Optimization and Control · Mathematics 2012-04-10 John C. Duchi , Peter L. Bartlett , Martin J. Wainwright

Without-Replacement Sampling for Stochastic Gradient Methods: Convergence Results and Application to Distributed Optimization

Stochastic gradient methods for machine learning and optimization problems are usually analyzed assuming data points are sampled \emph{with} replacement. In practice, however, sampling \emph{without} replacement is very common, easier to…

Machine Learning · Computer Science 2016-10-18 Ohad Shamir

Stochastic Nonconvex Optimization with Large Minibatches

We study stochastic optimization of nonconvex loss functions, which are typical objectives for training neural networks. We propose stochastic approximation algorithms which optimize a series of regularized, nonlinearized losses on large…

Machine Learning · Computer Science 2019-03-12 Weiran Wang , Nathan Srebro

Improved Analysis and Rates for Variance Reduction under Without-replacement Sampling Orders

When applying a stochastic algorithm, one must choose an order to draw samples. The practical choices are without-replacement sampling orders, which are empirically faster and more cache-friendly than uniform-iid-sampling but often have…

Machine Learning · Computer Science 2021-10-28 Xinmeng Huang , Kun Yuan , Xianghui Mao , Wotao Yin