Related papers: Incremental Without Replacement Sampling in Noncon…

On the Batch Size Selection in Stochastic Gradient Methods Using No-Replacement Sampling

Recent stochastic gradient methods that have appeared in the literature base their efficiency and global convergence properties on a suitable control of the variance of the gradient batch estimate. This control is typically achieved by…

Optimization and Control · Mathematics 2025-06-11 Marco Boresta , Alberto De Santis , Stefano Lucidi

Incremental Sampling Without Replacement for Sequence Models

Sampling is a fundamental technique, and sampling without replacement is often desirable when duplicate samples are not beneficial. Within machine learning, sampling is useful for generating diverse outputs from a trained model. We present…

Machine Learning · Computer Science 2021-07-21 Kensen Shi , David Bieber , Charles Sutton

Sampling without Replacement Leads to Faster Rates in Finite-Sum Minimax Optimization

We analyze the convergence rates of stochastic gradient algorithms for smooth finite-sum minimax optimization and show that, for many such algorithms, sampling the data points without replacement leads to faster convergence compared to…

Optimization and Control · Mathematics 2022-10-11 Aniket Das , Bernhard Schölkopf , Michael Muehlebach

Without-Replacement Sampling for Stochastic Gradient Methods: Convergence Results and Application to Distributed Optimization

Stochastic gradient methods for machine learning and optimization problems are usually analyzed assuming data points are sampled \emph{with} replacement. In practice, however, sampling \emph{without} replacement is very common, easier to…

Machine Learning · Computer Science 2016-10-18 Ohad Shamir

Gradient Sampling Methods for Nonsmooth Optimization

This paper reviews the gradient sampling methodology for solving nonsmooth, nonconvex optimization problems. An intuitively straightforward gradient sampling algorithm is stated and its convergence properties are summarized. Throughout this…

Optimization and Control · Mathematics 2018-05-01 James V. Burke , Frank E. Curtis , Adrian S. Lewis , Michael L. Overton , Lucas E. A. Simões

Provably Faster Algorithms for Bilevel Optimization via Without-Replacement Sampling

Bilevel Optimization has experienced significant advancements recently with the introduction of new efficient algorithms. Mirroring the success in single-level optimization, stochastic gradient-based algorithms are widely used in bilevel…

Optimization and Control · Mathematics 2024-11-12 Junyi Li , Heng Huang

Improved Optimization of Finite Sums with Minibatch Stochastic Variance Reduced Proximal Iterations

We present novel minibatch stochastic optimization methods for empirical risk minimization problems, the methods efficiently leverage variance reduced first-order and sub-sampled higher-order information to accelerate the convergence speed.…

Optimization and Control · Mathematics 2017-10-12 Jialei Wang , Tong Zhang

Fast Stochastic Methods for Nonsmooth Nonconvex Optimization

We analyze stochastic algorithms for optimizing nonconvex, nonsmooth finite-sum problems, where the nonconvex part is smooth and the nonsmooth part is convex. Surprisingly, unlike the smooth case, our knowledge of this fundamental problem…

Optimization and Control · Mathematics 2016-05-24 Sashank J. Reddi , Suvrit Sra , Barnabas Poczos , Alex Smola

Nonconvex Variance Reduced Optimization with Arbitrary Sampling

We provide the first importance sampling variants of variance reduced algorithms for empirical risk minimization with non-convex loss functions. In particular, we analyze non-convex versions of SVRG, SAGA and SARAH. Our methods have the…

Optimization and Control · Mathematics 2019-02-01 Samuel Horváth , Peter Richtárik

Data Sampling Strategies in Stochastic Algorithms for Empirical Risk Minimization

Gradient descent methods and especially their stochastic variants have become highly popular in the last decade due to their efficiency on big data optimization problems. In this thesis we present the development of data sampling strategies…

Optimization and Control · Mathematics 2018-04-03 Dominik Csiba

Effective Proximal Methods for Non-convex Non-smooth Regularized Learning

Sparse learning is a very important tool for mining useful information and patterns from high dimensional data. Non-convex non-smooth regularized learning problems play essential roles in sparse learning, and have drawn extensive attentions…

Machine Learning · Computer Science 2020-10-22 Guannan Liang , Qianqian Tong , Jiahao Ding , Miao Pan , Jinbo Bi

Incremental Methods for Weakly Convex Optimization

Incremental methods are widely utilized for solving finite-sum optimization problems in machine learning and signal processing. In this paper, we study a family of incremental methods -- including incremental subgradient, incremental…

Optimization and Control · Mathematics 2022-12-26 Xiao Li , Zhihui Zhu , Anthony Man-Cho So , Jason D Lee

A Stochastic Derivative-Free Optimization Method with Importance Sampling: Theory and Learning to Control

We consider the problem of unconstrained minimization of a smooth objective function in $\R^n$ in a setting where only function evaluations are possible. While importance sampling is one of the most popular techniques used by machine…

Optimization and Control · Mathematics 2020-04-03 Adel Bibi , El Houcine Bergou , Ozan Sener , Bernard Ghanem , Peter Richtárik

Doubly Accelerated Stochastic Variance Reduced Dual Averaging Method for Regularized Empirical Risk Minimization

In this paper, we develop a new accelerated stochastic gradient method for efficiently solving the convex regularized empirical risk minimization problem in mini-batch settings. The use of mini-batches is becoming a golden standard in the…

Optimization and Control · Mathematics 2017-09-20 Tomoya Murata , Taiji Suzuki

A Stochastic Composite Gradient Method with Incremental Variance Reduction

We consider the problem of minimizing the composition of a smooth (nonconvex) function and a smooth vector mapping, where the inner mapping is in the form of an expectation over some random variable or a finite sum. We propose a stochastic…

Optimization and Control · Mathematics 2019-06-26 Junyu Zhang , Lin Xiao

Beyond Smoothness and Convexity: Optimization via sampling

This work explores a novel perspective on solving nonconvex and nonsmooth optimization problems by leveraging sampling based methods. Instead of treating the objective function purely through traditional (often deterministic) optimization…

Optimization and Control · Mathematics 2025-05-21 Nahom Seyoum , Haoxiang You

Analysis of nonsmooth stochastic approximation: the differential inclusion approach

In this paper we address the convergence of stochastic approximation when the functions to be minimized are not convex and nonsmooth. We show that the "mean-limit" approach to the convergence which leads, for smooth problems, to the ODE…

Optimization and Control · Mathematics 2018-05-08 Szymon Majewski , Błażej Miasojedow , Eric Moulines

Mini-batch stochastic gradient descent with dynamic sample sizes

We focus on solving constrained convex optimization problems using mini-batch stochastic gradient descent. Dynamic sample size rules are presented which ensure a descent direction with high probability. Empirical results from two…

Optimization and Control · Mathematics 2017-08-03 Michael R. Metel

Improved Analysis and Rates for Variance Reduction under Without-replacement Sampling Orders

When applying a stochastic algorithm, one must choose an order to draw samples. The practical choices are without-replacement sampling orders, which are empirically faster and more cache-friendly than uniform-iid-sampling but often have…

Machine Learning · Computer Science 2021-10-28 Xinmeng Huang , Kun Yuan , Xianghui Mao , Wotao Yin

Subgradient sampling for nonsmooth nonconvex minimization

Risk minimization for nonsmooth nonconvex problems naturally leads to first-order sampling or, by an abuse of terminology, to stochastic subgradient descent. We establish the convergence of this method in the path-differentiable case and…

Optimization and Control · Mathematics 2024-07-24 Jérôme Bolte , Tam Le , Edouard Pauwels