Related papers: SARAH: A Novel Method for Machine Learning Problem…

Stochastic Recursive Gradient Algorithm for Nonconvex Optimization

In this paper, we study and analyze the mini-batch version of StochAstic Recursive grAdient algoritHm (SARAH), a method employing the stochastic recursive gradient, for solving empirical loss minimization for the case of nonconvex losses.…

Machine Learning · Statistics 2017-05-23 Lam M. Nguyen , Jie Liu , Katya Scheinberg , Martin Takáč

Random-reshuffled SARAH does not need a full gradient computations

The StochAstic Recursive grAdient algoritHm (SARAH) algorithm is a variance reduced variant of the Stochastic Gradient Descent (SGD) algorithm that needs a gradient of the objective function from time to time. In this paper, we remove the…

Machine Learning · Computer Science 2024-01-17 Aleksandr Beznosikov , Martin Takáč

Inexact SARAH Algorithm for Stochastic Optimization

We develop and analyze a variant of the SARAH algorithm, which does not require computation of the exact gradient. Thus this new method can be applied to general expectation minimization problems rather than only finite sum problems. While…

Optimization and Control · Mathematics 2020-08-28 Lam M. Nguyen , Katya Scheinberg , Martin Takáč

AI-SARAH: Adaptive and Implicit Stochastic Recursive Gradient Methods

We present AI-SARAH, a practical variant of SARAH. As a variant of SARAH, this algorithm employs the stochastic recursive gradient yet adjusts step-size based on local geometry. AI-SARAH implicitly computes step-size and efficiently…

Machine Learning · Computer Science 2022-02-02 Zheng Shi , Abdurakhmon Sadiev , Nicolas Loizou , Peter Richtárik , Martin Takáč

SARAH-based Variance-reduced Algorithm for Stochastic Finite-sum Cocoercive Variational Inequalities

Variational inequalities are a broad formalism that encompasses a vast number of applications. Motivated by applications in machine learning and beyond, stochastic methods are of great importance. In this paper we consider the problem of…

Optimization and Control · Mathematics 2023-09-26 Aleksandr Beznosikov , Alexander Gasnikov

ZeroSARAH: Efficient Nonconvex Finite-Sum Optimization with Zero Full Gradient Computation

We propose ZeroSARAH -- a novel variant of the variance-reduced method SARAH (Nguyen et al., 2017) -- for minimizing the average of a large number of nonconvex functions $\frac{1}{n}\sum_{i=1}^{n}f_i(x)$. To the best of our knowledge, in…

Machine Learning · Computer Science 2021-10-12 Zhize Li , Slavomír Hanzely , Peter Richtárik

Accelerating Mini-batch SARAH by Step Size Rules

StochAstic Recursive grAdient algoritHm (SARAH), originally proposed for convex optimization and also proven to be effective for general nonconvex optimization, has received great attention due to its simple recursive framework for updating…

Machine Learning · Computer Science 2019-06-21 Zhuang Yang , Zengping Chen , Cheng Wang

On Biased Stochastic Gradient Estimation

We present a uniform analysis of biased stochastic gradient methods for minimizing convex, strongly convex, and non-convex composite objectives, and identify settings where bias is useful in stochastic gradient estimation. The framework we…

Optimization and Control · Mathematics 2020-02-28 Derek Driggs , Jingwei Liang , Carola-Bibiane Schönlieb

Variance Reduction Methods Do Not Need to Compute Full Gradients: Improved Efficiency through Shuffling

Stochastic optimization algorithms are widely used for machine learning with large-scale data. However, their convergence often suffers from non-vanishing variance. Variance Reduction (VR) methods, such as SVRG and SARAH, address this issue…

Machine Learning · Computer Science 2026-01-12 Daniil Medyakov , Gleb Molodtsov , Savelii Chezhegov , Alexey Rebrikov , Aleksandr Beznosikov

Stochastic Reweighted Gradient Descent

Despite the strong theoretical guarantees that variance-reduced finite-sum optimization algorithms enjoy, their applicability remains limited to cases where the memory overhead they introduce (SAG/SAGA), or the periodic full gradient…

Optimization and Control · Mathematics 2021-03-24 Ayoub El Hanchi , David A. Stephens

ProxSARAH: An Efficient Algorithmic Framework for Stochastic Composite Nonconvex Optimization

We propose a new stochastic first-order algorithmic framework to solve stochastic composite nonconvex optimization problems that covers both finite-sum and expectation settings. Our algorithms rely on the SARAH estimator introduced in…

Optimization and Control · Mathematics 2019-04-01 Nhan H. Pham , Lam M. Nguyen , Dzung T. Phan , Quoc Tran-Dinh

Minimizing Finite Sums with the Stochastic Average Gradient

We propose the stochastic average gradient (SAG) method for optimizing the sum of a finite number of smooth convex functions. Like stochastic gradient (SG) methods, the SAG method's iteration cost is independent of the number of terms in…

Optimization and Control · Mathematics 2016-05-12 Mark Schmidt , Nicolas Le Roux , Francis Bach

Adjusted Shuffling SARAH: Advancing Complexity Analysis via Dynamic Gradient Weighting

In this paper, we propose Adjusted Shuffling SARAH, a novel algorithm that integrates shuffling strategies into the recursive SARAH framework using a dynamic weighting mechanism to enhance exploration. We analyze the algorithm under two…

Optimization and Control · Mathematics 2026-05-28 Duc Toan Nguyen , Trang H. Tran , Lam M. Nguyen

An Optimal Hybrid Variance-Reduced Algorithm for Stochastic Composite Nonconvex Optimization

In this note we propose a new variant of the hybrid variance-reduced proximal gradient method in [7] to solve a common stochastic composite nonconvex optimization problem under standard assumptions. We simply replace the independent…

Optimization and Control · Mathematics 2020-08-21 Deyi Liu , Lam M. Nguyen , Quoc Tran-Dinh

A Retrospective Approximation Approach for Smooth Stochastic Optimization

Stochastic Gradient (SG) is the defacto iterative technique to solve stochastic optimization (SO) problems with a smooth (non-convex) objective $f$ and a stochastic first-order oracle. SG's attractiveness is due in part to its simplicity of…

Optimization and Control · Mathematics 2024-03-08 David Newton , Raghu Bollapragada , Raghu Pasupathy , Nung Kwan Yip

A Stochastic Gradient Method with an Exponential Convergence Rate for Finite Training Sets

We propose a new stochastic gradient method for optimizing the sum of a finite set of smooth functions, where the sum is strongly convex. While standard stochastic gradient methods converge at sublinear rates for this problem, the proposed…

Optimization and Control · Mathematics 2013-03-12 Nicolas Le Roux , Mark Schmidt , Francis Bach

Stochastic Conjugate Frameworks for Nonconvex and Nonsmooth Optimization

We introduce two new stochastic conjugate frameworks for a class of nonconvex and possibly also nonsmooth optimization problems. These frameworks are built upon Stochastic Recursive Gradient Algorithm (SARAH) and we thus refer to them as…

Optimization and Control · Mathematics 2023-10-23 Jiangshan Wang , Zheng Peng

SAGA and Restricted Strong Convexity

SAGA is a fast incremental gradient method on the finite sum problem and its effectiveness has been tested on a vast of applications. In this paper, we analyze SAGA on a class of non-strongly convex and non-convex statistical problem such…

Machine Learning · Statistics 2017-02-28 Chao Qu , Yan Li , Huan Xu

Probabilistic Guarantees of Stochastic Recursive Gradient in Non-Convex Finite Sum Problems

This paper develops a new dimension-free Azuma-Hoeffding type bound on summation norm of a martingale difference sequence with random individual bounds. With this novel result, we provide high-probability bounds for the gradient norm…

Machine Learning · Statistics 2024-01-31 Yanjie Zhong , Jiaqi Li , Soumendra Lahiri

SUCAG: Stochastic Unbiased Curvature-aided Gradient Method for Distributed Optimization

We propose and analyze a new stochastic gradient method, which we call Stochastic Unbiased Curvature-aided Gradient (SUCAG), for finite sum optimization problems. SUCAG constitutes an unbiased total gradient tracking technique that uses…

Optimization and Control · Mathematics 2018-10-30 Hoi-To Wai , Nikolaos M. Freris , Angelia Nedic , Anna Scaglione