Related papers: SDCA without Duality

SDCA without Duality, Regularization, and Individual Convexity

Stochastic Dual Coordinate Ascent is a popular method for solving regularized loss minimization for the case of convex losses. We describe variants of SDCA that do not require explicit regularization and do not rely on duality. We prove…

Machine Learning · Computer Science 2016-05-24 Shai Shalev-Shwartz

Stochastic Dual Coordinate Ascent Methods for Regularized Loss Minimization

Stochastic Gradient Descent (SGD) has become popular for solving large scale supervised machine learning optimization problems such as SVM, due to their strong theoretical guarantees. While the closely related Dual Coordinate Ascent (DCA)…

Machine Learning · Statistics 2015-03-20 Shai Shalev-Shwartz , Tong Zhang

Accelerated Mini-Batch Stochastic Dual Coordinate Ascent

Stochastic dual coordinate ascent (SDCA) is an effective technique for solving regularized loss minimization problems in machine learning. This paper considers an extension of SDCA under the mini-batch setting that is often used in…

Machine Learning · Statistics 2013-05-14 Shai Shalev-Shwartz , Tong Zhang

Stochastic Dual Coordinate Ascent with Adaptive Probabilities

This paper introduces AdaSDCA: an adaptive variant of stochastic dual coordinate ascent (SDCA) for solving the regularized empirical risk minimization problems. Our modification consists in allowing the method adaptively change the…

Optimization and Control · Mathematics 2015-03-02 Dominik Csiba , Zheng Qu , Peter Richtárik

Linear convergence of SDCA in statistical estimation

In this paper, we consider stochastic dual coordinate (SDCA) {\em without} strongly convex assumption or convex assumption. We show that SDCA converges linearly under mild conditions termed restricted strong convexity. This covers a wide…

Machine Learning · Statistics 2017-04-04 Chao Qu , Huan Xu

Distributed Mini-Batch SDCA

We present an improved analysis of mini-batched stochastic dual coordinate ascent for regularized empirical loss minimization (i.e. SVM and SVM-type objectives). Our analysis allows for flexible sampling schemes, including where data is…

Machine Learning · Computer Science 2015-07-31 Martin Takáč , Peter Richtárik , Nathan Srebro

Analysis of Distributed Stochastic Dual Coordinate Ascent

In \citep{Yangnips13}, the author presented distributed stochastic dual coordinate ascent (DisDCA) algorithms for solving large-scale regularized loss minimization. Extraordinary performances have been observed and reported for the…

Distributed, Parallel, and Cluster Computing · Computer Science 2014-03-25 Tianbao Yang , Shenghuo Zhu , Rong Jin , Yuanqing Lin

Adaptive Stochastic Dual Coordinate Ascent for Conditional Random Fields

This work investigates the training of conditional random fields (CRFs) via the stochastic dual coordinate ascent (SDCA) algorithm of Shalev-Shwartz and Zhang (2016). SDCA enjoys a linear convergence rate and a strong empirical performance…

Machine Learning · Statistics 2018-07-11 Rémi Le Priol , Alexandre Piché , Simon Lacoste-Julien

Hybrid-DCA: A Double Asynchronous Approach for Stochastic Dual Coordinate Ascent

In prior works, stochastic dual coordinate ascent (SDCA) has been parallelized in a multi-core environment where the cores communicate through shared memory, or in a multi-processor distributed memory environment where the processors…

Distributed, Parallel, and Cluster Computing · Computer Science 2016-11-03 Soumitra Pal , Tingyang Xu , Tianbao Yang , Sanguthevar Rajasekaran , Jinbo Bi

Dual Free Adaptive Minibatch SDCA for Empirical Risk Minimization

In this paper we develop an adaptive dual free Stochastic Dual Coordinate Ascent (adfSDCA) algorithm for regularized empirical risk minimization problems. This is motivated by the recent work on dual free SDCA of Shalev-Shwartz (2016). The…

Optimization and Control · Mathematics 2018-01-26 Xi He , Rachael Tappenden , Martin Takac

Online Dual Coordinate Ascent Learning

The stochastic dual coordinate-ascent (S-DCA) technique is a useful alternative to the traditional stochastic gradient-descent algorithm for solving large-scale optimization problems due to its scalability to large data sets and strong…

Optimization and Control · Mathematics 2016-02-25 Bicheng Ying , Kun Yuan , Ali H. Sayed

Dual Averaging Converges for Nonconvex Smooth Stochastic Optimization

Dual averaging and gradient descent with their stochastic variants stand as the two canonical recipe books for first-order optimization: Every modern variant can be viewed as a descendant of one or the other. In the convex regime, these…

Optimization and Control · Mathematics 2025-05-28 Tuo Liu , El Mehdi Saad , Wojciech Kotłowski , Francesco Orabona

Alternating minimization and alternating descent over nonconvex sets

We analyze the performance of alternating minimization for loss functions optimized over two variables, where each variable may be restricted to lie in some potentially nonconvex constraint set. This type of setting arises naturally in…

Optimization and Control · Mathematics 2019-02-26 Wooseok Ha , Rina Foygel Barber

Proximal Stochastic Dual Coordinate Ascent

We introduce a proximal version of dual coordinate ascent method. We demonstrate how the derived algorithmic framework can be used for numerous regularized loss minimization problems, including $\ell_1$ regularization and structured output…

Machine Learning · Statistics 2012-11-13 Shai Shalev-Shwartz , Tong Zhang

On the convergence rate of the boosted Difference-of-Convex Algorithm (DCA)

The difference-of-convex algorithm (DCA) is a well-established nonlinear programming technique that solves successive convex optimization problems. These sub-problems are obtained from the difference-of-convex~(DC) decompositions of the…

Optimization and Control · Mathematics 2026-02-20 Hadi Abbaszadehpeivasti , Etienne de Klerk , Adrien Taylor

Stochastic Dual Ascent for Solving Linear Systems

We develop a new randomized iterative algorithm---stochastic dual ascent (SDA)---for finding the projection of a given vector onto the solution space of a linear system. The method is dual in nature: with the dual being a non-strongly…

Numerical Analysis · Mathematics 2016-01-29 Robert Mansel Gower , Peter Richtarik

DSA: Decentralized Double Stochastic Averaging Gradient Algorithm

This paper considers convex optimization problems where nodes of a network have access to summands of a global objective. Each of these local objectives is further assumed to be an average of a finite set of functions. The motivation for…

Optimization and Control · Mathematics 2015-06-16 Aryan Mokhtari , Alejandro Ribeiro

Convergence of dual ascent in non-convex/non-differentiable optimization

We revisit the classical dual ascent algorithm for minimization of convex functionals in the presence of linear constraints, and give convergence results which apply even for non-convex functionals. We describe limit points in terms of the…

Optimization and Control · Mathematics 2016-09-22 Fredrik Andersson , Marcus Carlsson , Carl Olsson

A Local Convergence Theory for the Stochastic Gradient Descent Method in Non-Convex Optimization With Non-isolated Local Minima

Loss functions with non-isolated minima have emerged in several machine learning problems, creating a gap between theory and practice. In this paper, we formulate a new type of local convexity condition that is suitable to describe the…

Machine Learning · Computer Science 2022-05-31 Taehee Ko , Xiantao Li

Accelerated Proximal Stochastic Dual Coordinate Ascent for Regularized Loss Minimization

We introduce a proximal version of the stochastic dual coordinate ascent method and show how to accelerate the method using an inner-outer iteration procedure. We analyze the runtime of the framework and obtain rates that improve…

Machine Learning · Statistics 2013-10-09 Shai Shalev-Shwartz , Tong Zhang