Related papers: Accelerating Greedy Coordinate Descent Methods

Accelerated Stochastic Greedy Coordinate Descent by Soft Thresholding Projection onto Simplex

In this paper we study the well-known greedy coordinate descent (GCD) algorithm to solve $\ell_1$-regularized problems and improve GCD by the two popular strategies: Nesterov's acceleration and stochastic optimization. Firstly, we propose a…

Optimization and Control · Mathematics 2017-06-27 Chaobing Song , Shaobo Cui , Yong Jiang , Shu-Tao Xia

Accelerated Coordinate Descent with Arbitrary Sampling and Best Rates for Minibatches

Accelerated coordinate descent is a widely popular optimization algorithm due to its efficiency on large-dimensional problems. It achieves state-of-the-art complexity on an important class of empirical risk minimization problems. In this…

Optimization and Control · Mathematics 2018-10-01 Filip Hanzely , Peter Richtárik

Efficient Greedy Coordinate Descent for Composite Problems

Coordinate descent with random coordinate selection is the current state of the art for many large scale optimization problems. However, greedy selection of the steepest coordinate on smooth problems can yield convergence rates independent…

Optimization and Control · Mathematics 2018-10-17 Sai Praneeth Karimireddy , Anastasia Koloskova , Sebastian U. Stich , Martin Jaggi

Greedy Block Coordinate Descent for Large Scale Gaussian Process Regression

We propose a variable decomposition algorithm -greedy block coordinate descent (GBCD)- in order to make dense Gaussian process regression practical for large scale problems. GBCD breaks a large scale optimization into a series of small…

Machine Learning · Computer Science 2012-06-18 Liefeng Bo , Cristian Sminchisescu

Greedy coordinate descent method on non-negative quadratic programming

The coordinate descent (CD) method has recently become popular for solving very large-scale problems, partly due to its simple update, low memory requirement, and fast convergence. In this paper, we explore the greedy CD on solving…

Optimization and Control · Mathematics 2020-12-14 Chenyu Wu , Yangyang Xu

Acceleration for Compressed Gradient Descent in Distributed and Federated Optimization

Due to the high communication cost in distributed and federated learning problems, methods relying on compression of communicated messages are becoming increasingly popular. While in other contexts the best performing gradient-type methods…

Optimization and Control · Mathematics 2020-06-29 Zhize Li , Dmitry Kovalev , Xun Qian , Peter Richtárik

Let's Make Block Coordinate Descent Converge Faster: Faster Greedy Rules, Message-Passing, Active-Set Complexity, and Superlinear Convergence

Block coordinate descent (BCD) methods are widely used for large-scale numerical optimization because of their cheap iteration costs, low memory requirements, amenability to parallelization, and ability to exploit problem structure. Three…

Optimization and Control · Mathematics 2022-08-02 Julie Nutini , Issam Laradji , Mark Schmidt

Approximate Steepest Coordinate Descent

We propose a new selection rule for the coordinate selection in coordinate descent methods for huge-scale optimization. The efficiency of this novel scheme is provably better than the efficiency of uniformly random selection, and can reach…

Machine Learning · Computer Science 2017-06-27 Sebastian U. Stich , Anant Raj , Martin Jaggi

Provably Accelerated Decentralized Gradient Method Over Unbalanced Directed Graphs

We consider the decentralized optimization problem, where a network of $n$ agents aims to collaboratively minimize the average of their individual smooth and convex objective functions through peer-to-peer communication in a directed graph.…

Optimization and Control · Mathematics 2023-12-07 Zhuoqing Song , Lei Shi , Shi Pu , Ming Yan

Accelerated Gradient Descent Escapes Saddle Points Faster than Gradient Descent

Nesterov's accelerated gradient descent (AGD), an instance of the general family of "momentum methods", provably achieves faster convergence rate than gradient descent (GD) in the convex setting. However, whether these methods are superior…

Machine Learning · Computer Science 2017-11-29 Chi Jin , Praneeth Netrapalli , Michael I. Jordan

Feature Clustering for Accelerating Parallel Coordinate Descent

Large-scale L1-regularized loss minimization problems arise in high-dimensional applications such as compressed sensing and high-dimensional supervised learning, including classification and regression problems. High-performance algorithms…

Machine Learning · Statistics 2012-12-19 Chad Scherrer , Ambuj Tewari , Mahantesh Halappanavar , David Haglin

Stochastic Spectral and Conjugate Descent Methods

The state-of-the-art methods for solving optimization problems in big dimensions are variants of randomized coordinate descent (RCD). In this paper we introduce a fundamentally new type of acceleration strategy for RCD based on the…

Optimization and Control · Mathematics 2018-02-13 Dmitry Kovalev , Eduard Gorbunov , Elnur Gasanov , Peter Richtárik

Fully Asynchronous Stochastic Coordinate Descent: A Tight Lower Bound on the Parallelism Achieving Linear Speedup

We seek tight bounds on the viable parallelism in asynchronous implementations of coordinate descent that achieves linear speedup. We focus on asynchronous coordinate descent (ACD) algorithms on convex functions which consist of the sum of…

Optimization and Control · Mathematics 2020-08-04 Yun Kuen Cheung , Richard Cole , Yixin Tao

Efficiency of Coordinate Descent Methods For Structured Nonconvex Optimization

Novel coordinate descent (CD) methods are proposed for minimizing nonconvex functions consisting of three terms: (i) a continuously differentiable term, (ii) a simple convex term, and (iii) a concave and continuous term. First, by extending…

Optimization and Control · Mathematics 2019-09-15 Qi Deng , Chenghao Lan

Adaptive Accelerated Gradient Descent Methods for Convex Optimization

This work proposes A$^2$GD, a novel adaptive accelerated gradient descent method for convex and composite optimization. Smoothness and convexity constants are updated via Lyapunov analysis. Inspired by stability analysis in ODE solvers, the…

Optimization and Control · Mathematics 2026-02-10 Zeyi Xu , Long Chen

Even Faster Accelerated Coordinate Descent Using Non-Uniform Sampling

Accelerated coordinate descent is widely used in optimization due to its cheap per-iteration cost and scalability to large-scale problems. Up to a primal-dual transformation, it is also the same as accelerated stochastic gradient descent…

Optimization and Control · Mathematics 2016-05-30 Zeyuan Allen-Zhu , Zheng Qu , Peter Richtárik , Yang Yuan

Asynchronous Delay-Aware Accelerated Proximal Coordinate Descent for Nonconvex Nonsmooth Problems

Nonconvex and nonsmooth problems have recently attracted considerable attention in machine learning. However, developing efficient methods for the nonconvex and nonsmooth optimization problems with certain performance guarantee remains a…

Optimization and Control · Mathematics 2019-02-07 Ehsan Kazemi , Liqiang Wang

Amortized Analysis on Asynchronous Gradient Descent

Gradient descent is an important class of iterative algorithms for minimizing convex functions. Classically, gradient descent has been a sequential and synchronous process. Distributed and asynchronous variants of gradient descent have been…

Optimization and Control · Mathematics 2014-12-02 Yun Kuen Cheung , Richard Cole

Accelerating Rescaled Gradient Descent: Fast Optimization of Smooth Functions

We present a family of algorithms, called descent algorithms, for optimizing convex and non-convex functions. We also introduce a new first-order algorithm, called rescaled gradient descent (RGD), and show that RGD achieves a faster…

Optimization and Control · Mathematics 2020-01-07 Ashia Wilson , Lester Mackey , Andre Wibisono

Analyzing and Improving Greedy 2-Coordinate Updates for Equality-Constrained Optimization via Steepest Descent in the 1-Norm

We consider minimizing a smooth function subject to a summation constraint over its variables. By exploiting a connection between the greedy 2-coordinate update for this problem and equality-constrained steepest descent in the 1-norm, we…

Optimization and Control · Mathematics 2023-07-04 Amrutha Varshini Ramesh , Aaron Mishkin , Mark Schmidt , Yihan Zhou , Jonathan Wilder Lavington , Jennifer She