English
Related papers

Related papers: Accelerating Greedy Coordinate Descent Methods

200 papers

In this paper we study the well-known greedy coordinate descent (GCD) algorithm to solve $\ell_1$-regularized problems and improve GCD by the two popular strategies: Nesterov's acceleration and stochastic optimization. Firstly, we propose a…

Optimization and Control · Mathematics 2017-06-27 Chaobing Song , Shaobo Cui , Yong Jiang , Shu-Tao Xia

Accelerated coordinate descent is a widely popular optimization algorithm due to its efficiency on large-dimensional problems. It achieves state-of-the-art complexity on an important class of empirical risk minimization problems. In this…

Optimization and Control · Mathematics 2018-10-01 Filip Hanzely , Peter Richtárik

Coordinate descent with random coordinate selection is the current state of the art for many large scale optimization problems. However, greedy selection of the steepest coordinate on smooth problems can yield convergence rates independent…

Optimization and Control · Mathematics 2018-10-17 Sai Praneeth Karimireddy , Anastasia Koloskova , Sebastian U. Stich , Martin Jaggi

We propose a variable decomposition algorithm -greedy block coordinate descent (GBCD)- in order to make dense Gaussian process regression practical for large scale problems. GBCD breaks a large scale optimization into a series of small…

Machine Learning · Computer Science 2012-06-18 Liefeng Bo , Cristian Sminchisescu

The coordinate descent (CD) method has recently become popular for solving very large-scale problems, partly due to its simple update, low memory requirement, and fast convergence. In this paper, we explore the greedy CD on solving…

Optimization and Control · Mathematics 2020-12-14 Chenyu Wu , Yangyang Xu

Due to the high communication cost in distributed and federated learning problems, methods relying on compression of communicated messages are becoming increasingly popular. While in other contexts the best performing gradient-type methods…

Optimization and Control · Mathematics 2020-06-29 Zhize Li , Dmitry Kovalev , Xun Qian , Peter Richtárik

Block coordinate descent (BCD) methods are widely used for large-scale numerical optimization because of their cheap iteration costs, low memory requirements, amenability to parallelization, and ability to exploit problem structure. Three…

Optimization and Control · Mathematics 2022-08-02 Julie Nutini , Issam Laradji , Mark Schmidt

We propose a new selection rule for the coordinate selection in coordinate descent methods for huge-scale optimization. The efficiency of this novel scheme is provably better than the efficiency of uniformly random selection, and can reach…

Machine Learning · Computer Science 2017-06-27 Sebastian U. Stich , Anant Raj , Martin Jaggi

We consider the decentralized optimization problem, where a network of $n$ agents aims to collaboratively minimize the average of their individual smooth and convex objective functions through peer-to-peer communication in a directed graph.…

Optimization and Control · Mathematics 2023-12-07 Zhuoqing Song , Lei Shi , Shi Pu , Ming Yan

Nesterov's accelerated gradient descent (AGD), an instance of the general family of "momentum methods", provably achieves faster convergence rate than gradient descent (GD) in the convex setting. However, whether these methods are superior…

Machine Learning · Computer Science 2017-11-29 Chi Jin , Praneeth Netrapalli , Michael I. Jordan

Large-scale L1-regularized loss minimization problems arise in high-dimensional applications such as compressed sensing and high-dimensional supervised learning, including classification and regression problems. High-performance algorithms…

Machine Learning · Statistics 2012-12-19 Chad Scherrer , Ambuj Tewari , Mahantesh Halappanavar , David Haglin

The state-of-the-art methods for solving optimization problems in big dimensions are variants of randomized coordinate descent (RCD). In this paper we introduce a fundamentally new type of acceleration strategy for RCD based on the…

Optimization and Control · Mathematics 2018-02-13 Dmitry Kovalev , Eduard Gorbunov , Elnur Gasanov , Peter Richtárik

We seek tight bounds on the viable parallelism in asynchronous implementations of coordinate descent that achieves linear speedup. We focus on asynchronous coordinate descent (ACD) algorithms on convex functions which consist of the sum of…

Optimization and Control · Mathematics 2020-08-04 Yun Kuen Cheung , Richard Cole , Yixin Tao

Novel coordinate descent (CD) methods are proposed for minimizing nonconvex functions consisting of three terms: (i) a continuously differentiable term, (ii) a simple convex term, and (iii) a concave and continuous term. First, by extending…

Optimization and Control · Mathematics 2019-09-15 Qi Deng , Chenghao Lan

This work proposes A$^2$GD, a novel adaptive accelerated gradient descent method for convex and composite optimization. Smoothness and convexity constants are updated via Lyapunov analysis. Inspired by stability analysis in ODE solvers, the…

Optimization and Control · Mathematics 2026-02-10 Zeyi Xu , Long Chen

Accelerated coordinate descent is widely used in optimization due to its cheap per-iteration cost and scalability to large-scale problems. Up to a primal-dual transformation, it is also the same as accelerated stochastic gradient descent…

Optimization and Control · Mathematics 2016-05-30 Zeyuan Allen-Zhu , Zheng Qu , Peter Richtárik , Yang Yuan

Nonconvex and nonsmooth problems have recently attracted considerable attention in machine learning. However, developing efficient methods for the nonconvex and nonsmooth optimization problems with certain performance guarantee remains a…

Optimization and Control · Mathematics 2019-02-07 Ehsan Kazemi , Liqiang Wang

Gradient descent is an important class of iterative algorithms for minimizing convex functions. Classically, gradient descent has been a sequential and synchronous process. Distributed and asynchronous variants of gradient descent have been…

Optimization and Control · Mathematics 2014-12-02 Yun Kuen Cheung , Richard Cole

We present a family of algorithms, called descent algorithms, for optimizing convex and non-convex functions. We also introduce a new first-order algorithm, called rescaled gradient descent (RGD), and show that RGD achieves a faster…

Optimization and Control · Mathematics 2020-01-07 Ashia Wilson , Lester Mackey , Andre Wibisono

We consider minimizing a smooth function subject to a summation constraint over its variables. By exploiting a connection between the greedy 2-coordinate update for this problem and equality-constrained steepest descent in the 1-norm, we…

Optimization and Control · Mathematics 2023-07-04 Amrutha Varshini Ramesh , Aaron Mishkin , Mark Schmidt , Yihan Zhou , Jonathan Wilder Lavington , Jennifer She
‹ Prev 1 2 3 10 Next ›