Related papers: Distributed Coordinate Descent Method for Learning…

Hybrid Coordinate Descent for Efficient Neural Network Learning Using Line Search and Gradient Descent

This paper presents a novel coordinate descent algorithm leveraging a combination of one-directional line search and gradient information for parameter updates for a squared error loss function. Each parameter undergoes updates determined…

Machine Learning · Computer Science 2024-08-05 Yen-Che Hsiao , Abhishek Dutta

Faster Coordinate Descent via Adaptive Importance Sampling

Coordinate descent methods employ random partial updates of decision variables in order to solve huge-scale convex optimization problems. In this work, we introduce new adaptive rules for the random selection of their updates. By adaptive,…

Machine Learning · Computer Science 2017-03-08 Dmytro Perekrestenko , Volkan Cevher , Martin Jaggi

Scalable Discrete Supervised Hash Learning with Asymmetric Matrix Factorization

Hashing method maps similar data to binary hashcodes with smaller hamming distance, and it has received a broad attention due to its low storage cost and fast retrieval speed. However, the existing limitations make the present algorithms…

Computer Vision and Pattern Recognition · Computer Science 2016-09-29 Shifeng Zhang , Jianmin Li , Jinma Guo , Bo Zhang

A Hybrid Method of Combinatorial Search and Coordinate Descent for Discrete Optimization

Discrete optimization is a central problem in mathematical optimization with a broad range of applications, among which binary optimization and sparse optimization are two common ones. However, these problems are NP-hard and thus difficult…

Optimization and Control · Mathematics 2018-11-26 Ganzhao Yuan , Li Shen , Wei-Shi Zheng

Fast Distributed Coordinate Descent for Non-Strongly Convex Losses

We propose an efficient distributed randomized coordinate descent method for minimizing regularized non-strongly convex loss functions. The method attains the optimal $O(1/k^2)$ convergence rate, where $k$ is the iteration counter. The core…

Optimization and Control · Mathematics 2014-07-29 Olivier Fercoq , Zheng Qu , Peter Richtárik , Martin Takáč

Exact Coordinate Descent for High-Dimensional Regularized Huber Regression

We develop an exact coordinate descent algorithm for high-dimensional regularized Huber regression. In contrast to composite gradient descent methods, our algorithm fully exploits the advantages of coordinate descent when the underlying…

Methodology · Statistics 2025-10-16 Younghoon Kim , Po-Ling Loh , Sumanta Basu

Coordinate Descent Algorithms

Coordinate descent algorithms solve optimization problems by successively performing approximate minimization along coordinate directions or coordinate hyperplanes. They have been used in applications for many years, and their popularity…

Optimization and Control · Mathematics 2015-02-18 Stephen J. Wright

Multi-Tier Federated Learning for Vertically Partitioned Data

We consider decentralized model training in tiered communication networks. Our network model consists of a set of silos, each holding a vertical partition of the data. Each silo contains a hub and a set of clients, with the silo's vertical…

Machine Learning · Computer Science 2021-02-09 Anirban Das , Stacy Patterson

Efficient Stochastic Gradient Descent for Learning with Distributionally Robust Optimization

Distributionally robust optimization (DRO) problems are increasingly seen as a viable method to train machine learning models for improved model generalization. These min-max formulations, however, are more difficult to solve. We therefore…

Machine Learning · Statistics 2020-11-03 Soumyadip Ghosh , Mark Squillante , Ebisa Wollega

Hydra: A Peer to Peer Distributed Training & Data Collection Framework

The world needs diverse and unbiased data to train deep learning models. Currently data comes from a variety of sources that are unmoderated to a large extent. The outcomes of training neural networks with unverified data yields biased…

Distributed, Parallel, and Cluster Computing · Computer Science 2018-11-27 Vaibhav Mathur , Karanbir Chahal

Large Scale Kernel Learning using Block Coordinate Descent

We demonstrate that distributed block coordinate descent can quickly solve kernel regression and classification problems with millions of data points. Armed with this capability, we conduct a thorough comparison between the full kernel, the…

Machine Learning · Computer Science 2016-02-18 Stephen Tu , Rebecca Roelofs , Shivaram Venkataraman , Benjamin Recht

Cross-Silo Federated Learning for Multi-Tier Networks with Vertical and Horizontal Data Partitioning

We consider federated learning in tiered communication networks. Our network model consists of a set of silos, each holding a vertical partition of the data. Each silo contains a hub and a set of clients, with the silo's vertical data shard…

Machine Learning · Computer Science 2024-04-26 Anirban Das , Timothy Castiglia , Shiqiang Wang , Stacy Patterson

A block coordinate descent optimizer for classification problems exploiting convexity

Second-order optimizers hold intriguing potential for deep learning, but suffer from increased cost and sensitivity to the non-convexity of the loss surface as compared to gradient-based approaches. We introduce a coordinate descent method…

Machine Learning · Computer Science 2020-06-19 Ravi G. Patel , Nathaniel A. Trask , Mamikon A. Gulian , Eric C. Cyr

Towards Solving Large-scale Expensive Optimization Problems Efficiently Using Coordinate Descent Algorithm

Many real-world problems are categorized as large-scale problems, and metaheuristic algorithms as an alternative method to solve large-scale problem; they need the evaluation of many candidate solutions to tackle them prior to their…

Neural and Evolutionary Computing · Computer Science 2020-09-14 Shahryar Rahnamayan , Seyed Jalaleddin Mousavirad

Hybrid-DCA: A Double Asynchronous Approach for Stochastic Dual Coordinate Ascent

In prior works, stochastic dual coordinate ascent (SDCA) has been parallelized in a multi-core environment where the cores communicate through shared memory, or in a multi-processor distributed memory environment where the processors…

Distributed, Parallel, and Cluster Computing · Computer Science 2016-11-03 Soumitra Pal , Tingyang Xu , Tianbao Yang , Sanguthevar Rajasekaran , Jinbo Bi

Consensus Driven Learning

As the complexity of our neural network models grow, so too do the data and computation requirements for successful training. One proposed solution to this problem is training on a distributed network of computational devices, thus…

Machine Learning · Computer Science 2020-05-22 Kyle Crandall , Dustin Webb

Parallel Coordinate Descent Methods for Big Data Optimization

In this work we show that randomized (block) coordinate descent methods can be accelerated by parallelization when applied to the problem of minimizing the sum of a partially separable smooth convex function and a simple separable convex…

Optimization and Control · Mathematics 2013-11-27 Peter Richtárik , Martin Takáč

Cogradient Descent for Dependable Learning

Conventional gradient descent methods compute the gradients for multiple variables through the partial derivative. Treating the coupled variables independently while ignoring the interaction, however, leads to an insufficient optimization…

Machine Learning · Computer Science 2021-06-22 Runqi Wang , Baochang Zhang , Li'an Zhuo , Qixiang Ye , David Doermann

An efficient distributed learning algorithm based on effective local functional approximations

Scalable machine learning over big data is an important problem that is receiving a lot of attention in recent years. On popular distributed environments such as Hadoop running on a cluster of commodity machines, communication costs are…

Machine Learning · Computer Science 2015-03-18 Dhruv Mahajan , Nikunj Agrawal , S. Sathiya Keerthi , S. Sundararajan , Leon Bottou

Machine learning approach to chance-constrained problems: An algorithm based on the stochastic gradient descent

We consider chance-constrained problems with discrete random distribution. We aim for problems with a large number of scenarios. We propose a novel method based on the stochastic gradient descent method which performs updates of the…

Optimization and Control · Mathematics 2019-05-28 Lukáš Adam , Martin Branda