Related papers: Optimizing the Optimal Weighted Average: Efficient…

A Distributed Frank-Wolfe Algorithm for Communication-Efficient Sparse Learning

Learning sparse combinations is a frequent theme in machine learning. In this paper, we study its associated optimization problem in the distributed setting where the elements to be combined are not centrally located but spread over a…

Distributed, Parallel, and Cluster Computing · Computer Science 2019-01-25 Aurélien Bellet , Yingyu Liang , Alireza Bagheri Garakani , Maria-Florina Balcan , Fei Sha

High-Dimensional Distributed Sparse Classification with Scalable Communication-Efficient Global Updates

As the size of datasets used in statistical learning continues to grow, distributed training of models has attracted increasing attention. These methods partition the data and exploit parallelism to reduce memory and runtime, but suffer…

Machine Learning · Computer Science 2024-07-10 Fred Lu , Ryan R. Curtin , Edward Raff , Francis Ferraro , James Holt

Distributed Learning with Sparse Communications by Identification

In distributed optimization for large-scale learning, a major performance limitation comes from the communications between the different entities. When computations are performed by workers on local data while a coordinator machine…

Optimization and Control · Mathematics 2020-06-26 Dmitry Grishchenko , Franck Iutzeler , Jérôme Malick , Massih-Reza Amini

CoCoA: A General Framework for Communication-Efficient Distributed Optimization

The scale of modern datasets necessitates the development of efficient distributed optimization methods for machine learning. We present a general-purpose framework for distributed computing environments, CoCoA, that has an efficient…

Machine Learning · Computer Science 2018-10-11 Virginia Smith , Simone Forte , Chenxin Ma , Martin Takac , Michael I. Jordan , Martin Jaggi

Communication-efficient distributed eigenspace estimation

Distributed computing is a standard way to scale up machine learning and data science algorithms to process large amounts of data. In such settings, avoiding communication amongst machines is paramount for achieving high performance. Rather…

Machine Learning · Statistics 2021-05-04 Vasileios Charisopoulos , Austin R. Benson , Anil Damle

Communication-Efficient Distributed Optimization of Self-Concordant Empirical Loss

We consider distributed convex optimization problems originated from sample average approximation of stochastic optimization, or empirical risk minimization in machine learning. We assume that each machine in the distributed computing…

Optimization and Control · Mathematics 2015-01-05 Yuchen Zhang , Lin Xiao

Debiased distributed learning for sparse partial linear models in high dimensions

Although various distributed machine learning schemes have been proposed recently for pure linear models and fully nonparametric models, little attention has been paid on distributed optimization for semi-paramemetric models with…

Machine Learning · Statistics 2019-11-05 Shaogao Lv , Heng Lian

Trainable Weight Averaging: Accelerating Training and Improving Generalization

Weight averaging is a widely used technique for accelerating training and improving the generalization of deep neural networks (DNNs). While existing approaches like stochastic weight averaging (SWA) rely on pre-set weighting schemes, they…

Machine Learning · Computer Science 2025-02-11 Tao Li , Zhehao Huang , Yingwen Wu , Zhengbao He , Qinghua Tao , Xiaolin Huang , Chih-Jen Lin

Communication Lower Bounds for Statistical Estimation Problems via a Distributed Data Processing Inequality

We study the tradeoff between the statistical error and communication cost of distributed statistical estimation problems in high dimensions. In the distributed sparse Gaussian mean estimation problem, each of the $m$ machines receives $n$…

Machine Learning · Computer Science 2016-05-11 Mark Braverman , Ankit Garg , Tengyu Ma , Huy L. Nguyen , David P. Woodruff

Adding vs. Averaging in Distributed Primal-Dual Optimization

Distributed optimization methods for large-scale machine learning suffer from a communication bottleneck. It is difficult to reduce this bottleneck while still efficiently and accurately aggregating partial work from different machines. In…

Machine Learning · Computer Science 2015-07-06 Chenxin Ma , Virginia Smith , Martin Jaggi , Michael I. Jordan , Peter Richtárik , Martin Takáč

Accuracy Analysis for Distributed Weighted Least-Squares Estimation in Finite Steps and Loopy Networks

Distributed parameter estimation for large-scale systems is an active research problem. The goal is to derive a distributed algorithm in which each agent obtains a local estimate of its own subset of the global parameter vector, based on…

Multiagent Systems · Computer Science 2018-06-26 Tianju Sui , Damián Marelli , Minyue Fu , Renquan Lu

Distributed Sparse Linear Regression under Communication Constraints

In multiple domains, statistical tasks are performed in distributed settings, with data split among several end machines that are connected to a fusion center. In various applications, the end machines have limited bandwidth and power, and…

Machine Learning · Computer Science 2026-01-05 Rodney Fonseca , Boaz Nadler

Efficient Protocols for Distributed Classification and Optimization

In distributed learning, the goal is to perform a learning task over data distributed across multiple nodes with minimal (expensive) communication. Prior work (Daume III et al., 2012) proposes a general model that bounds the communication…

Machine Learning · Computer Science 2012-04-17 Hal Daume , Jeff M. Phillips , Avishek Saha , Suresh Venkatasubramanian

Gradient Sparsification for Communication-Efficient Distributed Optimization

Modern large scale machine learning applications require stochastic optimization algorithms to be implemented on distributed computational architectures. A key bottleneck is the communication overhead for exchanging information such as…

Machine Learning · Computer Science 2017-10-31 Jianqiao Wangni , Jialei Wang , Ji Liu , Tong Zhang

Linearly Convergent Algorithm with Variance Reduction for Distributed Stochastic Optimization

This paper considers a distributed stochastic strongly convex optimization, where agents connected over a network aim to cooperatively minimize the average of all agents' local cost functions. Due to the stochasticity of gradient estimation…

Optimization and Control · Mathematics 2020-02-17 Jinlong Lei , Peng Yi , Jie Chen , Yiguang Hong

Efficient Distributed Estimation of Inverse Covariance Matrices

In distributed systems, communication is a major concern due to issues such as its vulnerability or efficiency. In this paper, we are interested in estimating sparse inverse covariance matrices when samples are distributed into different…

Methodology · Statistics 2016-10-04 Jesús Arroyo , Elizabeth Hou

Communication-efficient Distributed Sparse Linear Discriminant Analysis

We propose a communication-efficient distributed estimation method for sparse linear discriminant analysis (LDA) in the high dimensional regime. Our method distributes the data of size $N$ into $m$ machines, and estimates a local sparse LDA…

Machine Learning · Statistics 2016-10-18 Lu Tian , Quanquan Gu

Distributed Logistic Regression for Massive Data with Rare Events

Large-scale rare events data are commonly encountered in practice. To tackle the massive rare events data, we propose a novel distributed estimation method for logistic regression in a distributed system. For a distributed framework, we…

Methodology · Statistics 2023-04-06 Xuetong Li , Xuening Zhu , Hansheng Wang

Locally Convex Sparse Learning over Networks

We consider a distributed learning setup where a sparse signal is estimated over a network. Our main interest is to save communication resource for information exchange over the network and reduce processing time. Each node of the network…

Machine Learning · Statistics 2018-04-03 Ahmed Zaki , Saikat Chatterjee , Partha P. Mitra , Lars K. Rasmussen

An efficient distributed learning algorithm based on effective local functional approximations

Scalable machine learning over big data is an important problem that is receiving a lot of attention in recent years. On popular distributed environments such as Hadoop running on a cluster of commodity machines, communication costs are…

Machine Learning · Computer Science 2015-03-18 Dhruv Mahajan , Nikunj Agrawal , S. Sathiya Keerthi , S. Sundararajan , Leon Bottou