Related papers: Communication Efficient Distributed Agnostic Boost…

Breathe before Speaking: Efficient Information Dissemination Despite Noisy, Limited and Anonymous Communication

Distributed computing models typically assume reliable communication between processors. While such assumptions often hold for engineered networks, e.g., due to underlying error correction protocols, their relevance to biological systems,…

Distributed, Parallel, and Cluster Computing · Computer Science 2015-06-29 Ofer Feinerman , Bernhard Haeupler , Amos Korman

Revisiting Agnostic Boosting

Boosting is a key method in statistical learning, allowing for converting weak learners into strong ones. While well studied in the realizable case, the statistical properties of weak-to-strong learning remain less understood in the…

Machine Learning · Computer Science 2026-01-01 Arthur da Cunha , Mikael Møller Høgsgaard , Andrea Paudice , Yuxin Sun

A Resilient Distributed Boosting Algorithm

Given a learning task where the data is distributed among several parties, communication is one of the fundamental resources which the parties would like to minimize. We present a distributed boosting algorithm which is resilient to a…

Machine Learning · Computer Science 2022-06-14 Yuval Filmus , Idan Mehalel , Shay Moran

Communication-Aware Collaborative Learning

Algorithms for noiseless collaborative PAC learning have been analyzed and optimized in recent years with respect to sample complexity. In this paper, we study collaborative PAC learning with the goal of reducing communication cost at…

Machine Learning · Computer Science 2020-12-22 Avrim Blum , Shelby Heinecke , Lev Reyzin

Smoothed Agnostic Learning of Halfspaces over the Hypercube

Agnostic learning of Boolean halfspaces is a fundamental problem in computational learning theory, but it is known to be computationally hard even for weak learning. Recent work [CKKMK24] proposed smoothed analysis as a way to bypass such…

Machine Learning · Computer Science 2025-11-25 Yiwen Kou , Raghu Meka

Sample-Efficient Agnostic Boosting

The theory of boosting provides a computational framework for aggregating approximate weak learning algorithms, which perform marginally better than a random predictor, into an accurate strong learner. In the realizable case, the success of…

Machine Learning · Computer Science 2024-11-01 Udaya Ghai , Karan Singh

Sample-Optimal Agnostic Boosting with Unlabeled Data

Boosting provides a practical and provably effective framework for constructing accurate learning algorithms from inaccurate rules of thumb. It extends the promise of sample-efficient learning to settings where direct Empirical Risk…

Machine Learning · Computer Science 2025-03-07 Udaya Ghai , Karan Singh

Communication-efficient Variance-reduced Stochastic Gradient Descent

We consider the problem of communication efficient distributed optimization where multiple nodes exchange important algorithm information in every iteration to solve large problems. In particular, we focus on the stochastic variance-reduced…

Machine Learning · Computer Science 2020-03-16 Hossein S. Ghadikolaei , Sindri Magnusson

Boosting Classifiers with Noisy Inference

We present a principled framework to address resource allocation for realizing boosting algorithms on substrates with communication or computation noise. Boosting classifiers (e.g., AdaBoost) make a final decision via a weighted vote from…

Machine Learning · Computer Science 2020-10-28 Yongjune Kim , Yuval Cassuto , Lav R. Varshney

Sample-Near-Optimal Agnostic Boosting with Improved Running Time

Boosting is a powerful method that turns weak learners, which perform only slightly better than random guessing, into strong learners with high accuracy. While boosting is well understood in the classic setting, it is less so in the…

Machine Learning · Computer Science 2026-02-04 Arthur da Cunha , Mikael Møller Høgsgaard , Andrea Paudice

Learning to Abstain From Uninformative Data

Learning and decision-making in domains with naturally high noise-to-signal ratio, such as Finance or Healthcare, is often challenging, while the stakes are very high. In this paper, we study the problem of learning and acting under a…

Machine Learning · Computer Science 2023-09-26 Yikai Zhang , Songzhu Zheng , Mina Dalirrooyfard , Pengxiang Wu , Anderson Schneider , Anant Raj , Yuriy Nevmyvaka , Chao Chen

Communication-efficient Distributed Cooperative Learning with Compressed Beliefs

We study the problem of distributed cooperative learning, where a group of agents seeks to agree on a set of hypotheses that best describes a sequence of private observations. In the scenario where the set of hypotheses is large, we propose…

Machine Learning · Computer Science 2021-09-22 Mohammad Taha Toghani , César A. Uribe

Accelerated Methods with Compressed Communications for Distributed Optimization Problems under Data Similarity

In recent years, as data and problem sizes have increased, distributed learning has become an essential tool for training high-performance models. However, the communication bottleneck, especially for high-dimensional data, is a challenge.…

Optimization and Control · Mathematics 2025-04-28 Dmitry Bylinkin , Aleksandr Beznosikov

Distributed Learning, Communication Complexity and Privacy

We consider the problem of PAC-learning from distributed data and analyze fundamental communication complexity questions involved. We provide general upper and lower bounds on the amount of communication needed to learn well, showing that…

Machine Learning · Computer Science 2012-05-28 Maria-Florina Balcan , Avrim Blum , Shai Fine , Yishay Mansour

Communication-Efficient Agnostic Federated Averaging

In distributed learning settings such as federated learning, the training algorithm can be potentially biased towards different clients. Mohri et al. (2019) proposed a domain-agnostic learning algorithm, where the model is optimized for any…

Machine Learning · Computer Science 2021-06-17 Jae Ro , Mingqing Chen , Rajiv Mathews , Mehryar Mohri , Ananda Theertha Suresh

Making Asynchronous Distributed Computations Robust to Noise

We consider the problem of making distributed computations robust to noise, in particular to worst-case (adversarial) corruptions of messages. We give a general distributed interactive coding scheme which simulates any asynchronous…

Data Structures and Algorithms · Computer Science 2017-02-27 Keren Censor-Hillel , Ran Gelles , Bernhard Haeupler

Communication-Efficient Algorithms For Distributed Optimization

This thesis is concerned with the design of distributed algorithms for solving optimization problems. We consider networks where each node has exclusive access to a cost function, and design algorithms that make all nodes cooperate to find…

Optimization and Control · Mathematics 2013-12-03 João F. C. Mota

Protocols for Learning Classifiers on Distributed Data

We consider the problem of learning classifiers for labeled data that has been distributed across several nodes. Our goal is to find a single classifier, with small approximation error, across all datasets while minimizing the communication…

Machine Learning · Statistics 2012-03-06 Hal Daume , Jeff M. Phillips , Avishek Saha , Suresh Venkatasubramanian

Distributed Inference with Sparse and Quantized Communication

We consider the problem of distributed inference where agents in a network observe a stream of private signals generated by an unknown state, and aim to uniquely identify this state from a finite set of hypotheses. We focus on scenarios…

Systems and Control · Electrical Eng. & Systems 2021-09-01 Aritra Mitra , John A. Richards , Saurabh Bagchi , Shreyas Sundaram

Distributed Nonparametric Estimation under Communication Constraints

In the era of big data, it is necessary to split extremely large data sets across multiple computing nodes and construct estimators using the distributed data. When designing distributed estimators, it is desirable to minimize the amount of…

Statistics Theory · Mathematics 2022-04-25 Azeem Zaman , Botond Szabó