Related papers: A Communication-Efficient Distributed Algorithm fo…

Heterogeneity-aware and communication-efficient distributed statistical inference

In multicenter research, individual-level data are often protected against sharing across sites. To overcome the barrier of data sharing, many distributed algorithms, which only require sharing aggregated information, have been developed.…

Methodology · Statistics 2021-03-25 Rui Duan , Yang Ning , Yong Chen

Communication-Efficient Large-Scale Distributed Deep Learning: A Comprehensive Survey

With the rapid growth in the volume of data sets, models, and devices in the domain of deep learning, there is increasing attention on large-scale distributed deep learning. In contrast to traditional distributed deep learning, the…

Distributed, Parallel, and Cluster Computing · Computer Science 2024-04-10 Feng Liang , Zhen Zhang , Haifeng Lu , Victor C. M. Leung , Yanyi Guo , Xiping Hu

Privacy Preserving Analytics on Distributed Medical Data

Objective: To enable privacy-preserving learning of high quality generative and discriminative machine learning models from distributed electronic health records. Methods and Results: We describe general and scalable strategy to build…

Cryptography and Security · Computer Science 2018-06-19 Marina Blanton , Ah Reum Kang , Subhadeep Karan , Jaroslaw Zola

Towards Efficient Scheduling of Federated Mobile Devices under Computational and Statistical Heterogeneity

Originated from distributed learning, federated learning enables privacy-preserved collaboration on a new abstracted level by sharing the model parameters only. While the current research mainly focuses on optimizing learning algorithms and…

Machine Learning · Computer Science 2020-09-17 Cong Wang , Yuanyuan Yang , Pengzhan Zhou

A Framework for Hybrid Collective Inference in Distributed Sensor Networks

With the ever-increasing range of applications of Internet in Things (IoT) and sensor networks, challenges are emerging in various categories of classification tasks. Applications such as vehicular networking, UAV swarm coordination and…

Distributed, Parallel, and Cluster Computing · Computer Science 2026-04-01 Andrew Nash , Dirk Pesch , Krishnendu Guha

CEDAR: Communication Efficient Distributed Analysis for Regressions

Electronic health records (EHRs) offer great promises for advancing precision medicine and, at the same time, present significant analytical challenges. Particularly, it is often the case that patient-level data in EHRs cannot be shared…

Methodology · Statistics 2022-07-04 Changgee Chang , Zhiqi Bu , Qi Long

Communication-Efficient Distributed Estimation and Inference for Cox's Model

Motivated by multi-center biomedical studies that cannot share individual data due to privacy and ownership concerns, we develop communication-efficient iterative distributed algorithms for estimation and inference in the high-dimensional…

Methodology · Statistics 2024-06-25 Pierre Bayle , Jianqing Fan , Zhipeng Lou

Collaborative causal inference with a distributed data-sharing management

Data sharing barriers are paramount challenges arising from multicenter clinical trials where multiple data sources are stored in a distributed fashion at different local study sites. Merging such data sources into a common data storage for…

Methodology · Statistics 2022-04-05 Mengtong Hu , Xu Shi , Peter X. -K. Song

A communication efficient distributed learning framework for smart environments

Due to the pervasive diffusion of personal mobile and IoT devices, many ``smart environments'' (e.g., smart cities and smart factories) will be, among others, generators of huge amounts of data. Currently, this is typically achieved through…

Distributed, Parallel, and Cluster Computing · Computer Science 2021-09-28 Lorenzo Valerio , Andrea Passarella , Marco Conti

Learning over inherently distributed data

The recent decades have seen a surge of interests in distributed computing. Existing work focus primarily on either distributed computing platforms, data query tools, or, algorithms to divide big data and conquer at individual machines etc.…

Machine Learning · Statistics 2019-08-01 Donghui Yan , Ying Xu

Distributed Estimation and Learning over Heterogeneous Networks

We consider several estimation and learning problems that networked agents face when making decisions given their uncertainty about an unknown variable. Our methods are designed to efficiently deal with heterogeneity in both size and…

Applications · Statistics 2016-11-11 M. Amin Rahimian , Ali Jadbabaie

Distributed Dictionary Learning

The paper studies distributed Dictionary Learning (DL) problems where the learning task is distributed over a multi-agent network with time-varying (nonsymmetric) connectivity. This formulation is relevant, for instance, in big-data…

Optimization and Control · Mathematics 2016-12-23 Amir Daneshmand , Gesualdo Scutari , Francisco Facchinei

Delay-Tolerant Augmented-Consensus-based Distributed Directed Optimization

Distributed optimization finds applications in large-scale machine learning, data processing and classification over multi-agent networks. In real-world scenarios, the communication network of agents may encounter latency that may affect…

Systems and Control · Electrical Eng. & Systems 2025-10-06 Mohammadreza Doostmohammadian , Narahari Kasagatta Ramesh , Alireza Aghasi

A Comparative Study of Federated Learning Aggregation Strategies under Homogeneous and Heterogeneous Data Distributions

Federated Learning has emerged as a transformative paradigm for collaborative machine learning across distributed environments. However, its performance is strongly influenced by the aggregation strategy used to combine local model updates…

Machine Learning · Computer Science 2026-05-13 Antonios Makris , Christos Dousis , Emmanouil Kritharakis , Stavros Bouras , Konstantinos Tserpes

Communication-Efficient and Interoperable Distributed Learning

Collaborative learning across heterogeneous model architectures presents significant challenges in ensuring interoperability and preserving privacy. We propose a communication-efficient distributed learning framework that supports model…

Machine Learning · Computer Science 2025-09-30 Mounssif Krouka , Mehdi Bennis

Federated Causal Inference in Heterogeneous Observational Data

We are interested in estimating the effect of a treatment applied to individuals at multiple sites, where data is stored locally for each site. Due to privacy constraints, individual-level data cannot be shared across sites; the sites may…

Machine Learning · Computer Science 2023-04-04 Ruoxuan Xiong , Allison Koenecke , Michael Powell , Zhu Shen , Joshua T. Vogelstein , Susan Athey

Distributed Parameter Estimation via Pseudo-likelihood

Estimating statistical models within sensor networks requires distributed algorithms, in which both data and computation are distributed across the nodes of the network. We propose a general approach for distributed learning based on…

Machine Learning · Computer Science 2012-07-03 Qiang Liu , Alexander Ihler

Distributed Online Big Data Classification Using Context Information

Distributed, online data mining systems have emerged as a result of applications requiring analysis of large amounts of correlated and high-dimensional data produced by multiple distributed data sources. We propose a distributed online data…

Machine Learning · Computer Science 2013-07-03 Cem Tekin , Mihaela van der Schaar

Federated Learning in Distributed Medical Databases: Meta-Analysis of Large-Scale Subcortical Brain Data

At this moment, databanks worldwide contain brain images of previously unimaginable numbers. Combined with developments in data science, these massive data provide the potential to better understand the genetic underpinnings of brain…

Machine Learning · Statistics 2025-01-30 Santiago Silva , Boris Gutman , Eduardo Romero , Paul M Thompson , Andre Altmann , Marco Lorenzi

On a Distributed Approach for Density-based Clustering

Efficient extraction of useful knowledge from these data is still a challenge, mainly when the data is distributed, heterogeneous and of different quality depending on its corresponding local infrastructure. To reduce the overhead cost,…

Databases · Computer Science 2017-04-17 Nhien-An Le-Khac , M-Tahar Kechadi