English

A Communication Efficient Collaborative Learning Framework for Distributed Features

Machine Learning 2020-08-03 v6 Artificial Intelligence Machine Learning

Abstract

We introduce a collaborative learning framework allowing multiple parties having different sets of attributes about the same user to jointly build models without exposing their raw data or model parameters. In particular, we propose a Federated Stochastic Block Coordinate Descent (FedBCD) algorithm, in which each party conducts multiple local updates before each communication to effectively reduce the number of communication rounds among parties, a principal bottleneck for collaborative learning problems. We analyze theoretically the impact of the number of local updates and show that when the batch size, sample size, and the local iterations are selected appropriately, within TT iterations, the algorithm performs O(T)\mathcal{O}(\sqrt{T}) communication rounds and achieves some O(1/T)\mathcal{O}(1/\sqrt{T}) accuracy (measured by the average of the gradient norm squared). The approach is supported by our empirical evaluations on a variety of tasks and datasets, demonstrating advantages over stochastic gradient descent (SGD) approaches.

Keywords

Cite

@article{arxiv.1912.11187,
  title  = {A Communication Efficient Collaborative Learning Framework for Distributed Features},
  author = {Yang Liu and Yan Kang and Xinwei Zhang and Liping Li and Yong Cheng and Tianjian Chen and Mingyi Hong and Qiang Yang},
  journal= {arXiv preprint arXiv:1912.11187},
  year   = {2020}
}

Comments

This paper is published at the 2nd International Workshop on Federated Learning for Data Privacy and Confidentiality, in Conjunction with NeurIPS 2019 (FL-NeurIPS 19): https://nips.cc/Conferences/2019/ScheduleMultitrack?event=13202

R2 v1 2026-06-23T12:55:21.336Z