A Communication Efficient Collaborative Learning Framework for Distributed Features
Abstract
We introduce a collaborative learning framework allowing multiple parties having different sets of attributes about the same user to jointly build models without exposing their raw data or model parameters. In particular, we propose a Federated Stochastic Block Coordinate Descent (FedBCD) algorithm, in which each party conducts multiple local updates before each communication to effectively reduce the number of communication rounds among parties, a principal bottleneck for collaborative learning problems. We analyze theoretically the impact of the number of local updates and show that when the batch size, sample size, and the local iterations are selected appropriately, within iterations, the algorithm performs communication rounds and achieves some accuracy (measured by the average of the gradient norm squared). The approach is supported by our empirical evaluations on a variety of tasks and datasets, demonstrating advantages over stochastic gradient descent (SGD) approaches.
Cite
@article{arxiv.1912.11187,
title = {A Communication Efficient Collaborative Learning Framework for Distributed Features},
author = {Yang Liu and Yan Kang and Xinwei Zhang and Liping Li and Yong Cheng and Tianjian Chen and Mingyi Hong and Qiang Yang},
journal= {arXiv preprint arXiv:1912.11187},
year = {2020}
}
Comments
This paper is published at the 2nd International Workshop on Federated Learning for Data Privacy and Confidentiality, in Conjunction with NeurIPS 2019 (FL-NeurIPS 19): https://nips.cc/Conferences/2019/ScheduleMultitrack?event=13202