English

Optimistic Concurrency Control for Distributed Unsupervised Learning

Machine Learning 2013-07-31 v1 Artificial Intelligence Distributed, Parallel, and Cluster Computing

Abstract

Research on distributed machine learning algorithms has focused primarily on one of two extremes - algorithms that obey strict concurrency constraints or algorithms that obey few or no such constraints. We consider an intermediate alternative in which algorithms optimistically assume that conflicts are unlikely and if conflicts do arise a conflict-resolution protocol is invoked. We view this "optimistic concurrency control" paradigm as particularly appropriate for large-scale machine learning algorithms, particularly in the unsupervised setting. We demonstrate our approach in three problem areas: clustering, feature learning and online facility location. We evaluate our methods via large-scale experiments in a cluster computing environment.

Keywords

Cite

@article{arxiv.1307.8049,
  title  = {Optimistic Concurrency Control for Distributed Unsupervised Learning},
  author = {Xinghao Pan and Joseph E. Gonzalez and Stefanie Jegelka and Tamara Broderick and Michael I. Jordan},
  journal= {arXiv preprint arXiv:1307.8049},
  year   = {2013}
}

Comments

25 pages, 5 figures

R2 v1 2026-06-22T01:00:36.992Z