English

Flexible Queueing Architectures

Probability 2017-02-07 v2 Networking and Internet Architecture Performance

Abstract

We study a multi-server model with nn flexible servers and nn queues, connected through a bipartite graph, where the level of flexibility is captured by the graph's average degree, dnd_n. Applications in content replication in data centers, skill-based routing in call centers, and flexible supply chains are among our main motivations. We focus on the scaling regime where the system size nn tends to infinity, while the overall traffic intensity stays fixed. We show that a large capacity region and an asymptotically vanishing queueing delay are simultaneously achievable even under limited flexibility (dnnd_n \ll n). Our main results demonstrate that, when dnlnnd_n\gg \ln n, a family of expander-graph-based flexibility architectures has a capacity region that is within a constant factor of the maximum possible, while simultaneously ensuring a diminishing queueing delay for all arrival rate vectors in the capacity region. Our analysis is centered around a new class of virtual-queue-based scheduling policies that rely on dynamically constructed job-to-server assignments on the connectivity graph. For comparison, we also analyze a natural family of modular architectures, which is simpler but has provably weaker performance.

Keywords

Cite

@article{arxiv.1505.07648,
  title  = {Flexible Queueing Architectures},
  author = {John N. Tsitsiklis and Kuang Xu},
  journal= {arXiv preprint arXiv:1505.07648},
  year   = {2017}
}

Comments

Revised October 2016. A preliminary version of this paper appeared at the 2013 ACM Sigmetrics conference; the performance of the architectures proposed in the current paper is significantly better than the one in the conference version

R2 v1 2026-06-22T09:43:02.833Z