Related papers: Optimal Hyper-Scalable Load Balancing with a Stric…

Scalable Load Balancing Algorithms in Networked Systems

A fundamental challenge in large-scale networked systems viz., data centers and cloud networks is to distribute tasks to a pool of servers, using minimal instantaneous state information, while providing excellent delay performance. In this…

Probability · Mathematics 2018-09-07 Debankur Mukherjee

Optimal Service Elasticity in Large-Scale Distributed Systems

A fundamental challenge in large-scale cloud networks and data centers is to achieve highly efficient server utilization and limit energy consumption, while providing excellent user-perceived performance in the presence of uncertain and…

Probability · Mathematics 2017-06-23 Debankur Mukherjee , Souvik Dhara , Sem Borst , Johan S. H. van Leeuwaarden

We Are Impatient: Algorithms for Geographically Distributed Load Balancing with (Almost) Arbitrary Load Functions

In geographically-distributed systems, communication latencies are non-negligible. The perceived processing time of a request is thus composed of the time needed to route the request to the server and the true processing time. Once a…

Distributed, Parallel, and Cluster Computing · Computer Science 2014-02-11 Piotr Skowron , Krzysztof Rzadca

Heavy-traffic Optimality of Skip-the-Longest-Queues in Heterogeneous Service Systems

We consider a discrete-time parallel service system consisting of $n$ heterogeneous single server queues with infinite capacity. Jobs arrive to the system as an i.i.d. process with rate proportional to $n$, and must be immediately…

Performance · Computer Science 2025-05-30 Yishun Luo , Martin Zubeldia

Load Balancing Using Sparse Communication

Load balancing across parallel servers is an important class of congestion control problems that arises in service systems. An effective load balancer relies heavily on accurate, real-time congestion information to make routing decisions.…

Distributed, Parallel, and Cluster Computing · Computer Science 2024-05-24 Gal Mendelson , Xu Kuang

Scalable load balancing in networked systems: A survey of recent advances

The basic load balancing scenario involves a single dispatcher where tasks arrive that must immediately be forwarded to one of $N$ single-server queues. We discuss recent advances on scalable load balancing schemes which provide favorable…

Probability · Mathematics 2022-08-12 Mark van der Boor , Sem C. Borst , Johan S. H. van Leeuwaarden , Debankur Mukherjee

Stability and Heavy-traffic Delay Optimality of General Load Balancing Policies in Heterogeneous Service Systems

We consider a load balancing system consisting of $n$ single-server queues working in parallel, with heterogeneous service rates. Jobs arrive to a central dispatcher, which has to dispatch them to one of the queues immediately upon arrival.…

Performance · Computer Science 2025-10-17 Yishun Luo , Martin Zubeldia

On Universal Scaling of Distributed Queues under Load Balancing

This paper considers the steady-state performance of load balancing algorithms in a many-server system with distributed queues. The system has $N$ servers, and each server maintains a local queue with buffer size $b-1,$ i.e. a server can…

Probability · Mathematics 2019-12-30 Xin Liu , Lei Ying

Throughput-Optimal Load Balancing for Intra Datacenter Networks

Traffic load-balancing in datacenters alleviates hot spots and improves network utilization. In this paper, a stable in-network load-balancing algorithm is developed in the setting of software-defined networking. A control plane configures…

Networking and Internet Architecture · Computer Science 2016-12-07 Sucha Supittayapornpong , Michael J. Neely

Self-Learning Threshold-Based Load Balancing

We consider a large-scale service system where incoming tasks have to be instantaneously dispatched to one out of many parallel server pools. The user-perceived performance degrades with the number of concurrent tasks and the dispatcher…

Performance · Computer Science 2023-09-12 Diego Goldsztajn , Sem C. Borst , Johan S. H. van Leeuwaarden , Debankur Mukherjee , Philip A. Whiting

Hyper-Scalable JSQ with Sparse Feedback

Load balancing algorithms play a vital role in enhancing performance in data centers and cloud networks. Due to the massive size of these systems, scalability challenges, and especially the communication overhead associated with load…

Probability · Mathematics 2019-03-07 Mark van der Boor , Sem Borst , Johan van Leeuwaarden

Scalable Load Balancing in the Presence of Heterogeneous Servers

Heterogeneity is becoming increasingly ubiquitous in modern large-scale computer systems. Developing good load balancing policies for systems whose resources have varying speeds is crucial in achieving low response times. Indeed, how best…

Performance · Computer Science 2020-06-26 Kristen Gardner , Jazeem Abdul Jaleel , Alexander Wickeham , Sherwin Doroudi

Sparse Mean Field Load Balancing in Large Localized Queueing Systems

Scalable load balancing algorithms are of great interest in cloud networks and data centers, necessitating the use of tractable techniques to compute optimal load balancing policies for good performance. However, most existing scalable…

Distributed, Parallel, and Cluster Computing · Computer Science 2024-03-25 Anam Tahir , Kai Cui , Heinz Koeppl

Learning Mean-Field Control for Delayed Information Load Balancing in Large Queuing Systems

Recent years have seen a great increase in the capacity and parallel processing power of data centers and cloud services. To fully utilize the said distributed systems, optimal load balancing for parallel queuing architectures must be…

Distributed, Parallel, and Cluster Computing · Computer Science 2022-08-10 Anam Tahir , Kai Cui , Heinz Koeppl

Balancing Work and Size with Bounded Buffers

We consider the fundamental problem of managing a bounded size queue buffer where traffic consists of packets of varying size, where each packet requires several rounds of processing before it can be transmitted from the queue buffer. The…

Networking and Internet Architecture · Computer Science 2013-09-06 Kirill Kogan , Alejandro Lopez-Ortiz , Sergey I. Nikolenko , Gabriel Scalosub , Michael Segal

Scalable Distributed Job Processing with Dynamic Load Balancing

We present here a cost effective framework for a robust scalable and distributed job processing system that adapts to the dynamic computing needs easily with efficient load balancing for heterogeneous systems. The design is such that each…

Distributed, Parallel, and Cluster Computing · Computer Science 2013-06-07 Putti Srinivasrao , V. P. C. Rao , A. Govardhan , Ambika Prasad Mohanty

Scalable Load Balancing in Networked Systems: Universality Properties and Stochastic Coupling Methods

We present an overview of scalable load balancing algorithms which provide favorable delay performance in large-scale systems, and yet only require minimal implementation overhead. Aimed at a broad audience, the paper starts with an…

Performance · Computer Science 2017-12-25 Mark van der Boor , Sem C. Borst , Johan S. H. van Leeuwaarden , Debankur Mukherjee

Performance of Load Balancers with Bounded Maximum Queue Length in case of Non-Exponential Job Sizes

In large-scale distributed systems, balancing the load in an efficient way is crucial in order to achieve low latency. Recently, some load balancing policies have been suggested which are able to achieve a bounded maximum queue length in…

Performance · Computer Science 2022-01-12 Tim Hellemans , Grzegorz Kielanski , Benny Van Houdt

Stochastic Coordination in Heterogeneous Load Balancing Systems

Current-day data centers and high-volume cloud services employ a broad set of heterogeneous servers. In such settings, client requests typically arrive at multiple entry points, and dispatching them to servers is an urgent distributed…

Distributed, Parallel, and Cluster Computing · Computer Science 2021-07-27 Guy Goren , Shay Vargaftik , Yoram Moses

Delay-Optimal Scheduling for Queueing Systems with Switching Overhead

We study the scheduling polices for asymptotically optimal delay in queueing systems with switching overhead. Such systems consist of a single server that serves multiple queues, and some capacity is lost whenever the server switches to…

Performance · Computer Science 2017-01-17 Ping-Chun Hsieh , I-Hong Hou , Xi Liu