English
Related papers

Related papers: Hyper-Scalable JSQ with Sparse Feedback

200 papers

The basic load balancing scenario involves a single dispatcher where tasks arrive that must immediately be forwarded to one of $N$ single-server queues. We discuss recent advances on scalable load balancing schemes which provide favorable…

We present an overview of scalable load balancing algorithms which provide favorable delay performance in large-scale systems, and yet only require minimal implementation overhead. Aimed at a broad audience, the paper starts with an…

Performance · Computer Science 2017-12-25 Mark van der Boor , Sem C. Borst , Johan S. H. van Leeuwaarden , Debankur Mukherjee

Nowadays, the efficiency and even the feasibility of traditional load-balancing policies are challenged by the rapid growth of cloud infrastructure and the increasing levels of server heterogeneity. In such heterogeneous systems with many…

Networking and Internet Architecture · Computer Science 2020-03-10 Shay Vargaftik , Isaac Keslassy , Ariel Orda

Load balancing algorithms play a crucial role in delivering robust application performance in data centers and cloud networks. Recently, strong interest has emerged in Join-the-Idle-Queue (JIQ) algorithms, which rely on tokens issued by…

Probability · Mathematics 2017-06-06 Mark van der Boor , Sem Borst , Johan van Leeuwaarden

The Join-the-Shortest-Queue (JSQ) load-balancing scheme is known to minimise the average delay of jobs in homogeneous systems consisting of identical servers. However, it performs poorly in heterogeneous systems where servers have different…

Probability · Mathematics 2022-10-04 Sanidhay Bhambay , Arpan Mukhopadhyay

We consider a system of $N$ parallel single-server queues with unit exponential service rates and a single dispatcher where tasks arrive as a Poisson process of rate $\lambda(N)$. When a task arrives, the dispatcher assigns it to a server…

Load balancing across parallel servers is an important class of congestion control problems that arises in service systems. An effective load balancer relies heavily on accurate, real-time congestion information to make routing decisions.…

Distributed, Parallel, and Cluster Computing · Computer Science 2024-05-24 Gal Mendelson , Xu Kuang

Load balancing plays a critical role in efficiently dispatching jobs in parallel-server systems such as cloud networks and data centers. A fundamental challenge in the design of load balancing algorithms is to achieve an optimal trade-off…

Performance · Computer Science 2020-12-16 Mark van der Boor , Sem Borst , Johan van Leeuwaarden

We consider a system of $N$ parallel queues with identical exponential service rates and a single dispatcher where tasks arrive as a Poisson process. When a task arrives, the dispatcher always assigns it to an idle server, if there is any,…

Probability · Mathematics 2016-12-14 D. Mukherjee , S. C. Borst , J. S. H. van Leeuwaarden , P. A. Whiting

The Join-the-Shortest-Queue (JSQ) load balancing scheme is known to minimise the average response time of jobs in homogeneous systems with identical servers. However, for {\em heterogeneous} systems with servers having different processing…

Probability · Mathematics 2024-10-22 Sanidhay Bhambay , Burak Büke , Arpan Mukhopadhyay

A fundamental challenge in large-scale cloud networks and data centers is to achieve highly efficient server utilization and limit energy consumption, while providing excellent user-perceived performance in the presence of uncertain and…

Probability · Mathematics 2017-06-23 Debankur Mukherjee , Souvik Dhara , Sem Borst , Johan S. H. van Leeuwaarden

Recent years have seen a great increase in the capacity and parallel processing power of data centers and cloud services. To fully utilize the said distributed systems, optimal load balancing for parallel queuing architectures must be…

Distributed, Parallel, and Cluster Computing · Computer Science 2022-08-10 Anam Tahir , Kai Cui , Heinz Koeppl

Scalable load balancing algorithms are of great interest in cloud networks and data centers, necessitating the use of tractable techniques to compute optimal load balancing policies for good performance. However, most existing scalable…

Distributed, Parallel, and Cluster Computing · Computer Science 2024-03-25 Anam Tahir , Kai Cui , Heinz Koeppl

We study the steady-state delay performance of load balancing in large-scale systems with heterogeneous servers in the heavy-traffic regimes. The system consists of $N$ servers, each with a local buffer of size $b-1$, serving jobs in the…

Probability · Mathematics 2026-02-27 Xin Liu , Lei Ying

Join-the-shortest queue (JSQ) and its variants have often been used in solving load balancing problems. The aim of such policies is to minimize the average system occupation, e.g., the customer's system time. In this work we extend the…

Networking and Internet Architecture · Computer Science 2025-02-11 Andrea Fox , Francesco De Pellegrini , Eitan Altman , Arnob Ghosh , Ness Shroff

Load balancing plays a crucial role in many large scale systems. Several different load balancing principles have been proposed in the literature, such as Join-Shortest-Queue (JSQ) and its variations, or Join-Below-Threshold. We provide a…

Probability · Mathematics 2023-07-11 Illés Horváth , Márton Mészáros

We analyse Join-the-Shortest-Queue in a contemporary scaling regime known as the Non-Degenerate Slowdown regime. Join-the-Shortest-Queue (JSQ) is a classical load balancing policy for queueing systems with multiple parallel servers.…

Probability · Mathematics 2018-02-13 Varun Gupta , Neil Walton

This paper considers the steady-state performance of load balancing algorithms in a many-server system with distributed queues. The system has $N$ servers, and each server maintains a local queue with buffer size $b-1,$ i.e. a server can…

Probability · Mathematics 2019-12-30 Xin Liu , Lei Ying

We consider a system of $N$ identical server pools and a single dispatcher where tasks arrive as a Poisson process of rate $\lambda(N)$. Arriving tasks cannot be queued, and must immediately be assigned to one of the server pools to start…

The Join-the-Shortest-Queue (JSQ) policy is among the most widely used load balancing algorithms and has been extensively studied. However, an exact characterization of the system behavior remains challenging. Most prior research has…

Probability · Mathematics 2026-02-12 Yuanzhe Ma , Siva Theja Maguluri
‹ Prev 1 2 3 10 Next ›