Related papers: Self-Learning Threshold-Based Load Balancing

Scalable Load Balancing Algorithms in Networked Systems

A fundamental challenge in large-scale networked systems viz., data centers and cloud networks is to distribute tasks to a pool of servers, using minimal instantaneous state information, while providing excellent delay performance. In this…

Probability · Mathematics 2018-09-07 Debankur Mukherjee

Learning and balancing unknown loads in large-scale systems

Consider a system of identical server pools where tasks with exponentially distributed service times arrive as a time-inhomogenenous Poisson process. An admission threshold is used in an inner control loop to assign incoming tasks to server…

Probability · Mathematics 2024-04-08 Diego Goldsztajn , Sem C. Borst , Johan S. H. van Leeuwaarden

Threshold-based rerouting and replication for resolving job-server affinity relations

We consider a system with several job types and two parallel server pools. Within the pools the servers are homogeneous, but across pools possibly not in the sense that the service speed of a job may depend on its type as well as the server…

Performance · Computer Science 2020-05-28 Youri Raaijmakers , Sem Borst , Onno Boxma

Optimal Hyper-Scalable Load Balancing with a Strict Queue Limit

Load balancing plays a critical role in efficiently dispatching jobs in parallel-server systems such as cloud networks and data centers. A fundamental challenge in the design of load balancing algorithms is to achieve an optimal trade-off…

Performance · Computer Science 2020-12-16 Mark van der Boor , Sem Borst , Johan van Leeuwaarden

Scalable Load Balancing in the Presence of Heterogeneous Servers

Heterogeneity is becoming increasingly ubiquitous in modern large-scale computer systems. Developing good load balancing policies for systems whose resources have varying speeds is crucial in achieving low response times. Indeed, how best…

Performance · Computer Science 2020-06-26 Kristen Gardner , Jazeem Abdul Jaleel , Alexander Wickeham , Sherwin Doroudi

Stability and Heavy-traffic Delay Optimality of General Load Balancing Policies in Heterogeneous Service Systems

We consider a load balancing system consisting of $n$ single-server queues working in parallel, with heterogeneous service rates. Jobs arrive to a central dispatcher, which has to dispatch them to one of the queues immediately upon arrival.…

Performance · Computer Science 2025-10-17 Yishun Luo , Martin Zubeldia

Intelligent Load Balancing Systems using Reinforcement Learning System

Load Balancing is a fundamental technology for scaling cloud infrastructure. It enables systems to distribute incoming traffic across backend servers using predefined algorithms such as round robin, weighted round robin, least connections,…

Networking and Internet Architecture · Computer Science 2025-05-14 Raju Singh

Asymptotically Optimal Load Balancing in Large-scale Heterogeneous Systems with Multiple Dispatchers

We consider the load balancing problem in large-scale heterogeneous systems with multiple dispatchers. We introduce a general framework called Local-Estimation-Driven (LED). Under this framework, each dispatcher keeps local (possibly…

Performance · Computer Science 2020-02-21 Xingyu Zhou , Ness Shroff , Adam Wierman

We Are Impatient: Algorithms for Geographically Distributed Load Balancing with (Almost) Arbitrary Load Functions

In geographically-distributed systems, communication latencies are non-negligible. The perceived processing time of a request is thus composed of the time needed to route the request to the server and the true processing time. Once a…

Distributed, Parallel, and Cluster Computing · Computer Science 2014-02-11 Piotr Skowron , Krzysztof Rzadca

Optimal Service Elasticity in Large-Scale Distributed Systems

A fundamental challenge in large-scale cloud networks and data centers is to achieve highly efficient server utilization and limit energy consumption, while providing excellent user-perceived performance in the presence of uncertain and…

Probability · Mathematics 2017-06-23 Debankur Mukherjee , Souvik Dhara , Sem Borst , Johan S. H. van Leeuwaarden

Heavy-traffic Delay Optimality in Pull-based Load Balancing Systems: Necessary and Sufficient Conditions

In this paper, we consider a load balancing system under a general pull-based policy. In particular, each arrival is randomly dispatched to one of the servers whose queue lengths are below a threshold, if there are any; otherwise, this…

Performance · Computer Science 2018-08-22 Xingyu Zhou , Jian Tan , Ness Shroff

Utility maximizing load balancing policies

Consider a service system where incoming tasks are instantaneously dispatched to one out of many heterogeneous server pools. Associated with each server pool is a concave utility function which depends on the class of the server pool and…

Probability · Mathematics 2024-02-13 Diego Goldsztajn , Sem C. Borst , Johan S. H. van Leeuwaarden

Optimal Load Balancing and Assessment of Existing Load Balancing Criteria

Parallel iterative applications often suffer from load imbalance, one of the most critical performance degradation factors. Hence, load balancing techniques are used to distribute the workload evenly to maximize performance. A key challenge…

Distributed, Parallel, and Cluster Computing · Computer Science 2021-04-06 Anthony Boulmier , Nabil Abdennadher , Bastien Chopard

Learning Mean-Field Control for Delayed Information Load Balancing in Large Queuing Systems

Recent years have seen a great increase in the capacity and parallel processing power of data centers and cloud services. To fully utilize the said distributed systems, optimal load balancing for parallel queuing architectures must be…

Distributed, Parallel, and Cluster Computing · Computer Science 2022-08-10 Anam Tahir , Kai Cui , Heinz Koeppl

Asymptotic Optimality of Power-of-$d$ Load Balancing in Large-Scale Systems

We consider a system of $N$ identical server pools and a single dispatcher where tasks arrive as a Poisson process of rate $\lambda(N)$. Arriving tasks cannot be queued, and must immediately be assigned to one of the server pools to start…

Probability · Mathematics 2022-08-12 Debankur Mukherjee , Sem C. Borst , Johan S. H. van Leeuwaarden , Philip A. Whiting

Threshold Load Balancing in Networks

We study probabilistic protocols for concurrent threshold-based load balancing in networks. There are n resources or machines represented by nodes in an undirected graph and m >> n users that try to find an acceptable resource by moving…

Data Structures and Algorithms · Computer Science 2013-06-07 Martin Hoefer , Thomas Sauerwald

Evaluating load balancing policies for performance and energy-efficiency

Nowadays, more and more increasingly hard computations are performed in challenging fields like weather forecasting, oil and gas exploration, and cryptanalysis. Many of such computations can be implemented using a computer cluster with a…

Performance · Computer Science 2016-10-27 Freek van den Berg , Björn F. Postema , Boudewijn R. Haverkort

Load Balancing via Random Local Search in Closed and Open systems

In this paper, we analyze the performance of random load resampling and migration strategies in parallel server systems. Clients initially attach to an arbitrary server, but may switch server independently at random instants of time in an…

Networking and Internet Architecture · Computer Science 2010-04-12 A. Ganesh , S. Lilienthal , D. Manjunath , A. Proutiere , F. Simatos

Optimal Service Mode Assignment in a Simple Computation Offloading System: Extended Version

We consider a simple computation offloading model where jobs can either be fully processed in the cloud or be partially processed at a local server before being sent to the cloud to complete processing. Our goal is to design a policy for…

Systems and Control · Electrical Eng. & Systems 2025-09-24 Darin Jeff , Eytan Modiano

Trevor: Automatic configuration and scaling of stream processing pipelines

Operating a distributed data stream processing workload efficiently at scale is hard. The operator of the workload must parallelize and lay out tasks of the workload with resources that match the requirement of target data rate. The…

Distributed, Parallel, and Cluster Computing · Computer Science 2018-12-27 Manu Bansal , Eyal Cidon , Arjun Balasingam , Aditya Gudipati , Christos Kozyrakis , Sachin Katti