English
Related papers

Related papers: Optimal Hyper-Scalable Load Balancing with a Stric…

200 papers

A fundamental challenge in large-scale networked systems viz., data centers and cloud networks is to distribute tasks to a pool of servers, using minimal instantaneous state information, while providing excellent delay performance. In this…

Probability · Mathematics 2018-09-07 Debankur Mukherjee

A fundamental challenge in large-scale cloud networks and data centers is to achieve highly efficient server utilization and limit energy consumption, while providing excellent user-perceived performance in the presence of uncertain and…

Probability · Mathematics 2017-06-23 Debankur Mukherjee , Souvik Dhara , Sem Borst , Johan S. H. van Leeuwaarden

In geographically-distributed systems, communication latencies are non-negligible. The perceived processing time of a request is thus composed of the time needed to route the request to the server and the true processing time. Once a…

Distributed, Parallel, and Cluster Computing · Computer Science 2014-02-11 Piotr Skowron , Krzysztof Rzadca

We consider a discrete-time parallel service system consisting of $n$ heterogeneous single server queues with infinite capacity. Jobs arrive to the system as an i.i.d. process with rate proportional to $n$, and must be immediately…

Performance · Computer Science 2025-05-30 Yishun Luo , Martin Zubeldia

Load balancing across parallel servers is an important class of congestion control problems that arises in service systems. An effective load balancer relies heavily on accurate, real-time congestion information to make routing decisions.…

Distributed, Parallel, and Cluster Computing · Computer Science 2024-05-24 Gal Mendelson , Xu Kuang

The basic load balancing scenario involves a single dispatcher where tasks arrive that must immediately be forwarded to one of $N$ single-server queues. We discuss recent advances on scalable load balancing schemes which provide favorable…

We consider a load balancing system consisting of $n$ single-server queues working in parallel, with heterogeneous service rates. Jobs arrive to a central dispatcher, which has to dispatch them to one of the queues immediately upon arrival.…

Performance · Computer Science 2025-10-17 Yishun Luo , Martin Zubeldia

This paper considers the steady-state performance of load balancing algorithms in a many-server system with distributed queues. The system has $N$ servers, and each server maintains a local queue with buffer size $b-1,$ i.e. a server can…

Probability · Mathematics 2019-12-30 Xin Liu , Lei Ying

Traffic load-balancing in datacenters alleviates hot spots and improves network utilization. In this paper, a stable in-network load-balancing algorithm is developed in the setting of software-defined networking. A control plane configures…

Networking and Internet Architecture · Computer Science 2016-12-07 Sucha Supittayapornpong , Michael J. Neely

We consider a large-scale service system where incoming tasks have to be instantaneously dispatched to one out of many parallel server pools. The user-perceived performance degrades with the number of concurrent tasks and the dispatcher…

Load balancing algorithms play a vital role in enhancing performance in data centers and cloud networks. Due to the massive size of these systems, scalability challenges, and especially the communication overhead associated with load…

Probability · Mathematics 2019-03-07 Mark van der Boor , Sem Borst , Johan van Leeuwaarden

Heterogeneity is becoming increasingly ubiquitous in modern large-scale computer systems. Developing good load balancing policies for systems whose resources have varying speeds is crucial in achieving low response times. Indeed, how best…

Performance · Computer Science 2020-06-26 Kristen Gardner , Jazeem Abdul Jaleel , Alexander Wickeham , Sherwin Doroudi

Scalable load balancing algorithms are of great interest in cloud networks and data centers, necessitating the use of tractable techniques to compute optimal load balancing policies for good performance. However, most existing scalable…

Distributed, Parallel, and Cluster Computing · Computer Science 2024-03-25 Anam Tahir , Kai Cui , Heinz Koeppl

Recent years have seen a great increase in the capacity and parallel processing power of data centers and cloud services. To fully utilize the said distributed systems, optimal load balancing for parallel queuing architectures must be…

Distributed, Parallel, and Cluster Computing · Computer Science 2022-08-10 Anam Tahir , Kai Cui , Heinz Koeppl

We consider the fundamental problem of managing a bounded size queue buffer where traffic consists of packets of varying size, where each packet requires several rounds of processing before it can be transmitted from the queue buffer. The…

Networking and Internet Architecture · Computer Science 2013-09-06 Kirill Kogan , Alejandro Lopez-Ortiz , Sergey I. Nikolenko , Gabriel Scalosub , Michael Segal

We present here a cost effective framework for a robust scalable and distributed job processing system that adapts to the dynamic computing needs easily with efficient load balancing for heterogeneous systems. The design is such that each…

Distributed, Parallel, and Cluster Computing · Computer Science 2013-06-07 Putti Srinivasrao , V. P. C. Rao , A. Govardhan , Ambika Prasad Mohanty

We present an overview of scalable load balancing algorithms which provide favorable delay performance in large-scale systems, and yet only require minimal implementation overhead. Aimed at a broad audience, the paper starts with an…

Performance · Computer Science 2017-12-25 Mark van der Boor , Sem C. Borst , Johan S. H. van Leeuwaarden , Debankur Mukherjee

In large-scale distributed systems, balancing the load in an efficient way is crucial in order to achieve low latency. Recently, some load balancing policies have been suggested which are able to achieve a bounded maximum queue length in…

Performance · Computer Science 2022-01-12 Tim Hellemans , Grzegorz Kielanski , Benny Van Houdt

Current-day data centers and high-volume cloud services employ a broad set of heterogeneous servers. In such settings, client requests typically arrive at multiple entry points, and dispatching them to servers is an urgent distributed…

Distributed, Parallel, and Cluster Computing · Computer Science 2021-07-27 Guy Goren , Shay Vargaftik , Yoram Moses

We study the scheduling polices for asymptotically optimal delay in queueing systems with switching overhead. Such systems consist of a single server that serves multiple queues, and some capacity is lost whenever the server switches to…

Performance · Computer Science 2017-01-17 Ping-Chun Hsieh , I-Hong Hou , Xi Liu
‹ Prev 1 2 3 10 Next ›