Related papers: Flexible Queueing Architectures

Transportation Polytope and its Applications in Parallel Server Systems

A parallel server system is a stochastic processing network with applications in manufacturing, supply chain, ride-hailing, call centers, etc. Heterogeneous customers arrive in the system, and only a subset of servers can serve any customer…

Networking and Internet Architecture · Computer Science 2023-01-10 Sushil Mahavir Varma , Siva Theja Maguluri

Stability of Decentralized Queueing Networks Beyond Complete Bipartite Cases

Gaitonde and Tardos recently studied a model of queueing networks where queues compete for servers and re-send returned packets in future rounds. They quantify the amount of additional processing power that guarantees a decentralized…

Computer Science and Game Theory · Computer Science 2022-10-18 Hu Fu , Qun Hu , Jia'nan Lin

On the Power of Centralization in Distributed Processing

In this thesis, we propose and analyze a multi-server model that captures a performance trade-off between centralized and distributed processing. In our model, a fraction $p$ of an available resource is deployed in a centralized manner…

Distributed, Parallel, and Cluster Computing · Computer Science 2012-03-23 Kuang Xu

Fluid limits for interacting queues in sparse dynamic graphs

Consider a network of $n$ single-server queues where tasks arrive independently at each server at rate $\lambda_n$. The servers are connected by a graph that is resampled at rate $\mu_n$ in a way that is symmetric with respect to the…

Probability · Mathematics 2025-10-14 Diego Goldsztajn , Sem C. Borst , Johan S. H. van Leeuwaarden

Delay-minimizing capacity allocation in an infinite server queueing system

We consider a service system with an infinite number of exponential servers sharing a finite service capacity. The servers are ordered according to their speed, and arriving customers join the fastest idle server. A capacity allocation is…

Probability · Mathematics 2018-08-23 Refael Hassin , Liron Ravner

Dynamic scheduling in a partially fluid, partially lossy queueing system

We consider a single server queueing system with two classes of jobs: eager jobs with small sizes that require service to begin almost immediately upon arrival, and tolerant jobs with larger sizes that can wait for service. While blocking…

Performance · Computer Science 2021-12-24 Kiran Chaudhary , Veeraruna Kavitha , Jayakrishnan Nair

Exploiting Data Locality to Improve Performance of Heterogeneous Server Clusters

We consider load balancing in large-scale heterogeneous server systems in the presence of data locality that imposes constraints on which tasks can be assigned to which servers. The constraints are naturally captured by a bipartite graph…

Probability · Mathematics 2022-12-01 Zhisheng Zhao , Debankur Mukherjee , Ruoyu Wu

Cone Schedules for Processing Systems in Fluctuating Environments

We consider a generalized processing system having several queues, where the available service rate combinations are fluctuating over time due to reliability and availability variations. The objective is to allocate the available resources,…

Networking and Internet Architecture · Computer Science 2011-05-04 Kevin Ross , Nicholas Bambos , George Michailidis

A lower bound on the queueing delay in resource constrained load balancing

We consider the following distributed service model: jobs with unit mean, general distribution, and independent processing times arrive as a renewal process of rate $\lambda n$, with $0<\lambda<1$, and are immediately dispatched to one of…

Probability · Mathematics 2018-07-10 David Gamarnik , John N. Tsitsiklis , Martin Zubeldia

Queues on a dynamically evolving graph

This paper considers a population process on a dynamically evolving graph, which can be alternatively interpreted as a queueing network. The queues are of infinite-server type, entailing that at each node all customers present are served in…

Probability · Mathematics 2020-01-01 Michel Mandjes , Nicos Starreveld , René Bekker

Scheduling a multi class queue with many exponential servers: asymptotic optimality in heavy traffic

We consider the problem of scheduling a queueing system in which many statistically identical servers cater to several classes of impatient customers. Service times and impatience clocks are exponential while arrival processes are renewal.…

Probability · Mathematics 2007-05-23 Rami Atar , Avi Mandelbaum , Martin I. Reiman

Balanced Nonadaptive Redundancy Scheduling

Distributed computing systems implement redundancy to reduce the job completion time and variability. Despite a large body of work about computing redundancy, the analytical performance evaluation of redundancy techniques in queuing systems…

Information Theory · Computer Science 2022-01-05 Amir Behrouzi-Far , Emina Soljanin

Load Balancing Under Strict Compatibility Constraints

We study large-scale systems operating under the JSQ$(d)$ policy in the presence of stringent task-server compatibility constraints. Consider a system with $N$ identical single-server queues and $M(N)$ task types, where each server is able…

Probability · Mathematics 2020-08-25 Daan Rutten , Debankur Mukherjee

Stability and Capacity Regions or Discrete Time Queueing Networks

We consider stability and network capacity in discrete time queueing systems. Relationships between four common notions of stability are described. Specifically, we consider rate stability, mean rate stability, steady state stability, and…

Networking and Internet Architecture · Computer Science 2010-03-18 Michael J. Neely

Optimal Service Elasticity in Large-Scale Distributed Systems

A fundamental challenge in large-scale cloud networks and data centers is to achieve highly efficient server utilization and limit energy consumption, while providing excellent user-perceived performance in the presence of uncertain and…

Probability · Mathematics 2017-06-23 Debankur Mukherjee , Souvik Dhara , Sem Borst , Johan S. H. van Leeuwaarden

On the Benefit of Virtualization: Strategies for Flexible Server Allocation

Virtualization technology facilitates a dynamic, demand-driven allocation and migration of servers. This paper studies how the flexibility offered by network virtualization can be used to improve Quality-of-Service parameters such as…

Networking and Internet Architecture · Computer Science 2010-12-14 Dushyant Arora , Anja Feldmann , Gregor Schaffrath , Stefan Schmid

Flexibility in an asymmetric system with prolonged service time at non-dedicated servers

The prolonged service time at non-dedicated servers has been observed in [1]. Motivated by such real problems, we propose a stylized model which characterizes the feature of the prolonged service time at non-dedicated servers in an…

Performance · Computer Science 2021-03-02 Yanting Chen , Jingui Xie , Taozeng Zhu

Robust Scheduling for Flexible Processing Networks

Modern processing networks often consist of heterogeneous servers with widely varying capabilities, and process job flows with complex structure and requirements. A major challenge in designing efficient scheduling policies in these…

Probability · Mathematics 2016-10-13 Ramtin Pedarsani , Jean Walrand , Yuan Zhong

Geometric lower bounds for the steady-state occupancy of processing networks with limited connectivity

We consider processing networks where multiple dispatchers are connected to single-server queues by a bipartite compatibility graph, modeling constraints that are common in data centers and cloud networks due to geographic reasons or data…

Probability · Mathematics 2026-05-01 Diego Goldsztajn , Andres Ferragut

Matching Queues, Flexibility and Incentives

Problem definition: In many matching markets, some agents are fully flexible, while others only accept a subset of jobs. For example, ridesharing drivers can specify on the platform the destinations they are willing to accept. Conventional…

Computer Science and Game Theory · Computer Science 2026-01-30 Chiwei Yan , Francisco Castro , Peter Frazier , Hongyao Ma , Hamid Nazerzadeh