Related papers: Delay Optimization in a Simple Offloading System: …

Optimal Service Mode Assignment in a Simple Computation Offloading System: Extended Version

We consider a simple computation offloading model where jobs can either be fully processed in the cloud or be partially processed at a local server before being sent to the cloud to complete processing. Our goal is to design a policy for…

Systems and Control · Electrical Eng. & Systems 2025-09-24 Darin Jeff , Eytan Modiano

Delay-minimizing capacity allocation in an infinite server queueing system

We consider a service system with an infinite number of exponential servers sharing a finite service capacity. The servers are ordered according to their speed, and arriving customers join the fastest idle server. A capacity allocation is…

Probability · Mathematics 2018-08-23 Refael Hassin , Liron Ravner

On Delay-Optimal Scheduling in Queueing Systems with Replications

In modern computer systems, jobs are divided into short tasks and executed in parallel. Empirical observations in practical systems suggest that the task service times are highly random and the job service time is bottlenecked by the…

Performance · Computer Science 2017-02-08 Yin Sun , C. Emre Koksal , Ness B. Shroff

Opportunistic Scheduling for Optimal Spot Instance Savings in the Cloud

We study the problem of scheduling delay-sensitive jobs over spot and on-demand cloud instances to minimize average cost while meeting an average delay constraint. Jobs arrive as a general stochastic process, and incur different costs based…

Distributed, Parallel, and Cluster Computing · Computer Science 2026-01-21 Neelkamal Bhuyan , Randeep Bhatia , Murali Kodialam , TV Lakshman

Optimal Hyper-Scalable Load Balancing with a Strict Queue Limit

Load balancing plays a critical role in efficiently dispatching jobs in parallel-server systems such as cloud networks and data centers. A fundamental challenge in the design of load balancing algorithms is to achieve an optimal trade-off…

Performance · Computer Science 2020-12-16 Mark van der Boor , Sem Borst , Johan van Leeuwaarden

Scheduling Policies for Stability and Optimal Server Running Cost in Cloud Computing Platforms

We propose throughput and cost optimal job scheduling algorithms in cloud computing platforms offering Infrastructure as a Service. We first consider online migration and propose job scheduling algorithms to minimize job migration and…

Distributed, Parallel, and Cluster Computing · Computer Science 2022-06-07 Haritha K , Chandramani Singh

An Optimal-Transport-Based Reinforcement Learning Approach for Computation Offloading

With the mass deployment of computing-intensive applications and delay-sensitive applications on end devices, only adequate computing resources can meet differentiated services' delay requirements. By offloading tasks to cloud servers or…

Networking and Internet Architecture · Computer Science 2021-03-12 Zhuo Li , Xu Zhou , Taixin Li , Yang Liu

Delay constrained Energy Optimization for Edge Cloud Offloading

Resource limited user-devices may offload computation to a cloud server, in order to reduce power consumption and lower the execution time. However, to communicate to the cloud server over a wireless channel, additional energy is consumed…

Signal Processing · Electrical Eng. & Systems 2018-12-04 Shreya Tayade , Peter Rost , Andreas Maeder , Hans D. Schotten

Deadline-Aware Joint Task Scheduling and Offloading in Mobile Edge Computing Systems

The demand for stringent interactive quality-of-service has intensified in both mobile edge computing (MEC) and cloud systems, driven by the imperative to improve user experiences. As a result, the processing of computation-intensive tasks…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-07-28 Ngoc Hung Nguyen , Van-Dinh Nguyen , Anh Tuan Nguyen , Nguyen Van Thieu , Hoang Nam Nguyen , Symeon Chatzinotas

Delay-Optimal Scheduling for Queueing Systems with Switching Overhead

We study the scheduling polices for asymptotically optimal delay in queueing systems with switching overhead. Such systems consist of a single server that serves multiple queues, and some capacity is lost whenever the server switches to…

Performance · Computer Science 2017-01-17 Ping-Chun Hsieh , I-Hong Hou , Xi Liu

Delay-Optimal Service Chain Forwarding and Offloading in Collaborative Edge Computing

Collaborative edge computing (CEC) is an emerging paradigm for heterogeneous devices to collaborate on edge computation jobs. For congestible links and computing units, delay-optimal forwarding and offloading for service chain tasks (e.g.,…

Distributed, Parallel, and Cluster Computing · Computer Science 2023-12-12 Jinkun Zhang , Edmund Yeh

A Computation Offloading Model over Collaborative Cloud-Edge Networks with Optimal Transport Theory

As novel applications spring up in future network scenarios, the requirements on network service capabilities for differentiated services or burst services are diverse. Aiming at the research of collaborative computing and resource…

Networking and Internet Architecture · Computer Science 2021-02-25 Zhuo Li , Xu Zhou , Yang Liu , Congshan Fan , Wei Wang

Delay-Optimal Forwarding and Computation Offloading for Service Chain Tasks

Emerging edge computing paradigms enable heterogeneous devices to collaborate on complex computation applications. However, for congestible links and computing units, delay-optimal forwarding and offloading for service chain tasks (e.g.,…

Networking and Internet Architecture · Computer Science 2024-03-26 Jinkun Zhang , Yuezhou Liu , Edmund Yeh

Stability and Heavy-traffic Delay Optimality of General Load Balancing Policies in Heterogeneous Service Systems

We consider a load balancing system consisting of $n$ single-server queues working in parallel, with heterogeneous service rates. Jobs arrive to a central dispatcher, which has to dispatch them to one of the queues immediately upon arrival.…

Performance · Computer Science 2025-10-17 Yishun Luo , Martin Zubeldia

Scheduling Jobs with Random Resource Requirements in Computing Clusters

We consider a natural scheduling problem which arises in many distributed computing frameworks. Jobs with diverse resource requirements (e.g. memory requirements) arrive over time and must be served by a cluster of servers, each with a…

Networking and Internet Architecture · Computer Science 2019-01-21 Konstantinos Psychas , Javad Ghaderi

Near Delay-Optimal Scheduling of Batch Jobs in Multi-Server Systems

We study a class of scheduling problems, where each job is divided into a batch of unit-size tasks and these tasks can be executed in parallel on multiple servers with New-Better-than-Used (NBU) service time distributions. While many delay…

Networking and Internet Architecture · Computer Science 2023-10-02 Yin Sun , C. Emre Koksal , Ness B. Shroff

Offloading Optimization with Delay Distribution in the 3-tier Federated Cloud, Edge, and Fog Systems

Mobile edge computing and fog computing are promising techniques providing computation service closer to users to achieve lower latency. In this work, we study the optimal offloading strategy in the three-tier federated computation…

Networking and Internet Architecture · Computer Science 2021-07-13 Ren-Hung Hwang , Yuan-Cheng Lai , Ying-Dar Lin

Delay-Optimal Distributed Edge Computing in Wireless Edge Networks

By integrating edge computing with parallel computing, distributed edge computing (DEC) makes use of distributed devices in edge networks to perform computing in parallel, which can substantially reduce service delays. In this paper, we…

Networking and Internet Architecture · Computer Science 2020-02-10 Xiaowen Gong

Selection of network coding nodes for minimal playback delay in streaming overlays

Network coding permits to deploy distributed packet delivery algorithms that locally adapt to the network availability in media streaming applications. However, it may also increase delay and computational complexity if it is not…

Multimedia · Computer Science 2016-11-17 Nicolae Cleju , Nikolaos Thomos , Pascal Frossard

We Are Impatient: Algorithms for Geographically Distributed Load Balancing with (Almost) Arbitrary Load Functions

In geographically-distributed systems, communication latencies are non-negligible. The perceived processing time of a request is thus composed of the time needed to route the request to the server and the true processing time. Once a…

Distributed, Parallel, and Cluster Computing · Computer Science 2014-02-11 Piotr Skowron , Krzysztof Rzadca