Related papers: Diagonal Scaling: A Multi-Dimensional Resource Mod…

Dynamic Resource Allocation Method for Load Balance Scheduling over Cloud Data Center Networks

The cloud datacenter has numerous hosts as well as application requests where resources are dynamic. The demands placed on the resource allocation are diverse. These factors could lead to load imbalances, which affect scheduling efficiency…

Distributed, Parallel, and Cluster Computing · Computer Science 2022-11-07 Sakshi Chhabra , Ashutosh Kumar Singh

Multi-Resource Parallel Query Scheduling and Optimization

Scheduling query execution plans is a particularly complex problem in shared-nothing parallel systems, where each site consists of a collection of local time-shared (e.g., CPU(s) or disk(s)) and space-shared (e.g., memory) resources and…

Databases · Computer Science 2014-04-01 Minos Garofalakis , Yannis Ioannidis

Optimal Service Elasticity in Large-Scale Distributed Systems

A fundamental challenge in large-scale cloud networks and data centers is to achieve highly efficient server utilization and limit energy consumption, while providing excellent user-perceived performance in the presence of uncertain and…

Probability · Mathematics 2017-06-23 Debankur Mukherjee , Souvik Dhara , Sem Borst , Johan S. H. van Leeuwaarden

Prediction of Horizontal Data Partitioning Through Query Execution Cost Estimation

The excessively increased volume of data in modern data management systems demands an improved system performance, frequently provided by data distribution, system scalability and performance optimization techniques. Optimized horizontal…

Machine Learning · Computer Science 2019-11-27 Nino Arsov , Goran Velinov , Aleksandar S. Dimovski , Bojana Koteska , Dragan Sahpaski , Margina Kon-Popovska

Multi Query Optimization in GLADE

SQL-on-Hadoop systems, query optimization, data distribution over multiple nodes and parallelization techniques are few of the areas under extreme research these days. Big names like Amazon, Google, Microsoft and many more are working on…

Databases · Computer Science 2016-08-17 Abdur Rafay

Optimizing simultaneous autoscaling for serverless cloud computing

This paper explores resource allocation in serverless cloud computing platforms and proposes an optimization approach for autoscaling systems. Serverless computing relieves users from resource management tasks, enabling focus on application…

Distributed, Parallel, and Cluster Computing · Computer Science 2023-10-31 Harold Ship , Evgeny Shindin , Chen Wang , Diana Arroyo , Asser Tantawi

Towards Stochastically Optimizing Data Computing Flows

With rapid growth in the amount of unstructured data produced by memory-intensive applications, large scale data analytics has recently attracted increasing interest. Processing, managing and analyzing this huge amount of data poses several…

Distributed, Parallel, and Cluster Computing · Computer Science 2016-08-29 Farshid Farhat , Diman Zad Tootaghaj , Mohammad Arjomand

Novel Dynamic Load Balancing Algorithm for Cloud-Based Big Data Analytics

Big data analytics in cloud environments introduces challenges such as real-time load balancing besides security, privacy, and energy efficiency. In this paper, we propose a novel load balancing algorithm in cloud environments that performs…

Distributed, Parallel, and Cluster Computing · Computer Science 2021-02-03 Arman Aghdashi , Seyedeh Leili Mirtaheri

DEPAS: A Decentralized Probabilistic Algorithm for Auto-Scaling

The dynamic provisioning of virtualized resources offered by cloud computing infrastructures allows applications deployed in a cloud environment to automatically increase and decrease the amount of used resources. This capability is called…

Distributed, Parallel, and Cluster Computing · Computer Science 2012-02-14 Nicolo M. Calcavecchia , Bogdan Alexandru Caprarescu , Elisabetta Di Nitto , Daniel J. Dubois , Dana Petcu

An Efficient Secure Dynamic Skyline Query Model

It is now cost-effective to outsource large dataset and perform query over the cloud. However, in this scenario, there exist serious security and privacy issues that sensitive information contained in the dataset can be leaked. The most…

Databases · Computer Science 2020-02-25 Weiguo Wang , Hui Li , Yanguo Peng , Sourav S Bhowmick , Peng Chen , Xiaofeng Chen , Jiangtao Cui

Dynamic Deferral of Workload for Capacity Provisioning in Data Centers

Recent increase in energy prices has led researchers to find better ways for capacity provisioning in data centers to reduce energy wastage due to the variation in workload. This paper explores the opportunity for cost saving utilizing the…

Networking and Internet Architecture · Computer Science 2015-03-19 Muhammad Abdullah Adnan , Ryo Sugihara , Yan Ma , Rajesh Gupta

Optimization Strategies for Parallel Computation of Skylines

Skyline queries are one of the most widely adopted tools for Multi-Criteria Analysis, with applications covering diverse domains, including, e.g., Database Systems, Data Mining, and Decision Making. Skylines indeed offer a useful overview…

Databases · Computer Science 2024-11-25 Paolo Ciaccia , Davide Martinenghi

ScaleDL: Towards Scalable and Efficient Runtime Prediction for Distributed Deep Learning Workloads

Deep neural networks (DNNs) form the cornerstone of modern AI services, supporting a wide range of applications, including autonomous driving, chatbots, and recommendation systems. As models increase in size and complexity, DNN workloads…

Machine Learning · Computer Science 2025-11-14 Xiaokai Wang , Shaoyuan Huang , Yuting Li , Xiaofei Wang

Distributed Caching for Complex Querying of Raw Arrays

As applications continue to generate multi-dimensional data at exponentially increasing rates, fast analytics to extract meaningful results is becoming extremely important. The database community has developed array databases that alleviate…

Databases · Computer Science 2018-03-19 Weijie Zhao , Florin Rusu , Bin Dong , Kesheng Wu , Anna Y. Q. Ho , Peter Nugent

Performance Cost Tradeoffs in Intelligent Load Balancing for Multi Data Center Cloud Systems: From Static Policies to Adaptive Resource Distribution

Cloud computing infrastructures increasingly rely on geographically distributed data centers to meet the growing demand for low latency, high availability, and cost-efficient service delivery. In this context, load balancing plays a…

Distributed, Parallel, and Cluster Computing · Computer Science 2026-02-12 Saeid Aghasoleymani Najafabadi , Elaheh Nabavi Nia

Cluster Resource Management for Dynamic Workloads by Online Optimization

Over the past ten years, many different approaches have been proposed for different aspects of the problem of resources management for long running, dynamic and diverse workloads such as processing query streams or distributed deep…

Performance · Computer Science 2023-08-24 Nader Alfares , George Kesidis , Ata Fatahi Baarzi , Aman Jain

Dynamic Resource Allocation in the Cloud with Near-Optimal Efficiency

Cloud computing has motivated renewed interest in resource allocation problems with new consumption models. A common goal is to share a resource, such as CPU or I/O bandwidth, among distinct users with different demand patterns as well as…

Data Structures and Algorithms · Computer Science 2021-01-27 Sebastian Perez-Salazar , Ishai Menache , Mohit Singh , Alejandro Toriello

Towards Optimizing Storage Costs on the Cloud

We study the problem of optimizing data storage and access costs on the cloud while ensuring that the desired performance or latency is unaffected. We first propose an optimizer that optimizes the data placement tier (on the cloud) and the…

Databases · Computer Science 2023-07-07 Koyel Mukherjee , Raunak Shah , Shiv Kumar Saini , Karanpreet Singh , Khushi , Harsh Kesarwani , Kavya Barnwal , Ayush Chauhan

SCALE: Selective Resource Allocation for Overcoming Performance Bottlenecks in Mathematical Test-time Scaling

Test-time compute scaling has emerged as a powerful paradigm for enhancing mathematical reasoning in large language models (LLMs) by allocating additional computational resources during inference. However, current methods employ uniform…

Computation and Language · Computer Science 2025-12-02 Yang Xiao , Chunpu Xu , Ruifeng Yuan , Jiashuo Wang , Wenjie Li , Pengfei Liu

Distributed Rate Scaling in Large-Scale Service Systems

We consider a large-scale parallel-server system, where each server independently adjusts its processing speed in a decentralized manner. The objective is to minimize the overall cost, which comprises the average cost of maintaining the…

Optimization and Control · Mathematics 2023-06-06 Daan Rutten , Martin Zubeldia , Debankur Mukherjee