Related papers: Predicting Intermediate Storage Performance for Wo…

Accurate Performance Predictors for Edge Computing Applications

Accurate prediction of application performance is critical for enabling effective scheduling and resource management in resource-constrained dynamic edge environments. However, achieving predictable performance in such environments remains…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-10-24 Panagiotis Giannakopoulos , Bart van Knippenberg , Kishor Chandra Joshi , Nicola Calabretta , George Exarchakos

Predicting the Performance of Scientific Workflow Tasks for Cluster Resource Management: An Overview of the State of the Art

Scientific workflow management systems support large-scale data analysis on cluster infrastructures. For this, they interact with resource managers which schedule workflow tasks onto cluster nodes. In addition to workflow task descriptions,…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-04-30 Jonathan Bader , Kathleen West , Soeren Becker , Svetlana Kulagina , Fabian Lehmann , Lauritz Thamsen , Henning Meyerhenke , Odej Kao

Predicting the Performance-Cost Trade-off of Applications Across Multiple Systems

In modern computing environments, users may have multiple systems accessible to them such as local clusters, private clouds, or public clouds. This abundance of choices makes it difficult for users to select the system and configuration for…

Distributed, Parallel, and Cluster Computing · Computer Science 2023-04-05 Amir Nassereldine , Safaa Diab , Mohammed Baydoun , Kenneth Leach , Maxim Alt , Dejan Milojicic , Izzat El Hajj

Towards Learned Predictability of Storage Systems

With the rapid development of cloud computing and big data technologies, storage systems have become a fundamental building block of datacenters, incorporating hardware innovations such as flash solid state drives and non-volatile memories,…

Databases · Computer Science 2023-08-01 Chenyuan Wu

End-to-End Predictions-Based Resource Management Framework for Supercomputer Jobs

Job submissions of parallel applications to production supercomputer systems will have to be carefully tuned in terms of the job submission parameters to obtain minimum response times. In this work, we have developed an end-to-end resource…

Distributed, Parallel, and Cluster Computing · Computer Science 2020-08-20 Swetha Hariharan , Prakash Murali , Abhishek Pasari , Sathish Vadhiyar

Predictive Modeling of I/O Performance for Machine Learning Training Pipelines: A Data-Driven Approach to Storage Optimization

Modern machine learning training is increasingly bottlenecked by data I/O rather than compute. GPUs often sit idle at below 50% utilization waiting for data. This paper presents a machine learning approach to predict I/O performance and…

Performance · Computer Science 2025-12-22 Karthik Prabhakar , Durgamadhab Mishra

Cloud Workload Prediction based on Workflow Execution Time Discrepancies

Infrastructure as a service clouds hide the complexity of maintaining the physical infrastructure with a slight disadvantage: they also hide their internal working details. Should users need knowledge about these details e.g., to increase…

Distributed, Parallel, and Cluster Computing · Computer Science 2018-03-20 Gabor Kecskemeti , Zsolt Nemeth , Attila Kertesz , Rajiv Ranjan

Optimizing Prediction Serving on Low-Latency Serverless Dataflow

Prediction serving systems are designed to provide large volumes of low-latency inferences machine learning models. These systems mix data processing and computationally intensive model inference and benefit from multiple heterogeneous…

Distributed, Parallel, and Cluster Computing · Computer Science 2020-07-14 Vikram Sreekanti , Harikaran Subbaraj , Chenggang Wu , Joseph E. Gonzalez , Joseph M. Hellerstein

CMS Workflow Execution using Intelligent Job Scheduling and Data Access Strategies

Complex scientific workflows can process large amounts of data using thousands of tasks. The turnaround times of these workflows are often affected by various latencies such as the resource discovery, scheduling and data access latencies…

Software Engineering · Computer Science 2015-06-04 Khawar Hasham , Antonio Delgado Peris , Ashiq Anjum , Dave Evans , Dirk Hufnagel , Eduardo Huedo , José M. Hernández , Richard McClatchey , Stephen Gowdy , Simon Metson

Modular Resource Centric Learning for Workflow Performance Prediction

Workflows provide an expressive programming model for fine-grained control of large-scale applications in distributed computing environments. Accurate estimates of complex workflow execution metrics on large-scale machines have several key…

Distributed, Parallel, and Cluster Computing · Computer Science 2018-04-18 Alok Singh , Mai Nguyen , Shweta Purawat , Daniel Crawl , Ilkay Altintas

A Predictive Application Offloading Algorithm Using Small Datasets for Cloud Robotics

Many robotic applications that are critical for robot performance require immediate feedback, hence execution time is a critical concern. Furthermore, it is common that robots come with a fixed quantity of hardware resources; if an…

Robotics · Computer Science 2021-08-31 Manoj Penmetcha , Shyam Sundar Kannan , Byung-Cheol Min

Performance Analysis of Machine Learning Centered Workload Prediction Models for Cloud

The precise estimation of resource usage is a complex and challenging issue due to the high variability and dimensionality of heterogeneous service types and dynamic workloads. Over the last few years, the prediction of resource usage and…

Distributed, Parallel, and Cluster Computing · Computer Science 2023-02-07 Deepika Saxena , Jitendra Kumar , Ashutosh Kumar Singh , Stefan Schmid

Automatic Storage Structure Selection for hybrid Workload

In the use of database systems, the design of the storage engine and data model directly affects the performance of the database when performing queries. Therefore, the users of the database need to select the storage engine and design data…

Databases · Computer Science 2020-08-18 Hongzhi Wang , Yan Wei , Hao Yan

A Survey on User-Space Storage and Its Implementations

The storage stack in the traditional operating system is primarily optimized towards improving the CPU utilization and hiding the long I/O latency imposed by the slow I/O devices such as hard disk drivers (HDDs). However, the emerging…

Operating Systems · Computer Science 2023-06-21 Junzhe Li , Xiurui Pan , Shushu Yi , Jie Zhang

Finding Faster Configurations using FLASH

Finding good configurations for a software system is often challenging since the number of configuration options can be large. Software engineers often make poor choices about configuration or, even worse, they usually use a sub-optimal…

Software Engineering · Computer Science 2018-09-05 Vivek Nair , Zhe Yu , Tim Menzies , Norbert Siegmund , Sven Apel

Predicting Tactical Solutions to Operational Planning Problems under Imperfect Information

This paper offers a methodological contribution at the intersection of machine learning and operations research. Namely, we propose a methodology to quickly predict tactical solutions to a given operational problem. In this context, the…

Machine Learning · Computer Science 2022-06-10 Eric Larsen , Sébastien Lachapelle , Yoshua Bengio , Emma Frejinger , Simon Lacoste-Julien , Andrea Lodi

Optimal Configuration of API Resources in Cloud Native Computing

This paper presents how an existing framework for offline performance optimization can be applied to microservice applications during the Release phase of the DevOps life cycle. Optimization of resource allocation configuration parameters…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-12-30 Eddy Truyen , Wouter Joosen

A Data-Driven Approach to Dynamically Adjust Resource Allocation for Compute Clusters

Nowadays, data-centers are largely under-utilized because resource allocation is based on reservation mechanisms which ignore actual resource utilization. Indeed, it is common to reserve resources for peak demand, which may occur only for a…

Distributed, Parallel, and Cluster Computing · Computer Science 2018-07-03 Francesco Pace , Dimitrios Milios , Damiano Carra , Daniele Venzano , Pietro Michiardi

A reliability- and latency-driven task allocation framework for workflow applications in the edge-hub-cloud continuum

A growing number of critical workflow applications leverage a streamlined edge-hub-cloud architecture, which diverges from the conventional edge computing paradigm. An edge device, in collaboration with a hub device and a cloud server,…

Distributed, Parallel, and Cluster Computing · Computer Science 2026-02-23 Andreas Kouloumpris , Georgios L. Stavrinides , Maria K. Michael , Theocharis Theocharides

Energy-efficient workflow scheduling based on workflow structures under deadline and budget constraints in the cloud

The utilization of cloud environments to deploy scientific workflow applications is an emerging trend in scientific community. In this area, the main issue is the scheduling of workflows, which is known as an NP-complete problem. Apart from…

Distributed, Parallel, and Cluster Computing · Computer Science 2022-01-17 J. E. Ndamlabin Mboula , V. C. Kamla , M. H. Hilman , C. Tayou Djamegni