Related papers: Scalable, Fast Cloud Computing with Execution Temp…

Execution Templates: Caching Control Plane Decisions for Strong Scaling of Data Analytics

Control planes of cloud frameworks trade off between scheduling granularity and performance. Centralized systems schedule at task granularity, but only schedule a few thousand tasks per second. Distributed systems schedule hundreds of…

Distributed, Parallel, and Cluster Computing · Computer Science 2017-05-05 Omid Mashayekhi , Hang Qu , Chinmayee Shah , Philip Levis

Reproducible and Portable Big Data Analytics in the Cloud

Cloud computing has become a major approach to help reproduce computational experiments. Yet there are still two main difficulties in reproducing batch based big data analytics (including descriptive and predictive analytics) in the cloud.…

Distributed, Parallel, and Cluster Computing · Computer Science 2023-03-13 Xin Wang , Pei Guo , Xingyan Li , Aryya Gangopadhyay , Carl E. Busart , Jade Freeman , Jianwu Wang

Canary: A Scheduling Architecture for High Performance Cloud Computing

We present Canary, a scheduling architecture that allows high performance analytics workloads to scale out to run on thousands of cores. Canary is motivated by the observation that a central scheduler is a bottleneck for high performance…

Distributed, Parallel, and Cluster Computing · Computer Science 2016-04-15 Hang Qu , Omid Mashayekhi , David Terei , Philip Levis

ElastiBench: Scalable Continuous Benchmarking on Cloud FaaS Platforms

Running microbenchmark suites often and early in the development process enables developers to identify performance issues in their application. Microbenchmark suites of complex applications can comprise hundreds of individual benchmarks…

Distributed, Parallel, and Cluster Computing · Computer Science 2024-07-30 Trever Schirmer , Tobias Pfandzelter , David Bermbach

Scalable Runtime Architecture for Data-driven, Hybrid HPC and ML Workflow Applications

Hybrid workflows combining traditional HPC and novel ML methodologies are transforming scientific computing. This paper presents the architecture and implementation of a scalable runtime system that extends RADICAL-Pilot with service-based…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-03-18 Andre Merzky , Mikhail Titov , Matteo Turilli , Ozgur Kilic , Tianle Wang , Shantenu Jha

Towards an Adaptive Runtime System for Cloud-Native HPC

The ongoing convergence of HPC and cloud computing presents a fundamental challenge: HPC applications, designed for static and homogeneous supercomputers, are ill-suited for the dynamic, heterogeneous, and volatile nature of the cloud.…

Distributed, Parallel, and Cluster Computing · Computer Science 2026-03-17 Aditya Bhosale , Advait Tahilyani , Laxmikant Kale , Sara Kokkila-Schumacher

A Comprehensive Experimentation Framework for Energy-Efficient Design of Cloud-Native Applications

Current approaches to designing energy-efficient applications typically rely on measuring individual components using readily available local metrics, like CPU utilization. However, these metrics fall short when applied to cloud-native…

Software Engineering · Computer Science 2025-03-12 Sebastian Werner , Maria C. Borges , Karl Wolf , Stefan Tai

Divide (CPU Load) and Conquer: Semi-Flexible Cloud Resource Allocation

Cloud resource management is often modeled by two-dimensional bin packing with a set of items that correspond to tasks having fixed CPU and memory requirements. However, applications running in clouds are much more flexible: modern…

Distributed, Parallel, and Cluster Computing · Computer Science 2022-11-01 Bartłomiej Przybylski , Paweł Żuk , Krzysztof Rzadca

Performance Characterization of In-Memory Data Analytics on a Modern Cloud Server

In last decade, data analytics have rapidly progressed from traditional disk-based processing to modern in-memory processing. However, little effort has been devoted at enhancing performance at micro-architecture level. This paper…

Distributed, Parallel, and Cluster Computing · Computer Science 2016-11-17 Ahsan Javed Awan , Mats Brorsson , Vladimir Vlassov , Eduard Ayguade

Cloud-Assisted Nonlinear Model Predictive Control for Finite-Duration Tasks

Cloud computing creates new possibilities for control applications by offering powerful computation and storage capabilities. In this paper, we propose a novel cloud-assisted model predictive control (MPC) framework in which we…

Systems and Control · Electrical Eng. & Systems 2021-06-22 Nan Li , Kaixiang Zhang , Zhaojian Li , Vaibhav Srivastava , Xiang Yin

Enabling the Reflex Plane with the nanoPU

Many recent papers have demonstrated fast in-network computation using programmable switches, running many orders of magnitude faster than CPUs. The main limitation of writing software for switches is the constrained programming model and…

Networking and Internet Architecture · Computer Science 2022-12-14 Stephen Ibanez , Alex Mallery , Serhat Arslan , Theo Jepsen , Muhammad Shahbaz , Changhoon Kim , Nick McKeown

Revisiting Quantum Supremacy: Simulating Sycamore-Class Circuits Using Hybrid CPU/GPU HPC Workloads

We present a framework for effectively simulating the execution of quantum circuits originally designed to demonstrate quantum supremacy using accessible high-performance computing (HPC) infrastructure. Building on prior CPU-only…

Quantum Physics · Physics 2025-12-09 Bob Wold , Venkateswaran Kasirajan

A New Execution Model and Executor for Adaptively Optimizing the Performance of Parallel Algorithms Using HPX Runtime System

Developing parallel algorithms efficiently requires careful management of concurrency across diverse hardware architectures. C++ executors provide a standardized interface that simplifies the development process, allowing developers to…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-10-22 Karame Mohammadiporshokooh , Steven R. Brandt , Hartmut Kaiser

Cloud-Based Scheduling Mechanism for Scalable and Resource-Efficient Centralized Controllers

This paper proposes a novel approach to address the challenges of deploying complex robotic software in large-scale systems, i.e., Centralized Nonlinear Model Predictive Controllers (CNMPCs) for multi-agent systems. The proposed approach is…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-05-13 Achilleas Santi Seisa , Sumeet Gajanan Satpute , George Nikolakopoulos

Closing the Performance Gap with Modern C++

On the way to Exascale, programmers face the increasing challenge of having to support multiple hardware architectures from the same code base. At the same time, portability of code and performance are increasingly difficult to achieve as…

Distributed, Parallel, and Cluster Computing · Computer Science 2022-06-14 Thomas Heller , Hartmut Kaiser , Patrick Diehl , Dietmar Fey , Marc Alexander Schweitzer

Hyper: Distributed Cloud Processing for Large-Scale Deep Learning Tasks

Training and deploying deep learning models in real-world applications require processing large amounts of data. This is a challenging task when the amount of data grows to a hundred terabytes, or even, petabyte-scale. We introduce a hybrid…

Distributed, Parallel, and Cluster Computing · Computer Science 2019-10-17 Davit Buniatyan

A Self-adaptive Auto-scaling Method for Scientific Applications on HPC Environments and Clouds

High intensive computation applications can usually take days to months to finish an execution. During this time, it is common to have variations of the available resources when considering that such hardware is usually shared among a…

Distributed, Parallel, and Cluster Computing · Computer Science 2015-01-27 Kiran Mantripragada , Alecio Binotto , Leonardo P. Tizzei

Ensemble Toolkit: Scalable and Flexible Execution of Ensembles of Tasks

There are many science applications that require scalable task-level parallelism and support for flexible execution and coupling of ensembles of simulations. Most high-performance system software and middleware, however, are designed to…

Distributed, Parallel, and Cluster Computing · Computer Science 2016-06-29 Vivekanandan Balasubramanian , Antons Treikalis , Ole Weidner , Shantenu Jha

Cross-Layer Multi-Cloud Real-Time Application QoS Monitoring and Benchmarking As-a-Service Framework

Cloud computing provides on-demand access to affordable hardware (multi-core CPUs, GPUs, disks, and networking equipment) and software (databases, application servers and data processing frameworks) platforms with features such as…

Distributed, Parallel, and Cluster Computing · Computer Science 2016-11-17 Khalid Alhamazani , Rajiv Ranjan , Prem Prakash Jayaraman , Karan Mitra , Chang Liu , Fethi Rabhi , Dimitrios Georgakopoulos , Lizhe Wang

Towards Experiment Execution in Support of Community Benchmark Workflows for HPC

A key hurdle is demonstrating compute resource capability with limited benchmarks. We propose workflow templates as a solution, offering adaptable designs for specific scientific applications. Our paper identifies common usage patterns for…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-07-31 Gregor von Laszewski , Wesley Brewer , Sean R. Wilkinson , Andrew Shao , J. P. Fleischer , Harshad Pitkar , Christine R. Kirkpatrick , Geoffrey C. Fox