Related papers: Efficient Runtime Profiling for Black-box Machine …

Formal and Empirical Study of Metadata-Based Profiling for Resource Management in the Computing Continuum

We present and formalize a general approach for profiling workload by leveraging only a priori available static metadata to supply appropriate resource needs. Understanding the requirements and characteristics of a workload's runtime is…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-04-30 Andrea Morichetta , Stefan Nastic , Victor Casamayor Pujol , Schahram Dustdar

Predictive Performance Modeling for Distributed Computing using Black-Box Monitoring and Machine Learning

In many domains, the previous decade was characterized by increasing data volumes and growing complexity of computational workloads, creating new demands for highly data-parallel computing in distributed systems. Effective operation of…

Distributed, Parallel, and Cluster Computing · Computer Science 2019-01-25 Carl Witt , Marc Bux , Wladislaw Gusew , Ulf Leser

Scaling-up Distributed Processing of Data Streams for Machine Learning

Emerging applications of machine learning in numerous areas involve continuous gathering of and learning from streams of data. Real-time incorporation of streaming data into the learned models is essential for improved inference in these…

Machine Learning · Computer Science 2020-12-01 Matthew Nokleby , Haroon Raja , Waheed U. Bajwa

Run Time Approximation of Non-blocking Service Rates for Streaming Systems

Stream processing is a compute paradigm that promises safe and efficient parallelism. Modern big-data problems are often well suited for stream processing's throughput-oriented nature. Realization of efficient stream processing requires…

Performance · Computer Science 2015-04-14 Jonathan C. Beard , Roger D. Chamberlain

Automated Machine Learning Techniques for Data Streams

Automated machine learning techniques benefited from tremendous research progress in recently. These developments and the continuous-growing demand for machine learning experts led to the development of numerous AutoML tools. However, these…

Machine Learning · Computer Science 2021-06-15 Alexandru-Ionut Imbrea

Generative Profiling for Soft Real-Time Systems and its Applications to Resource Allocation

Modern real-time systems require accurate characterization of task timing behavior to ensure predictable performance, particularly on complex hardware architectures. Existing methods, such as worst-case execution time analysis, often fail…

Systems and Control · Electrical Eng. & Systems 2026-04-03 Georgiy A. Bondar , Abigail Eisenklam , Yifan Cai , Robert Gifford , Tushar Sial , Linh Thi Xuan Phan , Abhishek Halder

Resource- and Message Size-Aware Scheduling of Stream Processing at the Edge with application to Realtime Microscopy

Whilst computational resources at the cloud edge can be leveraged to improve latency and reduce the costs of cloud services for a wide variety mobile, web, and IoT applications; such resources are naturally constrained. For distributed…

Distributed, Parallel, and Cluster Computing · Computer Science 2019-12-20 Ben Blamey , Ida-Maria Sintorn , Andreas Hellander , Salman Toor

Dynamic Adaptation in Data Storage: Real-Time Machine Learning for Enhanced Prefetching

The exponential growth of data storage demands has necessitated the evolution of hierarchical storage management strategies [1]. This study explores the application of streaming machine learning [3] to revolutionize data prefetching within…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-01-30 Chiyu Cheng , Chang Zhou , Yang Zhao , Jin Cao

Optimizing Streaming Parallelism on Heterogeneous Many-Core Architectures: A Machine Learning Based Approach

This article presents an automatic approach to quickly derive a good solution for hardware resource partition and task granularity for task-based parallel applications on heterogeneous many-core architectures. Our approach employs a…

Distributed, Parallel, and Cluster Computing · Computer Science 2020-03-10 Peng Zhang , Jianbin Fang , Canqun Yang , Chun Huang , Tao Tang , Zheng Wang

Collaborative Cluster Configuration for Distributed Data-Parallel Processing: A Research Overview

Many organizations routinely analyze large datasets using systems for distributed data-parallel processing and clusters of commodity resources. Yet, users need to configure adequate resources for their data processing jobs. This requires…

Distributed, Parallel, and Cluster Computing · Computer Science 2022-06-02 Lauritz Thamsen , Dominik Scheinert , Jonathan Will , Jonathan Bader , Odej Kao

Simulation-Aided Policy Tuning for Black-Box Robot Learning

How can robots learn and adapt to new tasks and situations with little data? Systematic exploration and simulation are crucial tools for efficient robot learning. We present a novel black-box policy search algorithm focused on…

Robotics · Computer Science 2025-02-11 Shiming He , Alexander von Rohr , Dominik Baumann , Ji Xiang , Sebastian Trimpe

CloudProphet: A Machine Learning-Based Performance Prediction for Public Clouds

Computing servers have played a key role in developing and processing emerging compute-intensive applications in recent years. Consolidating multiple virtual machines (VMs) inside one server to run various applications introduces severe…

Distributed, Parallel, and Cluster Computing · Computer Science 2023-09-29 Darong Huang , Luis Costero , Ali Pahlevan , Marina Zapater , David Atienza

Simulating Performance of ML Systems with Offline Profiling

We advocate that simulation based on offline profiling is a promising approach to better understand and improve the complex ML systems. Our approach uses operation-level profiling and dataflow based simulation to ensure it offers a unified…

Distributed, Parallel, and Cluster Computing · Computer Science 2020-02-18 Hongming Huang , Peng Cheng , Hong Xu , Yongqiang Xiong

Runtime-optimized Multi-way Stream Join Operator for Large-scale Streaming data

Streaming computing enables the real-time processing of large volumes of data and offers significant advantages for various applications, including real-time recommendations, anomaly detection, and monitoring. The multi-way stream join…

Databases · Computer Science 2024-11-26 Jinlong Hu , Tingfeng Qiu

A Review of Resource Management in Fog Computing: Machine Learning Perspective

Fog computing becomes a promising technology to process user's requests near the proximity of users to reduce response time for latency-sensitive requests. Despite its advantages, the properties such as resource heterogeneity and…

Networking and Internet Architecture · Computer Science 2022-09-08 Muhammad Fahimullah , Shohreh Ahvar , Maria Trocan

Machine Learning-based Orchestration of Containers: A Taxonomy and Future Directions

Containerization is a lightweight application virtualization technology, providing high environmental consistency, operating system distribution portability, and resource isolation. Existing mainstream cloud service providers have…

Machine Learning · Computer Science 2021-08-23 Zhiheng Zhong , Minxian Xu , Maria Alejandra Rodriguez , Chengzhong Xu , Rajkumar Buyya

Evaluation of Load Prediction Techniques for Distributed Stream Processing

Distributed Stream Processing (DSP) systems enable processing large streams of continuous data to produce results in near to real time. They are an essential part of many data-intensive applications and analytics platforms. The rate at…

Distributed, Parallel, and Cluster Computing · Computer Science 2021-08-11 Kordian Gontarska , Morgan Geldenhuys , Dominik Scheinert , Philipp Wiesner , Andreas Polze , Lauritz Thamsen

Optimization and Learning with Information Streams: Time-varying Algorithms and Applications

There is a growing cross-disciplinary effort in the broad domain of optimization and learning with streams of data, applied to settings where traditional batch optimization techniques cannot produce solutions at time scales that match the…

Optimization and Control · Mathematics 2021-11-29 Emiliano Dall'Anese , Andrea Simonetto , Stephen Becker , Liam Madden

Task Runtime Prediction in Scientific Workflows Using an Online Incremental Learning Approach

Many algorithms in workflow scheduling and resource provisioning rely on the performance estimation of tasks to produce a scheduling plan. A profiler that is capable of modeling the execution of tasks and predicting their runtime…

Distributed, Parallel, and Cluster Computing · Computer Science 2019-03-01 Muhammad H. Hilman , Maria A. Rodriguez , Rajkumar Buyya

Time-Based Roofline for Deep Learning Performance Analysis

Deep learning applications are usually very compute-intensive and require a long run time for training and inference. This has been tackled by researchers from both hardware and software sides, and in this paper, we propose a Roofline-based…

Distributed, Parallel, and Cluster Computing · Computer Science 2020-09-24 Yunsong Wang , Charlene Yang , Steven Farrell , Yan Zhang , Thorsten Kurth , Samuel Williams