English
Related papers

Related papers: KS+: Predicting Workflow Task Memory Usage Over Ti…

200 papers

With the increasing amount of data available to scientists in disciplines as diverse as bioinformatics, physics, and remote sensing, scientific workflow systems are becoming increasingly important for composing and executing scalable data…

Distributed, Parallel, and Cluster Computing · Computer Science 2024-03-20 Jonathan Bader , Nils Diedrich , Lauritz Thamsen , Odej Kao

Scientific workflow management systems support large-scale data analysis on cluster infrastructures. For this, they interact with resource managers which schedule workflow tasks onto cluster nodes. In addition to workflow task descriptions,…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-04-30 Jonathan Bader , Kathleen West , Soeren Becker , Svetlana Kulagina , Fabian Lehmann , Lauritz Thamsen , Henning Meyerhenke , Odej Kao

As the amount of available data continues to grow in fields as diverse as bioinformatics, physics, and remote sensing, the importance of scientific workflows in the design and implementation of reproducible data analysis pipelines…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-02-11 Jonathan Bader , Fabian Skalski , Fabian Lehmann , Dominik Scheinert , Jonathan Will , Lauritz Thamsen , Odej Kao

Scientific workflows are used to analyze large amounts of data. These workflows comprise numerous tasks, many of which are executed repeatedly, running the same custom program on different inputs. Users specify resource allocations for each…

Distributed, Parallel, and Cluster Computing · Computer Science 2024-10-11 Fabian Lehmann , Jonathan Bader , Ninon De Mecquenem , Xing Wang , Vasilis Bountris , Florian Friederici , Ulf Leser , Lauritz Thamsen

Scientific workflow management systems like Nextflow support large-scale data analysis by abstracting away the details of scientific workflows. In these systems, workflows consist of several abstract tasks, of which instances are run in…

Distributed, Parallel, and Cluster Computing · Computer Science 2022-01-20 Jonathan Bader , Lauritz Thamsen , Svetlana Kulagina , Jonathan Will , Henning Meyerhenke , Odej Kao

Many algorithms in workflow scheduling and resource provisioning rely on the performance estimation of tasks to produce a scheduling plan. A profiler that is capable of modeling the execution of tasks and predicting their runtime…

Distributed, Parallel, and Cluster Computing · Computer Science 2019-03-01 Muhammad H. Hilman , Maria A. Rodriguez , Rajkumar Buyya

Software-defined networks (SDNs) are a huge evolution in simplifying implementation and network operation which have reduced costs and made the network programmable. Although SDNs are a suitable option for solving some of the previous…

Networking and Internet Architecture · Computer Science 2019-10-03 Mahdi Sarbazi , Mehdi SadeghZadeh , seyyed Javad Mir Abedini

Although High Performance Computing (HPC) users understand basic resource requirements such as the number of CPUs and memory limits, internal infrastructural utilization data is exclusively leveraged by cluster operators, who use it to…

Distributed, Parallel, and Cluster Computing · Computer Science 2024-01-19 Abel Souza , Kristiaan Pelckmans , Johan Tordsson

Many resource management techniques for task scheduling, energy and carbon efficiency, and cost optimization in workflows rely on a-priori task runtime knowledge. Building runtime prediction models on historical data is often not feasible…

Distributed, Parallel, and Cluster Computing · Computer Science 2023-09-14 Jonathan Bader , Fabian Lehmann , Lauritz Thamsen , Ulf Leser , Odej Kao

Failed workloads that consumed significant computational resources in time and space affect the efficiency of data centers significantly and thus limit the amount of scientific work that can be achieved. While the computational power has…

Distributed, Parallel, and Cluster Computing · Computer Science 2023-01-13 Jie Li , Rui Wang , Ghazanfar Ali , Tommy Dang , Alan Sill , Yong Chen

Scientific workflows are designed as directed acyclic graphs (DAGs) and consist of multiple dependent task definitions. They are executed over a large amount of data, often resulting in thousands of tasks with heterogeneous compute…

Distributed, Parallel, and Cluster Computing · Computer Science 2023-07-19 Jonathan Bader , Nicolas Zunker , Soeren Becker , Odej Kao

Scientific workflows are pipelines of interdependent tasks. They are increasingly executed on shared Kubernetes clusters via workflow engines such as Nextflow. Their energy consumption matters for both cost and sustainability. It is…

Distributed, Parallel, and Cluster Computing · Computer Science 2026-05-22 Philipp Thamm , Somayeh Mohammadi , Kathleen West , Knut Reinert , Lauritz Thamsen , Ulf Leser

Many scientific workflow scheduling algorithms need to be informed about task runtimes a-priori to conduct efficient scheduling. In heterogeneous cluster infrastructures, this problem becomes aggravated because these runtimes are required…

Distributed, Parallel, and Cluster Computing · Computer Science 2022-05-24 Jonathan Bader , Fabian Lehmann , Lauritz Thamsen , Jonathan Will , Ulf Leser , Odej Kao

We consider a natural scheduling problem which arises in many distributed computing frameworks. Jobs with diverse resource requirements (e.g. memory requirements) arrive over time and must be served by a cluster of servers, each with a…

Networking and Internet Architecture · Computer Science 2019-01-21 Konstantinos Psychas , Javad Ghaderi

Configuring a storage system to better serve an application is a challenging task complicated by a multidimensional, discrete configuration space and the high cost of space exploration (e.g., by running the application with different…

Distributed, Parallel, and Cluster Computing · Computer Science 2013-06-11 Lauro Beltrão Costa , Abmar Barros , Samer Al-Kiswany , Hao Yang , Emalayan Vairavanathan , Matei Ripeanu

While detailed resource usage monitoring is possible on the low-level using proper tools, associating such usage with higher-level abstractions in the application layer that actually cause the resource usage in the first place presents a…

Distributed, Parallel, and Cluster Computing · Computer Science 2024-08-02 Joel Witzke , Ansgar Lößer , Vasilis Bountris , Florian Schintke , Björn Scheuermann

Recent trends of technology have explored a numerous applications of cloud services, which require a significant amount of energy. In the present scenario, most of the energy sources are limited and have a greenhouse effect on the…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-12-15 Sohan Kumar Pande , Sanjaya Kumar Panda , Preeti Ranjan Sahu

Nowadays, data-centers are largely under-utilized because resource allocation is based on reservation mechanisms which ignore actual resource utilization. Indeed, it is common to reserve resources for peak demand, which may occur only for a…

Distributed, Parallel, and Cluster Computing · Computer Science 2018-07-03 Francesco Pace , Dimitrios Milios , Damiano Carra , Daniele Venzano , Pietro Michiardi

The collaborative efforts of large communities in science experiments, often comprising thousands of global members, reflect a monumental commitment to exploration and discovery. Recently, advanced and complex data processing has gained…

Scientific research in many fields routinely requires the analysis of large datasets, and scientists often employ workflow systems to leverage clusters of computers for their data analysis. However, due to their size and scale, these…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-06-05 Lauritz Thamsen , Yehia Elkhatib , Paul Harvey , Syed Waqar Nabi , Jeremy Singer , Wim Vanderbauwhede
‹ Prev 1 2 3 10 Next ›