Related papers: Partitioning Uncertain Workflows

A Bayesian Approach to the Partitioning of Workflows

When partitioning workflows in realistic scenarios, the knowledge of the processing units is often vague or unknown. A naive approach to addressing this issue is to perform many controlled experiments for different workloads, each…

Distributed, Parallel, and Cluster Computing · Computer Science 2015-11-03 Freddy C. Chua , Bernardo A. Huberman

Workflow Partitioning and Deployment on the Cloud using Orchestra

Orchestrating service-oriented workflows is typically based on a design model that routes both data and control through a single point - the centralised workflow engine. This causes scalability problems that include the unnecessary…

Distributed, Parallel, and Cluster Computing · Computer Science 2016-11-15 Ward Jaradat , Alan Dearle , Adam Barker

A network partition method for solving large-scale complex nonlinear processes

A numerical framework based on network partition and operator splitting is developed to solve nonlinear differential equations of large-scale dynamic processes encountered in physics, chemistry and biology. Under the assumption that those…

Computational Physics · Physics 2018-01-22 Shucheng Pan , Jianhang Wang , Xiangyu Hu , Nikolaus A. Adams

Variable Partitioning for Distributed Optimization

This paper is about how to partition decision variables while decomposing a large-scale optimization problem for the best performance of distributed solution methods. Solving a large-scale optimization problem sequen- tially can be…

Optimization and Control · Mathematics 2017-10-26 Yuchen Zheng , Ilbin Lee , Nicoleta Serban

Stream Iterative Distributed Coded Computing for Learning Applications in Heterogeneous Systems

To improve the utility of learning applications and render machine learning solutions feasible for complex applications, a substantial amount of heavy computations is needed. Thus, it is essential to delegate the computations among several…

Distributed, Parallel, and Cluster Computing · Computer Science 2022-04-29 Homa Esfahanizadeh , Alejandro Cohen , Muriel Medard

(Re)partitioning for stream-enabled computation

Partitioning an input graph over a set of workers is a complex operation. Objectives are twofold: split the work evenly, so that every worker gets an equal share, and minimize edge cut to achieve a good work locality (i.e. workers can work…

Distributed, Parallel, and Cluster Computing · Computer Science 2013-11-28 Le Merrer Erwan , Liang Yizhong , Trédan Gilles

Strict Partitioning for Sporadic Rigid Gang Tasks

The rigid gang task model is based on the idea of executing multiple threads simultaneously on a fixed number of processors to increase efficiency and performance. Although there is extensive literature on global rigid gang scheduling,…

Distributed, Parallel, and Cluster Computing · Computer Science 2024-09-04 Binqi Sun , Tomasz Kloda , Marco Caccamo

Stream Distributed Coded Computing

The emerging large-scale and data-hungry algorithms require the computations to be delegated from a central server to several worker nodes. One major challenge in the distributed computations is to tackle delays and failures caused by the…

Information Theory · Computer Science 2021-03-03 Alejandro Cohen , Guillaume Thiran , Homa Esfahanizadeh , Muriel Médard

The TensorFlow Partitioning and Scheduling Problem: It's the Critical Path!

State-of-the-art data flow systems such as TensorFlow impose iterative calculations on large graphs that need to be partitioned on heterogeneous devices such as CPUs, GPUs, and TPUs. However, partitioning can not be viewed in isolation.…

Distributed, Parallel, and Cluster Computing · Computer Science 2017-11-07 Ruben Mayer , Christian Mayer , Larissa Laich

Workflow environments for advanced cyberinfrastructure platforms

Progress in science is deeply bound to the effective use of high-performance computing infrastructures and to the efficient extraction of knowledge from vast amounts of data. Such data comes from different sources that follow a cycle…

Distributed, Parallel, and Cluster Computing · Computer Science 2020-06-15 Rosa M Badia , Jorge Ejarque , Francesc Lordan , Daniele Lezzi , Javier Conejero , Javier Álvarez Cid-Fuentes , Yolanda Becerra , Anna Queralt

Recursive Multi-Section on the Fly: Shared-Memory Streaming Algorithms for Hierarchical Graph Partitioning and Process Mapping

Partitioning a graph into balanced blocks such that few edges run between blocks is a key problem for large-scale distributed processing. A current trend for partitioning huge graphs are streaming algorithms, which use low computational…

Data Structures and Algorithms · Computer Science 2022-02-02 Marcelo Fonseca Faraj , Christian Schulz

WawPart: Workload-Aware Partitioning of Knowledge Graphs

Large-scale datasets in the form of knowledge graphs are often used in numerous domains, today. A knowledge graphs size often exceeds the capacity of a single computer system, especially if the graph must be stored in main memory. To…

Databases · Computer Science 2022-03-29 Amitabh Priyadarshi , Krzysztof J. Kochut

Workflows to driving high-performance interactive supercomputing for urgent decision making

Interactive urgent computing is a small but growing user of supercomputing resources. However there are numerous technical challenges that must be overcome to make supercomputers fully suited to the wide range of urgent workloads which…

Distributed, Parallel, and Cluster Computing · Computer Science 2022-06-29 Nick Brown , Rupert Nash , Gordon Gibb , Evgenij Belikov , Artur Podobas , Wei Der Chien , Stefano Markidis , Markus Flatken , Andreas Gerndt

Efficient partitioning and reordering of conforming virtual element discretizations for large scale Discrete Fracture Network flow parallel solvers

Discrete Fracture Network models are largely used for very large scale geological flow simulations. For this reason numerical methods require an investigation of tools for efficient parallel solutions on High Performance Computing systems.…

Numerical Analysis · Mathematics 2021-06-21 Stefano Berrone , Alice Raeli

Flow-Bench: A Dataset for Computational Workflow Anomaly Detection

A computational workflow, also known as workflow, consists of tasks that must be executed in a specific order to attain a specific goal. Often, in fields such as biology, chemistry, physics, and data science, among others, these workflows…

Distributed, Parallel, and Cluster Computing · Computer Science 2024-06-14 George Papadimitriou , Hongwei Jin , Cong Wang , Rajiv Mayani , Krishnan Raghavan , Anirban Mandal , Prasanna Balaprakash , Ewa Deelman

Methods for Partitioning Data to Improve Parallel Execution Time for Sorting on Heterogeneous Clusters

The aim of the paper is to introduce general techniques in order to optimize the parallel execution time of sorting on a distributed architectures with processors of various speeds. Such an application requires a partitioning step. For…

Distributed, Parallel, and Cluster Computing · Computer Science 2016-08-16 Christophe Cérin , Jean-Christophe Dubacq , Jean-Louis Roch , the SafeScale Collaboration

Machine Learning-based Selection of Graph Partitioning Strategy Using the Characteristics of Graph Data and Algorithm

Analyzing large graph data is an essential part of many modern applications, such as social networks. Due to its large computational complexity, distributed processing is frequently employed. This requires graph data to be divided across…

Distributed, Parallel, and Cluster Computing · Computer Science 2022-09-12 YoungJoon Park , DongKyu Lee , Tien-Cuong Bui

Optimal Divisible Load Scheduling for Resource-Sharing Network

Scheduling is an important task allowing parallel systems to perform efficiently and reliably. For modern computation systems, divisible load is a special type of data which can be divided into arbitrary sizes and independently processed in…

Distributed, Parallel, and Cluster Computing · Computer Science 2019-02-07 Fei Wu , Yang Cao , Thomas Robertazzi

Resolvable Designs for Speeding up Distributed Computing

Distributed computing frameworks such as MapReduce are often used to process large computational jobs. They operate by partitioning each job into smaller tasks executed on different servers. The servers also need to exchange intermediate…

Distributed, Parallel, and Cluster Computing · Computer Science 2020-04-20 Konstantinos Konstantinidis , Aditya Ramamoorthy

Deep Learning on Operational Facility Data Related to Large-Scale Distributed Area Scientific Workflows

Distributed computing platforms provide a robust mechanism to perform large-scale computations by splitting the task and data among multiple locations, possibly located thousands of miles apart geographically. Although such distribution of…

Distributed, Parallel, and Cluster Computing · Computer Science 2018-04-24 Alok Singh , Eric Stephan , Malachi Schram , Ilkay Altintas