Related papers: Supervised Distributed Computing

Supervised Distributed Computing: Efficiency and Robustness under a Majority of Adversarial Workers

We consider a recently proposed \emph{supervised distributed computing} paradigm \cite{augustine2025supervised} that extends and refines the standard master-worker paradigm for parallel computations. In this paradigm, there is a supervisor,…

Distributed, Parallel, and Cluster Computing · Computer Science 2026-05-15 John Augustine , Henning Hillebrandt , Manish Kumar , Christian Scheideler , Julian Werthmann

Stream Distributed Coded Computing

The emerging large-scale and data-hungry algorithms require the computations to be delegated from a central server to several worker nodes. One major challenge in the distributed computations is to tackle delays and failures caused by the…

Information Theory · Computer Science 2021-03-03 Alejandro Cohen , Guillaume Thiran , Homa Esfahanizadeh , Muriel Médard

A Multi-Layered Distributed Computing Framework for Enhanced Edge Computing

The rise of the Internet of Things and edge computing has shifted computing resources closer to end-users, benefiting numerous delay-sensitive, computation-intensive applications. To speed up computation, distributed computing is a…

Distributed, Parallel, and Cluster Computing · Computer Science 2024-10-10 Ke Ma , Junfei Xie

Task Scheduling in Geo-Distributed Computing: A Survey

Geo-distributed computing, a paradigm that assigns computational tasks to globally distributed nodes, has emerged as a promising approach in cloud computing, edge computing, cloud-edge computing and supercomputer computing (HPC). It enables…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-01-28 Yujian Wu , Shanjiang Tang , Ce Yu , Bin Yang , Chao Sun , Jian Xiao , Hutong Wu

Learning to Schedule: A Supervised Learning Framework for Network-Aware Scheduling of Data-Intensive Workloads

Distributed cloud environments hosting data-intensive applications often experience slowdowns due to network congestion, asymmetric bandwidth, and inter-node data shuffling. These factors are typically not captured by traditional host-level…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-11-21 Sankalpa Timilsina , Susmit Shannigrahi

Scheduling and data redistribution strategies on star platforms

In this work we are interested in the problem of scheduling and redistributing data on master-slave platforms. We consider the case were the workers possess initial loads, some of which having to be redistributed in order to balance their…

Distributed, Parallel, and Cluster Computing · Computer Science 2016-08-16 Loris Marchal , Veronika Rehn , Yves Robert , Frédéric Vivien

Distributed Supervised Learning using Neural Networks

Distributed learning is the problem of inferring a function in the case where training data is distributed among multiple geographically separated sources. Particularly, the focus is on designing learning strategies with low computational…

Machine Learning · Statistics 2016-07-22 Simone Scardapane

Heterogeneous Coded Computation across Heterogeneous Workers

Coded distributed computing framework enables large-scale machine learning (ML) models to be trained efficiently in a distributed manner, while mitigating the straggler effect. In this work, we consider a multi-task assignment problem in a…

Information Theory · Computer Science 2019-05-21 Yuxuan Sun , Junlin Zhao , Sheng Zhou , Deniz Gündüz

Scheduling in distributed systems: A cloud computing perspective

Scheduling is essentially a decision-making process that enables resource sharing among a number of activities by determining their execution order on the set of available resources. The emergence of distributed systems brought new…

Distributed, Parallel, and Cluster Computing · Computer Science 2019-01-11 Luiz F. Bittencourt , Alfredo Goldman , Edmundo R. M. Madeira , Nelson L. S. da Fonseca , Rizos Sakellariou

Considering Human Aspects on Strategies for Designing and Managing Distributed Human Computation

A human computation system can be viewed as a distributed system in which the processors are humans, called workers. Such systems harness the cognitive power of a group of workers connected to the Internet to execute relatively simple…

Human-Computer Interaction · Computer Science 2015-06-09 Lesandro Ponciano , Francisco Brasileiro , Nazareno Andrade , Lívia Sampaio

Distributed Linearly Separable Computation with Arbitrary Heterogeneous Data Assignment

Distributed linearly separable computation is a fundamental problem in large-scale distributed systems, requiring the computation of linearly separable functions over different datasets across distributed workers. This paper studies a…

Distributed, Parallel, and Cluster Computing · Computer Science 2026-01-16 Ziting Zhang , Kai Wan , Minquan Cheng , Shuo Shao , Giuseppe Caire

How to Optimally Allocate Resources for Coded Distributed Computing?

Today's data centers have an abundance of computing resources, hosting server clusters consisting of as many as tens or hundreds of thousands of machines. To execute a complex computing task over a data center, it is natural to distribute…

Information Theory · Computer Science 2017-02-24 Qian Yu , Songze Li , Mohammad Ali Maddah-Ali , A. Salman Avestimehr

A Survey on Dynamic Job Scheduling in Grid Environment Based on Heuristic Algorithms

Computational Grids are a new trend in distributed computing systems. They allow the sharing of geographically distributed resources in an efficient way, extending the boundaries of what we perceive as distributed computing. Various…

Distributed, Parallel, and Cluster Computing · Computer Science 2014-08-24 D. Thilagavathi , Antony Selvadoss Thanamani

Computation Scheduling for Distributed Machine Learning with Straggling Workers

We study scheduling of computation tasks across n workers in a large scale distributed learning problem with the help of a master. Computation and communication delays are assumed to be random, and redundant computations are assigned to…

Distributed, Parallel, and Cluster Computing · Computer Science 2020-01-08 Mohammad Mohammadi Amiri , Deniz Gunduz

Vers: fully distributed Coded Computing System with Distributed Encoding

Coded computing has proved to be useful in distributed computing. We have observed that almost all coded computing systems studied so far consider a setup of one master and some workers. However, recently emerging technologies such as…

Distributed, Parallel, and Cluster Computing · Computer Science 2023-04-13 Nastaran Abadi Khooshemehr , Mohammad Ali Maddah-Ali

Online Job Scheduling in Distributed Machine Learning Clusters

Nowadays large-scale distributed machine learning systems have been deployed to support various analytics and intelligence services in IT firms. To train a large dataset and derive the prediction/inference model, e.g., a deep neural…

Distributed, Parallel, and Cluster Computing · Computer Science 2018-01-04 Yixin Bao , Yanghua Peng , Chuan Wu , Zongpeng Li

Hyper: Distributed Cloud Processing for Large-Scale Deep Learning Tasks

Training and deploying deep learning models in real-world applications require processing large amounts of data. This is a challenging task when the amount of data grows to a hundred terabytes, or even, petabyte-scale. We introduce a hybrid…

Distributed, Parallel, and Cluster Computing · Computer Science 2019-10-17 Davit Buniatyan

Scheduling Stochastic Real-Time Jobs in Unreliable Workers

We consider a distributed computing network consisting of a master and multiple workers processing tasks of different types. The master is running multiple applications. Each application stochastically generates real-time jobs with a strict…

Distributed, Parallel, and Cluster Computing · Computer Science 2020-01-31 Yu-Pin Hsu , Yu-Chih Huang , Shin-Lin Shieh

Distributed Multi-Task Relationship Learning

Multi-task learning aims to learn multiple tasks jointly by exploiting their relatedness to improve the generalization performance for each task. Traditionally, to perform multi-task learning, one needs to centralize data from all the tasks…

Machine Learning · Computer Science 2017-06-21 Sulin Liu , Sinno Jialin Pan , Qirong Ho

Stream Iterative Distributed Coded Computing for Learning Applications in Heterogeneous Systems

To improve the utility of learning applications and render machine learning solutions feasible for complex applications, a substantial amount of heavy computations is needed. Thus, it is essential to delegate the computations among several…

Distributed, Parallel, and Cluster Computing · Computer Science 2022-04-29 Homa Esfahanizadeh , Alejandro Cohen , Muriel Medard