Related papers: Adaptive Scheduling for Machine Learning Tasks ove…

Linear Regression over Networks with Communication Guarantees

A key functionality of emerging connected autonomous systems such as smart cities, smart transportation systems, and the industrial Internet-of-Things, is the ability to process and learn from data collected at different physical locations.…

Machine Learning · Computer Science 2021-03-09 Konstantinos Gatsis

Poster: Flexible Scheduling of Network and Computing Resources for Distributed AI Tasks

Many emerging Artificial Intelligence (AI) applications require on-demand provisioning of large-scale computing, which can only be enabled by leveraging distributed computing services interconnected through networking. To address such…

Networking and Internet Architecture · Computer Science 2024-07-09 Ruikun Wang , Jiawei Zhang , Qiaolun Zhang , Bojun Zhang , Zhiqun Gu , Aryanaz Attarpour , Yuefeng Ji , Massimo Tornatore

Learning to Schedule: A Supervised Learning Framework for Network-Aware Scheduling of Data-Intensive Workloads

Distributed cloud environments hosting data-intensive applications often experience slowdowns due to network congestion, asymmetric bandwidth, and inter-node data shuffling. These factors are typically not captured by traditional host-level…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-11-21 Sankalpa Timilsina , Susmit Shannigrahi

Towards a Centralized Scheduling Framework for Communication Flows in Distributed Systems

The overall performance of a distributed system is highly dependent on the communication efficiency of the system. Although network resources (links, bandwidth) are becoming increasingly more available, the communication performance of data…

Data Structures and Algorithms · Computer Science 2009-06-02 Mugurel Ionut Andreica , Eliana-Dina Tirsa , Nicolae Tapus , Florin Pop , Ciprian Mihai Dobre

ATA: Adaptive Task Allocation for Efficient Resource Management in Distributed Machine Learning

Asynchronous methods are fundamental for parallelizing computations in distributed machine learning. They aim to accelerate training by fully utilizing all available resources. However, their greedy approach can lead to inefficiencies using…

Machine Learning · Computer Science 2025-05-23 Artavazd Maranjyan , El Mehdi Saad , Peter Richtárik , Francesco Orabona

Adaptive Scheduling for Multi-Task Learning

To train neural machine translation models simultaneously on multiple tasks (languages), it is common to sample each task uniformly or in proportion to dataset sizes. As these methods offer little control over performance trade-offs, we…

Machine Learning · Computer Science 2019-09-17 Sébastien Jean , Orhan Firat , Melvin Johnson

Real-Time Scheduling via Reinforcement Learning

Cyber-physical systems, such as mobile robots, must respond adaptively to dynamic operating conditions. Effective operation of these systems requires that sensing and actuation tasks are performed in a timely manner. Additionally, execution…

Machine Learning · Computer Science 2012-03-19 Robert Glaubius , Terry Tidwell , Christopher Gill , William D. Smart

Toward Efficient Online Scheduling for Distributed Machine Learning Systems

Recent years have witnessed a rapid growth of distributed machine learning (ML) frameworks, which exploit the massive parallelism of computing clusters to expedite ML training. However, the proliferation of distributed ML frameworks also…

Distributed, Parallel, and Cluster Computing · Computer Science 2022-05-16 Menglu Yu , Jia Liu , Chuan Wu , Bo Ji , Elizabeth S. Bentley

Distributed Multi-Task Relationship Learning

Multi-task learning aims to learn multiple tasks jointly by exploiting their relatedness to improve the generalization performance for each task. Traditionally, to perform multi-task learning, one needs to centralize data from all the tasks…

Machine Learning · Computer Science 2017-06-21 Sulin Liu , Sinno Jialin Pan , Qirong Ho

Online Optimization for Network Resource Allocation and Comparison with Reinforcement Learning Techniques

We tackle in this paper an online network resource allocation problem with job transfers. The network is composed of many servers connected by communication links. The system operates in discrete time; at each time slot, the administrator…

Machine Learning · Statistics 2023-11-17 Ahmed Sid-Ali , Ioannis Lambadaris , Yiqiang Q. Zhao , Gennady Shaikhet , Amirhossein Asgharnia

Adaptive Multi-task Learning for Probabilistic Load Forecasting

Simultaneous load forecasting across multiple entities (e.g., regions, buildings) is crucial for the efficient, reliable, and cost-effective operation of power systems. Accurate load forecasting is a challenging problem due to the inherent…

Machine Learning · Computer Science 2026-01-21 Onintze Zaballa , Verónica Álvarez , Santiago Mazuelas

Diffusion Adaptation over Networks

Adaptive networks are well-suited to perform decentralized information processing and optimization tasks and to model various types of self-organized and complex behavior encountered in nature. Adaptive networks consist of a collection of…

Multiagent Systems · Computer Science 2013-05-07 Ali H. Sayed

Adaptive Dispatching of Tasks in the Cloud

The increasingly wide application of Cloud Computing enables the consolidation of tens of thousands of applications in shared infrastructures. Thus, meeting the quality of service requirements of so many diverse applications in such shared…

Distributed, Parallel, and Cluster Computing · Computer Science 2015-01-06 Lan Wang , Erol Gelenbe

Machine Learning and CPU (Central Processing Unit) Scheduling Co-Optimization over a Network of Computing Centers

In the rapidly evolving research on artificial intelligence (AI) the demand for fast, computationally efficient, and scalable solutions has increased in recent years. The problem of optimizing the computing resources for distributed machine…

Machine Learning · Computer Science 2025-10-30 Mohammadreza Doostmohammadian , Zulfiya R. Gabidullina , Hamid R. Rabiee

Edge Learning for Large-Scale Internet of Things With Task-Oriented Efficient Communication

In the Internet of Things (IoT) networks, edge learning for data-driven tasks provides intelligent applications and services. As the network size becomes large, different users may generate distinct datasets. Thus, to suit multiple edge…

Information Theory · Computer Science 2023-05-02 Haihui Xie , Minghua Xia , Peiran Wu , Shuai Wang , H. Vincent Poor

A Survey on Distributed Machine Learning

The demand for artificial intelligence has grown significantly over the last decade and this growth has been fueled by advances in machine learning techniques and the ability to leverage hardware acceleration. However, in order to increase…

Machine Learning · Computer Science 2022-11-28 Joost Verbraeken , Matthijs Wolting , Jonathan Katzy , Jeroen Kloppenburg , Tim Verbelen , Jan S. Rellermeyer

Competing Adaptive Networks

Adaptive networks have the capability to pursue solutions of global stochastic optimization problems by relying only on local interactions within neighborhoods. The diffusion of information through repeated interactions allows for globally…

Multiagent Systems · Computer Science 2021-03-30 Stefan Vlaski , Ali H. Sayed

Towards Secure and Efficient Data Scheduling for Vehicular Social Networks

Efficient data transmission scheduling within vehicular environments poses a significant challenge due to the high mobility of such networks. Contemporary research predominantly centers on crafting cooperative scheduling algorithms tailored…

Machine Learning · Computer Science 2024-07-02 Youhua Xia , Tiehua Zhang , Jiong Jin , Ying He , Fei Yu

Adaptive Task Allocation for Heterogeneous Multi-Robot Teams with Evolving and Unknown Robot Capabilities

For multi-robot teams with heterogeneous capabilities, typical task allocation methods assign tasks to robots based on the suitability of the robots to perform certain tasks as well as the requirements of the task itself. However, in…

Robotics · Computer Science 2020-03-09 Yousef Emam , Siddharth Mayya , Gennaro Notomista , Addison Bohannon , Magnus Egerstedt

Reinforcement Learning for Adaptive Resource Scheduling in Complex System Environments

This study presents a novel computer system performance optimization and adaptive workload management scheduling algorithm based on Q-learning. In modern computing environments, characterized by increasing data volumes, task complexity, and…

Machine Learning · Computer Science 2024-11-11 Pochun Li , Yuyang Xiao , Jinghua Yan , Xuan Li , Xiaoye Wang