Distributed, Parallel, and Cluster Computing · Computer Science
LIBRA: Enabling Workload-aware Multi-dimensional Network Topology Optimization for Distributed Training of Large AI Models
William Won, Saeed Rashidi, Sudarshan Srinivasan, Tushar Krishna
2025-04-15
Distributed, Parallel, and Cluster Computing · Computer Science
DFLOP: A Data-driven Framework for Multimodal LLM Training Pipeline Optimization
Hyeonjun An, Sihyun Kim, Chaerim Lim, Hyunjoon Kim +8
2026-05-20
Distributed, Parallel, and Cluster Computing · Computer Science
Boosting Distributed Machine Learning Training Through Loss-tolerant Transmission Protocol
Zixuan Chen, Lei Shi, Xuandong Liu, Xin Ai +2
2023-08-15
Information Theory · Computer Science
A Novel Coded Computing Approach for Distributed Multi-Task Learning
Minquan Cheng, Yongkang Wang, Lingyu Zhang, Youlong Wu
2025-07-25
Machine Learning · Computer Science
DBLP: Phase-Aware Bounded-Loss Transport for Burst-Resilient Distributed ML Training
Zechen Ma, Zixi Qu, Jinyan Yi, David Lin +1
2026-05-05
Distributed, Parallel, and Cluster Computing · Computer Science
Communication-Efficient Distributed Deep Learning: A Comprehensive Survey
Zhenheng Tang, Shaohuai Shi, Wei Wang, Bo Li +1
2023-09-04
Machine Learning · Computer Science
Understanding and Accelerating the Training of Masked Diffusion Language Models
Chunsan Hong, Sanghyun Lee, Chieh-Hsin Lai, Satoshi Hayakawa +4
2026-05-14
Machine Learning · Computer Science
Dependable Distributed Training of Compressed Machine Learning Models
Francesco Malandrino, Giuseppe Di Giacomo, Marco Levorato, Carla Fabiana Chiasserini
2024-02-23
Machine Learning · Computer Science
Delay-Aware Hierarchical Federated Learning
Frank Po-Chen Lin, Seyyedali Hosseinalipour, Nicolò Michelusi, Christopher Brinton
2023-09-29
Distributed, Parallel, and Cluster Computing · Computer Science
Efficient Distributed MLLM Training with Cornstarch
Insu Jang, Runyu Lu, Nikhil Bansal, Ang Chen +1
2026-05-26
Networking and Internet Architecture · Computer Science
Machine Learning for Networking: Workflow, Advances and Opportunities
Mowei Wang, Yong Cui, Xin Wang, Shihan Xiao +1
2017-11-17
Distributed, Parallel, and Cluster Computing · Computer Science
Scaling Distributed Machine Learning with In-Network Aggregation
Amedeo Sapio, Marco Canini, Chen-Yu Ho, Jacob Nelson +6
2020-10-01
Robotics · Computer Science
MLLM-Fabric: Multimodal Large Language Model-Driven Robotic Framework for Fabric Sorting and Selection
Liman Wang, Hanyang Zhong, Tianyuan Wang, Shan Luo +1
2025-10-14
Distributed, Parallel, and Cluster Computing · Computer Science
Distributed Learning over Unreliable Networks
Chen Yu, Hanlin Tang, Cedric Renggli, Simon Kassing +4
2019-05-17
Networking and Internet Architecture · Computer Science
Enabling Fast and Flexible Distributed Deep Learning with Programmable Switches
Heng Pan, Penglai Cui, Zhenyu li, Ru Jia +10
2022-08-11
Machine Learning · Computer Science
Distributed Machine Learning via Sufficient Factor Broadcasting
Pengtao Xie, Jin Kyu Kim, Yi Zhou, Qirong Ho +3
2015-11-30
Machine Learning · Computer Science
Distributed Machine Learning via Sufficient Factor Broadcasting
Pengtao Xie, Jin Kyu Kim, Yi Zhou, Qirong Ho +3
2015-09-08
Optimization and Control · Mathematics
Large problems are not necessarily hard: A case study on distributed NMPC paying off
Gösta Stomberg, Maurice Raetsch, Alexander Engelmann, Timm Faulwasser
2025-04-16
Machine Learning · Computer Science
Collective Online Learning of Gaussian Processes in Massive Multi-Agent Systems
Trong Nghia Hoang, Quang Minh Hoang, Kian Hsiang Low, Jonathan How
2018-11-14
Networking and Internet Architecture · Computer Science
NetLLM: Adapting Large Language Models for Networking
Duo Wu, Xianda Wang, Yaqi Qiao, Zhi Wang +3
2024-08-07
Information Theory · Computer Science
Model-Driven Deep Learning for Physical Layer Communications
Hengtao He, Shi Jin, Chao-Kai Wen, Feifei Gao +2
2019-02-26