Distributed, Parallel, and Cluster Computing · Computer Science
Towards Scalable Distributed Training of Deep Learning on Public Cloud Clusters
Shaohuai Shi, Xianhao Zhou, Shutao Song, Xingyao Wang +20
2020-10-21
Machine Learning · Computer Science
Chainer: A Deep Learning Framework for Accelerating the Research Cycle
Seiya Tokui, Ryosuke Okuta, Takuya Akiba, Yusuke Niitani +6
2019-08-02
Distributed, Parallel, and Cluster Computing · Computer Science
Performance Modeling and Evaluation of Distributed Deep Learning Frameworks on GPUs
Shaohuai Shi, Qiang Wang, Xiaowen Chu
2018-08-21
Distributed, Parallel, and Cluster Computing · Computer Science
Characterizing and Understanding Distributed GNN Training on GPUs
Haiyang Lin, Mingyu Yan, Xiaocheng Yang, Mo Zou +3
2022-04-19
Machine Learning · Computer Science
Massively Distributed SGD: ImageNet/ResNet-50 Training in a Flash
Hiroaki Mikami, Hisahiro Suganuma, Pongsakorn U-chupala, Yoshiki Tanaka +1
2019-03-06
Distributed, Parallel, and Cluster Computing · Computer Science
Efficient Scaling of Dynamic Graph Neural Networks
Venkatesan T. Chakaravarthy, Shivmaran S. Pandian, Saurabh Raje, Yogish Sabharwal +2
2021-09-17
Distributed, Parallel, and Cluster Computing · Computer Science
Optimizing Network Performance for Distributed DNN Training on GPU Clusters: ImageNet/AlexNet Training in 1.5 Minutes
Peng Sun, Wansen Feng, Ruobing Han, Shengen Yan +1
2019-10-23
Distributed, Parallel, and Cluster Computing · Computer Science
PowerAI DDL
Minsik Cho, Ulrich Finkler, Sameer Kumar, David Kung +2
2017-08-08
Machine Learning · Statistics
Distributed Training of Deep Neural Networks with Theoretical Analysis: Under SSP Setting
Abhimanu Kumar, Pengtao Xie, Junming Yin, Eric P. Xing
2016-10-04
Machine Learning · Computer Science
DistGNN: Scalable Distributed Training for Large-Scale Graph Neural Networks
Vasimuddin Md, Sanchit Misra, Guixiang Ma, Ramanarayan Mohanty +5
2021-04-19
Distributed, Parallel, and Cluster Computing · Computer Science
FusionLLM: A Decentralized LLM Training System on Geo-distributed GPUs with Adaptive Compression
Zhenheng Tang, Xueze Kang, Yiming Yin, Xinglin Pan +10
2024-10-17
Machine Learning · Computer Science
GSR-GNN: Training Acceleration and Memory-Saving Framework of Deep GNNs on Circuit Graph
Yuebo Luo, Shiyang Li, Yifei Feng, Vishal Kancharla +2
2026-03-31
Machine Learning · Computer Science
Distributing Deep Learning Hyperparameter Tuning for 3D Medical Image Segmentation
Josep Lluis Berral, Oriol Aranda, Juan Luis Dominguez, Jordi Torres
2021-11-01
Distributed, Parallel, and Cluster Computing · Computer Science
Parallelizing Training of Deep Generative Models on Massive Scientific Datasets
Sam Ade Jacobs, Brian Van Essen, David Hysom, Jae-Seung Yeom +10
2019-10-08
Machine Learning · Computer Science
Accurate, Efficient and Scalable Training of Graph Neural Networks
Hanqing Zeng, Hongkuan Zhou, Ajitesh Srivastava, Rajgopal Kannan +1
2020-10-08
Machine Learning · Computer Science
Yet Another Accelerated SGD: ResNet-50 Training on ImageNet in 74.7 seconds
Masafumi Yamazaki, Akihiko Kasagi, Akihiro Tabuchi, Takumi Honda +5
2019-04-01
Machine Learning · Computer Science
Highly Scalable Deep Learning Training System with Mixed-Precision: Training ImageNet in Four Minutes
Xianyan Jia, Shutao Song, Wei He, Yangzihao Wang +10
2018-07-31