Machine Learning · Computer Science
Yet Another Accelerated SGD: ResNet-50 Training on ImageNet in 74.7 seconds
Masafumi Yamazaki, Akihiko Kasagi, Akihiro Tabuchi, Takumi Honda +5
2019-04-01
Machine Learning · Computer Science
Massively Distributed SGD: ImageNet/ResNet-50 Training in a Flash
Hiroaki Mikami, Hisahiro Suganuma, Pongsakorn U-chupala, Yoshiki Tanaka +1
2019-03-06
Machine Learning · Computer Science
Training EfficientNets at Supercomputer Scale: 83% ImageNet Top-1 Accuracy in One Hour
Arissa Wongpanich, Hieu Pham, James Demmel, Mingxing Tan +3
2020-11-06
Machine Learning · Computer Science
Highly Scalable Deep Learning Training System with Mixed-Precision: Training ImageNet in Four Minutes
Xianyan Jia, Shutao Song, Wei He, Yangzihao Wang +10
2018-07-31
Machine Learning · Statistics
Scale out for large minibatch SGD: Residual network training on ImageNet-1K with improved accuracy and reduced time to train
Valeriu Codreanu, Damian Podareanu, Vikram Saletore
2017-11-17
Computer Vision and Pattern Recognition · Computer Science
Accurate, Large Minibatch SGD: Training ImageNet in 1 Hour
Priya Goyal, Piotr Dollár, Ross Girshick, Pieter Noordhuis +5
2018-05-02
Distributed, Parallel, and Cluster Computing · Computer Science
Towards Scalable Distributed Training of Deep Learning on Public Cloud Clusters
Shaohuai Shi, Xianhao Zhou, Shutao Song, Xingyao Wang +20
2020-10-21
Computer Vision and Pattern Recognition · Computer Science
ImageNet Training in Minutes
Yang You, Zhao Zhang, Cho-Jui Hsieh, James Demmel +1
2018-02-01
Distributed, Parallel, and Cluster Computing · Computer Science
Extremely Large Minibatch SGD: Training ResNet-50 on ImageNet in 15 Minutes
Takuya Akiba, Shuji Suzuki, Keisuke Fukuda
2017-11-15
Computer Vision and Pattern Recognition · Computer Science
What can we learn from misclassified ImageNet images?
Shixian Wen, Amanda Sofie Rios, Kiran Lekkala, Laurent Itti
2022-01-21
Distributed, Parallel, and Cluster Computing · Computer Science
Optimizing Distributed Training Approaches for Scaling Neural Networks
Vishnu Vardhan Baligodugula, Fathi Amsaad
2025-04-01
Computer Vision and Pattern Recognition · Computer Science
High-Performance Large-Scale Image Recognition Without Normalization
Andrew Brock, Soham De, Samuel L. Smith, Karen Simonyan
2021-02-12
Distributed, Parallel, and Cluster Computing · Computer Science
Efficient Training of Convolutional Neural Nets on Large Distributed Systems
Sameer Kumar, Dheeraj Sreedhar, Vaibhav Saxena, Yogish Sabharwal +1
2017-11-03
Machine Learning · Computer Science
Deep Learning Models on CPUs: A Methodology for Efficient Training
Quchen Fu, Ramesh Chukka, Keith Achorn, Thomas Atta-fosu +4
2023-06-21
Computer Vision and Pattern Recognition · Computer Science
Data-Efficient Deep Learning Method for Image Classification Using Data Augmentation, Focal Cosine Loss, and Ensemble
Byeongjo Kim, Chanran Kim, Jaehoon Lee, Jein Song +1
2020-07-16
Computer Vision and Pattern Recognition · Computer Science
Training and Inference within 1 Second -- Tackle Cross-Sensor Degradation of Real-World Pansharpening with Efficient Residual Feature Tailoring
Tianyu Xin, Jin-Liang Xiao, Zeyu Xia, Shan Yin +1
2025-11-21
Machine Learning · Computer Science
The Limit of the Batch Size
Yang You, Yuhui Wang, Huan Zhang, Zhao Zhang +2
2020-06-16
Computer Vision and Pattern Recognition · Computer Science
Going deeper with Image Transformers
Hugo Touvron, Matthieu Cord, Alexandre Sablayrolles, Gabriel Synnaeve +1
2021-04-08
Distributed, Parallel, and Cluster Computing · Computer Science
PowerAI DDL
Minsik Cho, Ulrich Finkler, Sameer Kumar, David Kung +2
2017-08-08
Computer Vision and Pattern Recognition · Computer Science
Efficient Image Dataset Classification Difficulty Estimation for Predicting Deep-Learning Accuracy
Florian Scheidegger, Roxana Istrate, Giovanni Mariani, Luca Benini +2
2018-03-28
Distributed, Parallel, and Cluster Computing · Computer Science
Optimizing Network Performance for Distributed DNN Training on GPU Clusters: ImageNet/AlexNet Training in 1.5 Minutes
Peng Sun, Wansen Feng, Ruobing Han, Shengen Yan +1
2019-10-23