Machine Learning · Statistics
When Does Stochastic Gradient Algorithm Work Well?
Lam M. Nguyen, Nam H. Nguyen, Dzung T. Phan, Jayant R. Kalagnanam +1
2018-12-27
Machine Learning · Statistics
Statistical Inference for Model Parameters in Stochastic Gradient Descent
Xi Chen, Jason D. Lee, Xin T. Tong, Yichen Zhang
2023-11-02
Machine Learning · Computer Science
Staleness-aware Async-SGD for Distributed Deep Learning
Wei Zhang, Suyog Gupta, Xiangru Lian, Ji Liu
2016-04-06
Machine Learning · Computer Science
Stochastic Gradient Descent for Nonconvex Learning without Bounded Gradient Assumptions
Yunwen Lei, Ting Hu, Guiying Li, Ke Tang
2019-12-16
Machine Learning · Computer Science
A Simplified Analysis of SGD for Linear Regression with Weight Averaging
Alexandru Meterez, Depen Morwani, Costin-Andrei Oncescu, Jingfeng Wu +2
2025-06-19
Machine Learning · Computer Science
Always-Sparse Training by Growing Connections with Guided Stochastic Exploration
Mike Heddes, Narayan Srinivasa, Tony Givargis, Alexandru Nicolau
2025-05-01
Machine Learning · Computer Science
Asynchronous Stochastic Gradient Descent with Delay Compensation
Shuxin Zheng, Qi Meng, Taifeng Wang, Wei Chen +3
2020-02-19
Machine Learning · Computer Science
Asynchronous Stochastic Gradient Descent with Decoupled Backpropagation and Layer-Wise Updates
Cabrel Teguemne Fokam, Khaleelulla Khan Nazeer, Lukas König, David Kappel +1
2025-02-10
Machine Learning · Statistics
Differential Equations for Modeling Asynchronous Algorithms
Li He, Qi Meng, Wei Chen, Zhi-Ming Ma +1
2018-05-09
Distributed, Parallel, and Cluster Computing · Computer Science
Adaptive Elastic Training for Sparse Deep Learning on Heterogeneous Multi-GPU Servers
Yujing Ma, Florin Rusu, Kesheng Wu, Alexander Sim
2021-10-15
Machine Learning · Computer Science
Online Learning to Sample
Guillaume Bouchard, Théo Trouillon, Julien Perez, Adrien Gaidon
2016-03-16