Related papers: A Novel DNN Training Framework via Data Sampling a…

A Unified Framework for Training Neural Networks

The lack of mathematical tractability of Deep Neural Networks (DNNs) has hindered progress towards having a unified convergence analysis of training algorithms, in the general setting. We propose a unified optimization framework for…

Machine Learning · Computer Science 2018-05-24 Hadi Ghauch , Hossein Shokri-Ghadikolaei , Carlo Fischione , Mikael Skoglund

A General Multiple Data Augmentation Based Framework for Training Deep Neural Networks

Deep neural networks (DNNs) often rely on massive labelled data for training, which is inaccessible in many applications. Data augmentation (DA) tackles data scarcity by creating new labelled data from available ones. Different DA methods…

Neural and Evolutionary Computing · Computer Science 2022-05-31 Binyan Hu , Yu Sun , A. K. Qin

Simplifying Distributed Neural Network Training on Massive Graphs: Randomized Partitions Improve Model Aggregation

Distributed training of GNNs enables learning on massive graphs (e.g., social and e-commerce networks) that exceed the storage and computational capacity of a single machine. To reach performance comparable to centralized training,…

Machine Learning · Computer Science 2023-05-18 Jiong Zhu , Aishwarya Reganti , Edward Huang , Charles Dickens , Nikhil Rao , Karthik Subbian , Danai Koutra

Comprehensive Evaluation of GNN Training Systems: A Data Management Perspective

Many Graph Neural Network (GNN) training systems have emerged recently to support efficient GNN training. Since GNNs embody complex data dependencies between training samples, the training of GNNs should address distinct challenges…

Machine Learning · Computer Science 2024-03-21 Hao Yuan , Yajiong Liu , Yanfeng Zhang , Xin Ai , Qiange Wang , Chaoyi Chen , Yu Gu , Ge Yu

A New Perspective for Understanding Generalization Gap of Deep Neural Networks Trained with Large Batch Sizes

Deep neural networks (DNNs) are typically optimized using various forms of mini-batch gradient descent algorithm. A major motivation for mini-batch gradient descent is that with a suitably chosen batch size, available computing resources…

Machine Learning · Computer Science 2022-10-25 Oyebade K. Oyedotun , Konstantinos Papadopoulos , Djamila Aouada

Neural Network Training Techniques Regularize Optimization Trajectory: An Empirical Study

Modern deep neural network (DNN) trainings utilize various training techniques, e.g., nonlinear activation functions, batch normalization, skip-connections, etc. Despite their effectiveness, it is still mysterious how they help accelerate…

Machine Learning · Computer Science 2024-03-05 Cheng Chen , Junjie Yang , Yi Zhou

Optimizing Task Placement and Online Scheduling for Distributed GNN Training Acceleration

Training Graph Neural Networks (GNN) on large graphs is resource-intensive and time-consuming, mainly due to the large graph data that cannot be fit into the memory of a single machine, but have to be fetched from distributed graph storage…

Distributed, Parallel, and Cluster Computing · Computer Science 2022-08-23 Ziyue Luo , Yixin Bao , Chuan Wu

Normalization Techniques in Training DNNs: Methodology, Analysis and Application

Normalization techniques are essential for accelerating the training and improving the generalization of deep neural networks (DNNs), and have successfully been used in various applications. This paper reviews and comments on the past,…

Machine Learning · Computer Science 2020-09-29 Lei Huang , Jie Qin , Yi Zhou , Fan Zhu , Li Liu , Ling Shao

A Distributed Training Architecture For Combinatorial Optimization

In recent years, graph neural networks (GNNs) have been widely applied in tackling combinatorial optimization problems. However, existing methods still suffer from limited accuracy when addressing that on complex graphs and exhibit poor…

Machine Learning · Computer Science 2025-11-13 Yuyao Long

Relearning Forgotten Knowledge: on Forgetting, Overfit and Training-Free Ensembles of DNNs

The infrequent occurrence of overfit in deep neural networks is perplexing. On the one hand, theory predicts that as models get larger they should eventually become too specialized for a specific training set, with ensuing decrease in…

Machine Learning · Computer Science 2023-12-29 Uri Stern , Daphna Weinshall

Neuron Coverage-Guided Domain Generalization

This paper focuses on the domain generalization task where domain knowledge is unavailable, and even worse, only samples from a single domain can be utilized during training. Our motivation originates from the recent progresses in deep…

Machine Learning · Computer Science 2022-03-08 Chris Xing Tian , Haoliang Li , Xiaofei Xie , Yang Liu , Shiqi Wang

DCNNs on a Diet: Sampling Strategies for Reducing the Training Set Size

Large-scale supervised classification algorithms, especially those based on deep convolutional neural networks (DCNNs), require vast amounts of training data to achieve state-of-the-art performance. Decreasing this data requirement would…

Computer Vision and Pattern Recognition · Computer Science 2016-06-15 Maya Kabkab , Azadeh Alavi , Rama Chellappa

Accelerating DNN Training in Wireless Federated Edge Learning Systems

Training task in classical machine learning models, such as deep neural networks, is generally implemented at a remote cloud center for centralized learning, which is typically time-consuming and resource-hungry. It also incurs serious…

Machine Learning · Computer Science 2020-10-27 Jinke Ren , Guanding Yu , Guangyao Ding

A Theoretical-Empirical Approach to Estimating Sample Complexity of DNNs

This paper focuses on understanding how the generalization error scales with the amount of the training data for deep neural networks (DNNs). Existing techniques in statistical learning require computation of capacity measures, such as VC…

Machine Learning · Computer Science 2021-05-06 Devansh Bisla , Apoorva Nandini Saridena , Anna Choromanska

SortedNet: A Scalable and Generalized Framework for Training Modular Deep Neural Networks

Deep neural networks (DNNs) must cater to a variety of users with different performance needs and budgets, leading to the costly practice of training, storing, and maintaining numerous user/task-specific models. There are solutions in the…

Machine Learning · Computer Science 2024-06-04 Mojtaba Valipour , Mehdi Rezagholizadeh , Hossein Rajabzadeh , Parsa Kavehzadeh , Marzieh Tahaei , Boxing Chen , Ali Ghodsi

Training With Data Dependent Dynamic Learning Rates

Recently many first and second order variants of SGD have been proposed to facilitate training of Deep Neural Networks (DNNs). A common limitation of these works stem from the fact that they use the same learning rate across all instances…

Machine Learning · Computer Science 2021-05-31 Shreyas Saxena , Nidhi Vyas , Dennis DeCoste

A Comprehensive Survey on Distributed Training of Graph Neural Networks

Graph neural networks (GNNs) have been demonstrated to be a powerful algorithmic model in broad application fields for their effectiveness in learning over graphs. To scale GNN training up for large-scale and ever-growing graphs, the most…

Distributed, Parallel, and Cluster Computing · Computer Science 2023-11-30 Haiyang Lin , Mingyu Yan , Xiaochun Ye , Dongrui Fan , Shirui Pan , Wenguang Chen , Yuan Xie

A Framework for Parallel and Distributed Training of Neural Networks

The aim of this paper is to develop a general framework for training neural networks (NNs) in a distributed environment, where training data is partitioned over a set of agents that communicate with each other through a sparse, possibly…

Machine Learning · Statistics 2017-04-21 Simone Scardapane , Paolo Di Lorenzo

Heterogeneous Resource Allocation with Multi-task Learning for Wireless Networks

The optimal solution to an optimization problem depends on the problem's objective function, constraints, and size. While deep neural networks (DNNs) have proven effective in solving optimization problems, changes in the problem's size,…

Machine Learning · Computer Science 2025-02-17 Nikos A. Mitsiou , Pavlos S. Bouzinis , Panagiotis G. Sarigiannidis , George K. Karagiannidis

Distributed Graph Neural Network Training: A Survey

Graph neural networks (GNNs) are a type of deep learning models that are trained on graphs and have been successfully applied in various domains. Despite the effectiveness of GNNs, it is still challenging for GNNs to efficiently scale to…

Machine Learning · Computer Science 2023-08-28 Yingxia Shao , Hongzheng Li , Xizhi Gu , Hongbo Yin , Yawen Li , Xupeng Miao , Wentao Zhang , Bin Cui , Lei Chen