English
Related papers

Related papers: Parameter Re-Initialization through Cyclical Batch…

200 papers

Weight initialization plays a crucial role in the optimization behavior and convergence efficiency of neural networks. Most existing initialization methods, such as Xavier and Kaiming initializations, rely on random sampling and do not…

Machine Learning · Computer Science 2026-02-09 Shaowen Wang , Tariq Alkhalifah

We propose a novel network initialization method using Perlin noise for training image classification networks with a limited amount of data. Our main idea is to initialize the network parameters by solving an artificial noise…

Computer Vision and Pattern Recognition · Computer Science 2021-01-20 Nakamasa Inoue , Eisuke Yamagata , Hirokatsu Kataoka

Weight initialization plays an important role in neural network training. Widely used initialization methods are proposed and evaluated for networks that are trained from scratch. However, the growing number of pretrained models now offers…

Machine Learning · Computer Science 2023-12-01 Zhiqiu Xu , Yanjie Chen , Kirill Vishniakov , Yida Yin , Zhiqiang Shen , Trevor Darrell , Lingjie Liu , Zhuang Liu

Training a neural network (NN) depends on multiple factors, including but not limited to the initial weights. In this paper, we focus on initializing deep NN parameters such that it performs better, comparing to random or zero…

Machine Learning · Computer Science 2020-11-10 Mohamad H. Danesh

Re-initializing a neural network during training has been observed to improve generalization in recent works. Yet it is neither widely adopted in deep learning practice nor is it often used in state-of-the-art training protocols. This…

Machine Learning · Computer Science 2023-04-04 Sheheryar Zaidi , Tudor Berariu , Hyunjik Kim , Jörg Bornschein , Claudia Clopath , Yee Whye Teh , Razvan Pascanu

Residual networks (ResNet) and weight normalization play an important role in various deep learning applications. However, parameter initialization strategies have not been studied previously for weight normalized networks and, in practice,…

Machine Learning · Statistics 2019-10-31 Devansh Arpit , Victor Campos , Yoshua Bengio

Weight initialization is important for faster convergence and stability of deep neural networks training. In this paper, a robust initialization method is developed to address the training instability in long short-term memory (LSTM)…

To adapt to real-world data streams, continual learning (CL) systems must rapidly learn new concepts while preserving and utilizing prior knowledge. When it comes to adding new information to continually-trained deep neural networks (DNNs),…

Machine Learning · Computer Science 2025-07-02 Md Yousuf Harun , Christopher Kanan

Batch normalization (BN) is comprised of a normalization component followed by an affine transformation and has become essential for training deep neural networks. Standard initialization of each BN in a network sets the affine…

Computer Vision and Pattern Recognition · Computer Science 2022-07-18 Jim Davis , Logan Frank

Convergence rate of training algorithms for neural networks is heavily affected by initialization of weights. In this paper, an original algorithm for initialization of weights in backpropagation neural net is presented with application to…

Machine Learning · Computer Science 2020-04-07 Nadir Murru , Rosaria Rossini

The aim of this paper is to introduce two widely applicable regularization methods based on the direct modification of weight matrices. The first method, Weight Reinitialization, utilizes a simplified Bayesian assumption with partially…

Machine Learning · Computer Science 2022-06-07 Patrik Reizinger , Bálint Gyires-Tóth

Initializing the weights and the biases is a key part of the training process of a neural network. Unlike the subsequent optimization phase, however, the initialization phase has gained only limited attention in the literature. In this…

Machine Learning · Computer Science 2019-09-06 Ingo Steinwart

Deep learning relies on good initialization schemes and hyperparameter choices prior to training a neural network. Random weight initializations induce random network ensembles, which give rise to the trainability, training speed, and…

Machine Learning · Statistics 2019-10-25 Rebekka Burkholz , Alina Dubatovka

Appropriate weight initialization has been of key importance to successfully train neural networks. Recently, batch normalization has diminished the role of weight initialization by simply normalizing each layer based on batch statistics.…

Computer Vision and Pattern Recognition · Computer Science 2022-08-03 Pedro Hermosilla , Michael Schelling , Tobias Ritschel , Timo Ropinski

Despite recent algorithmic advances, we still lack principled ways to leverage the well-documented rescaling symmetries in ReLU neural network parameters. While two properly rescaled weights implement the same function, the training…

Machine Learning · Statistics 2026-02-24 Arthur Lebeurrier , Titouan Vayer , Rémi Gribonval

Neural network-based function approximation plays a pivotal role in the advancement of scientific computing and machine learning. Yet, training such models faces several challenges: (i) each target function often requires training a new…

Machine Learning · Computer Science 2025-10-13 Xinwen Hu , Yunqing Huang , Nianyu Yi , Peimeng Yin

Good weight initialization serves as an effective measure to reduce the training cost of a deep neural network (DNN) model. The choice of how to initialize parameters is challenging and may require manual tuning, which can be time-consuming…

Computer Vision and Pattern Recognition · Computer Science 2024-07-17 Yifan Gong , Zheng Zhan , Yanyu Li , Yerlan Idelbayev , Andrey Zharkov , Kfir Aberman , Sergey Tulyakov , Yanzhi Wang , Jian Ren

A learning rate scheduler is a predefined set of instructions for varying search stepsizes during model training processes. This paper introduces a new logarithmic method using harsh restarting of step sizes through stochastic gradient…

Machine Learning · Computer Science 2024-06-12 Philip Naveen

A common method in training neural networks is to initialize all the weights to be independent Gaussian vectors. We observe that by instead initializing the weights into independent pairs, where each pair consists of two identical Gaussian…

Machine Learning · Computer Science 2022-06-28 Alexander Munteanu , Simon Omlor , Zhao Song , David P. Woodruff

Pruning the weights of randomly initialized neural networks plays an important role in the context of lottery ticket hypothesis. Ramanujan et al. (2020) empirically showed that only pruning the weights can achieve remarkable performance…

Machine Learning · Computer Science 2022-04-06 Daiki Chijiwa , Shin'ya Yamaguchi , Yasutoshi Ida , Kenji Umakoshi , Tomohiro Inoue
‹ Prev 1 2 3 10 Next ›