Related papers: Parameter Re-Initialization through Cyclical Batch…

Target noise: A pre-training based neural network initialization for efficient high resolution learning

Weight initialization plays a crucial role in the optimization behavior and convergence efficiency of neural networks. Most existing initialization methods, such as Xavier and Kaiming initializations, rely on random sampling and do not…

Machine Learning · Computer Science 2026-02-09 Shaowen Wang , Tariq Alkhalifah

Initialization Using Perlin Noise for Training Networks with a Limited Amount of Data

We propose a novel network initialization method using Perlin noise for training image classification networks with a limited amount of data. Our main idea is to initialize the network parameters by solving an artificial noise…

Computer Vision and Pattern Recognition · Computer Science 2021-01-20 Nakamasa Inoue , Eisuke Yamagata , Hirokatsu Kataoka

Initializing Models with Larger Ones

Weight initialization plays an important role in neural network training. Widely used initialization methods are proposed and evaluated for networks that are trained from scratch. However, the growing number of pretrained models now offers…

Machine Learning · Computer Science 2023-12-01 Zhiqiu Xu , Yanjie Chen , Kirill Vishniakov , Yida Yin , Zhiqiang Shen , Trevor Darrell , Lingjie Liu , Zhuang Liu

Reducing Neural Network Parameter Initialization Into an SMT Problem

Training a neural network (NN) depends on multiple factors, including but not limited to the initial weights. In this paper, we focus on initializing deep NN parameters such that it performs better, comparing to random or zero…

Machine Learning · Computer Science 2020-11-10 Mohamad H. Danesh

When Does Re-initialization Work?

Re-initializing a neural network during training has been observed to improve generalization in recent works. Yet it is neither widely adopted in deep learning practice nor is it often used in state-of-the-art training protocols. This…

Machine Learning · Computer Science 2023-04-04 Sheheryar Zaidi , Tudor Berariu , Hyunjik Kim , Jörg Bornschein , Claudia Clopath , Yee Whye Teh , Razvan Pascanu

How to Initialize your Network? Robust Initialization for WeightNorm & ResNets

Residual networks (ResNet) and weight normalization play an important role in various deep learning applications. However, parameter initialization strategies have not been studied previously for weight normalized networks and, in practice,…

Machine Learning · Statistics 2019-10-31 Devansh Arpit , Victor Campos , Yoshua Bengio

On the Initialization of Long Short-Term Memory Networks

Weight initialization is important for faster convergence and stability of deep neural networks training. In this paper, a robust initialization method is developed to address the training instability in long short-term memory (LSTM)…

Machine Learning · Computer Science 2019-12-24 Mostafa Mehdipour Ghazi , Mads Nielsen , Akshay Pai , Marc Modat , M. Jorge Cardoso , Sebastien Ourselin , Lauge Sorensen

A Good Start Matters: Enhancing Continual Learning with Data-Driven Weight Initialization

To adapt to real-world data streams, continual learning (CL) systems must rapidly learn new concepts while preserving and utilizing prior knowledge. When it comes to adding new information to continually-trained deep neural networks (DNNs),…

Machine Learning · Computer Science 2025-07-02 Md Yousuf Harun , Christopher Kanan

Revisiting Batch Norm Initialization

Batch normalization (BN) is comprised of a normalization component followed by an affine transformation and has become essential for training deep neural networks. Standard initialization of each BN in a network sets the affine…

Computer Vision and Pattern Recognition · Computer Science 2022-07-18 Jim Davis , Logan Frank

A Bayesian approach for initialization of weights in backpropagation neural net with application to character recognition

Convergence rate of training algorithms for neural networks is heavily affected by initialization of weights. In this paper, an original algorithm for initialization of weights in backpropagation neural net is presented with application to…

Machine Learning · Computer Science 2020-04-07 Nadir Murru , Rosaria Rossini

Stochastic Weight Matrix-based Regularization Methods for Deep Neural Networks

The aim of this paper is to introduce two widely applicable regularization methods based on the direct modification of weight matrices. The first method, Weight Reinitialization, utilizes a simplified Bayesian assumption with partially…

Machine Learning · Computer Science 2022-06-07 Patrik Reizinger , Bálint Gyires-Tóth

A Sober Look at Neural Network Initializations

Initializing the weights and the biases is a key part of the training process of a neural network. Unlike the subsequent optimization phase, however, the initialization phase has gained only limited attention in the literature. In this…

Machine Learning · Computer Science 2019-09-06 Ingo Steinwart

Initialization of ReLUs for Dynamical Isometry

Deep learning relies on good initialization schemes and hyperparameter choices prior to training a neural network. Random weight initializations induce random network ensembles, which give rise to the trainability, training speed, and…

Machine Learning · Statistics 2019-10-25 Rebekka Burkholz , Alina Dubatovka

Variance-Aware Weight Initialization for Point Convolutional Neural Networks

Appropriate weight initialization has been of key importance to successfully train neural networks. Recently, batch normalization has diminished the role of weight initialization by simply normalizing each layer based on batch statistics.…

Computer Vision and Pattern Recognition · Computer Science 2022-08-03 Pedro Hermosilla , Michael Schelling , Tobias Ritschel , Timo Ropinski

Path-conditioned training: a principled way to rescale ReLU neural networks

Despite recent algorithmic advances, we still lack principled ways to leverage the well-documented rescaling symmetries in ReLU neural network parameters. While two properly rescaled weights implement the same function, the training…

Machine Learning · Statistics 2026-02-24 Arthur Lebeurrier , Titouan Vayer , Rémi Gribonval

Weights initialization of neural networks for function approximation

Neural network-based function approximation plays a pivotal role in the advancement of scientific computing and machine learning. Yet, training such models faces several challenges: (i) each target function often requires training a new…

Machine Learning · Computer Science 2025-10-13 Xinwen Hu , Yunqing Huang , Nianyu Yi , Peimeng Yin

Efficient Training with Denoised Neural Weights

Good weight initialization serves as an effective measure to reduce the training cost of a deep neural network (DNN) model. The choice of how to initialize parameters is challenging and may require manual tuning, which can be time-consuming…

Computer Vision and Pattern Recognition · Computer Science 2024-07-17 Yifan Gong , Zheng Zhan , Yanyu Li , Yerlan Idelbayev , Andrey Zharkov , Kfir Aberman , Sergey Tulyakov , Yanzhi Wang , Jian Ren

Cyclical Log Annealing as a Learning Rate Scheduler

A learning rate scheduler is a predefined set of instructions for varying search stepsizes during model training processes. This paper introduces a new logarithmic method using harsh restarting of step sizes through stochastic gradient…

Machine Learning · Computer Science 2024-06-12 Philip Naveen

Bounding the Width of Neural Networks via Coupled Initialization -- A Worst Case Analysis

A common method in training neural networks is to initialize all the weights to be independent Gaussian vectors. We observe that by instead initializing the weights into independent pairs, where each pair consists of two identical Gaussian…

Machine Learning · Computer Science 2022-06-28 Alexander Munteanu , Simon Omlor , Zhao Song , David P. Woodruff

Pruning Randomly Initialized Neural Networks with Iterative Randomization

Pruning the weights of randomly initialized neural networks plays an important role in the context of lottery ticket hypothesis. Ramanujan et al. (2020) empirically showed that only pruning the weights can achieve remarkable performance…

Machine Learning · Computer Science 2022-04-06 Daiki Chijiwa , Shin'ya Yamaguchi , Yasutoshi Ida , Kenji Umakoshi , Tomohiro Inoue