Related papers: Sinusoidal Initialization, Time for a New Start

Revisiting Initialization of Neural Networks

The proper initialization of weights is crucial for the effective training and fast convergence of deep neural networks (DNNs). Prior work in this area has mostly focused on balancing the variance among weights per layer to maintain…

Machine Learning · Computer Science 2020-06-05 Maciej Skorski , Alessandro Temperoni , Martin Theobald

A new initialisation to Control Gradients in Sinusoidal Neural network

Proper initialisation strategy is of primary importance to mitigate gradient explosion or vanishing when training neural networks. Yet, the impact of initialisation parameters still lacks a precise theoretical understanding for several…

Machine Learning · Computer Science 2026-05-12 Andrea Combette , Antoine Venaille , Nelly Pustelnik

Provable Benefit of Orthogonal Initialization in Optimizing Deep Linear Networks

The selection of initial parameter values for gradient-based optimization of deep neural networks is one of the most impactful hyperparameter choices in deep learning systems, affecting both convergence times and model performance. Yet…

Machine Learning · Computer Science 2020-01-17 Wei Hu , Lechao Xiao , Jeffrey Pennington

Target noise: A pre-training based neural network initialization for efficient high resolution learning

Weight initialization plays a crucial role in the optimization behavior and convergence efficiency of neural networks. Most existing initialization methods, such as Xavier and Kaiming initializations, rely on random sampling and do not…

Machine Learning · Computer Science 2026-02-09 Shaowen Wang , Tariq Alkhalifah

On the Initialization of Long Short-Term Memory Networks

Weight initialization is important for faster convergence and stability of deep neural networks training. In this paper, a robust initialization method is developed to address the training instability in long short-term memory (LSTM)…

Machine Learning · Computer Science 2019-12-24 Mostafa Mehdipour Ghazi , Mads Nielsen , Akshay Pai , Marc Modat , M. Jorge Cardoso , Sebastien Ourselin , Lauge Sorensen

Fast Training of Sinusoidal Neural Fields via Scaling Initialization

Neural fields are an emerging paradigm that represent data as continuous functions parameterized by neural networks. Despite many advantages, neural fields often have a high training cost, which prevents a broader adoption. In this paper,…

Machine Learning · Computer Science 2026-02-04 Taesun Yeom , Sangyoon Lee , Jaeho Lee

Tuning the Frequencies: Robust Training for Sinusoidal Neural Networks

Sinusoidal neural networks have been shown effective as implicit neural representations (INRs) of low-dimensional signals, due to their smoothness and high representation capacity. However, initializing and training them remain empirical…

Machine Learning · Computer Science 2025-04-07 Tiago Novello , Diana Aldana , Andre Araujo , Luiz Velho

IDInit: A Universal and Stable Initialization Method for Neural Network Training

Deep neural networks have achieved remarkable accomplishments in practice. The success of these networks hinges on effective initialization methods, which are vital for ensuring stable and rapid convergence during training. Recently,…

Machine Learning · Computer Science 2025-03-11 Yu Pan , Chaozheng Wang , Zekai Wu , Qifan Wang , Min Zhang , Zenglin Xu

How to Initialize your Network? Robust Initialization for WeightNorm & ResNets

Residual networks (ResNet) and weight normalization play an important role in various deep learning applications. However, parameter initialization strategies have not been studied previously for weight normalized networks and, in practice,…

Machine Learning · Statistics 2019-10-31 Devansh Arpit , Victor Campos , Yoshua Bengio

Initialization of ReLUs for Dynamical Isometry

Deep learning relies on good initialization schemes and hyperparameter choices prior to training a neural network. Random weight initializations induce random network ensembles, which give rise to the trainability, training speed, and…

Machine Learning · Statistics 2019-10-25 Rebekka Burkholz , Alina Dubatovka

An Effective Weight Initialization Method for Deep Learning: Application to Satellite Image Classification

The growing interest in satellite imagery has triggered the need for efficient mechanisms to extract valuable information from these vast data sources, providing deeper insights. Even though deep learning has shown significant progress in…

Computer Vision and Pattern Recognition · Computer Science 2024-06-04 Wadii Boulila , Eman Alshanqiti , Ayyub Alzahem , Anis Koubaa , Nabil Mlaiki

Starting Positions Matter: A Study on Better Weight Initialization for Neural Network Quantization

Deep neural network (DNN) quantization for fast, efficient inference has been an important tool in limiting the cost of machine learning (ML) model inference. Quantization-specific model development techniques such as regularization,…

Computer Vision and Pattern Recognition · Computer Science 2025-06-13 Stone Yun , Alexander Wong

Deep activity propagation via weight initialization in spiking neural networks

Spiking Neural Networks (SNNs) and neuromorphic computing offer bio-inspired advantages such as sparsity and ultra-low power consumption, providing a promising alternative to conventional networks. However, training deep SNNs from scratch…

Computer Vision and Pattern Recognition · Computer Science 2025-05-21 Aurora Micheli , Olaf Booij , Jan van Gemert , Nergis Tömen

Depth-Aware Initialization for Stable and Efficient Neural Network Training

In past few years, various initialization schemes have been proposed. These schemes are glorot initialization, He initialization, initialization using orthogonal matrix, random walk method for initialization. Some of these methods stress on…

Machine Learning · Computer Science 2025-09-08 Vijay Pandey

Neural network initialization with nonlinear characteristics and information on hierarchical features

Initialization of neural network parameters, such as weights and biases, has a crucial impact on learning performance; if chosen well, we can even avoid the need for additional training with backpropagation. For example, algorithms based on…

Machine Learning · Computer Science 2026-03-16 Hikaru Homma , Jun Ohkubo

AutoInit: Analytic Signal-Preserving Weight Initialization for Neural Networks

Neural networks require careful weight initialization to prevent signals from exploding or vanishing. Existing initialization schemes solve this problem in specific cases by assuming that the network has a certain activation function or…

Machine Learning · Computer Science 2022-12-01 Garrett Bingham , Risto Miikkulainen

A Good Start Matters: Enhancing Continual Learning with Data-Driven Weight Initialization

To adapt to real-world data streams, continual learning (CL) systems must rapidly learn new concepts while preserving and utilizing prior knowledge. When it comes to adding new information to continually-trained deep neural networks (DNNs),…

Machine Learning · Computer Science 2025-07-02 Md Yousuf Harun , Christopher Kanan

Weight Initialization Techniques for Deep Learning Algorithms in Remote Sensing: Recent Trends and Future Perspectives

During the last decade, several research works have focused on providing novel deep learning methods in many application fields. However, few of them have investigated the weight initialization process for deep learning, although its…

Machine Learning · Computer Science 2021-02-16 Wadii Boulila , Maha Driss , Mohamed Al-Sarem , Faisal Saeed , Moez Krichen

Simple initialization and parametrization of sinusoidal networks via their kernel bandwidth

Neural networks with sinusoidal activations have been proposed as an alternative to networks with traditional activation functions. Despite their promise, particularly for learning implicit models, their training behavior is not yet fully…

Machine Learning · Computer Science 2022-11-29 Filipe de Avila Belbute-Peres , J. Zico Kolter

Improving Deep Neural Network Random Initialization Through Neuronal Rewiring

The deep learning literature is continuously updated with new architectures and training techniques. However, weight initialization is overlooked by most recent research, despite some intriguing findings regarding random weights. On the…

Neural and Evolutionary Computing · Computer Science 2022-07-19 Leonardo Scabini , Bernard De Baets , Odemir M. Bruno