English
Related papers

Related papers: Sinusoidal Initialization, Time for a New Start

200 papers

The proper initialization of weights is crucial for the effective training and fast convergence of deep neural networks (DNNs). Prior work in this area has mostly focused on balancing the variance among weights per layer to maintain…

Machine Learning · Computer Science 2020-06-05 Maciej Skorski , Alessandro Temperoni , Martin Theobald

Proper initialisation strategy is of primary importance to mitigate gradient explosion or vanishing when training neural networks. Yet, the impact of initialisation parameters still lacks a precise theoretical understanding for several…

Machine Learning · Computer Science 2026-05-12 Andrea Combette , Antoine Venaille , Nelly Pustelnik

The selection of initial parameter values for gradient-based optimization of deep neural networks is one of the most impactful hyperparameter choices in deep learning systems, affecting both convergence times and model performance. Yet…

Machine Learning · Computer Science 2020-01-17 Wei Hu , Lechao Xiao , Jeffrey Pennington

Weight initialization plays a crucial role in the optimization behavior and convergence efficiency of neural networks. Most existing initialization methods, such as Xavier and Kaiming initializations, rely on random sampling and do not…

Machine Learning · Computer Science 2026-02-09 Shaowen Wang , Tariq Alkhalifah

Weight initialization is important for faster convergence and stability of deep neural networks training. In this paper, a robust initialization method is developed to address the training instability in long short-term memory (LSTM)…

Neural fields are an emerging paradigm that represent data as continuous functions parameterized by neural networks. Despite many advantages, neural fields often have a high training cost, which prevents a broader adoption. In this paper,…

Machine Learning · Computer Science 2026-02-04 Taesun Yeom , Sangyoon Lee , Jaeho Lee

Sinusoidal neural networks have been shown effective as implicit neural representations (INRs) of low-dimensional signals, due to their smoothness and high representation capacity. However, initializing and training them remain empirical…

Machine Learning · Computer Science 2025-04-07 Tiago Novello , Diana Aldana , Andre Araujo , Luiz Velho

Deep neural networks have achieved remarkable accomplishments in practice. The success of these networks hinges on effective initialization methods, which are vital for ensuring stable and rapid convergence during training. Recently,…

Machine Learning · Computer Science 2025-03-11 Yu Pan , Chaozheng Wang , Zekai Wu , Qifan Wang , Min Zhang , Zenglin Xu

Residual networks (ResNet) and weight normalization play an important role in various deep learning applications. However, parameter initialization strategies have not been studied previously for weight normalized networks and, in practice,…

Machine Learning · Statistics 2019-10-31 Devansh Arpit , Victor Campos , Yoshua Bengio

Deep learning relies on good initialization schemes and hyperparameter choices prior to training a neural network. Random weight initializations induce random network ensembles, which give rise to the trainability, training speed, and…

Machine Learning · Statistics 2019-10-25 Rebekka Burkholz , Alina Dubatovka

The growing interest in satellite imagery has triggered the need for efficient mechanisms to extract valuable information from these vast data sources, providing deeper insights. Even though deep learning has shown significant progress in…

Computer Vision and Pattern Recognition · Computer Science 2024-06-04 Wadii Boulila , Eman Alshanqiti , Ayyub Alzahem , Anis Koubaa , Nabil Mlaiki

Deep neural network (DNN) quantization for fast, efficient inference has been an important tool in limiting the cost of machine learning (ML) model inference. Quantization-specific model development techniques such as regularization,…

Computer Vision and Pattern Recognition · Computer Science 2025-06-13 Stone Yun , Alexander Wong

Spiking Neural Networks (SNNs) and neuromorphic computing offer bio-inspired advantages such as sparsity and ultra-low power consumption, providing a promising alternative to conventional networks. However, training deep SNNs from scratch…

Computer Vision and Pattern Recognition · Computer Science 2025-05-21 Aurora Micheli , Olaf Booij , Jan van Gemert , Nergis Tömen

In past few years, various initialization schemes have been proposed. These schemes are glorot initialization, He initialization, initialization using orthogonal matrix, random walk method for initialization. Some of these methods stress on…

Machine Learning · Computer Science 2025-09-08 Vijay Pandey

Initialization of neural network parameters, such as weights and biases, has a crucial impact on learning performance; if chosen well, we can even avoid the need for additional training with backpropagation. For example, algorithms based on…

Machine Learning · Computer Science 2026-03-16 Hikaru Homma , Jun Ohkubo

Neural networks require careful weight initialization to prevent signals from exploding or vanishing. Existing initialization schemes solve this problem in specific cases by assuming that the network has a certain activation function or…

Machine Learning · Computer Science 2022-12-01 Garrett Bingham , Risto Miikkulainen

To adapt to real-world data streams, continual learning (CL) systems must rapidly learn new concepts while preserving and utilizing prior knowledge. When it comes to adding new information to continually-trained deep neural networks (DNNs),…

Machine Learning · Computer Science 2025-07-02 Md Yousuf Harun , Christopher Kanan

During the last decade, several research works have focused on providing novel deep learning methods in many application fields. However, few of them have investigated the weight initialization process for deep learning, although its…

Machine Learning · Computer Science 2021-02-16 Wadii Boulila , Maha Driss , Mohamed Al-Sarem , Faisal Saeed , Moez Krichen

Neural networks with sinusoidal activations have been proposed as an alternative to networks with traditional activation functions. Despite their promise, particularly for learning implicit models, their training behavior is not yet fully…

Machine Learning · Computer Science 2022-11-29 Filipe de Avila Belbute-Peres , J. Zico Kolter

The deep learning literature is continuously updated with new architectures and training techniques. However, weight initialization is overlooked by most recent research, despite some intriguing findings regarding random weights. On the…

Neural and Evolutionary Computing · Computer Science 2022-07-19 Leonardo Scabini , Bernard De Baets , Odemir M. Bruno
‹ Prev 1 2 3 10 Next ›