Related papers: Correlated Initialization for Correlated Data

Initializing ReLU networks in an expressive subspace of weights

Using a mean-field theory of signal propagation, we analyze the evolution of correlations between two signals propagating forward through a deep ReLU network with correlated weights. Signals become highly correlated in deep ReLU networks…

Machine Learning · Computer Science 2021-05-26 Dayal Singh , G J Sreejith

Learned Initializations for Optimizing Coordinate-Based Neural Representations

Coordinate-based neural representations have shown significant promise as an alternative to discrete, array-based representations for complex low dimensional signals. However, optimizing a coordinate-based network from randomly initialized…

Computer Vision and Pattern Recognition · Computer Science 2021-03-24 Matthew Tancik , Ben Mildenhall , Terrance Wang , Divi Schmidt , Pratul P. Srinivasan , Jonathan T. Barron , Ren Ng

Initialization Matters: Unraveling the Impact of Pre-Training on Federated Learning

Initializing with pre-trained models when learning on downstream tasks is becoming standard practice in machine learning. Several recent works explore the benefits of pre-trained initialization in a federated learning (FL) setting, where…

Machine Learning · Computer Science 2025-02-13 Divyansh Jhunjhunwala , Pranay Sharma , Zheng Xu , Gauri Joshi

A Theory of Initialisation's Impact on Specialisation

Prior work has demonstrated a consistent tendency in neural networks engaged in continual learning tasks, wherein intermediate task similarity results in the highest levels of catastrophic interference. This phenomenon is attributed to the…

Machine Learning · Computer Science 2025-03-05 Devon Jarvis , Sebastian Lee , Clémentine Carla Juliette Dominé , Andrew M Saxe , Stefano Sarao Mannelli

Neural network initialization with nonlinear characteristics and information on hierarchical features

Initialization of neural network parameters, such as weights and biases, has a crucial impact on learning performance; if chosen well, we can even avoid the need for additional training with backpropagation. For example, algorithms based on…

Machine Learning · Computer Science 2026-03-16 Hikaru Homma , Jun Ohkubo

Data-dependent Initializations of Convolutional Neural Networks

Convolutional Neural Networks spread through computer vision like a wildfire, impacting almost all visual tasks imaginable. Despite this, few researchers dare to train their models from scratch. Most work builds on one of a handful of…

Computer Vision and Pattern Recognition · Computer Science 2016-09-26 Philipp Krähenbühl , Carl Doersch , Jeff Donahue , Trevor Darrell

Implicit Bias of Linear RNNs

Contemporary wisdom based on empirical studies suggests that standard recurrent neural networks (RNNs) do not perform well on tasks requiring long-term memory. However, precise reasoning for this behavior is still unknown. This paper…

Machine Learning · Computer Science 2021-01-21 Melikasadat Emami , Mojtaba Sahraee-Ardakan , Parthe Pandit , Sundeep Rangan , Alyson K. Fletcher

Adjusting for Spatial Correlation in Machine and Deep Learning

Spatial data display correlation between observations collected at neighboring locations. Generally, machine and deep learning methods either do not account for this correlation or do so indirectly through correlated features and thereby…

Methodology · Statistics 2024-10-08 Matthew J. Heaton , Andrew Millane , Jake S. Rhodes

Initialization Strategies of Spatio-Temporal Convolutional Neural Networks

We propose a new way of incorporating temporal information present in videos into Spatial Convolutional Neural Networks (ConvNets) trained on images, that avoids training Spatio-Temporal ConvNets from scratch. We describe several…

Computer Vision and Pattern Recognition · Computer Science 2015-03-26 Elman Mansimov , Nitish Srivastava , Ruslan Salakhutdinov

Scaling and Resizing Symmetry in Feedforward Networks

Weights initialization in deep neural networks have a strong impact on the speed of converge of the learning map. Recent studies have shown that in the case of random initializations, a chaos/order phase transition occur in the space of…

Machine Learning · Computer Science 2023-06-28 Carlos Cardona

A Good Start Matters: Enhancing Continual Learning with Data-Driven Weight Initialization

To adapt to real-world data streams, continual learning (CL) systems must rapidly learn new concepts while preserving and utilizing prior knowledge. When it comes to adding new information to continually-trained deep neural networks (DNNs),…

Machine Learning · Computer Science 2025-07-02 Md Yousuf Harun , Christopher Kanan

Consequences of Slow Neural Dynamics for Incremental Learning

In the human brain, internal states are often correlated over time (due to local recurrence and other intrinsic circuit properties), punctuated by abrupt transitions. At first glance, temporal smoothness of internal states presents a…

Machine Learning · Computer Science 2023-05-24 Shima Rahimi Moghaddam , Fanjun Bu , Christopher J. Honey

Effects of Initialization Biases on Deep Neural Network Training Dynamics

Untrained large neural networks, just after random initialization, tend to favour a small subset of classes, assigning high predicted probabilities to these few classes and approximately zero probability to all others. This bias, termed…

Machine Learning · Computer Science 2025-11-27 Nicholas Pellegrino , David Szczecina , Paul W. Fieguth

The Impact of Reinitialization on Generalization in Convolutional Neural Networks

Recent results suggest that reinitializing a subset of the parameters of a neural network during training can improve generalization, particularly for small training sets. We study the impact of different reinitialization methods in several…

Machine Learning · Computer Science 2021-09-02 Ibrahim Alabdulmohsin , Hartmut Maennel , Daniel Keysers

On the (Non-)Robustness of Two-Layer Neural Networks in Different Learning Regimes

Neural networks are known to be highly sensitive to adversarial examples. These may arise due to different factors, such as random initialization, or spurious correlations in the learning problem. To better understand these factors, we…

Machine Learning · Statistics 2022-07-05 Elvis Dohmatob , Alberto Bietti

Deep learning of spatial densities in inhomogeneous correlated quantum systems

Machine learning has made important headway in helping to improve the treatment of quantum many-body systems. A domain of particular relevance are correlated inhomogeneous systems. What has been missing so far is a general, scalable…

Quantum Physics · Physics 2026-02-10 Alex Blania , Sandro Herbig , Fabian Dechent , Evert van Nieuwenburg , Florian Marquardt

Detecting Bias in the Presence of Spatial Autocorrelation

In spite of considerable practical importance, current algorithmic fairness literature lacks technical methods to account for underlying geographic dependency while evaluating or mitigating bias issues for spatial data. We initiate the…

Applications · Statistics 2022-01-31 Subhabrata Majumdar , Cheryl Flynn , Ritwik Mitra

Where Should We Begin? A Low-Level Exploration of Weight Initialization Impact on Quantized Behaviour of Deep Neural Networks

With the proliferation of deep convolutional neural network (CNN) algorithms for mobile processing, limited precision quantization has become an essential tool for CNN efficiency. Consequently, various works have sought to design fixed…

Machine Learning · Computer Science 2020-12-01 Stone Yun , Alexander Wong

Dynamic Neural Diversification: Path to Computationally Sustainable Neural Networks

Small neural networks with a constrained number of trainable parameters, can be suitable resource-efficient candidates for many simple tasks, where now excessively large models are used. However, such models face several problems during the…

Machine Learning · Computer Science 2021-09-21 Alexander Kovalenko , Pavel Kordík , Magda Friedjungová

How does unlabeled data improve generalization in self-training? A one-hidden-layer theoretical analysis

Self-training, a semi-supervised learning algorithm, leverages a large amount of unlabeled data to improve learning when the labeled data are limited. Despite empirical successes, its theoretical characterization remains elusive. To the…

Machine Learning · Computer Science 2022-02-15 Shuai Zhang , Meng Wang , Sijia Liu , Pin-Yu Chen , Jinjun Xiong