Related papers: Understanding Learning Dynamics Through Structured…

Cross-Model Semantics in Representation Learning

The internal representations learned by deep networks are often sensitive to architecture-specific choices, raising questions about the stability, alignment, and transferability of learned structure across models. In this paper, we…

Machine Learning · Computer Science 2025-08-06 Saleh Nikooroo , Thomas Engel

When Representations Align: Universality in Representation Learning Dynamics

Deep neural networks come in many sizes and architectures. The choice of architecture, in conjunction with the dataset and learning algorithm, is commonly understood to affect the learned neural representations. Yet, recent results have…

Machine Learning · Computer Science 2024-07-08 Loek van Rossem , Andrew M. Saxe

When Does Structure Matter in Continual Learning? Dimensionality Controls When Modularity Shapes Representational Geometry

To preserve previously learned representations, continual learning systems must strike a balance between plasticity, the ability to acquire new knowledge, and stability. This stability-plasticity dilemma affects how representations can be…

Machine Learning · Computer Science 2026-05-01 Kathrin Korte , Joachim Winter Pedersen , Eleni Nisioti , Sebastian Risi

The Neural Race Reduction: Dynamics of Abstraction in Gated Networks

Our theoretical understanding of deep learning has not kept pace with its empirical success. While network architecture is known to be critical, we do not yet understand its effect on learned representations and network behavior, or how…

Machine Learning · Computer Science 2022-07-22 Andrew M. Saxe , Shagun Sodhani , Sam Lewallen

On architectural choices in deep learning: From network structure to gradient convergence and parameter estimation

We study mechanisms to characterize how the asymptotic convergence of backpropagation in deep architectures, in general, is related to the network structure, and how it may be influenced by other design choices including activation type,…

Machine Learning · Computer Science 2017-03-02 Vamsi K Ithapu , Sathya N Ravi , Vikas Singh

Structured Transformations for Stable and Interpretable Neural Computation

Despite their impressive performance, contemporary neural networks often lack structural safeguards that promote stable learning and interpretable behavior. In this work, we introduce a reformulation of layer-level transformations that…

Machine Learning · Computer Science 2025-08-04 Saleh Nikooroo , Thomas Engel

Discrete Latent Structure in Neural Networks

Many types of data from fields including natural language processing, computer vision, and bioinformatics, are well represented by discrete, compositional structures such as trees, sequences, or matchings. Latent structure models are a…

Machine Learning · Computer Science 2026-02-04 Vlad Niculae , Caio F. Corro , Nikita Nangia , Tsvetomila Mihaylova , André F. T. Martins

Collective variables of neural networks: empirical time evolution and scaling laws

This work presents a novel means for understanding learning dynamics and scaling relations in neural networks. We show that certain measures on the spectrum of the empirical neural tangent kernel, specifically entropy and trace, yield…

Machine Learning · Computer Science 2024-10-11 Samuel Tovey , Sven Krippendorf , Michael Spannowsky , Konstantin Nikolaou , Christian Holm

Sensitive Dependence of Optimal Network Dynamics on Network Structure

The relation between network structure and dynamics is determinant for the behavior of complex systems in numerous domains. An important long-standing problem concerns the properties of the networks that optimize the dynamics with respect…

Adaptation and Self-Organizing Systems · Physics 2017-12-07 Takashi Nishikawa , Jie Sun , Adilson E. Motter

Information Structure in Mappings: An Approach to Learning, Representation, and Generalisation

Despite the remarkable success of large large-scale neural networks, we still lack unified notation for thinking about and describing their representational spaces. We lack methods to reliably describe how their representations are…

Machine Learning · Computer Science 2025-06-02 Henry Conklin

On the interplay of network structure and gradient convergence in deep learning

The regularization and output consistency behavior of dropout and layer-wise pretraining for learning deep networks have been fairly well studied. However, our understanding of how the asymptotic convergence of backpropagation in deep…

Machine Learning · Computer Science 2017-02-23 Vamsi K Ithapu , Sathya N Ravi , Vikas Singh

Network Dynamics-Based Framework for Understanding Deep Neural Networks

Advancements in artificial intelligence call for a deeper understanding of the fundamental mechanisms underlying deep learning. In this work, we propose a theoretical framework to analyze learning dynamics through the lens of dynamical…

Machine Learning · Computer Science 2025-10-13 Yuchen Lin , Yong Zhang , Sihan Feng , Hong Zhao

Structure Learning in Motor Control:A Deep Reinforcement Learning Model

Motor adaptation displays a structure-learning effect: adaptation to a new perturbation occurs more quickly when the subject has prior exposure to perturbations with related structure. Although this `learning-to-learn' effect is well…

Artificial Intelligence · Computer Science 2017-07-14 Ari Weinstein , Matthew M. Botvinick

A simple theory for training response of deep neural networks

Deep neural networks give us a powerful method to model the training dataset's relationship between input and output. We can regard that as a complex adaptive system consisting of many artificial neurons that work as an adaptive memory as a…

Disordered Systems and Neural Networks · Physics 2024-05-08 Kenichi Nakazato

Low-Rank Learning by Design: the Role of Network Architecture and Activation Linearity in Gradient Rank Collapse

Our understanding of learning dynamics of deep neural networks (DNNs) remains incomplete. Recent research has begun to uncover the mathematical principles underlying these networks, including the phenomenon of "Neural Collapse", where…

Machine Learning · Computer Science 2024-02-13 Bradley T. Baker , Barak A. Pearlmutter , Robyn Miller , Vince D. Calhoun , Sergey M. Plis

Reframing Neural Networks: Deep Structure in Overcomplete Representations

In comparison to classical shallow representation learning techniques, deep neural networks have achieved superior performance in nearly every application benchmark. But despite their clear empirical advantages, it is still not well…

Machine Learning · Computer Science 2022-01-11 Calvin Murdock , George Cazenavette , Simon Lucey

H-Model: Dynamic Neural Architectures for Adaptive Processing

This article explores the design and experimentation of a neural network architecture capable of dynamically adjusting its internal structure based on the input data. The proposed model introduces a routing mechanism that allows each layer…

Machine Learning · Computer Science 2025-11-18 Dmytro Hospodarchuk

Dynamical stability and chaos in artificial neural network trajectories along training

The process of training an artificial neural network involves iteratively adapting its parameters so as to minimize the error of the network's prediction, when confronted with a learning task. This iterative change can be naturally…

Machine Learning · Computer Science 2024-04-10 Kaloyan Danovski , Miguel C. Soriano , Lucas Lacasa

The Dynamics of Learning: A Random Matrix Approach

Understanding the learning dynamics of neural networks is one of the key issues for the improvement of optimization algorithms as well as for the theoretical comprehension of why deep neural nets work so well today. In this paper, we…

Machine Learning · Statistics 2021-03-18 Zhenyu Liao , Romain Couillet

Dynamics-Driven Evolution to Structural Heterogeneity in Complex Networks

The mutual influence of dynamics and structure is a central issue in complex systems. In this paper we study by simulation slow evolution of network under the feedback of a local-majority-rule opinion process. If performance-enhancing local…

Physics and Society · Physics 2009-11-13 Zhen Shao , Haijun Zhou