English
Related papers

Related papers: Understanding Learning Dynamics Through Structured…

200 papers

The internal representations learned by deep networks are often sensitive to architecture-specific choices, raising questions about the stability, alignment, and transferability of learned structure across models. In this paper, we…

Machine Learning · Computer Science 2025-08-06 Saleh Nikooroo , Thomas Engel

Deep neural networks come in many sizes and architectures. The choice of architecture, in conjunction with the dataset and learning algorithm, is commonly understood to affect the learned neural representations. Yet, recent results have…

Machine Learning · Computer Science 2024-07-08 Loek van Rossem , Andrew M. Saxe

To preserve previously learned representations, continual learning systems must strike a balance between plasticity, the ability to acquire new knowledge, and stability. This stability-plasticity dilemma affects how representations can be…

Machine Learning · Computer Science 2026-05-01 Kathrin Korte , Joachim Winter Pedersen , Eleni Nisioti , Sebastian Risi

Our theoretical understanding of deep learning has not kept pace with its empirical success. While network architecture is known to be critical, we do not yet understand its effect on learned representations and network behavior, or how…

Machine Learning · Computer Science 2022-07-22 Andrew M. Saxe , Shagun Sodhani , Sam Lewallen

We study mechanisms to characterize how the asymptotic convergence of backpropagation in deep architectures, in general, is related to the network structure, and how it may be influenced by other design choices including activation type,…

Machine Learning · Computer Science 2017-03-02 Vamsi K Ithapu , Sathya N Ravi , Vikas Singh

Despite their impressive performance, contemporary neural networks often lack structural safeguards that promote stable learning and interpretable behavior. In this work, we introduce a reformulation of layer-level transformations that…

Machine Learning · Computer Science 2025-08-04 Saleh Nikooroo , Thomas Engel

Many types of data from fields including natural language processing, computer vision, and bioinformatics, are well represented by discrete, compositional structures such as trees, sequences, or matchings. Latent structure models are a…

Machine Learning · Computer Science 2026-02-04 Vlad Niculae , Caio F. Corro , Nikita Nangia , Tsvetomila Mihaylova , André F. T. Martins

This work presents a novel means for understanding learning dynamics and scaling relations in neural networks. We show that certain measures on the spectrum of the empirical neural tangent kernel, specifically entropy and trace, yield…

Machine Learning · Computer Science 2024-10-11 Samuel Tovey , Sven Krippendorf , Michael Spannowsky , Konstantin Nikolaou , Christian Holm

The relation between network structure and dynamics is determinant for the behavior of complex systems in numerous domains. An important long-standing problem concerns the properties of the networks that optimize the dynamics with respect…

Adaptation and Self-Organizing Systems · Physics 2017-12-07 Takashi Nishikawa , Jie Sun , Adilson E. Motter

Despite the remarkable success of large large-scale neural networks, we still lack unified notation for thinking about and describing their representational spaces. We lack methods to reliably describe how their representations are…

Machine Learning · Computer Science 2025-06-02 Henry Conklin

The regularization and output consistency behavior of dropout and layer-wise pretraining for learning deep networks have been fairly well studied. However, our understanding of how the asymptotic convergence of backpropagation in deep…

Machine Learning · Computer Science 2017-02-23 Vamsi K Ithapu , Sathya N Ravi , Vikas Singh

Advancements in artificial intelligence call for a deeper understanding of the fundamental mechanisms underlying deep learning. In this work, we propose a theoretical framework to analyze learning dynamics through the lens of dynamical…

Machine Learning · Computer Science 2025-10-13 Yuchen Lin , Yong Zhang , Sihan Feng , Hong Zhao

Motor adaptation displays a structure-learning effect: adaptation to a new perturbation occurs more quickly when the subject has prior exposure to perturbations with related structure. Although this `learning-to-learn' effect is well…

Artificial Intelligence · Computer Science 2017-07-14 Ari Weinstein , Matthew M. Botvinick

Deep neural networks give us a powerful method to model the training dataset's relationship between input and output. We can regard that as a complex adaptive system consisting of many artificial neurons that work as an adaptive memory as a…

Disordered Systems and Neural Networks · Physics 2024-05-08 Kenichi Nakazato

Our understanding of learning dynamics of deep neural networks (DNNs) remains incomplete. Recent research has begun to uncover the mathematical principles underlying these networks, including the phenomenon of "Neural Collapse", where…

Machine Learning · Computer Science 2024-02-13 Bradley T. Baker , Barak A. Pearlmutter , Robyn Miller , Vince D. Calhoun , Sergey M. Plis

In comparison to classical shallow representation learning techniques, deep neural networks have achieved superior performance in nearly every application benchmark. But despite their clear empirical advantages, it is still not well…

Machine Learning · Computer Science 2022-01-11 Calvin Murdock , George Cazenavette , Simon Lucey

This article explores the design and experimentation of a neural network architecture capable of dynamically adjusting its internal structure based on the input data. The proposed model introduces a routing mechanism that allows each layer…

Machine Learning · Computer Science 2025-11-18 Dmytro Hospodarchuk

The process of training an artificial neural network involves iteratively adapting its parameters so as to minimize the error of the network's prediction, when confronted with a learning task. This iterative change can be naturally…

Machine Learning · Computer Science 2024-04-10 Kaloyan Danovski , Miguel C. Soriano , Lucas Lacasa

Understanding the learning dynamics of neural networks is one of the key issues for the improvement of optimization algorithms as well as for the theoretical comprehension of why deep neural nets work so well today. In this paper, we…

Machine Learning · Statistics 2021-03-18 Zhenyu Liao , Romain Couillet

The mutual influence of dynamics and structure is a central issue in complex systems. In this paper we study by simulation slow evolution of network under the feedback of a local-majority-rule opinion process. If performance-enhancing local…

Physics and Society · Physics 2009-11-13 Zhen Shao , Haijun Zhou
‹ Prev 1 2 3 10 Next ›