Related papers: When Representations Align: Universality in Repres…

Towards Understanding Learning Representations: To What Extent Do Different Neural Networks Learn the Same Representation

It is widely believed that learning good representations is one of the main reasons for the success of deep neural networks. Although highly intuitive, there is a lack of theory and systematic approach quantitatively characterizing what…

Machine Learning · Computer Science 2018-11-30 Liwei Wang , Lunjia Hu , Jiayuan Gu , Yue Wu , Zhiqiang Hu , Kun He , John Hopcroft

Understanding Learning Dynamics Through Structured Representations

While modern deep networks have demonstrated remarkable versatility, their training dynamics remain poorly understood--often driven more by empirical tweaks than architectural insight. This paper investigates how internal structural choices…

Machine Learning · Computer Science 2025-08-26 Saleh Nikooroo , Thomas Engel

Do Wide and Deep Networks Learn the Same Things? Uncovering How Neural Network Representations Vary with Width and Depth

A key factor in the success of deep neural networks is the ability to scale models to improve performance by varying the architecture depth and width. This simple property of neural network design has resulted in highly effective…

Machine Learning · Computer Science 2021-04-13 Thao Nguyen , Maithra Raghu , Simon Kornblith

Complexity of Representations in Deep Learning

Deep neural networks use multiple layers of functions to map an object represented by an input vector progressively to different representations, and with sufficient training, eventually to a single score for each class that is the output…

Machine Learning · Computer Science 2022-09-02 Tin Kam Ho

Understanding Dynamics of Nonlinear Representation Learning and Its Application

Representations of the world environment play a crucial role in artificial intelligence. It is often inefficient to conduct reasoning and inference directly in the space of raw sensory representations, such as pixel values of images.…

Machine Learning · Computer Science 2022-04-12 Kenji Kawaguchi , Linjun Zhang , Zhun Deng

From Lazy to Rich: Exact Learning Dynamics in Deep Linear Networks

Biological and artificial neural networks develop internal representations that enable them to perform complex tasks. In artificial networks, the effectiveness of these models relies on their ability to build task specific representation, a…

Machine Learning · Computer Science 2025-03-05 Clémentine C. J. Dominé , Nicolas Anguita , Alexandra M. Proca , Lukas Braun , Daniel Kunin , Pedro A. M. Mediano , Andrew M. Saxe

The Neural Race Reduction: Dynamics of Abstraction in Gated Networks

Our theoretical understanding of deep learning has not kept pace with its empirical success. While network architecture is known to be critical, we do not yet understand its effect on learned representations and network behavior, or how…

Machine Learning · Computer Science 2022-07-22 Andrew M. Saxe , Shagun Sodhani , Sam Lewallen

Cross-Model Semantics in Representation Learning

The internal representations learned by deep networks are often sensitive to architecture-specific choices, raising questions about the stability, alignment, and transferability of learned structure across models. In this paper, we…

Machine Learning · Computer Science 2025-08-06 Saleh Nikooroo , Thomas Engel

Reframing Neural Networks: Deep Structure in Overcomplete Representations

In comparison to classical shallow representation learning techniques, deep neural networks have achieved superior performance in nearly every application benchmark. But despite their clear empirical advantages, it is still not well…

Machine Learning · Computer Science 2022-01-11 Calvin Murdock , George Cazenavette , Simon Lucey

How Do the Architecture and Optimizer Affect Representation Learning? On the Training Dynamics of Representations in Deep Neural Networks

In this paper, we elucidate how representations in deep neural networks (DNNs) evolve during training. Our focus is on overparameterized learning settings where the training continues much after the trained DNN starts to perfectly fit its…

Machine Learning · Computer Science 2025-02-04 Yuval Sharon , Yehuda Dar

How Deep Neural Networks Learn Compositional Data: The Random Hierarchy Model

Deep learning algorithms demonstrate a surprising ability to learn high-dimensional tasks from limited examples. This is commonly attributed to the depth of neural networks, enabling them to build a hierarchy of abstract, low-dimensional…

Machine Learning · Computer Science 2024-07-04 Francesco Cagnetta , Leonardo Petrini , Umberto M. Tomasini , Alessandro Favero , Matthieu Wyart

Convergent Learning: Do different neural networks learn the same representations?

Recent success in training deep neural networks have prompted active investigation into the features learned on their intermediate layers. Such research is difficult because it requires making sense of non-linear computations performed by…

Machine Learning · Computer Science 2016-03-01 Yixuan Li , Jason Yosinski , Jeff Clune , Hod Lipson , John Hopcroft

When Does Structure Matter in Continual Learning? Dimensionality Controls When Modularity Shapes Representational Geometry

To preserve previously learned representations, continual learning systems must strike a balance between plasticity, the ability to acquire new knowledge, and stability. This stability-plasticity dilemma affects how representations can be…

Machine Learning · Computer Science 2026-05-01 Kathrin Korte , Joachim Winter Pedersen , Eleni Nisioti , Sebastian Risi

Representation Learning: A Review and New Perspectives

The success of machine learning algorithms generally depends on data representation, and we hypothesize that this is because different representations can entangle and hide more or less the different explanatory factors of variation behind…

Machine Learning · Computer Science 2014-04-24 Yoshua Bengio , Aaron Courville , Pascal Vincent

Information Structure in Mappings: An Approach to Learning, Representation, and Generalisation

Despite the remarkable success of large large-scale neural networks, we still lack unified notation for thinking about and describing their representational spaces. We lack methods to reliably describe how their representations are…

Machine Learning · Computer Science 2025-06-02 Henry Conklin

Deep representation learning: Fundamentals, Perspectives, Applications, and Open Challenges

Machine Learning algorithms have had a profound impact on the field of computer science over the past few decades. These algorithms performance is greatly influenced by the representations that are derived from the data in the learning…

Machine Learning · Computer Science 2022-11-29 Kourosh T. Baghaei , Amirreza Payandeh , Pooya Fayyazsanavi , Shahram Rahimi , Zhiqian Chen , Somayeh Bakhtiari Ramezani

Feature Learning beyond the Lazy-Rich Dichotomy: Insights from Representational Geometry

Integrating task-relevant information into neural representations is a fundamental ability of both biological and artificial intelligence systems. Recent theories have categorized learning into two regimes: the rich regime, where neural…

Machine Learning · Computer Science 2025-07-14 Chi-Ning Chou , Hang Le , Yichen Wang , SueYeon Chung

Connecting Neural Models Latent Geometries with Relative Geodesic Representations

Neural models learn representations of high-dimensional data on low-dimensional manifolds. Multiple factors, including stochasticities in the training process, model architectures, and additional inductive biases, may induce different…

Machine Learning · Computer Science 2025-12-02 Hanlin Yu , Berfin Inal , Georgios Arvanitidis , Soren Hauberg , Francesco Locatello , Marco Fumero

Learned feature representations are biased by complexity, learning order, position, and more

Representation learning, and interpreting learned representations, are key areas of focus in machine learning and neuroscience. Both fields generally use representations as a means to understand or improve a system's computations. In this…

Machine Learning · Computer Science 2024-09-24 Andrew Kyle Lampinen , Stephanie C. Y. Chan , Katherine Hermann

Reasoning-Modulated Representations

Neural networks leverage robust internal representations in order to generalise. Learning them is difficult, and often requires a large training set that covers the data distribution densely. We study a common setting where our task is not…

Machine Learning · Computer Science 2022-12-06 Petar Veličković , Matko Bošnjak , Thomas Kipf , Alexander Lerchner , Raia Hadsell , Razvan Pascanu , Charles Blundell