Related papers: Feature Collapse

Generalized Neural Collapse for a Large Number of Classes

Neural collapse provides an elegant mathematical characterization of learned last layer representations (a.k.a. features) and classifier weights in deep classification models. Such results not only provide insights but also motivate new…

Machine Learning · Computer Science 2023-10-30 Jiachen Jiang , Jinxin Zhou , Peng Wang , Qing Qu , Dustin Mixon , Chong You , Zhihui Zhu

Limitations of Neural Collapse for Understanding Generalization in Deep Learning

The recent work of Papyan, Han, & Donoho (2020) presented an intriguing "Neural Collapse" phenomenon, showing a structural property of interpolating classifiers in the late stage of training. This opened a rich area of exploration studying…

Machine Learning · Computer Science 2022-02-18 Like Hui , Mikhail Belkin , Preetum Nakkiran

Neural Collapse with Normalized Features: A Geometric Analysis over the Riemannian Manifold

When training overparameterized deep networks for classification tasks, it has been widely observed that the learned features exhibit a so-called "neural collapse" phenomenon. More specifically, for the output features of the penultimate…

Machine Learning · Computer Science 2023-03-09 Can Yaras , Peng Wang , Zhihui Zhu , Laura Balzano , Qing Qu

On the Role of Neural Collapse in Transfer Learning

We study the ability of foundation models to learn representations for classification that are transferable to new, unseen classes. Recent results in the literature show that representations learned by a single classifier over many classes…

Machine Learning · Computer Science 2022-01-05 Tomer Galanti , András György , Marcus Hutter

Inducing Neural Collapse in Deep Long-tailed Learning

Although deep neural networks achieve tremendous success on various classification tasks, the generalization ability drops sheer when training datasets exhibit long-tailed distributions. One of the reasons is that the learned…

Machine Learning · Computer Science 2023-02-27 Xuantong Liu , Jianfeng Zhang , Tianyang Hu , He Cao , Lujia Pan , Yuan Yao

Rethinking Generalization of Neural Models: A Named Entity Recognition Case Study

While neural network-based models have achieved impressive performance on a large body of NLP tasks, the generalization behavior of different models remains poorly understood: Does this excellent performance imply a perfect generalization…

Computation and Language · Computer Science 2020-01-14 Jinlan Fu , Pengfei Liu , Qi Zhang , Xuanjing Huang

Linguistic Collapse: Neural Collapse in (Large) Language Models

Neural collapse ($\mathcal{NC}$) is a phenomenon observed in classification tasks where top-layer representations collapse into their class means, which become equinorm, equiangular and aligned with the classifiers. These behaviours --…

Machine Learning · Computer Science 2024-11-27 Robert Wu , Vardan Papyan

Going Beyond Neural Network Feature Similarity: The Network Feature Complexity and Its Interpretation Using Category Theory

The behavior of neural networks still remains opaque, and a recently widely noted phenomenon is that networks often achieve similar performance when initialized with different random parameters. This phenomenon has attracted significant…

Machine Learning · Computer Science 2023-11-28 Yiting Chen , Zhanpeng Zhou , Junchi Yan

Memorization-Dilation: Modeling Neural Collapse Under Label Noise

The notion of neural collapse refers to several emergent phenomena that have been empirically observed across various canonical classification problems. During the terminal phase of training a deep neural network, the feature embedding of…

Machine Learning · Computer Science 2023-04-05 Duc Anh Nguyen , Ron Levie , Julian Lienen , Gitta Kutyniok , Eyke Hüllermeier

Flatness is Necessary, Neural Collapse is Not: Rethinking Generalization via Grokking

Neural collapse, i.e., the emergence of highly symmetric, class-wise clustered representations, is frequently observed in deep networks and is often assumed to reflect or enable generalization. In parallel, flatness of the loss landscape…

Machine Learning · Computer Science 2026-02-05 Ting Han , Linara Adilova , Henning Petzka , Jens Kleesiek , Michael Kamp

Feature learning is decoupled from generalization in high capacity neural networks

Neural networks outperform kernel methods, sometimes by orders of magnitude, e.g. on staircase functions. This advantage stems from the ability of neural networks to learn features, adapting their hidden representations to better capture…

Machine Learning · Computer Science 2025-07-29 Niclas Alexander Göring , Charles London , Abdurrahman Hadi Erturk , Chris Mingard , Yoonsoo Nam , Ard A. Louis

Learning to Generalize Compositionally by Transferring Across Semantic Parsing Tasks

Neural network models often generalize poorly to mismatched domains or distributions. In NLP, this issue arises in particular when models are expected to generalize compositionally, that is, to novel combinations of familiar words and…

Computation and Language · Computer Science 2021-11-10 Wang Zhu , Peter Shaw , Tal Linzen , Fei Sha

Deep Neural Regression Collapse

Neural Collapse is a phenomenon that helps identify sparse and low rank structures in deep classifiers. Recent work has extended the definition of neural collapse to regression problems, albeit only measuring the phenomenon at the last…

Machine Learning · Computer Science 2026-03-26 Akshay Rangamani , Altay Unal

Perturbation Analysis of Neural Collapse

Training deep neural networks for classification often includes minimizing the training loss beyond the zero training error point. In this phase of training, a "neural collapse" behavior has been observed: the variability of features…

Machine Learning · Computer Science 2023-05-30 Tom Tirer , Haoxiang Huang , Jonathan Niles-Weed

On the Role of Neural Collapse in Meta Learning Models for Few-shot Learning

Meta-learning frameworks for few-shot learning aims to learn models that can learn new skills or adapt to new environments rapidly with a few training examples. This has led to the generalizability of the developed model towards new classes…

Machine Learning · Computer Science 2023-10-10 Saaketh Medepalli , Naren Doraiswamy

Analyzing and Interpreting Convolutional Neural Networks in NLP

Convolutional neural networks have been successfully applied to various NLP tasks. However, it is not obvious whether they model different linguistic patterns such as negation, intensification, and clause compositionality to help the…

Computation and Language · Computer Science 2018-10-23 Mahnaz Koupaee , William Yang Wang

Towards Demystifying the Generalization Behaviors When Neural Collapse Emerges

Neural Collapse (NC) is a well-known phenomenon of deep neural networks in the terminal phase of training (TPT). It is characterized by the collapse of features and classifier into a symmetrical structure, known as simplex equiangular tight…

Machine Learning · Computer Science 2023-10-13 Peifeng Gao , Qianqian Xu , Yibo Yang , Peisong Wen , Huiyang Shao , Zhiyong Yang , Bernard Ghanem , Qingming Huang

Probability Distribution Collapse: A Critical Bottleneck to Compact Unsupervised Neural Grammar Induction

Unsupervised neural grammar induction aims to learn interpretable hierarchical structures from language data. However, existing models face an expressiveness bottleneck, often resulting in unnecessarily large yet underperforming grammars.…

Computation and Language · Computer Science 2025-09-26 Jinwook Park , Kangil Kim

A Neural Collapse Perspective on Feature Evolution in Graph Neural Networks

Graph neural networks (GNNs) have become increasingly popular for classification tasks on graph-structured data. Yet, the interplay between graph topology and feature evolution in GNNs is not well understood. In this paper, we focus on…

Machine Learning · Computer Science 2023-10-27 Vignesh Kothapalli , Tom Tirer , Joan Bruna

Can We Understand Plasticity Through Neural Collapse?

This paper explores the connection between two recently identified phenomena in deep learning: plasticity loss and neural collapse. We analyze their correlation in different scenarios, revealing a significant association during the initial…

Machine Learning · Computer Science 2024-04-04 Guglielmo Bonifazi , Iason Chalas , Gian Hess , Jakub Łucki