English
Related papers

Related papers: Feature Collapse

200 papers

Neural collapse provides an elegant mathematical characterization of learned last layer representations (a.k.a. features) and classifier weights in deep classification models. Such results not only provide insights but also motivate new…

Machine Learning · Computer Science 2023-10-30 Jiachen Jiang , Jinxin Zhou , Peng Wang , Qing Qu , Dustin Mixon , Chong You , Zhihui Zhu

The recent work of Papyan, Han, & Donoho (2020) presented an intriguing "Neural Collapse" phenomenon, showing a structural property of interpolating classifiers in the late stage of training. This opened a rich area of exploration studying…

Machine Learning · Computer Science 2022-02-18 Like Hui , Mikhail Belkin , Preetum Nakkiran

When training overparameterized deep networks for classification tasks, it has been widely observed that the learned features exhibit a so-called "neural collapse" phenomenon. More specifically, for the output features of the penultimate…

Machine Learning · Computer Science 2023-03-09 Can Yaras , Peng Wang , Zhihui Zhu , Laura Balzano , Qing Qu

We study the ability of foundation models to learn representations for classification that are transferable to new, unseen classes. Recent results in the literature show that representations learned by a single classifier over many classes…

Machine Learning · Computer Science 2022-01-05 Tomer Galanti , András György , Marcus Hutter

Although deep neural networks achieve tremendous success on various classification tasks, the generalization ability drops sheer when training datasets exhibit long-tailed distributions. One of the reasons is that the learned…

Machine Learning · Computer Science 2023-02-27 Xuantong Liu , Jianfeng Zhang , Tianyang Hu , He Cao , Lujia Pan , Yuan Yao

While neural network-based models have achieved impressive performance on a large body of NLP tasks, the generalization behavior of different models remains poorly understood: Does this excellent performance imply a perfect generalization…

Computation and Language · Computer Science 2020-01-14 Jinlan Fu , Pengfei Liu , Qi Zhang , Xuanjing Huang

Neural collapse ($\mathcal{NC}$) is a phenomenon observed in classification tasks where top-layer representations collapse into their class means, which become equinorm, equiangular and aligned with the classifiers. These behaviours --…

Machine Learning · Computer Science 2024-11-27 Robert Wu , Vardan Papyan

The behavior of neural networks still remains opaque, and a recently widely noted phenomenon is that networks often achieve similar performance when initialized with different random parameters. This phenomenon has attracted significant…

Machine Learning · Computer Science 2023-11-28 Yiting Chen , Zhanpeng Zhou , Junchi Yan

The notion of neural collapse refers to several emergent phenomena that have been empirically observed across various canonical classification problems. During the terminal phase of training a deep neural network, the feature embedding of…

Machine Learning · Computer Science 2023-04-05 Duc Anh Nguyen , Ron Levie , Julian Lienen , Gitta Kutyniok , Eyke Hüllermeier

Neural collapse, i.e., the emergence of highly symmetric, class-wise clustered representations, is frequently observed in deep networks and is often assumed to reflect or enable generalization. In parallel, flatness of the loss landscape…

Machine Learning · Computer Science 2026-02-05 Ting Han , Linara Adilova , Henning Petzka , Jens Kleesiek , Michael Kamp

Neural networks outperform kernel methods, sometimes by orders of magnitude, e.g. on staircase functions. This advantage stems from the ability of neural networks to learn features, adapting their hidden representations to better capture…

Machine Learning · Computer Science 2025-07-29 Niclas Alexander Göring , Charles London , Abdurrahman Hadi Erturk , Chris Mingard , Yoonsoo Nam , Ard A. Louis

Neural network models often generalize poorly to mismatched domains or distributions. In NLP, this issue arises in particular when models are expected to generalize compositionally, that is, to novel combinations of familiar words and…

Computation and Language · Computer Science 2021-11-10 Wang Zhu , Peter Shaw , Tal Linzen , Fei Sha

Neural Collapse is a phenomenon that helps identify sparse and low rank structures in deep classifiers. Recent work has extended the definition of neural collapse to regression problems, albeit only measuring the phenomenon at the last…

Machine Learning · Computer Science 2026-03-26 Akshay Rangamani , Altay Unal

Training deep neural networks for classification often includes minimizing the training loss beyond the zero training error point. In this phase of training, a "neural collapse" behavior has been observed: the variability of features…

Machine Learning · Computer Science 2023-05-30 Tom Tirer , Haoxiang Huang , Jonathan Niles-Weed

Meta-learning frameworks for few-shot learning aims to learn models that can learn new skills or adapt to new environments rapidly with a few training examples. This has led to the generalizability of the developed model towards new classes…

Machine Learning · Computer Science 2023-10-10 Saaketh Medepalli , Naren Doraiswamy

Convolutional neural networks have been successfully applied to various NLP tasks. However, it is not obvious whether they model different linguistic patterns such as negation, intensification, and clause compositionality to help the…

Computation and Language · Computer Science 2018-10-23 Mahnaz Koupaee , William Yang Wang

Neural Collapse (NC) is a well-known phenomenon of deep neural networks in the terminal phase of training (TPT). It is characterized by the collapse of features and classifier into a symmetrical structure, known as simplex equiangular tight…

Machine Learning · Computer Science 2023-10-13 Peifeng Gao , Qianqian Xu , Yibo Yang , Peisong Wen , Huiyang Shao , Zhiyong Yang , Bernard Ghanem , Qingming Huang

Unsupervised neural grammar induction aims to learn interpretable hierarchical structures from language data. However, existing models face an expressiveness bottleneck, often resulting in unnecessarily large yet underperforming grammars.…

Computation and Language · Computer Science 2025-09-26 Jinwook Park , Kangil Kim

Graph neural networks (GNNs) have become increasingly popular for classification tasks on graph-structured data. Yet, the interplay between graph topology and feature evolution in GNNs is not well understood. In this paper, we focus on…

Machine Learning · Computer Science 2023-10-27 Vignesh Kothapalli , Tom Tirer , Joan Bruna

This paper explores the connection between two recently identified phenomena in deep learning: plasticity loss and neural collapse. We analyze their correlation in different scenarios, revealing a significant association during the initial…

Machine Learning · Computer Science 2024-04-04 Guglielmo Bonifazi , Iason Chalas , Gian Hess , Jakub Łucki
‹ Prev 1 2 3 10 Next ›