Related papers: Nearest Class-Center Simplification through Interm…

Neural Collapse in the Intermediate Hidden Layers of Classification Neural Networks

Neural Collapse (NC) gives a precise description of the representations of classes in the final hidden layer of classification neural networks. This description provides insights into how these networks learn features and generalize well…

Machine Learning · Computer Science 2023-08-08 Liam Parker , Emre Onal , Anton Stengel , Jake Intrater

Neural Collapse in Deep Linear Networks: From Balanced to Imbalanced Data

Modern deep neural networks have achieved impressive performance on tasks from image classification to natural language processing. Surprisingly, these complex systems with massive amounts of parameters exhibit the same structural…

Machine Learning · Computer Science 2023-06-21 Hien Dang , Tho Tran , Stanley Osher , Hung Tran-The , Nhat Ho , Tan Nguyen

Neural Collapse: A Review on Modelling Principles and Generalization

Deep classifier neural networks enter the terminal phase of training (TPT) when training error reaches zero and tend to exhibit intriguing Neural Collapse (NC) properties. Neural collapse essentially represents a state at which the…

Machine Learning · Computer Science 2023-04-12 Vignesh Kothapalli

Perturbation Analysis of Neural Collapse

Training deep neural networks for classification often includes minimizing the training loss beyond the zero training error point. In this phase of training, a "neural collapse" behavior has been observed: the variability of features…

Machine Learning · Computer Science 2023-05-30 Tom Tirer , Haoxiang Huang , Jonathan Niles-Weed

Prevalence of Neural Collapse during the terminal phase of deep learning training

Modern practice for training classification deepnets involves a Terminal Phase of Training (TPT), which begins at the epoch where training error first vanishes; During TPT, the training error stays effectively zero while training loss is…

Machine Learning · Computer Science 2020-09-23 Vardan Papyan , X. Y. Han , David L. Donoho

On the Implicit Bias Towards Minimal Depth of Deep Neural Networks

Recent results in the literature suggest that the penultimate (second-to-last) layer representations of neural networks that are trained for classification exhibit a clustering property called neural collapse (NC). We study the implicit…

Machine Learning · Computer Science 2022-09-29 Tomer Galanti , Liane Galanti , Ido Ben-Shaul

On the Optimization Landscape of Neural Collapse under MSE Loss: Global Optimality with Unconstrained Features

When training deep neural networks for classification tasks, an intriguing empirical phenomenon has been widely observed in the last-layer classifiers and features, where (i) the class means and the last-layer classifiers all collapse to…

Machine Learning · Computer Science 2022-03-15 Jinxin Zhou , Xiao Li , Tianyu Ding , Chong You , Qing Qu , Zhihui Zhu

Deep Neural Regression Collapse

Neural Collapse is a phenomenon that helps identify sparse and low rank structures in deep classifiers. Recent work has extended the definition of neural collapse to regression problems, albeit only measuring the phenomenon at the last…

Machine Learning · Computer Science 2026-03-26 Akshay Rangamani , Altay Unal

Neural Collapse Under MSE Loss: Proximity to and Dynamics on the Central Path

The recently discovered Neural Collapse (NC) phenomenon occurs pervasively in today's deep net training paradigm of driving cross-entropy (CE) loss towards zero. During NC, last-layer features collapse to their class-means, both classifiers…

Machine Learning · Computer Science 2022-05-11 X. Y. Han , Vardan Papyan , David L. Donoho

Understanding Imbalanced Semantic Segmentation Through Neural Collapse

A recent study has shown a phenomenon called neural collapse in that the within-class means of features and the classifier weight vectors converge to the vertices of a simplex equiangular tight frame at the terminal phase of training for…

Computer Vision and Pattern Recognition · Computer Science 2023-01-04 Zhisheng Zhong , Jiequan Cui , Yibo Yang , Xiaoyang Wu , Xiaojuan Qi , Xiangyu Zhang , Jiaya Jia

Neural Collapse is Globally Optimal in Deep Regularized ResNets and Transformers

The empirical emergence of neural collapse -- a surprising symmetry in the feature representations of the training data in the penultimate layer of deep neural networks -- has spurred a line of theoretical research aimed at its…

Machine Learning · Computer Science 2025-05-22 Peter Súkeník , Christoph H. Lampert , Marco Mondelli

Neural Collapse with Normalized Features: A Geometric Analysis over the Riemannian Manifold

When training overparameterized deep networks for classification tasks, it has been widely observed that the learned features exhibit a so-called "neural collapse" phenomenon. More specifically, for the output features of the penultimate…

Machine Learning · Computer Science 2023-03-09 Can Yaras , Peng Wang , Zhihui Zhu , Laura Balzano , Qing Qu

Neural Collapse for Cross-entropy Class-Imbalanced Learning with Unconstrained ReLU Feature Model

The current paradigm of training deep neural networks for classification tasks includes minimizing the empirical risk that pushes the training loss value towards zero, even after the training error has been vanished. In this terminal phase…

Machine Learning · Computer Science 2024-06-07 Hien Dang , Tho Tran , Tan Nguyen , Nhat Ho

Limitations of Neural Collapse for Understanding Generalization in Deep Learning

The recent work of Papyan, Han, & Donoho (2020) presented an intriguing "Neural Collapse" phenomenon, showing a structural property of interpolating classifiers in the late stage of training. This opened a rich area of exploration studying…

Machine Learning · Computer Science 2022-02-18 Like Hui , Mikhail Belkin , Preetum Nakkiran

Inducing Neural Collapse in Imbalanced Learning: Do We Really Need a Learnable Classifier at the End of Deep Neural Network?

Modern deep neural networks for classification usually jointly learn a backbone for representation and a linear classifier to output the logit of each class. A recent study has shown a phenomenon called neural collapse that the within-class…

Machine Learning · Computer Science 2022-10-13 Yibo Yang , Shixiang Chen , Xiangtai Li , Liang Xie , Zhouchen Lin , Dacheng Tao

A Geometric Analysis of Neural Collapse with Unconstrained Features

We provide the first global optimization landscape analysis of $Neural\;Collapse$ -- an intriguing empirical phenomenon that arises in the last-layer classifiers and features of neural networks during the terminal phase of training. As…

Machine Learning · Computer Science 2021-05-07 Zhihui Zhu , Tianyu Ding , Jinxin Zhou , Xiao Li , Chong You , Jeremias Sulam , Qing Qu

Navigate Beyond Shortcuts: Debiased Learning through the Lens of Neural Collapse

Recent studies have noted an intriguing phenomenon termed Neural Collapse, that is, when the neural networks establish the right correlation between feature spaces and the training targets, their last-layer features, together with the…

Computer Vision and Pattern Recognition · Computer Science 2025-09-22 Yining Wang , Junjie Sun , Chenyue Wang , Mi Zhang , Min Yang

Leveraging Intermediate Neural Collapse with Simplex ETFs for Efficient Deep Neural Networks

Neural collapse is a phenomenon observed during the terminal phase of neural network training, characterized by the convergence of network activations, class means, and linear classifier weights to a simplex equiangular tight frame (ETF), a…

Machine Learning · Computer Science 2024-12-03 Emily Liu

The Impact of Geometric Complexity on Neural Collapse in Transfer Learning

Many of the recent remarkable advances in computer vision and language models can be attributed to the success of transfer learning via the pre-training of large foundation models. However, a theoretical framework which explains this…

Machine Learning · Computer Science 2024-12-19 Michael Munn , Benoit Dherin , Javier Gonzalvo

Wide Neural Networks Trained with Weight Decay Provably Exhibit Neural Collapse

Deep neural networks (DNNs) at convergence consistently represent the training data in the last layer via a highly symmetric geometric structure referred to as neural collapse. This empirical evidence has spurred a line of theoretical…

Machine Learning · Computer Science 2024-10-08 Arthur Jacot , Peter Súkeník , Zihan Wang , Marco Mondelli