English
Related papers

Related papers: Nearest Class-Center Simplification through Interm…

200 papers

Neural Collapse (NC) gives a precise description of the representations of classes in the final hidden layer of classification neural networks. This description provides insights into how these networks learn features and generalize well…

Machine Learning · Computer Science 2023-08-08 Liam Parker , Emre Onal , Anton Stengel , Jake Intrater

Modern deep neural networks have achieved impressive performance on tasks from image classification to natural language processing. Surprisingly, these complex systems with massive amounts of parameters exhibit the same structural…

Machine Learning · Computer Science 2023-06-21 Hien Dang , Tho Tran , Stanley Osher , Hung Tran-The , Nhat Ho , Tan Nguyen

Deep classifier neural networks enter the terminal phase of training (TPT) when training error reaches zero and tend to exhibit intriguing Neural Collapse (NC) properties. Neural collapse essentially represents a state at which the…

Machine Learning · Computer Science 2023-04-12 Vignesh Kothapalli

Training deep neural networks for classification often includes minimizing the training loss beyond the zero training error point. In this phase of training, a "neural collapse" behavior has been observed: the variability of features…

Machine Learning · Computer Science 2023-05-30 Tom Tirer , Haoxiang Huang , Jonathan Niles-Weed

Modern practice for training classification deepnets involves a Terminal Phase of Training (TPT), which begins at the epoch where training error first vanishes; During TPT, the training error stays effectively zero while training loss is…

Machine Learning · Computer Science 2020-09-23 Vardan Papyan , X. Y. Han , David L. Donoho

Recent results in the literature suggest that the penultimate (second-to-last) layer representations of neural networks that are trained for classification exhibit a clustering property called neural collapse (NC). We study the implicit…

Machine Learning · Computer Science 2022-09-29 Tomer Galanti , Liane Galanti , Ido Ben-Shaul

When training deep neural networks for classification tasks, an intriguing empirical phenomenon has been widely observed in the last-layer classifiers and features, where (i) the class means and the last-layer classifiers all collapse to…

Machine Learning · Computer Science 2022-03-15 Jinxin Zhou , Xiao Li , Tianyu Ding , Chong You , Qing Qu , Zhihui Zhu

Neural Collapse is a phenomenon that helps identify sparse and low rank structures in deep classifiers. Recent work has extended the definition of neural collapse to regression problems, albeit only measuring the phenomenon at the last…

Machine Learning · Computer Science 2026-03-26 Akshay Rangamani , Altay Unal

The recently discovered Neural Collapse (NC) phenomenon occurs pervasively in today's deep net training paradigm of driving cross-entropy (CE) loss towards zero. During NC, last-layer features collapse to their class-means, both classifiers…

Machine Learning · Computer Science 2022-05-11 X. Y. Han , Vardan Papyan , David L. Donoho

A recent study has shown a phenomenon called neural collapse in that the within-class means of features and the classifier weight vectors converge to the vertices of a simplex equiangular tight frame at the terminal phase of training for…

Computer Vision and Pattern Recognition · Computer Science 2023-01-04 Zhisheng Zhong , Jiequan Cui , Yibo Yang , Xiaoyang Wu , Xiaojuan Qi , Xiangyu Zhang , Jiaya Jia

The empirical emergence of neural collapse -- a surprising symmetry in the feature representations of the training data in the penultimate layer of deep neural networks -- has spurred a line of theoretical research aimed at its…

Machine Learning · Computer Science 2025-05-22 Peter Súkeník , Christoph H. Lampert , Marco Mondelli

When training overparameterized deep networks for classification tasks, it has been widely observed that the learned features exhibit a so-called "neural collapse" phenomenon. More specifically, for the output features of the penultimate…

Machine Learning · Computer Science 2023-03-09 Can Yaras , Peng Wang , Zhihui Zhu , Laura Balzano , Qing Qu

The current paradigm of training deep neural networks for classification tasks includes minimizing the empirical risk that pushes the training loss value towards zero, even after the training error has been vanished. In this terminal phase…

Machine Learning · Computer Science 2024-06-07 Hien Dang , Tho Tran , Tan Nguyen , Nhat Ho

The recent work of Papyan, Han, & Donoho (2020) presented an intriguing "Neural Collapse" phenomenon, showing a structural property of interpolating classifiers in the late stage of training. This opened a rich area of exploration studying…

Machine Learning · Computer Science 2022-02-18 Like Hui , Mikhail Belkin , Preetum Nakkiran

Modern deep neural networks for classification usually jointly learn a backbone for representation and a linear classifier to output the logit of each class. A recent study has shown a phenomenon called neural collapse that the within-class…

Machine Learning · Computer Science 2022-10-13 Yibo Yang , Shixiang Chen , Xiangtai Li , Liang Xie , Zhouchen Lin , Dacheng Tao

We provide the first global optimization landscape analysis of $Neural\;Collapse$ -- an intriguing empirical phenomenon that arises in the last-layer classifiers and features of neural networks during the terminal phase of training. As…

Machine Learning · Computer Science 2021-05-07 Zhihui Zhu , Tianyu Ding , Jinxin Zhou , Xiao Li , Chong You , Jeremias Sulam , Qing Qu

Recent studies have noted an intriguing phenomenon termed Neural Collapse, that is, when the neural networks establish the right correlation between feature spaces and the training targets, their last-layer features, together with the…

Computer Vision and Pattern Recognition · Computer Science 2025-09-22 Yining Wang , Junjie Sun , Chenyue Wang , Mi Zhang , Min Yang

Neural collapse is a phenomenon observed during the terminal phase of neural network training, characterized by the convergence of network activations, class means, and linear classifier weights to a simplex equiangular tight frame (ETF), a…

Machine Learning · Computer Science 2024-12-03 Emily Liu

Many of the recent remarkable advances in computer vision and language models can be attributed to the success of transfer learning via the pre-training of large foundation models. However, a theoretical framework which explains this…

Machine Learning · Computer Science 2024-12-19 Michael Munn , Benoit Dherin , Javier Gonzalvo

Deep neural networks (DNNs) at convergence consistently represent the training data in the last layer via a highly symmetric geometric structure referred to as neural collapse. This empirical evidence has spurred a line of theoretical…

Machine Learning · Computer Science 2024-10-08 Arthur Jacot , Peter Súkeník , Zihan Wang , Marco Mondelli
‹ Prev 1 2 3 10 Next ›