Related papers: Learning Equi-angular Representations for Online C…

Leveraging Intermediate Neural Collapse with Simplex ETFs for Efficient Deep Neural Networks

Neural collapse is a phenomenon observed during the terminal phase of neural network training, characterized by the convergence of network activations, class means, and linear classifier weights to a simplex equiangular tight frame (ETF), a…

Machine Learning · Computer Science 2024-12-03 Emily Liu

Inducing Neural Collapse in Imbalanced Learning: Do We Really Need a Learnable Classifier at the End of Deep Neural Network?

Modern deep neural networks for classification usually jointly learn a backbone for representation and a linear classifier to output the logit of each class. A recent study has shown a phenomenon called neural collapse that the within-class…

Machine Learning · Computer Science 2022-10-13 Yibo Yang , Shixiang Chen , Xiangtai Li , Liang Xie , Zhouchen Lin , Dacheng Tao

No Fear of Classifier Biases: Neural Collapse Inspired Federated Learning with Synthetic and Fixed Classifier

Data heterogeneity is an inherent challenge that hinders the performance of federated learning (FL). Recent studies have identified the biased classifiers of local models as the key bottleneck. Previous attempts have used classifier…

Machine Learning · Computer Science 2023-08-29 Zexi Li , Xinyi Shang , Rui He , Tao Lin , Chao Wu

Rethinking Continual Learning with Progressive Neural Collapse

Continual Learning (CL) seeks to build an agent that can continuously learn a sequence of tasks, where a key challenge, namely Catastrophic Forgetting, persists due to the potential knowledge interference among different tasks. On the other…

Machine Learning · Computer Science 2026-03-10 Zheng Wang , Wanhao Yu , Li Yang , Sen Lin

Guiding Neural Collapse: Optimising Towards the Nearest Simplex Equiangular Tight Frame

Neural Collapse (NC) is a recently observed phenomenon in neural networks that characterises the solution space of the final classifier layer when trained until zero training loss. Specifically, NC suggests that the final classifier layer…

Machine Learning · Computer Science 2024-11-05 Evan Markou , Thalaiyasingam Ajanthan , Stephen Gould

Neural Collapse Inspired Federated Learning with Non-iid Data

One of the challenges in federated learning is the non-independent and identically distributed (non-iid) characteristics between heterogeneous devices, which cause significant differences in local updates and affect the performance of the…

Machine Learning · Computer Science 2023-04-03 Chenxi Huang , Liang Xie , Yibo Yang , Wenxiao Wang , Binbin Lin , Deng Cai

Space Alignment Matters: The Missing Piece for Inducing Neural Collapse in Long-Tailed Learning

Recent studies on Neural Collapse (NC) reveal that, under class-balanced conditions, the class feature means and classifier weights spontaneously align into a simplex equiangular tight frame (ETF). In long-tailed regimes, however, severe…

Machine Learning · Computer Science 2025-12-10 Jinping Wang , Zhiqiang Gao , Zhiwu Xie

A Comprehensive Empirical Evaluation on Online Continual Learning

Online continual learning aims to get closer to a live learning experience by learning directly on a stream of data with temporally shifting distribution and by storing a minimum amount of data from that stream. In this empirical…

Machine Learning · Computer Science 2023-09-26 Albin Soutif--Cormerais , Antonio Carta , Andrea Cossu , Julio Hurtado , Hamed Hemati , Vincenzo Lomonaco , Joost Van de Weijer

Neural Collapse for Cross-entropy Class-Imbalanced Learning with Unconstrained ReLU Feature Model

The current paradigm of training deep neural networks for classification tasks includes minimizing the empirical risk that pushes the training loss value towards zero, even after the training error has been vanished. In this terminal phase…

Machine Learning · Computer Science 2024-06-07 Hien Dang , Tho Tran , Tan Nguyen , Nhat Ho

POUR: A Provably Optimal Method for Unlearning Representations via Neural Collapse

In computer vision, machine unlearning aims to remove the influence of specific visual concepts or training images without retraining from scratch. Studies show that existing approaches often modify the classifier while leaving internal…

Computer Vision and Pattern Recognition · Computer Science 2026-04-28 Anjie Le , Can Peng , Yuyuan Liu , J. Alison Noble

Deep Neural Network Models Trained With A Fixed Random Classifier Transfer Better Across Domains

The recently discovered Neural collapse (NC) phenomenon states that the last-layer weights of Deep Neural Networks (DNN), converge to the so-called Equiangular Tight Frame (ETF) simplex, at the terminal phase of their training. This ETF…

Machine Learning · Computer Science 2024-03-01 Hafiz Tiomoko Ali , Umberto Michieli , Ji Joong Moon , Daehyun Kim , Mete Ozay

Inducing Neural Collapse to a Fixed Hierarchy-Aware Frame for Reducing Mistake Severity

There is a recently discovered and intriguing phenomenon called Neural Collapse: at the terminal phase of training a deep neural network for classification, the within-class penultimate feature means and the associated classifier vectors of…

Computer Vision and Pattern Recognition · Computer Science 2023-08-10 Tong Liang , Jim Davis

rETF-semiSL: Semi-Supervised Learning for Neural Collapse in Temporal Data

Deep neural networks for time series must capture complex temporal patterns, to effectively represent dynamic data. Self- and semi-supervised learning methods show promising results in pre-training large models, which -- when finetuned for…

Machine Learning · Computer Science 2025-08-15 Yuhan Xie , William Cappelletti , Mahsa Shoaran , Pascal Frossard

On the Role of Neural Collapse in Meta Learning Models for Few-shot Learning

Meta-learning frameworks for few-shot learning aims to learn models that can learn new skills or adapt to new environments rapidly with a few training examples. This has led to the generalizability of the developed model towards new classes…

Machine Learning · Computer Science 2023-10-10 Saaketh Medepalli , Naren Doraiswamy

Towards Continual Learning Desiderata via HSIC-Bottleneck Orthogonalization and Equiangular Embedding

Deep neural networks are susceptible to catastrophic forgetting when trained on sequential tasks. Various continual learning (CL) methods often rely on exemplar buffers or/and network expansion for balancing model stability and plasticity,…

Machine Learning · Computer Science 2024-01-18 Depeng Li , Tianqi Wang , Junwei Chen , Qining Ren , Kenji Kawaguchi , Zhigang Zeng

Improving Online Continual Learning Performance and Stability with Temporal Ensembles

Neural networks are very effective when trained on large datasets for a large number of iterations. However, when they are trained on non-stationary streams of data and in an online fashion, their performance is reduced (1) by the online…

Machine Learning · Computer Science 2023-07-04 Albin Soutif--Cormerais , Antonio Carta , Joost Van de Weijer

Scalable Class-Incremental Learning Based on Parametric Neural Collapse

Incremental learning often encounter challenges such as overfitting to new data and catastrophic forgetting of old data. Existing methods can effectively extend the model for new tasks while freezing the parameters of the old model, but…

Computer Vision and Pattern Recognition · Computer Science 2025-12-29 Chuangxin Zhang , Guangfeng Lin , Enhui Zhao , Kaiyang Liao , Yajun Chen

Fresh-CL: Feature Realignment through Experts on Hypersphere in Continual Learning

Continual Learning enables models to learn and adapt to new tasks while retaining prior knowledge. Introducing new tasks, however, can naturally lead to feature entanglement across tasks, limiting the model's capability to distinguish…

Machine Learning · Computer Science 2025-01-14 Zhongyi Zhou , Yaxin Peng , Pin Yi , Minjie Zhu , Chaomin Shen

New Insights on Reducing Abrupt Representation Change in Online Continual Learning

In the online continual learning paradigm, agents must learn from a changing distribution while respecting memory and compute constraints. Experience Replay (ER), where a small subset of past data is stored and replayed alongside new data,…

Machine Learning · Computer Science 2022-05-03 Lucas Caccia , Rahaf Aljundi , Nader Asadi , Tinne Tuytelaars , Joelle Pineau , Eugene Belilovsky

Prevalence of Neural Collapse during the terminal phase of deep learning training

Modern practice for training classification deepnets involves a Terminal Phase of Training (TPT), which begins at the epoch where training error first vanishes; During TPT, the training error stays effectively zero while training loss is…

Machine Learning · Computer Science 2020-09-23 Vardan Papyan , X. Y. Han , David L. Donoho