English
Related papers

Related papers: Learning Equi-angular Representations for Online C…

200 papers

Neural collapse is a phenomenon observed during the terminal phase of neural network training, characterized by the convergence of network activations, class means, and linear classifier weights to a simplex equiangular tight frame (ETF), a…

Machine Learning · Computer Science 2024-12-03 Emily Liu

Modern deep neural networks for classification usually jointly learn a backbone for representation and a linear classifier to output the logit of each class. A recent study has shown a phenomenon called neural collapse that the within-class…

Machine Learning · Computer Science 2022-10-13 Yibo Yang , Shixiang Chen , Xiangtai Li , Liang Xie , Zhouchen Lin , Dacheng Tao

Data heterogeneity is an inherent challenge that hinders the performance of federated learning (FL). Recent studies have identified the biased classifiers of local models as the key bottleneck. Previous attempts have used classifier…

Machine Learning · Computer Science 2023-08-29 Zexi Li , Xinyi Shang , Rui He , Tao Lin , Chao Wu

Continual Learning (CL) seeks to build an agent that can continuously learn a sequence of tasks, where a key challenge, namely Catastrophic Forgetting, persists due to the potential knowledge interference among different tasks. On the other…

Machine Learning · Computer Science 2026-03-10 Zheng Wang , Wanhao Yu , Li Yang , Sen Lin

Neural Collapse (NC) is a recently observed phenomenon in neural networks that characterises the solution space of the final classifier layer when trained until zero training loss. Specifically, NC suggests that the final classifier layer…

Machine Learning · Computer Science 2024-11-05 Evan Markou , Thalaiyasingam Ajanthan , Stephen Gould

One of the challenges in federated learning is the non-independent and identically distributed (non-iid) characteristics between heterogeneous devices, which cause significant differences in local updates and affect the performance of the…

Machine Learning · Computer Science 2023-04-03 Chenxi Huang , Liang Xie , Yibo Yang , Wenxiao Wang , Binbin Lin , Deng Cai

Recent studies on Neural Collapse (NC) reveal that, under class-balanced conditions, the class feature means and classifier weights spontaneously align into a simplex equiangular tight frame (ETF). In long-tailed regimes, however, severe…

Machine Learning · Computer Science 2025-12-10 Jinping Wang , Zhiqiang Gao , Zhiwu Xie

Online continual learning aims to get closer to a live learning experience by learning directly on a stream of data with temporally shifting distribution and by storing a minimum amount of data from that stream. In this empirical…

The current paradigm of training deep neural networks for classification tasks includes minimizing the empirical risk that pushes the training loss value towards zero, even after the training error has been vanished. In this terminal phase…

Machine Learning · Computer Science 2024-06-07 Hien Dang , Tho Tran , Tan Nguyen , Nhat Ho

In computer vision, machine unlearning aims to remove the influence of specific visual concepts or training images without retraining from scratch. Studies show that existing approaches often modify the classifier while leaving internal…

Computer Vision and Pattern Recognition · Computer Science 2026-04-28 Anjie Le , Can Peng , Yuyuan Liu , J. Alison Noble

The recently discovered Neural collapse (NC) phenomenon states that the last-layer weights of Deep Neural Networks (DNN), converge to the so-called Equiangular Tight Frame (ETF) simplex, at the terminal phase of their training. This ETF…

Machine Learning · Computer Science 2024-03-01 Hafiz Tiomoko Ali , Umberto Michieli , Ji Joong Moon , Daehyun Kim , Mete Ozay

There is a recently discovered and intriguing phenomenon called Neural Collapse: at the terminal phase of training a deep neural network for classification, the within-class penultimate feature means and the associated classifier vectors of…

Computer Vision and Pattern Recognition · Computer Science 2023-08-10 Tong Liang , Jim Davis

Deep neural networks for time series must capture complex temporal patterns, to effectively represent dynamic data. Self- and semi-supervised learning methods show promising results in pre-training large models, which -- when finetuned for…

Machine Learning · Computer Science 2025-08-15 Yuhan Xie , William Cappelletti , Mahsa Shoaran , Pascal Frossard

Meta-learning frameworks for few-shot learning aims to learn models that can learn new skills or adapt to new environments rapidly with a few training examples. This has led to the generalizability of the developed model towards new classes…

Machine Learning · Computer Science 2023-10-10 Saaketh Medepalli , Naren Doraiswamy

Deep neural networks are susceptible to catastrophic forgetting when trained on sequential tasks. Various continual learning (CL) methods often rely on exemplar buffers or/and network expansion for balancing model stability and plasticity,…

Machine Learning · Computer Science 2024-01-18 Depeng Li , Tianqi Wang , Junwei Chen , Qining Ren , Kenji Kawaguchi , Zhigang Zeng

Neural networks are very effective when trained on large datasets for a large number of iterations. However, when they are trained on non-stationary streams of data and in an online fashion, their performance is reduced (1) by the online…

Machine Learning · Computer Science 2023-07-04 Albin Soutif--Cormerais , Antonio Carta , Joost Van de Weijer

Incremental learning often encounter challenges such as overfitting to new data and catastrophic forgetting of old data. Existing methods can effectively extend the model for new tasks while freezing the parameters of the old model, but…

Computer Vision and Pattern Recognition · Computer Science 2025-12-29 Chuangxin Zhang , Guangfeng Lin , Enhui Zhao , Kaiyang Liao , Yajun Chen

Continual Learning enables models to learn and adapt to new tasks while retaining prior knowledge. Introducing new tasks, however, can naturally lead to feature entanglement across tasks, limiting the model's capability to distinguish…

Machine Learning · Computer Science 2025-01-14 Zhongyi Zhou , Yaxin Peng , Pin Yi , Minjie Zhu , Chaomin Shen

In the online continual learning paradigm, agents must learn from a changing distribution while respecting memory and compute constraints. Experience Replay (ER), where a small subset of past data is stored and replayed alongside new data,…

Machine Learning · Computer Science 2022-05-03 Lucas Caccia , Rahaf Aljundi , Nader Asadi , Tinne Tuytelaars , Joelle Pineau , Eugene Belilovsky

Modern practice for training classification deepnets involves a Terminal Phase of Training (TPT), which begins at the epoch where training error first vanishes; During TPT, the training error stays effectively zero while training loss is…

Machine Learning · Computer Science 2020-09-23 Vardan Papyan , X. Y. Han , David L. Donoho
‹ Prev 1 2 3 10 Next ›