English
Related papers

Related papers: Compression-aware Continual Learning using Singula…

200 papers

Singular value decomposition (SVD) is one of the most popular compression methods that approximate a target matrix with smaller matrices. However, standard SVD treats the parameters within the matrix with equal importance, which is a simple…

Computation and Language · Computer Science 2022-12-19 Ting Hua , Yen-Chang Hsu , Felicity Wang , Qian Lou , Yilin Shen , Hongxia Jin

Factorizing a large matrix into small matrices is a popular strategy for model compression. Singular value decomposition (SVD) plays a vital role in this compression strategy, approximating a learned matrix with fewer parameters. However,…

Machine Learning · Computer Science 2022-07-04 Yen-Chang Hsu , Ting Hua , Sungen Chang , Qian Lou , Yilin Shen , Hongxia Jin

Continual learning in large language models (LLMs) is prone to catastrophic forgetting, where adapting to new tasks significantly degrades performance on previously learned ones. Existing methods typically rely on low-rank,…

Modern deep neural networks (DNNs) often require high memory consumption and large computational loads. In order to deploy DNN algorithms efficiently on edge or mobile devices, a series of DNN compression algorithms have been explored,…

Machine Learning · Computer Science 2020-04-21 Huanrui Yang , Minxue Tang , Wei Wen , Feng Yan , Daniel Hu , Ang Li , Hai Li , Yiran Chen

While Convolutional Neural Networks (CNNs) excel at learning complex latent-space representations, their over-parameterization can lead to overfitting and reduced performance, particularly with limited data. This, alongside their high…

Computer Vision and Pattern Recognition · Computer Science 2024-01-17 Manish Sharma , Jamison Heard , Eli Saber , Panos P. Markopoulos

Low-rank approximation methods such as singular value decomposition (SVD) and its variants (e.g., Fisher-weighted SVD, Activation SVD) have recently emerged as effective tools for neural network compression. In this setting, decomposition…

Machine Learning · Computer Science 2025-12-02 Haoran Qin , Shansita Sharma , Ali Abbasi , Chayne Thrash , Soheil Kolouri

It is well known that Convolutional Neural Networks (CNNs) have significant redundancy in their filter weights. Various methods have been proposed in the literature to compress trained CNNs. These include techniques like pruning weights,…

Machine Learning · Computer Science 2019-06-12 Muhammad Tayyab , Abhijit Mahalanobis

In this paper, we introduce a new post-training compression paradigm for Large Language Models (LLMs) to facilitate their wider adoption. We delve into LLM weight low-rank decomposition, and find that the challenges of this task stem from…

Computation and Language · Computer Science 2025-08-29 Zhihang Yuan , Yuzhang Shang , Yue Song , Dawei Yang , Qiang Wu , Yan Yan , Guangyu Sun

Deep Neural Networks (DNNs) have encountered an emerging deployment challenge due to large and expensive memory and computation requirements. In this paper, we present a new Adaptive-Rank Singular Value Decomposition (ARSVD) method that…

Machine Learning · Computer Science 2025-05-13 Kalyan Cherukuri , Aarav Lala

While recent continual learning methods largely alleviate the catastrophic problem on toy-sized datasets, some issues remain to be tackled to apply them to real-world problem domains. First, a continual learning model should effectively…

Machine Learning · Computer Science 2020-02-18 Jaehong Yoon , Saehoon Kim , Eunho Yang , Sung Ju Hwang

Singular Value Decomposition (SVD) has become an important technique for reducing the computational burden of Vision Language Models (VLMs), which play a central role in tasks such as image captioning and visual question answering. Although…

Computer Vision and Pattern Recognition · Computer Science 2026-04-06 Haiyu Wang , Yutong Wang , Jack Jiang , Sai Qian Zhang

Continual lifelong learning is essential to many applications. In this paper, we propose a simple but effective approach to continual deep learning. Our approach leverages the principles of deep model compression, critical weights…

Machine Learning · Computer Science 2019-10-31 Steven C. Y. Hung , Cheng-Hao Tu , Cheng-En Wu , Chien-Hung Chen , Yi-Ming Chan , Chu-Song Chen

Continual learning algorithms which keep the parameters of new tasks close to that of previous tasks, are popular in preventing catastrophic forgetting in sequential task learning settings. However, 1) the performance for the new continual…

Machine Learning · Computer Science 2023-07-21 Wei Cong , Yang Cong , Gan Sun , Yuyang Liu , Jiahua Dong

Large language models (LLMs) have rapidly grown in scale, creating substantial memory and computational costs that hinder efficient deployment. Singular value decomposition (SVD) has emerged as an effective post-training compression…

Machine Learning · Computer Science 2026-05-12 Hengyi Zhu , Zhendong Mi , Grace Li Zhang , Shaoyi Huang

Neural networks are widely used for image-related tasks but typically demand considerable computing power. Once a network has been trained, however, its memory- and compute-footprint can be reduced by compression. In this work, we focus on…

Machine Learning · Computer Science 2025-11-13 Alper Kalle , Theo Rudkiewicz , Mohamed-Oumar Ouerfelli , Mohamed Tamaazousti

While transfer learning is an effective strategy, it often overlooks the opportunity to leverage knowledge from numerous available models online. Addressing this multi-source transfer learning problem is a promising path to boost…

Machine Learning · Computer Science 2026-04-24 Marcin Osial , Bartosz Wójcik , Bartosz Zieliński , Sebastian Cygert

Continual Learning (CL) in Automatic Speech Recognition (ASR) suffers from catastrophic forgetting when adapting to new tasks, domains, or speakers. A common strategy to mitigate this is to store a subset of past data in memory for…

Audio and Speech Processing · Electrical Eng. & Systems 2026-02-06 Steven Vander Eeckt , Hugo Van hamme

Despite their high accuracy, complex neural networks demand significant computational resources, posing challenges for deployment on resource constrained devices such as mobile phones and embedded systems. Compression algorithms have been…

Machine Learning · Computer Science 2025-09-23 Ali Aghababaei-Harandi , Massih-Reza Amini

Deep neural networks typically impose significant computational loads and memory consumption. Moreover, the large parameters pose constraints on deploying the model on edge devices such as embedded systems. Tensor decomposition offers a…

Computer Vision and Pattern Recognition · Computer Science 2024-08-30 Yaping He , Linhao Jiang , Di Wu

In the field of large language model (LLM) compression, singular value decomposition (SVD) is a widely studied and adopted low-rank decomposition technique. Since SVD operates exclusively on linear modules, and these modules in LLMs are…

Machine Learning · Computer Science 2025-10-23 Lin Xv , Jingsheng Gao , Xian Gao , Ting Liu , Yuzhuo Fu
‹ Prev 1 2 3 10 Next ›