Related papers: Compression-aware Continual Learning using Singula…

Numerical Optimizations for Weighted Low-rank Estimation on Language Model

Singular value decomposition (SVD) is one of the most popular compression methods that approximate a target matrix with smaller matrices. However, standard SVD treats the parameters within the matrix with equal importance, which is a simple…

Computation and Language · Computer Science 2022-12-19 Ting Hua , Yen-Chang Hsu , Felicity Wang , Qian Lou , Yilin Shen , Hongxia Jin

Language model compression with weighted low-rank factorization

Factorizing a large matrix into small matrices is a popular strategy for model compression. Singular value decomposition (SVD) plays a vital role in this compression strategy, approximating a learned matrix with fewer parameters. However,…

Machine Learning · Computer Science 2022-07-04 Yen-Chang Hsu , Ting Hua , Sungen Chang , Qian Lou , Yilin Shen , Hongxia Jin

Sculpting Subspaces: Constrained Full Fine-Tuning in LLMs for Continual Learning

Continual learning in large language models (LLMs) is prone to catastrophic forgetting, where adapting to new tasks significantly degrades performance on previously learned ones. Existing methods typically rely on low-rank,…

Machine Learning · Computer Science 2025-04-10 Nikhil Shivakumar Nayak , Krishnateja Killamsetty , Ligong Han , Abhishek Bhandwaldar , Prateek Chanda , Kai Xu , Hao Wang , Aldo Pareja , Oleg Silkin , Mustafa Eyceoz , Akash Srivastava

Learning Low-rank Deep Neural Networks via Singular Vector Orthogonality Regularization and Singular Value Sparsification

Modern deep neural networks (DNNs) often require high memory consumption and large computational loads. In order to deploy DNN algorithms efficiently on edge or mobile devices, a series of DNN compression algorithms have been explored,…

Machine Learning · Computer Science 2020-04-21 Huanrui Yang , Minxue Tang , Wei Wen , Feng Yan , Daniel Hu , Ang Li , Hai Li , Yiran Chen

Convolutional Neural Network Compression via Dynamic Parameter Rank Pruning

While Convolutional Neural Networks (CNNs) excel at learning complex latent-space representations, their over-parameterization can lead to overfitting and reduced performance, particularly with limited data. This, alongside their high…

Computer Vision and Pattern Recognition · Computer Science 2024-01-17 Manish Sharma , Jamison Heard , Eli Saber , Panos P. Markopoulos

Low-Rank Prehab: Preparing Neural Networks for SVD Compression

Low-rank approximation methods such as singular value decomposition (SVD) and its variants (e.g., Fisher-weighted SVD, Activation SVD) have recently emerged as effective tools for neural network compression. In this setting, decomposition…

Machine Learning · Computer Science 2025-12-02 Haoran Qin , Shansita Sharma , Ali Abbasi , Chayne Thrash , Soheil Kolouri

BasisConv: A method for compressed representation and learning in CNNs

It is well known that Convolutional Neural Networks (CNNs) have significant redundancy in their filter weights. Various methods have been proposed in the literature to compress trained CNNs. These include techniques like pruning weights,…

Machine Learning · Computer Science 2019-06-12 Muhammad Tayyab , Abhijit Mahalanobis

ASVD: Activation-aware Singular Value Decomposition for Compressing Large Language Models

In this paper, we introduce a new post-training compression paradigm for Large Language Models (LLMs) to facilitate their wider adoption. We delve into LLM weight low-rank decomposition, and find that the challenges of this task stem from…

Computation and Language · Computer Science 2025-08-29 Zhihang Yuan , Yuzhang Shang , Yue Song , Dawei Yang , Qiang Wu , Yan Yan , Guangyu Sun

Low-Rank Matrix Approximation for Neural Network Compression

Deep Neural Networks (DNNs) have encountered an emerging deployment challenge due to large and expensive memory and computation requirements. In this paper, we present a new Adaptive-Rank Singular Value Decomposition (ARSVD) method that…

Machine Learning · Computer Science 2025-05-13 Kalyan Cherukuri , Aarav Lala

Scalable and Order-robust Continual Learning with Additive Parameter Decomposition

While recent continual learning methods largely alleviate the catastrophic problem on toy-sized datasets, some issues remain to be tackled to apply them to real-world problem domains. First, a continual learning model should effectively…

Machine Learning · Computer Science 2020-02-18 Jaehong Yoon , Saehoon Kim , Eunho Yang , Sung Ju Hwang

WSVD: Weighted Low-Rank Approximation for Fast and Efficient Execution of Low-Precision Vision-Language Models

Singular Value Decomposition (SVD) has become an important technique for reducing the computational burden of Vision Language Models (VLMs), which play a central role in tasks such as image captioning and visual question answering. Although…

Computer Vision and Pattern Recognition · Computer Science 2026-04-06 Haiyu Wang , Yutong Wang , Jack Jiang , Sai Qian Zhang

Compacting, Picking and Growing for Unforgetting Continual Learning

Continual lifelong learning is essential to many applications. In this paper, we propose a simple but effective approach to continual deep learning. Our approach leverages the principles of deep model compression, critical weights…

Machine Learning · Computer Science 2019-10-31 Steven C. Y. Hung , Cheng-Hao Tu , Cheng-En Wu , Chien-Hung Chen , Yi-Ming Chan , Chu-Song Chen

Self-paced Weight Consolidation for Continual Learning

Continual learning algorithms which keep the parameters of new tasks close to that of previous tasks, are popular in preventing catastrophic forgetting in sequential task learning settings. However, 1) the performance for the new continual…

Machine Learning · Computer Science 2023-07-21 Wei Cong , Yang Cong , Gan Sun , Yuyang Liu , Jiahua Dong

Different Prompts, Different Ranks: Prompt-aware Dynamic Rank Selection for SVD-based LLM Compression

Large language models (LLMs) have rapidly grown in scale, creating substantial memory and computational costs that hinder efficient deployment. Singular value decomposition (SVD) has emerged as an effective post-training compression…

Machine Learning · Computer Science 2026-05-12 Hengyi Zhu , Zhendong Mi , Grace Li Zhang , Shaoyi Huang

Distribution-Aware Tensor Decomposition for Compression of Convolutional Neural Networks

Neural networks are widely used for image-related tasks but typically demand considerable computing power. Once a network has been trained, however, its memory- and compute-footprint can be reduced by compression. In this work, we focus on…

Machine Learning · Computer Science 2025-11-13 Alper Kalle , Theo Rudkiewicz , Mohamed-Oumar Ouerfelli , Mohamed Tamaazousti

Efficient Multi-Source Knowledge Transfer by Model Merging

While transfer learning is an effective strategy, it often overlooks the opportunity to leverage knowledge from numerous available models online. Addressing this multi-source transfer learning problem is a promising path to boost…

Machine Learning · Computer Science 2026-04-24 Marcin Osial , Bartosz Wójcik , Bartosz Zieliński , Sebastian Cygert

Efficient Rehearsal for Continual Learning in ASR via Singular Value Tuning

Continual Learning (CL) in Automatic Speech Recognition (ASR) suffers from catastrophic forgetting when adapting to new tasks, domains, or speakers. A common strategy to mitigate this is to store a subset of past data in memory for…

Audio and Speech Processing · Electrical Eng. & Systems 2026-02-06 Steven Vander Eeckt , Hugo Van hamme

Unified Framework for Pre-trained Neural Network Compression via Decomposition and Optimized Rank Selection

Despite their high accuracy, complex neural networks demand significant computational resources, posing challenges for deployment on resource constrained devices such as mobile phones and embedded systems. Compression algorithms have been…

Machine Learning · Computer Science 2025-09-23 Ali Aghababaei-Harandi , Massih-Reza Amini

Convolutional Neural Network Compression Based on Low-Rank Decomposition

Deep neural networks typically impose significant computational loads and memory consumption. Moreover, the large parameters pose constraints on deploying the model on edge devices such as embedded systems. Tensor decomposition offers a…

Computer Vision and Pattern Recognition · Computer Science 2024-08-30 Yaping He , Linhao Jiang , Di Wu

ARA: Adaptive Rank Allocation for Efficient Large Language Model SVD Compression

In the field of large language model (LLM) compression, singular value decomposition (SVD) is a widely studied and adopted low-rank decomposition technique. Since SVD operates exclusively on linear modules, and these modules in LLMs are…

Machine Learning · Computer Science 2025-10-23 Lin Xv , Jingsheng Gao , Xian Gao , Ting Liu , Yuzhuo Fu