English
Related papers

Related papers: Optimizing Singular Spectrum for Large Language Mo…

200 papers

Continual learning in large language models (LLMs) is prone to catastrophic forgetting, where adapting to new tasks significantly degrades performance on previously learned ones. Existing methods typically rely on low-rank,…

Singular value decomposition (SVD) is one of the most popular compression methods that approximate a target matrix with smaller matrices. However, standard SVD treats the parameters within the matrix with equal importance, which is a simple…

Computation and Language · Computer Science 2022-12-19 Ting Hua , Yen-Chang Hsu , Felicity Wang , Qian Lou , Yilin Shen , Hongxia Jin

The advancements in Large Language Models (LLMs) have been hindered by their substantial sizes, which necessitates LLM compression methods for practical deployment. Singular Value Decomposition (SVD) offers a promising solution for LLM…

Computation and Language · Computer Science 2025-03-18 Xin Wang , Yu Zheng , Zhongwei Wan , Mi Zhang

Large Language Models (LLMs) have achieved remarkable breakthroughs. However, the huge number of parameters in LLMs require significant amount of memory storage in inference, which prevents their practical deployment in many applications.…

Computation and Language · Computer Science 2024-10-08 Jingcun Wang , Yu-Guang Chen , Ing-Chao Lin , Bing Li , Grace Li Zhang

Despite significant advancements, the practical deployment of Large Language Models (LLMs) is often hampered by their immense sizes, highlighting the need for effective compression techniques. Singular Value Decomposition (SVD) is a…

Computation and Language · Computer Science 2025-03-18 Xin Wang , Samiul Alam , Zhongwei Wan , Hui Shen , Mi Zhang

Large language models (LLMs) have achieved remarkable success in natural language processing (NLP) tasks, yet their substantial memory requirements present significant challenges for deployment on resource-constrained devices. Singular…

Computer Vision and Pattern Recognition · Computer Science 2025-09-26 Zhiteng Li , Mingyuan Xia , Jingyuan Zhang , Zheng Hui , Haotong Qin , Linghe Kong , Yulun Zhang , Xiaokang Yang

In the field of large language model (LLM) compression, singular value decomposition (SVD) is a widely studied and adopted low-rank decomposition technique. Since SVD operates exclusively on linear modules, and these modules in LLMs are…

Machine Learning · Computer Science 2025-10-23 Lin Xv , Jingsheng Gao , Xian Gao , Ting Liu , Yuzhuo Fu

Large language models (LLMs) have demonstrated impressive capabilities in a wide range of downstream natural language processing tasks. Nevertheless, their considerable sizes and memory demands hinder practical deployment, underscoring the…

Computation and Language · Computer Science 2026-03-17 Haolei Bai , Siyong Jian , Tuo Liang , Yu Yin , Huan Wang

Large Language Models (LLMs) present significant deployment challenges due to their immense size and computational requirements. Model compression techniques are essential for making these models practical for resource-constrained…

Post-training fundamentally alters the behavior of large language models (LLMs), yet its impact on the internal parameter space remains poorly understood. In this work, we conduct a systematic singular value decomposition (SVD) analysis of…

Machine Learning · Computer Science 2026-01-30 Xinyu He , Xianghui Cao

In this paper, we propose a new sampling strategy for hyperspectral signals that is based on dictionary learning and singular value decomposition (SVD). Specifically, we first learn a sparsifying dictionary from training spectral data using…

Computer Vision and Pattern Recognition · Computer Science 2015-12-04 Mingrui Yang , Frank de Hoog , Yuqi Fan , Wen Hu

Large language models (LLMs) excel in general tasks but struggle with domain-specific ones, requiring fine-tuning with specific data. With many open-source LLMs available, selecting the best model for fine-tuning downstream tasks is…

Computation and Language · Computer Science 2025-09-05 Wei Huang , Huang Wei , Yinggui Wang

Approaches for compressing large-language models using low-rank decomposition have made strides, particularly with the introduction of activation and loss-aware SVD, which improves the trade-off between decomposition rank and downstream…

Machine Learning · Computer Science 2025-12-17 Sidhant Sundrani , Francesco Tudisco , Pasquale Minervini

Factorizing a large matrix into small matrices is a popular strategy for model compression. Singular value decomposition (SVD) plays a vital role in this compression strategy, approximating a learned matrix with fewer parameters. However,…

Machine Learning · Computer Science 2022-07-04 Yen-Chang Hsu , Ting Hua , Sungen Chang , Qian Lou , Yilin Shen , Hongxia Jin

Low-rank decomposition, particularly Singular Value Decomposition (SVD), is a pivotal technique for mitigating the storage and computational demands of Large Language Models (LLMs). However, prevalent SVD-based approaches overlook the…

Machine Learning · Computer Science 2026-01-15 Lin Xv , Xian Gao , Ting Li , Yuzhuo Fu

Advances in large language models have driven strong performance across many tasks, but their memory and compute costs still hinder deployment. SVD-based compression reduces storage and can speed up inference via low-rank factors, yet…

Machine Learning · Computer Science 2026-02-04 Ali Abbasi , Chayne Thrash , Haoran Qin , Shansita Sharma , Sepehr Seifi , Soheil Kolouri

Low-rank gradient-based optimization methods have significantly improved memory efficiency during the training of large language models (LLMs), enabling operations within constrained hardware without sacrificing performance. However, these…

Machine Learning · Computer Science 2025-10-28 Yehonathan Refael , Guy Smorodinsky , Tom Tirer , Ofir Lindenbaum

Large pre-trained models (LPMs) have demonstrated exceptional performance in diverse natural language processing and computer vision tasks. However, fully fine-tuning these models poses substantial memory challenges, particularly in…

Machine Learning · Computer Science 2024-09-12 Chengwei Sun , Jiwei Wei , Yujia Wu , Yiming Shi , Shiyuan He , Zeyu Ma , Ning Xie , Yang Yang

The singular value decomposition (SVD) is not only a classical theory in matrix computation and analysis, but also is a powerful tool in machine learning and modern data analysis. In this tutorial we first study the basic notion of SVD and…

Machine Learning · Computer Science 2015-10-30 Zhihua Zhang

We propose a compression based continual task learning method that can dynamically grow a neural network. Inspired from the recent model compression techniques, we employ compression-aware training and perform low-rank weight approximations…

Computer Vision and Pattern Recognition · Computer Science 2020-09-16 Varigonda Pavan Teja , Priyadarshini Panda
‹ Prev 1 2 3 10 Next ›