English
Related papers

Related papers: Learning Fine-grained Parameter Sharing via Sparse…

200 papers

Although scaling laws and many empirical results suggest that increasing the size of Vision Transformers often improves performance, model accuracy and training behavior are not always monotonically increasing with scale. Focusing on…

Computer Vision and Pattern Recognition · Computer Science 2025-12-02 Anantha Padmanaban Krishna Kumar

Scalable coding, which can adapt to channel bandwidth variation, performs well in today's complex network environment. However, most existing scalable compression methods face two challenges: reduced compression performance and insufficient…

Image and Video Processing · Electrical Eng. & Systems 2024-12-03 Yongqi Zhai , Yi Ma , Luyang Tang , Wei Jiang , Ronggang Wang

Multilayer perceptrons (MLP), or fully connected artificial neural networks, are known for performing vector-matrix multiplications using learnable weight matrices; however, their practical application in many machine learning tasks,…

Machine Learning · Computer Science 2025-04-22 Mehmet Yamaç , Muhammad Numan Yousaf , Serkan Kiranyaz , Moncef Gabbouj

Federated Learning (FL) has gained popularity for fine-tuning large language models (LLMs) across multiple nodes, each with its own private data. While LoRA has been widely adopted for parameter efficient federated fine-tuning, recent…

Machine Learning · Computer Science 2025-03-11 Navyansh Mahla , Sunny Gupta , Amit Sethi

Scalable coding, which can adapt to channel bandwidth variation, performs well in today's complex network environment. However, the existing scalable compression methods face two challenges: reduced compression performance and insufficient…

Image and Video Processing · Electrical Eng. & Systems 2022-01-05 Yi Ma , Yongqi Zhai , Ronggang Wang

Parameter Recombination (PR) methods aim to efficiently compose the weights of a neural network for applications like Parameter-Efficient FineTuning (PEFT) and Model Compression (MC), among others. Most methods typically focus on one…

Computer Vision and Pattern Recognition · Computer Science 2026-04-14 Nazia Tasnim , Shrimai Prabhumoye , Bryan A. Plummer

Parameter-efficient fine-tuning (PEFT) has emerged as a popular solution for adapting pre-trained Vision Transformer (ViT) models to downstream applications by updating only a small subset of parameters. While current PEFT methods have…

Computer Vision and Pattern Recognition · Computer Science 2025-12-19 Ting Liu , Xuyang Liu , Liangtao Shi , Zunnan Xu , Yue Hu , Siteng Huang , Yi Xin , Bineng Zhong , Donglin Wang

Parameter sharing has proven to be a parameter-efficient approach. Previous work on Transformers has focused on sharing parameters in different layers, which can improve the performance of models with limited parameters by increasing model…

Machine Learning · Computer Science 2023-06-19 Ye Lin , Mingxuan Wang , Zhexi Zhang , Xiaohui Wang , Tong Xiao , Jingbo Zhu

The performance of Transformer models has been enhanced by increasing the number of parameters and the length of the processed text. Consequently, fine-tuning the entire model becomes a memory-intensive process. High-performance methods for…

Computation and Language · Computer Science 2024-10-11 Viktoriia Chekalina , Anna Rudenko , Gleb Mezentsev , Alexander Mikhalev , Alexander Panchenko , Ivan Oseledets

Large Language Models (LLMs) have achieved remarkable breakthroughs. However, the huge number of parameters in LLMs require significant amount of memory storage in inference, which prevents their practical deployment in many applications.…

Computation and Language · Computer Science 2024-10-08 Jingcun Wang , Yu-Guang Chen , Ing-Chao Lin , Bing Li , Grace Li Zhang

Foundation models and their checkpoints have significantly advanced deep learning, boosting performance across various applications. However, fine-tuned models often struggle outside their specific domains and exhibit considerable…

Parameter-Efficient Fine-tuning (PEFT) facilitates the fine-tuning of Large Language Models (LLMs) under limited resources. However, the fine-tuning performance with PEFT on complex, knowledge-intensive tasks is limited due to the…

Computation and Language · Computer Science 2024-06-10 Jitai Hao , WeiWei Sun , Xin Xin , Qi Meng , Zhumin Chen , Pengjie Ren , Zhaochun Ren

Multilayer perceptrons (MLPs) are an integral part of large language models, yet their dense representations render them difficult to understand, edit, and steer. Recent methods learn interpretable approximations via neuron-level sparsity,…

Machine Learning · Computer Science 2026-01-15 James Oldfield , Shawn Im , Sharon Li , Mihalis A. Nicolaou , Ioannis Patras , Grigorios G Chrysos

Despite the success of deep learning in domains such as image, voice, and graphs, there has been little progress in deep representation learning for domains without a known structure between features. For instance, a tabular dataset of…

Machine Learning · Computer Science 2020-11-26 Mohammad Kachuee , Sajad Darabi , Shayan Fazeli , Majid Sarrafzadeh

With the rise and ubiquity of larger deep learning models, the need for high-quality compression techniques is growing in order to deploy these models widely. The sheer parameter count of these models makes it difficult to fit them into the…

Computation and Language · Computer Science 2025-04-01 Neha Verma , Kenton Murray , Kevin Duh

Foundation models, with a vast number of parameters and pretraining on massive datasets, achieve state-of-the-art performance across various applications. However, efficiently adapting them to downstream tasks with minimal computational…

Machine Learning · Computer Science 2025-04-07 Van-Anh Nguyen , Thanh-Toan Do , Mehrtash Harandi , Dinh Phung , Trung Le

Parameter-efficient fine-tuning (PEFT) allows model builders to capture the task-specific parameters into adapters, which are a fraction of the size of the original base model. Popularity of PEFT technique for fine-tuning has led to the…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-10-24 Saransh Gupta , Umesh Deshpande , Travis Janssen , Swami Sundararaman

In this work, we propose using a unified representation, termed Factorized Features, for low-level vision tasks, where we test on Single Image Super-Resolution (SISR) and \textbf{Image Compression}. Motivated by the shared principles…

Image and Video Processing · Electrical Eng. & Systems 2025-11-04 Yang-Che Sun , Cheng Yu Yeo , Ernie Chu , Jun-Cheng Chen , Yu-Lun Liu

Parameter-efficient transfer learning (PETL) is proposed as a cost-effective way to transfer pre-trained models to downstream tasks, avoiding the high cost of updating entire large-scale pre-trained models (LPMs). In this work, we present…

Computer Vision and Pattern Recognition · Computer Science 2024-07-03 Yijin Huang , Pujin Cheng , Roger Tam , Xiaoying Tang

Parameter fine tuning is a transfer learning approach whereby learned parameters from pre-trained source network are transferred to the target network followed by fine-tuning. Prior research has shown that this approach is capable of…

Computer Vision and Pattern Recognition · Computer Science 2019-09-20 Tasfia Shermin , Shyh Wei Teng , Manzur Murshed , Guojun Lu , Ferdous Sohel , Manoranjan Paul
‹ Prev 1 2 3 10 Next ›