Related papers: Learning Fine-grained Parameter Sharing via Sparse…

Parameter Reduction Improves Vision Transformers: A Comparative Study of Sharing and Width Reduction

Although scaling laws and many empirical results suggest that increasing the size of Vision Transformers often improves performance, model accuracy and training behavior are not always monotonically increasing with scale. Focusing on…

Computer Vision and Pattern Recognition · Computer Science 2025-12-02 Anantha Padmanaban Krishna Kumar

DeepFGS: Fine-Grained Scalable Coding for Learned Image Compression

Scalable coding, which can adapt to channel bandwidth variation, performs well in today's complex network environment. However, most existing scalable compression methods face two challenges: reduced compression performance and insufficient…

Image and Video Processing · Electrical Eng. & Systems 2024-12-03 Yongqi Zhai , Yi Ma , Luyang Tang , Wei Jiang , Ronggang Wang

Multiscale Tensor Summation Factorization as a New Neural Network Layer (MTS Layer) for Multidimensional Data Processing

Multilayer perceptrons (MLP), or fully connected artificial neural networks, are known for performing vector-matrix multiplications using learnable weight matrices; however, their practical application in many machine learning tasks,…

Machine Learning · Computer Science 2025-04-22 Mehmet Yamaç , Muhammad Numan Yousaf , Serkan Kiranyaz , Moncef Gabbouj

Sequential Compression Layers for Efficient Federated Learning in Foundational Models

Federated Learning (FL) has gained popularity for fine-tuning large language models (LLMs) across multiple nodes, each with its own private data. While LoRA has been widely adopted for parameter efficient federated fine-tuning, recent…

Machine Learning · Computer Science 2025-03-11 Navyansh Mahla , Sunny Gupta , Amit Sethi

DeepFGS: Fine-Grained Scalable Coding for Learned Image Compression

Scalable coding, which can adapt to channel bandwidth variation, performs well in today's complex network environment. However, the existing scalable compression methods face two challenges: reduced compression performance and insufficient…

Image and Video Processing · Electrical Eng. & Systems 2022-01-05 Yi Ma , Yongqi Zhai , Ronggang Wang

Decompose, Mix, Adapt: A Unified Framework for Parameter-Efficient Neural Network Recombination and Compression

Parameter Recombination (PR) methods aim to efficiently compose the weights of a neural network for applications like Parameter-Efficient FineTuning (PEFT) and Model Compression (MC), among others. Most methods typically focus on one…

Computer Vision and Pattern Recognition · Computer Science 2026-04-14 Nazia Tasnim , Shrimai Prabhumoye , Bryan A. Plummer

Sparse-Tuning: Adapting Vision Transformers with Efficient Fine-tuning and Inference

Parameter-efficient fine-tuning (PEFT) has emerged as a popular solution for adapting pre-trained Vision Transformer (ViT) models to downstream applications by updating only a small subset of parameters. While current PEFT methods have…

Computer Vision and Pattern Recognition · Computer Science 2025-12-19 Ting Liu , Xuyang Liu , Liangtao Shi , Zunnan Xu , Yue Hu , Siteng Huang , Yi Xin , Bineng Zhong , Donglin Wang

Understanding Parameter Sharing in Transformers

Parameter sharing has proven to be a parameter-efficient approach. Previous work on Transformers has focused on sharing parameters in different layers, which can improve the performance of models with limited parameters by increasing model…

Machine Learning · Computer Science 2023-06-19 Ye Lin , Mingxuan Wang , Zhexi Zhang , Xiaohui Wang , Tong Xiao , Jingbo Zhu

SparseGrad: A Selective Method for Efficient Fine-tuning of MLP Layers

The performance of Transformer models has been enhanced by increasing the number of parameters and the length of the processed text. Consequently, fine-tuning the entire model becomes a memory-intensive process. High-performance methods for…

Computation and Language · Computer Science 2024-10-11 Viktoriia Chekalina , Anna Rudenko , Gleb Mezentsev , Alexander Mikhalev , Alexander Panchenko , Ivan Oseledets

Basis Sharing: Cross-Layer Parameter Sharing for Large Language Model Compression

Large Language Models (LLMs) have achieved remarkable breakthroughs. However, the huge number of parameters in LLMs require significant amount of memory storage in inference, which prevents their practical deployment in many applications.…

Computation and Language · Computer Science 2024-10-08 Jingcun Wang , Yu-Guang Chen , Ing-Chao Lin , Bing Li , Grace Li Zhang

Neural Parameter Search for Slimmer Fine-Tuned Models and Better Transfer

Foundation models and their checkpoints have significantly advanced deep learning, boosting performance across various applications. However, fine-tuned models often struggle outside their specific domains and exhibit considerable…

Machine Learning · Computer Science 2025-05-27 Guodong Du , Zitao Fang , Jing Li , Junlin Li , Runhua Jiang , Shuyang Yu , Yifei Guo , Yangneng Chen , Sim Kuan Goh , Ho-Kin Tang , Daojing He , Honghai Liu , Min Zhang

MEFT: Memory-Efficient Fine-Tuning through Sparse Adapter

Parameter-Efficient Fine-tuning (PEFT) facilitates the fine-tuning of Large Language Models (LLMs) under limited resources. However, the fine-tuning performance with PEFT on complex, knowledge-intensive tasks is limited due to the…

Computation and Language · Computer Science 2024-06-10 Jitai Hao , WeiWei Sun , Xin Xin , Qi Meng , Zhumin Chen , Pengjie Ren , Zhaochun Ren

Towards Interpretability Without Sacrifice: Faithful Dense Layer Decomposition with Mixture of Decoders

Multilayer perceptrons (MLPs) are an integral part of large language models, yet their dense representations render them difficult to understand, edit, and steer. Recent methods learn interpretable approximations via neuron-level sparsity,…

Machine Learning · Computer Science 2026-01-15 James Oldfield , Shawn Im , Sharon Li , Mihalis A. Nicolaou , Ioannis Patras , Grigorios G Chrysos

Group-Connected Multilayer Perceptron Networks

Despite the success of deep learning in domains such as image, voice, and graphs, there has been little progress in deep representation learning for domains without a known structure between features. For instance, a tabular dataset of…

Machine Learning · Computer Science 2020-11-26 Mohammad Kachuee , Sajad Darabi , Shayan Fazeli , Majid Sarrafzadeh

Merging Feed-Forward Sublayers for Compressed Transformers

With the rise and ubiquity of larger deep learning models, the need for high-quality compression techniques is growing in order to deploy these models widely. The sheer parameter count of these models makes it difficult to fit them into the…

Computation and Language · Computer Science 2025-04-01 Neha Verma , Kenton Murray , Kevin Duh

Optimizing Specific and Shared Parameters for Efficient Parameter Tuning

Foundation models, with a vast number of parameters and pretraining on massive datasets, achieve state-of-the-art performance across various applications. However, efficiently adapting them to downstream tasks with minimal computational…

Machine Learning · Computer Science 2025-04-07 Van-Anh Nguyen , Thanh-Toan Do , Mehrtash Harandi , Dinh Phung , Trung Le

Symbiosis: Multi-Adapter Inference and Fine-Tuning

Parameter-efficient fine-tuning (PEFT) allows model builders to capture the task-specific parameters into adapters, which are a fraction of the size of the original base model. Popularity of PEFT technique for fine-tuning has led to the…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-10-24 Saransh Gupta , Umesh Deshpande , Travis Janssen , Swami Sundararaman

FIPER: Factorized Features for Robust Image Super-Resolution and Compression

In this work, we propose using a unified representation, termed Factorized Features, for low-level vision tasks, where we test on Single Image Super-Resolution (SISR) and \textbf{Image Compression}. Motivated by the shared principles…

Image and Video Processing · Electrical Eng. & Systems 2025-11-04 Yang-Che Sun , Cheng Yu Yeo , Ernie Chu , Jun-Cheng Chen , Yu-Lun Liu

Fine-grained Prompt Tuning: A Parameter and Memory Efficient Transfer Learning Method for High-resolution Medical Image Classification

Parameter-efficient transfer learning (PETL) is proposed as a cost-effective way to transfer pre-trained models to downstream tasks, avoiding the high cost of updating entire large-scale pre-trained models (LPMs). In this work, we present…

Computer Vision and Pattern Recognition · Computer Science 2024-07-03 Yijin Huang , Pujin Cheng , Roger Tam , Xiaoying Tang

Enhanced Transfer Learning with ImageNet Trained Classification Layer

Parameter fine tuning is a transfer learning approach whereby learned parameters from pre-trained source network are transferred to the target network followed by fine-tuning. Prior research has shown that this approach is capable of…

Computer Vision and Pattern Recognition · Computer Science 2019-09-20 Tasfia Shermin , Shyh Wei Teng , Manzur Murshed , Guojun Lu , Ferdous Sohel , Manoranjan Paul