Learning Fine-grained Parameter Sharing via Sparse Tensor Decomposition

Cem Üyük; Mike Lasby; Mohamed Yassin; Utku Evci; Yani Ioannou

Learning Fine-grained Parameter Sharing via Sparse Tensor Decomposition

Machine Learning 2026-05-26 v4

Authors: Cem Üyük , Mike Lasby , Mohamed Yassin , Utku Evci , Yani Ioannou

Abstract

Large neural networks achieve state-of-the-art performance on many tasks, yet their sheer size hinders deployment on resource-constrained devices. Among existing compression approaches, cross-layer parameter sharing remains relatively unexplored for transformer models. In this paper, we introduce Fine-grained Parameter Sharing (FiPS), a unified framework for compressing transformer Multi-Layer Perceptrons (MLPs) that combines cross-block parameter sharing, low-rank factorization, and sparsity in a single optimization. FiPS concatenates MLP weight matrices across a group of transformer blocks and factorizes them into a shared basis and sparse, layer-specific projection matrices. Both factors are initialized via singular value decomposition (SVD) and jointly optimized by block-wise reconstruction error minimization. FiPS compresses Vision Transformers (ViTs) by up to 33% with less than 1% top-1 accuracy loss on ImageNet-1k, and by up to 57% when combined with fine-tuning. It also compresses Large Language Models (LLMs) by up to 20% while outperforming existing SVD-based methods in perplexity and downstream benchmarks at matched compression. Combined with Quantization-Aware Training (QAT), 3-bit FiPS on Gemma-2-2B achieves lower perplexity than 2-bit QAT alone while matching the same 8x compression. These results establish fine-grained parameter sharing as a practical and effective approach for transformer MLP compression.

Keywords

sparse learning model compression image compression

Cite

@article{arxiv.2411.09816,
  title  = {Learning Fine-grained Parameter Sharing via Sparse Tensor Decomposition},
  author = {Cem Üyük and Mike Lasby and Mohamed Yassin and Utku Evci and Yani Ioannou},
  journal= {arXiv preprint arXiv:2411.09816},
  year   = {2026}
}

Comments

Accepted as is to Transactions on Machine Learning Research (TMLR), 2026. OpenReview: https://openreview.net/forum?id=vbS7Z8Zswe

Learning Fine-grained Parameter Sharing via Sparse Tensor Decomposition

Abstract

Keywords

Cite

Comments

Related papers