Related papers: Transfer Learning for Structured Pruning under Lim…

Structured Pruning for Multi-Task Deep Neural Networks

Although multi-task deep neural network (DNN) models have computation and storage benefits over individual single-task DNN models, they can be further optimized via model compression. Numerous structured pruning methods are already…

Machine Learning · Computer Science 2023-04-17 Siddhant Garg , Lijun Zhang , Hui Guan

A Second-Order Perspective on Pruning at Initialization and Knowledge Transfer

The widespread availability of pre-trained vision models has enabled numerous deep learning applications through their transferable representations. However, their computational and storage costs often limit practical deployment.…

Computer Vision and Pattern Recognition · Computer Science 2025-09-30 Leonardo Iurada , Beatrice Occhiena , Tatiana Tommasi

Data-Efficient Structured Pruning via Submodular Optimization

Structured pruning is an effective approach for compressing large pre-trained neural networks without significantly affecting their performance. However, most current structured pruning methods do not provide any performance guarantees, and…

Machine Learning · Computer Science 2023-02-14 Marwa El Halabi , Suraj Srinivas , Simon Lacoste-Julien

Differentiable Architecture Pruning for Transfer Learning

We propose a new gradient-based approach for extracting sub-architectures from a given large model. Contrarily to existing pruning methods, which are unable to disentangle the network architecture and the corresponding weights, our…

Machine Learning · Computer Science 2021-07-08 Nicolo Colombo , Yang Gao

Exploring the Efficacy of Transfer Learning in Mining Image-Based Software Artifacts

Transfer learning allows us to train deep architectures requiring a large number of learned parameters, even if the amount of available data is limited, by leveraging existing models previously trained for another task. Here we explore the…

Software Engineering · Computer Science 2020-03-04 Natalie Best , Jordan Ott , Erik Linstead

Parameter-Efficient Transfer Learning with Diff Pruning

While task-specific finetuning of pretrained networks has led to significant empirical advances in NLP, the large size of networks makes finetuning difficult to deploy in multi-task, memory-constrained settings. We propose diff pruning as a…

Computation and Language · Computer Science 2021-06-10 Demi Guo , Alexander M. Rush , Yoon Kim

TransTailor: Pruning the Pre-trained Model for Improved Transfer Learning

The increasing of pre-trained models has significantly facilitated the performance on limited data tasks with transfer learning. However, progress on transfer learning mainly focuses on optimizing the weights of pre-trained models, which…

Computer Vision and Pattern Recognition · Computer Science 2021-03-03 Bingyan Liu , Yifeng Cai , Yao Guo , Xiangqun Chen

Can pruning make Large Language Models more efficient?

Transformer models have revolutionized natural language processing with their unparalleled ability to grasp complex contextual relationships. However, the vast number of parameters in these models has raised concerns regarding computational…

Machine Learning · Computer Science 2023-10-10 Sia Gholami , Marwan Omar

These Are Not All the Features You Are Looking For: A Fundamental Bottleneck in Supervised Pretraining

Transfer learning is widely used to adapt large pretrained models to new tasks with only a small amount of new data. However, a challenge persists -- the features from the original task often do not fully cover what is needed for unseen…

Machine Learning · Computer Science 2026-02-10 Xingyu Alice Yang , Jianyu Zhang , Léon Bottou

A Comparative Study of Pruning Methods in Transformer-based Time Series Forecasting

The current landscape in time-series forecasting is dominated by Transformer-based models. Their high parameter count and corresponding demand in computational resources pose a challenge to real-world deployment, especially for commercial…

Machine Learning · Computer Science 2024-12-18 Nicholas Kiefer , Arvid Weyrauch , Muhammed Öz , Achim Streit , Markus Götz , Charlotte Debus

Multi-Task and Transfer Learning for Federated Learning Applications

Federated learning enables many applications benefiting distributed and private datasets of a large number of potential data-holding clients. However, different clients usually have their own particular objectives in terms of the tasks to…

Machine Learning · Computer Science 2022-07-19 Cihat Keçeci , Mohammad Shaqfeh , Hayat Mbayed , Erchin Serpedin

A Meta-Learning Approach for Custom Model Training

Transfer-learning and meta-learning are two effective methods to apply knowledge learned from large data sources to new tasks. In few-class, few-shot target task settings (i.e. when there are only a few classes and training examples…

Machine Learning · Computer Science 2019-02-11 Amir Erfan Eshratifar , Mohammad Saeed Abrishami , David Eigen , Massoud Pedram

Task-oriented Memory-efficient Pruning-Adapter

The Outstanding performance and growing size of Large Language Models has led to increased attention in parameter efficient learning. The two predominant approaches are Adapters and Pruning. Adapters are to freeze the model and give it a…

Computation and Language · Computer Science 2023-04-07 Guorun Wang , Jun Yang , Yaoru Sun

muNet: Evolving Pretrained Deep Neural Networks into Scalable Auto-tuning Multitask Systems

Most uses of machine learning today involve training a model from scratch for a particular task, or sometimes starting with a model pretrained on a related task and then fine-tuning on a downstream task. Both approaches offer limited…

Machine Learning · Computer Science 2022-05-26 Andrea Gesmundo , Jeff Dean

Frustratingly Easy Task-aware Pruning for Large Language Models

Pruning provides a practical solution to reduce the resources required to run large language models (LLMs) to benefit from their effective capabilities as well as control their cost for training and inference. Research on LLM pruning often…

Computation and Language · Computer Science 2025-10-28 Yuanhe Tian , Junjie Liu , Xican Yang , Haishan Ye , Yan Song

Pruning Pretrained Encoders with a Multitask Objective

The sizes of pretrained language models make them challenging and expensive to use when there are multiple desired downstream tasks. In this work, we adopt recent strategies for model pruning during finetuning to explore the question of…

Computation and Language · Computer Science 2021-12-13 Patrick Xia , Richard Shin

Scalable Transfer Learning with Expert Models

Transfer of pre-trained representations can improve sample efficiency and reduce computational requirements for new tasks. However, representations used for transfer are usually generic, and are not tailored to a particular distribution of…

Machine Learning · Computer Science 2020-09-29 Joan Puigcerver , Carlos Riquelme , Basil Mustafa , Cedric Renggli , André Susano Pinto , Sylvain Gelly , Daniel Keysers , Neil Houlsby

How Well Do Sparse Imagenet Models Transfer?

Transfer learning is a classic paradigm by which models pretrained on large "upstream" datasets are adapted to yield good results on "downstream" specialized datasets. Generally, more accurate models on the "upstream" dataset tend to…

Computer Vision and Pattern Recognition · Computer Science 2022-04-22 Eugenia Iofinova , Alexandra Peste , Mark Kurtz , Dan Alistarh

Transfer Learning for Finetuning Large Language Models

As the landscape of large language models expands, efficiently finetuning for specific tasks becomes increasingly crucial. At the same time, the landscape of parameter-efficient finetuning methods rapidly expands. Consequently,…

Computation and Language · Computer Science 2024-11-05 Tobias Strangmann , Lennart Purucker , Jörg K. H. Franke , Ivo Rapant , Fabio Ferreira , Frank Hutter

Data Fusion of Deep Learned Molecular Embeddings for Property Prediction

Data-driven approaches such as deep learning can result in predictive models for material properties with exceptional accuracy and efficiency. However, in many applications, data is sparse, severely limiting their accuracy and…

Machine Learning · Computer Science 2025-10-29 Robert J Appleton , Brian C Barnes , Alejandro Strachan