Related papers: Guided Transfer Learning

No Parameters Left Behind: Sensitivity Guided Adaptive Learning Rate for Training Large Transformer Models

Recent research has shown the existence of significant redundancy in large Transformer models. One can prune the redundant parameters without significantly sacrificing the generalization performance. However, we question whether the…

Computation and Language · Computer Science 2022-02-15 Chen Liang , Haoming Jiang , Simiao Zuo , Pengcheng He , Xiaodong Liu , Jianfeng Gao , Weizhu Chen , Tuo Zhao

Transfer Learning for Structured Pruning under Limited Task Data

Large, pre-trained models are problematic to use in resource constrained applications. Fortunately, task-aware structured pruning methods offer a solution. These approaches reduce model size by dropping structural units like layers and…

Computation and Language · Computer Science 2023-11-14 Lucio Dery , David Grangier , Awni Hannun

Generalization Performance of Transfer Learning: Overparameterized and Underparameterized Regimes

Transfer learning is a useful technique for achieving improved performance and reducing training costs by leveraging the knowledge gained from source tasks and applying it to target tasks. Assessing the effectiveness of transfer learning…

Machine Learning · Computer Science 2023-06-12 Peizhong Ju , Sen Lin , Mark S. Squillante , Yingbin Liang , Ness B. Shroff

Limits of Transfer Learning

Transfer learning involves taking information and insight from one problem domain and applying it to a new problem domain. Although widely used in practice, theory for transfer learning remains less well-developed. To address this, we prove…

Machine Learning · Statistics 2020-06-24 Jake Williams , Abel Tadesse , Tyler Sam , Huey Sun , George D. Montanez

Transfer Learning for Performance Modeling of Configurable Systems: An Exploratory Analysis

Modern software systems provide many configuration options which significantly influence their non-functional properties. To understand and predict the effect of configuration options, several sampling and learning strategies have been…

Machine Learning · Statistics 2017-09-08 Pooyan Jamshidi , Norbert Siegmund , Miguel Velez , Christian Kästner , Akshay Patel , Yuvraj Agarwal

Learning Bound for Parameter Transfer Learning

We consider a transfer-learning problem by using the parameter transfer approach, where a suitable parameter of feature mapping is learned through one task and applied to another objective task. Then, we introduce the notion of the local…

Machine Learning · Statistics 2017-01-19 Wataru Kumagai

Transfer Learning for Algorithm Recommendation

Meta-Learning is a subarea of Machine Learning that aims to take advantage of prior knowledge to learn faster and with fewer data [1]. There are different scenarios where meta-learning can be applied, and one of the most common is algorithm…

Machine Learning · Computer Science 2019-10-17 Gean Trindade Pereira , Moisés dos Santos , Edesio Alcobaça , Rafael Mantovani , André Carvalho

These Are Not All the Features You Are Looking For: A Fundamental Bottleneck in Supervised Pretraining

Transfer learning is widely used to adapt large pretrained models to new tasks with only a small amount of new data. However, a challenge persists -- the features from the original task often do not fully cover what is needed for unseen…

Machine Learning · Computer Science 2026-02-10 Xingyu Alice Yang , Jianyu Zhang , Léon Bottou

K for the Price of 1: Parameter-efficient Multi-task and Transfer Learning

We introduce a novel method that enables parameter-efficient transfer and multi-task learning with deep neural networks. The basic approach is to learn a model patch - a small set of parameters - that will specialize to each task, instead…

Machine Learning · Computer Science 2019-02-26 Pramod Kaushik Mudrakarta , Mark Sandler , Andrey Zhmoginov , Andrew Howard

Memory-based Parameter Adaptation

Deep neural networks have excelled on a wide range of problems, from vision to language and game playing. Neural networks very gradually incorporate information into weights as they process data, requiring very low learning rates. If the…

Machine Learning · Statistics 2018-03-01 Pablo Sprechmann , Siddhant M. Jayakumar , Jack W. Rae , Alexander Pritzel , Adrià Puigdomènech Badia , Benigno Uria , Oriol Vinyals , Demis Hassabis , Razvan Pascanu , Charles Blundell

Transfer learning under latent space model

Latent space model plays a crucial role in network analysis, and accurate estimation of latent variables is essential for downstream tasks such as link prediction. However, the large number of parameters to be estimated presents a…

Methodology · Statistics 2025-09-22 Kuangnan Fang , Ruixuan Qin , Xinyan Fan

Transferring Learning Trajectories of Neural Networks

Training deep neural networks (DNNs) is computationally expensive, which is problematic especially when performing duplicated or similar training runs in model ensemble or fine-tuning pre-trained models, for example. Once we have trained…

Machine Learning · Computer Science 2023-10-04 Daiki Chijiwa

A Concise Review of Transfer Learning

The availability of abundant labeled data in recent years led the researchers to introduce a methodology called transfer learning, which utilizes existing data in situations where there are difficulties in collecting new annotated data.…

Machine Learning · Computer Science 2021-04-07 Abolfazl Farahani , Behrouz Pourshojae , Khaled Rasheed , Hamid R. Arabnia

Flexible Multi-task Networks by Learning Parameter Allocation

This paper proposes a novel learning method for multi-task applications. Multi-task neural networks can learn to transfer knowledge across different tasks by using parameter sharing. However, sharing parameters between unrelated tasks can…

Machine Learning · Computer Science 2020-07-21 Krzysztof Maziarz , Efi Kokiopoulou , Andrea Gesmundo , Luciano Sbaiz , Gabor Bartok , Jesse Berent

Transfer Learning of Linear Regression with Multiple Pretrained Models: Benefiting from More Pretrained Models via Overparameterization Debiasing

We study transfer learning for a linear regression task using several least-squares pretrained models that can be overparameterized. We formulate the target learning task as optimization that minimizes squared errors on the target dataset…

Machine Learning · Computer Science 2026-02-19 Daniel Boharon , Yehuda Dar

Predicting Parameters in Deep Learning

We demonstrate that there is significant redundancy in the parameterization of several deep learning models. Given only a few weight values for each feature it is possible to accurately predict the remaining values. Moreover, we show that…

Machine Learning · Computer Science 2014-10-28 Misha Denil , Babak Shakibi , Laurent Dinh , Marc'Aurelio Ranzato , Nando de Freitas

The Fast and the Flexible: training neural networks to learn to follow instructions from small data

Learning to follow human instructions is a long-pursued goal in artificial intelligence. The task becomes particularly challenging if no prior knowledge of the employed language is assumed while relying only on a handful of examples to…

Computation and Language · Computer Science 2019-04-03 Rezka Leonandya , Elia Bruni , Dieuwke Hupkes , Germán Kruszewski

Adaptive Transfer Learning on Graph Neural Networks

Graph neural networks (GNNs) is widely used to learn a powerful representation of graph-structured data. Recent work demonstrates that transferring knowledge from self-supervised tasks to downstream tasks could further improve graph…

Machine Learning · Computer Science 2021-07-21 Xueting Han , Zhenhuan Huang , Bang An , Jing Bai

Network Parameter Learning Using Nonlinear Transforms, Local Representation Goals and Local Propagation Constraints

In this paper, we introduce a novel concept for learning of the parameters in a neural network. Our idea is grounded on modeling a learning problem that addresses a trade-off between (i) satisfying local objectives at each node and (ii)…

Machine Learning · Computer Science 2019-02-04 Dimche Kostadinov , Behrooz Razdehi , Slava Voloshynovskiy

Domain Adaptive Transfer Learning with Specialist Models

Transfer learning is a widely used method to build high performing computer vision models. In this paper, we study the efficacy of transfer learning by examining how the choice of data impacts performance. We find that more pre-training…

Computer Vision and Pattern Recognition · Computer Science 2018-12-13 Jiquan Ngiam , Daiyi Peng , Vijay Vasudevan , Simon Kornblith , Quoc V. Le , Ruoming Pang