English
Related papers

Related papers: Guided Transfer Learning

200 papers

Recent research has shown the existence of significant redundancy in large Transformer models. One can prune the redundant parameters without significantly sacrificing the generalization performance. However, we question whether the…

Computation and Language · Computer Science 2022-02-15 Chen Liang , Haoming Jiang , Simiao Zuo , Pengcheng He , Xiaodong Liu , Jianfeng Gao , Weizhu Chen , Tuo Zhao

Large, pre-trained models are problematic to use in resource constrained applications. Fortunately, task-aware structured pruning methods offer a solution. These approaches reduce model size by dropping structural units like layers and…

Computation and Language · Computer Science 2023-11-14 Lucio Dery , David Grangier , Awni Hannun

Transfer learning is a useful technique for achieving improved performance and reducing training costs by leveraging the knowledge gained from source tasks and applying it to target tasks. Assessing the effectiveness of transfer learning…

Machine Learning · Computer Science 2023-06-12 Peizhong Ju , Sen Lin , Mark S. Squillante , Yingbin Liang , Ness B. Shroff

Transfer learning involves taking information and insight from one problem domain and applying it to a new problem domain. Although widely used in practice, theory for transfer learning remains less well-developed. To address this, we prove…

Machine Learning · Statistics 2020-06-24 Jake Williams , Abel Tadesse , Tyler Sam , Huey Sun , George D. Montanez

Modern software systems provide many configuration options which significantly influence their non-functional properties. To understand and predict the effect of configuration options, several sampling and learning strategies have been…

Machine Learning · Statistics 2017-09-08 Pooyan Jamshidi , Norbert Siegmund , Miguel Velez , Christian Kästner , Akshay Patel , Yuvraj Agarwal

We consider a transfer-learning problem by using the parameter transfer approach, where a suitable parameter of feature mapping is learned through one task and applied to another objective task. Then, we introduce the notion of the local…

Machine Learning · Statistics 2017-01-19 Wataru Kumagai

Meta-Learning is a subarea of Machine Learning that aims to take advantage of prior knowledge to learn faster and with fewer data [1]. There are different scenarios where meta-learning can be applied, and one of the most common is algorithm…

Machine Learning · Computer Science 2019-10-17 Gean Trindade Pereira , Moisés dos Santos , Edesio Alcobaça , Rafael Mantovani , André Carvalho

Transfer learning is widely used to adapt large pretrained models to new tasks with only a small amount of new data. However, a challenge persists -- the features from the original task often do not fully cover what is needed for unseen…

Machine Learning · Computer Science 2026-02-10 Xingyu Alice Yang , Jianyu Zhang , Léon Bottou

We introduce a novel method that enables parameter-efficient transfer and multi-task learning with deep neural networks. The basic approach is to learn a model patch - a small set of parameters - that will specialize to each task, instead…

Machine Learning · Computer Science 2019-02-26 Pramod Kaushik Mudrakarta , Mark Sandler , Andrey Zhmoginov , Andrew Howard

Deep neural networks have excelled on a wide range of problems, from vision to language and game playing. Neural networks very gradually incorporate information into weights as they process data, requiring very low learning rates. If the…

Latent space model plays a crucial role in network analysis, and accurate estimation of latent variables is essential for downstream tasks such as link prediction. However, the large number of parameters to be estimated presents a…

Methodology · Statistics 2025-09-22 Kuangnan Fang , Ruixuan Qin , Xinyan Fan

Training deep neural networks (DNNs) is computationally expensive, which is problematic especially when performing duplicated or similar training runs in model ensemble or fine-tuning pre-trained models, for example. Once we have trained…

Machine Learning · Computer Science 2023-10-04 Daiki Chijiwa

The availability of abundant labeled data in recent years led the researchers to introduce a methodology called transfer learning, which utilizes existing data in situations where there are difficulties in collecting new annotated data.…

Machine Learning · Computer Science 2021-04-07 Abolfazl Farahani , Behrouz Pourshojae , Khaled Rasheed , Hamid R. Arabnia

This paper proposes a novel learning method for multi-task applications. Multi-task neural networks can learn to transfer knowledge across different tasks by using parameter sharing. However, sharing parameters between unrelated tasks can…

Machine Learning · Computer Science 2020-07-21 Krzysztof Maziarz , Efi Kokiopoulou , Andrea Gesmundo , Luciano Sbaiz , Gabor Bartok , Jesse Berent

We study transfer learning for a linear regression task using several least-squares pretrained models that can be overparameterized. We formulate the target learning task as optimization that minimizes squared errors on the target dataset…

Machine Learning · Computer Science 2026-02-19 Daniel Boharon , Yehuda Dar

We demonstrate that there is significant redundancy in the parameterization of several deep learning models. Given only a few weight values for each feature it is possible to accurately predict the remaining values. Moreover, we show that…

Machine Learning · Computer Science 2014-10-28 Misha Denil , Babak Shakibi , Laurent Dinh , Marc'Aurelio Ranzato , Nando de Freitas

Learning to follow human instructions is a long-pursued goal in artificial intelligence. The task becomes particularly challenging if no prior knowledge of the employed language is assumed while relying only on a handful of examples to…

Computation and Language · Computer Science 2019-04-03 Rezka Leonandya , Elia Bruni , Dieuwke Hupkes , Germán Kruszewski

Graph neural networks (GNNs) is widely used to learn a powerful representation of graph-structured data. Recent work demonstrates that transferring knowledge from self-supervised tasks to downstream tasks could further improve graph…

Machine Learning · Computer Science 2021-07-21 Xueting Han , Zhenhuan Huang , Bang An , Jing Bai

In this paper, we introduce a novel concept for learning of the parameters in a neural network. Our idea is grounded on modeling a learning problem that addresses a trade-off between (i) satisfying local objectives at each node and (ii)…

Machine Learning · Computer Science 2019-02-04 Dimche Kostadinov , Behrooz Razdehi , Slava Voloshynovskiy

Transfer learning is a widely used method to build high performing computer vision models. In this paper, we study the efficacy of transfer learning by examining how the choice of data impacts performance. We find that more pre-training…

Computer Vision and Pattern Recognition · Computer Science 2018-12-13 Jiquan Ngiam , Daiyi Peng , Vijay Vasudevan , Simon Kornblith , Quoc V. Le , Ruoming Pang
‹ Prev 1 2 3 10 Next ›