Related papers: Efficient Neural Task Adaptation by Maximum Entrop…

Efficiently Tuned Parameters are Task Embeddings

Intermediate-task transfer can benefit a wide range of NLP tasks with properly selected source datasets. However, it is computationally infeasible to experiment with all intermediate transfer combinations, making choosing a useful source…

Computation and Language · Computer Science 2022-10-24 Wangchunshu Zhou , Canwen Xu , Julian McAuley

What to Pre-Train on? Efficient Intermediate Task Selection

Intermediate task fine-tuning has been shown to culminate in large transfer gains across many NLP tasks. With an abundance of candidate datasets as well as pre-trained language models, it has become infeasible to run the cross-product of…

Computation and Language · Computer Science 2021-09-13 Clifton Poth , Jonas Pfeiffer , Andreas Rücklé , Iryna Gurevych

Explicit Inductive Bias for Transfer Learning with Convolutional Networks

In inductive transfer learning, fine-tuning pre-trained convolutional networks substantially outperforms training from scratch. When using fine-tuning, the underlying assumption is that the pre-trained model extracts generic features, which…

Machine Learning · Computer Science 2018-06-07 Xuhong Li , Yves Grandvalet , Franck Davoine

Renofeation: A Simple Transfer Learning Method for Improved Adversarial Robustness

Fine-tuning through knowledge transfer from a pre-trained model on a large-scale dataset is a widely spread approach to effectively build models on small-scale datasets. In this work, we show that a recent adversarial attack designed for…

Machine Learning · Computer Science 2021-04-30 Ting-Wu Chin , Cha Zhang , Diana Marculescu

Test-Time Adaptation to Distribution Shift by Confidence Maximization and Input Transformation

Deep neural networks often exhibit poor performance on data that is unlikely under the train-time data distribution, for instance data affected by corruptions. Previous works demonstrate that test-time adaptation to data shift, for instance…

Machine Learning · Statistics 2021-06-30 Chaithanya Kumar Mummadi , Robin Hutmacher , Kilian Rambach , Evgeny Levinkov , Thomas Brox , Jan Hendrik Metzen

Self-Supervised Learning via Maximum Entropy Coding

A mainstream type of current self-supervised learning methods pursues a general-purpose representation that can be well transferred to downstream tasks, typically by optimizing on a given pretext task such as instance discrimination. In…

Computer Vision and Pattern Recognition · Computer Science 2022-10-21 Xin Liu , Zhongdao Wang , Yali Li , Shengjin Wang

Transfer Learning Using Classification Layer Features of CNN

Although CNNs have gained the ability to transfer learned knowledge from source task to target task by virtue of large annotated datasets but consume huge processing time to fine-tune without GPU. In this paper, we propose a new…

Computer Vision and Pattern Recognition · Computer Science 2019-03-28 Tasfia Shermin , Manzur Murshed , Guojun Lu , Shyh Wei Teng

Transfer Reinforcement Learning across Homotopy Classes

The ability for robots to transfer their learned knowledge to new tasks -- where data is scarce -- is a fundamental challenge for successful robot learning. While fine-tuning has been well-studied as a simple but effective transfer approach…

Robotics · Computer Science 2021-08-10 Zhangjie Cao , Minae Kwon , Dorsa Sadigh

Less is More: Selective Layer Finetuning with SubTuning

Finetuning a pretrained model has become a standard approach for training neural networks on novel tasks, resulting in fast convergence and improved performance. In this work, we study an alternative finetuning method, where instead of…

Machine Learning · Computer Science 2023-07-04 Gal Kaplun , Andrey Gurevich , Tal Swisa , Mazor David , Shai Shalev-Shwartz , Eran Malach

Learn Faster and Forget Slower via Fast and Stable Task Adaptation

Training Deep Neural Networks (DNNs) is still highly time-consuming and compute-intensive. It has been shown that adapting a pretrained model may significantly accelerate this process. With a focus on classification, we show that current…

Neural and Evolutionary Computing · Computer Science 2020-12-01 Farshid Varno , Lucas May Petry , Lisa Di Jorio , Stan Matwin

Less is More: Parameter-Efficient Selection of Intermediate Tasks for Transfer Learning

Intermediate task transfer learning can greatly improve model performance. If, for example, one has little training data for emotion detection, first fine-tuning a language model on a sentiment classification dataset may improve performance…

Computation and Language · Computer Science 2024-10-22 David Schulte , Felix Hamborg , Alan Akbik

Generalization in Transfer Learning

Agents trained with deep reinforcement learning algorithms are capable of performing highly complex tasks including locomotion in continuous environments. We investigate transferring the learning acquired in one task to a set of previously…

Machine Learning · Computer Science 2024-03-06 Suzan Ece Ada , Emre Ugur , H. Levent Akin

Attentive Multi-Task Deep Reinforcement Learning

Sharing knowledge between tasks is vital for efficient learning in a multi-task setting. However, most research so far has focused on the easier case where knowledge transfer is not harmful, i.e., where knowledge from one task cannot…

Machine Learning · Computer Science 2019-07-08 Timo Bram , Gino Brunner , Oliver Richter , Roger Wattenhofer

Incremental Learning with Maximum Entropy Regularization: Rethinking Forgetting and Intransigence

Incremental learning suffers from two challenging problems; forgetting of old knowledge and intransigence on learning new knowledge. Prediction by the model incrementally learned with a subset of the dataset are thus uncertain and the…

Machine Learning · Computer Science 2019-02-05 Dahyun Kim , Jihwan Bae , Yeonsik Jo , Jonghyun Choi

Transfer Learning for Finetuning Large Language Models

As the landscape of large language models expands, efficiently finetuning for specific tasks becomes increasingly crucial. At the same time, the landscape of parameter-efficient finetuning methods rapidly expands. Consequently,…

Computation and Language · Computer Science 2024-11-05 Tobias Strangmann , Lennart Purucker , Jörg K. H. Franke , Ivo Rapant , Fabio Ferreira , Frank Hutter

Enhanced Transfer Learning with ImageNet Trained Classification Layer

Parameter fine tuning is a transfer learning approach whereby learned parameters from pre-trained source network are transferred to the target network followed by fine-tuning. Prior research has shown that this approach is capable of…

Computer Vision and Pattern Recognition · Computer Science 2019-09-20 Tasfia Shermin , Shyh Wei Teng , Manzur Murshed , Guojun Lu , Ferdous Sohel , Manoranjan Paul

Interleaved Multitask Learning with Energy Modulated Learning Progress

As humans learn new skills and apply their existing knowledge while maintaining previously learned information, "continual learning" in machine learning aims to incorporate new data while retaining and utilizing past knowledge. However,…

Robotics · Computer Science 2025-07-29 Hanne Say , Suzan Ece Ada , Emre Ugur , Minoru Asada , Erhan Oztop

muNet: Evolving Pretrained Deep Neural Networks into Scalable Auto-tuning Multitask Systems

Most uses of machine learning today involve training a model from scratch for a particular task, or sometimes starting with a model pretrained on a related task and then fine-tuning on a downstream task. Both approaches offer limited…

Machine Learning · Computer Science 2022-05-26 Andrea Gesmundo , Jeff Dean

Optimizing Information Loss Towards Robust Neural Networks

Neural Networks (NNs) are vulnerable to adversarial examples. Such inputs differ only slightly from their benign counterparts yet provoke misclassifications of the attacked NNs. The required perturbations to craft the examples are often…

Cryptography and Security · Computer Science 2020-09-30 Philip Sperl , Konstantin Böttinger

NoisyNN: Exploring the Impact of Information Entropy Change in Learning Systems

We investigate the impact of entropy change in deep learning systems by noise injection at different levels, including the embedding space and the image. The series of models that employ our methodology are collectively known as Noisy…

Artificial Intelligence · Computer Science 2025-09-09 Xiaowei Yu , Zhe Huang , Minheng Chen , Lu Zhang , Tianming Liu , Dajiang Zhu