English
Related papers

Related papers: Efficient Neural Task Adaptation by Maximum Entrop…

200 papers

Intermediate-task transfer can benefit a wide range of NLP tasks with properly selected source datasets. However, it is computationally infeasible to experiment with all intermediate transfer combinations, making choosing a useful source…

Computation and Language · Computer Science 2022-10-24 Wangchunshu Zhou , Canwen Xu , Julian McAuley

Intermediate task fine-tuning has been shown to culminate in large transfer gains across many NLP tasks. With an abundance of candidate datasets as well as pre-trained language models, it has become infeasible to run the cross-product of…

Computation and Language · Computer Science 2021-09-13 Clifton Poth , Jonas Pfeiffer , Andreas Rücklé , Iryna Gurevych

In inductive transfer learning, fine-tuning pre-trained convolutional networks substantially outperforms training from scratch. When using fine-tuning, the underlying assumption is that the pre-trained model extracts generic features, which…

Machine Learning · Computer Science 2018-06-07 Xuhong Li , Yves Grandvalet , Franck Davoine

Fine-tuning through knowledge transfer from a pre-trained model on a large-scale dataset is a widely spread approach to effectively build models on small-scale datasets. In this work, we show that a recent adversarial attack designed for…

Machine Learning · Computer Science 2021-04-30 Ting-Wu Chin , Cha Zhang , Diana Marculescu

Deep neural networks often exhibit poor performance on data that is unlikely under the train-time data distribution, for instance data affected by corruptions. Previous works demonstrate that test-time adaptation to data shift, for instance…

A mainstream type of current self-supervised learning methods pursues a general-purpose representation that can be well transferred to downstream tasks, typically by optimizing on a given pretext task such as instance discrimination. In…

Computer Vision and Pattern Recognition · Computer Science 2022-10-21 Xin Liu , Zhongdao Wang , Yali Li , Shengjin Wang

Although CNNs have gained the ability to transfer learned knowledge from source task to target task by virtue of large annotated datasets but consume huge processing time to fine-tune without GPU. In this paper, we propose a new…

Computer Vision and Pattern Recognition · Computer Science 2019-03-28 Tasfia Shermin , Manzur Murshed , Guojun Lu , Shyh Wei Teng

The ability for robots to transfer their learned knowledge to new tasks -- where data is scarce -- is a fundamental challenge for successful robot learning. While fine-tuning has been well-studied as a simple but effective transfer approach…

Robotics · Computer Science 2021-08-10 Zhangjie Cao , Minae Kwon , Dorsa Sadigh

Finetuning a pretrained model has become a standard approach for training neural networks on novel tasks, resulting in fast convergence and improved performance. In this work, we study an alternative finetuning method, where instead of…

Machine Learning · Computer Science 2023-07-04 Gal Kaplun , Andrey Gurevich , Tal Swisa , Mazor David , Shai Shalev-Shwartz , Eran Malach

Training Deep Neural Networks (DNNs) is still highly time-consuming and compute-intensive. It has been shown that adapting a pretrained model may significantly accelerate this process. With a focus on classification, we show that current…

Neural and Evolutionary Computing · Computer Science 2020-12-01 Farshid Varno , Lucas May Petry , Lisa Di Jorio , Stan Matwin

Intermediate task transfer learning can greatly improve model performance. If, for example, one has little training data for emotion detection, first fine-tuning a language model on a sentiment classification dataset may improve performance…

Computation and Language · Computer Science 2024-10-22 David Schulte , Felix Hamborg , Alan Akbik

Agents trained with deep reinforcement learning algorithms are capable of performing highly complex tasks including locomotion in continuous environments. We investigate transferring the learning acquired in one task to a set of previously…

Machine Learning · Computer Science 2024-03-06 Suzan Ece Ada , Emre Ugur , H. Levent Akin

Sharing knowledge between tasks is vital for efficient learning in a multi-task setting. However, most research so far has focused on the easier case where knowledge transfer is not harmful, i.e., where knowledge from one task cannot…

Machine Learning · Computer Science 2019-07-08 Timo Bram , Gino Brunner , Oliver Richter , Roger Wattenhofer

Incremental learning suffers from two challenging problems; forgetting of old knowledge and intransigence on learning new knowledge. Prediction by the model incrementally learned with a subset of the dataset are thus uncertain and the…

Machine Learning · Computer Science 2019-02-05 Dahyun Kim , Jihwan Bae , Yeonsik Jo , Jonghyun Choi

As the landscape of large language models expands, efficiently finetuning for specific tasks becomes increasingly crucial. At the same time, the landscape of parameter-efficient finetuning methods rapidly expands. Consequently,…

Computation and Language · Computer Science 2024-11-05 Tobias Strangmann , Lennart Purucker , Jörg K. H. Franke , Ivo Rapant , Fabio Ferreira , Frank Hutter

Parameter fine tuning is a transfer learning approach whereby learned parameters from pre-trained source network are transferred to the target network followed by fine-tuning. Prior research has shown that this approach is capable of…

Computer Vision and Pattern Recognition · Computer Science 2019-09-20 Tasfia Shermin , Shyh Wei Teng , Manzur Murshed , Guojun Lu , Ferdous Sohel , Manoranjan Paul

As humans learn new skills and apply their existing knowledge while maintaining previously learned information, "continual learning" in machine learning aims to incorporate new data while retaining and utilizing past knowledge. However,…

Robotics · Computer Science 2025-07-29 Hanne Say , Suzan Ece Ada , Emre Ugur , Minoru Asada , Erhan Oztop

Most uses of machine learning today involve training a model from scratch for a particular task, or sometimes starting with a model pretrained on a related task and then fine-tuning on a downstream task. Both approaches offer limited…

Machine Learning · Computer Science 2022-05-26 Andrea Gesmundo , Jeff Dean

Neural Networks (NNs) are vulnerable to adversarial examples. Such inputs differ only slightly from their benign counterparts yet provoke misclassifications of the attacked NNs. The required perturbations to craft the examples are often…

Cryptography and Security · Computer Science 2020-09-30 Philip Sperl , Konstantin Böttinger

We investigate the impact of entropy change in deep learning systems by noise injection at different levels, including the embedding space and the image. The series of models that employ our methodology are collectively known as Noisy…

Artificial Intelligence · Computer Science 2025-09-09 Xiaowei Yu , Zhe Huang , Minheng Chen , Lu Zhang , Tianming Liu , Dajiang Zhu
‹ Prev 1 2 3 10 Next ›