English
Related papers

Related papers: Adaptable Adapters

200 papers

Transformer-based pre-trained models with millions of parameters require large storage. Recent approaches tackle this shortcoming by training adapters, but these approaches still require a relatively large number of parameters. In this…

Computation and Language · Computer Science 2023-01-31 Chin-Lun Fu , Zih-Ching Chen , Yun-Ru Lee , Hung-yi Lee

Adapter-based tuning has recently arisen as an alternative to fine-tuning. It works by adding light-weight adapter modules to a pretrained language model (PrLM) and only updating the parameters of adapter modules when learning on a…

Computation and Language · Computer Science 2021-06-08 Ruidan He , Linlin Liu , Hai Ye , Qingyu Tan , Bosheng Ding , Liying Cheng , Jia-Wei Low , Lidong Bing , Luo Si

The open-access dissemination of pretrained language models through online repositories has led to a democratization of state-of-the-art natural language processing (NLP) research. This also allows people outside of NLP to use such models…

Computation and Language · Computer Science 2022-04-20 Tilman Beck , Bela Bohlender , Christina Viehmann , Vincent Hane , Yanik Adamson , Jaber Khuri , Jonas Brossmann , Jonas Pfeiffer , Iryna Gurevych

We introduce Adapters, an open-source library that unifies parameter-efficient and modular transfer learning in large language models. By integrating 10 diverse adapter methods into a unified interface, Adapters offers ease of use and…

The current modus operandi in NLP involves downloading and fine-tuning pre-trained models consisting of millions or billions of parameters. Storing and sharing such large trained models is expensive, slow, and time-consuming, which impedes…

Computation and Language · Computer Science 2020-10-07 Jonas Pfeiffer , Andreas Rücklé , Clifton Poth , Aishwarya Kamath , Ivan Vulić , Sebastian Ruder , Kyunghyun Cho , Iryna Gurevych

Transfer learning with large pretrained transformer-based language models like BERT has become a dominating approach for most NLP tasks. Simply fine-tuning those large language models on downstream tasks or combining it with task-specific…

Computation and Language · Computer Science 2021-08-06 Wenjuan Han , Bo Pang , Yingnian Wu

Fine-tuning large pre-trained models is an effective transfer mechanism in NLP. However, in the presence of many downstream tasks, fine-tuning is parameter inefficient: an entire new model is required for every task. As an alternative, we…

Fine-tuning of self-supervised models is a powerful transfer learning method in a variety of fields, including speech processing, since it can utilize generic feature representations obtained from large amounts of unlabeled data.…

Multimedia · Computer Science 2022-12-07 Shinta Otake , Rei Kawakami , Nakamasa Inoue

Adapters are widely popular parameter-efficient transfer learning approaches in natural language processing that insert trainable modules in between layers of a pre-trained language model. Apart from several heuristics, however, there has…

Computation and Language · Computer Science 2023-10-31 Rishabh Bhardwaj , Tushar Vaidya , Soujanya Poria

Adapter modules were recently introduced as an efficient alternative to fine-tuning in NLP. Adapter tuning consists in freezing pretrained parameters of a model and injecting lightweight modules between layers, resulting in the addition of…

Computation and Language · Computer Science 2021-07-14 Hang Le , Juan Pino , Changhan Wang , Jiatao Gu , Didier Schwab , Laurent Besacier

Parameter-Efficient transfer learning with Adapters have been studied in Natural Language Processing (NLP) as an alternative to full fine-tuning. Adapters are memory-efficient and scale well with downstream tasks by training small…

Information Retrieval · Computer Science 2023-03-24 Vaishali Pal , Carlos Lassance , Hervé Déjean , Stéphane Clinchant

Adapters have been positioned as a parameter-efficient fine-tuning (PEFT) approach, whereby a minimal number of parameters are added to the model and fine-tuned. However, adapters have not been sufficiently analyzed to understand if PEFT…

Computation and Language · Computer Science 2023-05-15 Nandini Mundra , Sumanth Doddapaneni , Raj Dabre , Anoop Kunchukuttan , Ratish Puduppully , Mitesh M. Khapra

Transformer-based pre-trained models have revolutionized NLP for superior performance and generality. Fine-tuning pre-trained models for downstream tasks often requires private data, for which federated learning is the de-facto approach…

Machine Learning · Computer Science 2023-05-10 Dongqi Cai , Yaozong Wu , Shangguang Wang , Felix Xiaozhu Lin , Mengwei Xu

Adapters, a plug-in neural network module with some tunable parameters, have emerged as a parameter-efficient transfer learning technique for adapting pre-trained models to downstream tasks, especially for natural language processing (NLP)…

Information Retrieval · Computer Science 2023-12-11 Junchen Fu , Fajie Yuan , Yu Song , Zheng Yuan , Mingyue Cheng , Shenghui Cheng , Jiaqi Zhang , Jie Wang , Yunzhu Pan

Adapters provide an efficient and lightweight mechanism for adapting trained transformer models to a variety of different tasks. However, they have often been found to be outperformed by other adaptation mechanisms, including low-rank…

Computer Vision and Pattern Recognition · Computer Science 2024-06-12 Jan-Martin O. Steitz , Stefan Roth

Large Language Models (LLMs) are computationally intensive, particularly during inference. Neuron-adaptive techniques, which selectively activate neurons in Multi-Layer Perceptron (MLP) layers, offer some speedups but suffer from…

Machine Learning · Computer Science 2025-05-08 Roberto Garcia , Jerry Liu , Daniel Sorvisto , Sabri Eyuboglu

Continual learning allows the system to learn and adapt to new tasks while retaining the knowledge acquired from previous tasks. However, deep learning models suffer from catastrophic forgetting of knowledge learned from earlier tasks while…

Machine Learning · Computer Science 2024-12-17 Dupati Srikar Chandra , P. K. Srijith , Dana Rezazadegan , Chris McCarthy

We present a novel resizing module for neural networks: shape adaptor, a drop-in enhancement built on top of traditional resizing layers, such as pooling, bilinear sampling, and strided convolution. Whilst traditional resizing layers have…

Machine Learning · Computer Science 2020-08-11 Shikun Liu , Zhe Lin , Yilin Wang , Jianming Zhang , Federico Perazzi , Edward Johns

Parameter-efficient transfer learning (PETL) aims to adapt pre-trained models to new downstream tasks while minimizing the number of fine-tuned parameters. Adapters, a popular approach in PETL, inject additional capacity into existing…

Machine Learning · Computer Science 2024-10-22 Aleksandra I. Nowak , Otniel-Bogdan Mercea , Anurag Arnab , Jonas Pfeiffer , Yann Dauphin , Utku Evci

Adapter layers are lightweight, learnable units inserted between transformer layers. Recent work explores using such layers for neural machine translation (NMT), to adapt pre-trained models to new domains or language pairs, training only a…

Computation and Language · Computer Science 2021-10-20 Asa Cooper Stickland , Alexandre Bérard , Vassilina Nikoulina
‹ Prev 1 2 3 10 Next ›