Related papers: Adaptor: Objective-Centric Adaptation Framework fo…

Adapters: A Unified Library for Parameter-Efficient and Modular Transfer Learning

We introduce Adapters, an open-source library that unifies parameter-efficient and modular transfer learning in large language models. By integrating 10 diverse adapter methods into a unified interface, Adapters offers ease of use and…

Computation and Language · Computer Science 2023-11-21 Clifton Poth , Hannah Sterz , Indraneil Paul , Sukannya Purkayastha , Leon Engländer , Timo Imhof , Ivan Vulić , Sebastian Ruder , Iryna Gurevych , Jonas Pfeiffer

Towards a General Pre-training Framework for Adaptive Learning in MOOCs

Adaptive learning aims to stimulate and meet the needs of individual learners, which requires sophisticated system-level coordination of diverse tasks, including modeling learning resources, estimating student states, and making…

Computers and Society · Computer Science 2022-08-10 Qingyang Zhong , Jifan Yu , Zheyuan Zhang , Yiming Mao , Yuquan Wang , Yankai Lin , Lei Hou , Juanzi Li , Jie Tang

Efficient Adapter Tuning of Pre-trained Speech Models for Automatic Speaker Verification

With excellent generalization ability, self-supervised speech models have shown impressive performance on various downstream speech tasks in the pre-training and fine-tuning paradigm. However, as the growing size of pre-trained models,…

Audio and Speech Processing · Electrical Eng. & Systems 2024-03-04 Mufan Sang , John H. L. Hansen

UDapter: Language Adaptation for Truly Universal Dependency Parsing

Recent advances in multilingual dependency parsing have brought the idea of a truly universal parser closer to reality. However, cross-language interference and restrained model capacity remain major obstacles. To address this, we propose a…

Computation and Language · Computer Science 2020-10-07 Ahmet Üstün , Arianna Bisazza , Gosse Bouma , Gertjan van Noord

Multilingual Domain Adaptation for NMT: Decoupling Language and Domain Information with Adapters

Adapter layers are lightweight, learnable units inserted between transformer layers. Recent work explores using such layers for neural machine translation (NMT), to adapt pre-trained models to new domains or language pairs, training only a…

Computation and Language · Computer Science 2021-10-20 Asa Cooper Stickland , Alexandre Bérard , Vassilina Nikoulina

Legal Transformer Models May Not Always Help

Deep learning-based Natural Language Processing methods, especially transformers, have achieved impressive performance in the last few years. Applying those state-of-the-art NLP methods to legal activities to automate or simplify some…

Computation and Language · Computer Science 2021-09-16 Saibo Geng , Rémi Lebret , Karl Aberer

Simple, Scalable Adaptation for Neural Machine Translation

Fine-tuning pre-trained Neural Machine Translation (NMT) models is the dominant approach for adapting to new languages and domains. However, fine-tuning requires adapting and maintaining a separate model for each target task. We propose a…

Computation and Language · Computer Science 2019-09-19 Ankur Bapna , Naveen Arivazhagan , Orhan Firat

Cross-Modal Adapter for Vision-Language Retrieval

Vision-language retrieval is an important multi-modal learning topic, where the goal is to retrieve the most relevant visual candidate for a given text query. Recently, pre-trained models, e.g., CLIP, show great potential on retrieval…

Computer Vision and Pattern Recognition · Computer Science 2025-09-03 Haojun Jiang , Jianke Zhang , Rui Huang , Chunjiang Ge , Zanlin Ni , Shiji Song , Gao Huang

Adaptive Neural Compilation

This paper proposes an adaptive neural-compilation framework to address the problem of efficient program learning. Traditional code optimisation strategies used in compilers are based on applying pre-specified set of transformations that…

Artificial Intelligence · Computer Science 2016-05-27 Rudy Bunel , Alban Desmaison , Pushmeet Kohli , Philip H. S. Torr , M. Pawan Kumar

Object-aware Video-language Pre-training for Retrieval

Recently, by introducing large-scale dataset and strong transformer network, video-language pre-training has shown great success especially for retrieval. Yet, existing video-language transformer models do not explicitly fine-grained…

Computer Vision and Pattern Recognition · Computer Science 2022-05-19 Alex Jinpeng Wang , Yixiao Ge , Guanyu Cai , Rui Yan , Xudong Lin , Ying Shan , Xiaohu Qie , Mike Zheng Shou

An Adapter-Based Unified Model for Multiple Spoken Language Processing Tasks

Self-supervised learning models have revolutionized the field of speech processing. However, the process of fine-tuning these models on downstream tasks requires substantial computational resources, particularly when dealing with multiple…

Computation and Language · Computer Science 2024-06-24 Varsha Suresh , Salah Aït-Mokhtar , Caroline Brun , Ioan Calapodescu

AdaptFlow: Adaptive Workflow Optimization via Meta-Learning

Recent advances in large language models (LLMs) have sparked growing interest in agentic workflows, which are structured sequences of LLM invocations intended to solve complex tasks. However, existing approaches often rely on static…

Artificial Intelligence · Computer Science 2025-08-12 Runchuan Zhu , Bowen Jiang , Lingrui Mei , Fangkai Yang , Lu Wang , Haoxiang Gao , Fengshuo Bai , Pu Zhao , Qingwei Lin , Saravan Rajmohan , Dongmei Zhang

AdapterHub Playground: Simple and Flexible Few-Shot Learning with Adapters

The open-access dissemination of pretrained language models through online repositories has led to a democratization of state-of-the-art natural language processing (NLP) research. This also allows people outside of NLP to use such models…

Computation and Language · Computer Science 2022-04-20 Tilman Beck , Bela Bohlender , Christina Viehmann , Vincent Hane , Yanik Adamson , Jaber Khuri , Jonas Brossmann , Jonas Pfeiffer , Iryna Gurevych

The Hidden Space of Transformer Language Adapters

We analyze the operation of transformer language adapters, which are small modules trained on top of a frozen language model to adapt its predictions to new target languages. We show that adapted predictions mostly evolve in the source…

Computation and Language · Computer Science 2024-06-11 Jesujoba O. Alabi , Marius Mosbach , Matan Eyal , Dietrich Klakow , Mor Geva

MDAPT: Multilingual Domain Adaptive Pretraining in a Single Model

Domain adaptive pretraining, i.e. the continued unsupervised pretraining of a language model on domain-specific text, improves the modelling of text for downstream tasks within the domain. Numerous real-world applications are based on…

Computation and Language · Computer Science 2021-09-15 Rasmus Kær Jørgensen , Mareike Hartmann , Xiang Dai , Desmond Elliott

AdapterHub: A Framework for Adapting Transformers

The current modus operandi in NLP involves downloading and fine-tuning pre-trained models consisting of millions or billions of parameters. Storing and sharing such large trained models is expensive, slow, and time-consuming, which impedes…

Computation and Language · Computer Science 2020-10-07 Jonas Pfeiffer , Andreas Rücklé , Clifton Poth , Aishwarya Kamath , Ivan Vulić , Sebastian Ruder , Kyunghyun Cho , Iryna Gurevych

Open Programming Language Interpreters

Context: This paper presents the concept of open programming language interpreters and the implementation of a framework-level metaobject protocol (MOP) to support them. Inquiry: We address the problem of dynamic interpreter adaptation to…

Programming Languages · Computer Science 2017-04-03 Walter Cazzola , Albert Shaqiri

Parameter-Efficient Fine-Tuning With Adapters

In the arena of language model fine-tuning, the traditional approaches, such as Domain-Adaptive Pretraining (DAPT) and Task-Adaptive Pretraining (TAPT), although effective, but computational intensive. This research introduces a novel…

Computation and Language · Computer Science 2024-05-10 Keyu Chen , Yuan Pang , Zi Yang

Adaptive Fine-Tuning of Transformer-Based Language Models for Named Entity Recognition

The current standard approach for fine-tuning transformer-based language models includes a fixed number of training epochs and a linear learning rate schedule. In order to obtain a near-optimal model for the given downstream task, a search…

Computation and Language · Computer Science 2022-02-08 Felix Stollenwerk

Cross-Domain Object Detection Using Unsupervised Image Translation

Unsupervised domain adaptation for object detection addresses the adaption of detectors trained in a source domain to work accurately in an unseen target domain. Recently, methods approaching the alignment of the intermediate features…

Computer Vision and Pattern Recognition · Computer Science 2026-01-21 Vinicius F. Arruda , Rodrigo F. Berriel , Thiago M. Paixão , Claudine Badue , Alberto F. De Souza , Nicu Sebe , Thiago Oliveira-Santos