Related papers: Conditional Adapters: Parameter-efficient Transfer…

Cross-Modal Adapter: Parameter-Efficient Transfer Learning Approach for Vision-Language Models

Adapter-based parameter-efficient transfer learning has achieved exciting results in vision-language models. Traditional adapter methods often require training or fine-tuning, facing challenges such as insufficient samples or resource…

Computer Vision and Pattern Recognition · Computer Science 2024-04-22 Juncheng Yang , Zuchao Li , Shuai Xie , Weiping Zhu , Wei Yu , Shijun Li

CoDA: Towards Effective Cross-domain Knowledge Transfer via CoT-guided Domain Adaptation

Large language models (LLMs) have achieved substantial advances in logical reasoning, yet they continue to lag behind human-level performance. In-context learning provides a viable solution that boosts the model's performance via prompting…

Artificial Intelligence · Computer Science 2026-04-22 Jianzhi Yan , Le Liu , Buzhou Tang , Yang Xiang , Dongning Sun , Zhiming Li

Generalizing to New Physical Systems via Context-Informed Dynamics Model

Data-driven approaches to modeling physical systems fail to generalize to unseen systems that share the same general dynamics with the learning domain, but correspond to different physical contexts. We propose a new framework for this key…

Machine Learning · Computer Science 2022-06-27 Matthieu Kirchmeyer , Yuan Yin , Jérémie Donà , Nicolas Baskiotis , Alain Rakotomamonjy , Patrick Gallinari

UDApter -- Efficient Domain Adaptation Using Adapters

We propose two methods to make unsupervised domain adaptation (UDA) more parameter efficient using adapters, small bottleneck layers interspersed with every layer of the large-scale pre-trained language model (PLM). The first method…

Computation and Language · Computer Science 2023-02-17 Bhavitvya Malik , Abhinav Ramesh Kashyap , Min-Yen Kan , Soujanya Poria

Parameter-Efficient Abstractive Question Answering over Tables or Text

A long-term ambition of information seeking QA systems is to reason over multi-modal contexts and generate natural answers to user queries. Today, memory intensive pre-trained language models are adapted to downstream tasks such as QA by…

Computation and Language · Computer Science 2023-07-12 Vaishali Pal , Evangelos Kanoulas , Maarten de Rijke

CoDA: Coding LM via Diffusion Adaptation

Diffusion language models promise bidirectional context and infilling capabilities that autoregressive coders lack, yet practical systems remain heavyweight. We introduce CoDA, a 1.7B-parameter diffusion coder trained on TPU with a fully…

Machine Learning · Computer Science 2025-10-07 Haolin Chen , Shiyu Wang , Can Qin , Bo Pang , Zuxin Liu , Jielin Qiu , Jianguo Zhang , Yingbo Zhou , Zeyuan Chen , Ran Xu , Shelby Heinecke , Silvio Savarese , Caiming Xiong , Huan Wang , Weiran Yao

TADA: Efficient Task-Agnostic Domain Adaptation for Transformers

Intermediate training of pre-trained transformer-based language models on domain-specific data leads to substantial gains for downstream tasks. To increase efficiency and prevent catastrophic forgetting alleviated from full domain-adaptive…

Computation and Language · Computer Science 2023-05-23 Chia-Chien Hung , Lukas Lange , Jannik Strötgen

Dynamic Domain Adaptation for Efficient Inference

Domain adaptation (DA) enables knowledge transfer from a labeled source domain to an unlabeled target domain by reducing the cross-domain distribution discrepancy. Most prior DA approaches leverage complicated and powerful deep neural…

Computer Vision and Pattern Recognition · Computer Science 2021-03-31 Shuang Li , Jinming Zhang , Wenxuan Ma , Chi Harold Liu , Wei Li

CorDA: Context-Oriented Decomposition Adaptation of Large Language Models for Task-Aware Parameter-Efficient Fine-tuning

Current parameter-efficient fine-tuning (PEFT) methods build adapters widely agnostic of the context of downstream task to learn, or the context of important knowledge to maintain. As a result, there is often a performance gap compared to…

Machine Learning · Computer Science 2025-03-11 Yibo Yang , Xiaojie Li , Zhongzhu Zhou , Shuaiwen Leon Song , Jianlong Wu , Liqiang Nie , Bernard Ghanem

Parameter-efficient transfer learning of pre-trained Transformer models for speaker verification using adapters

Recently, the pre-trained Transformer models have received a rising interest in the field of speech processing thanks to their great success in various downstream tasks. However, most fine-tuning approaches update all the parameters of the…

Audio and Speech Processing · Electrical Eng. & Systems 2022-10-31 Junyi Peng , Themos Stafylakis , Rongzhi Gu , Oldřich Plchot , Ladislav Mošner , Lukáš Burget , Jan Černocký

Transfer Learning with Dynamic Distribution Adaptation

Transfer learning aims to learn robust classifiers for the target domain by leveraging knowledge from a source domain. Since the source and the target domains are usually from different distributions, existing methods mainly focus on…

Machine Learning · Computer Science 2019-09-19 Jindong Wang , Yiqiang Chen , Wenjie Feng , Han Yu , Meiyu Huang , Qiang Yang

Dynamic Adapter Meets Prompt Tuning: Parameter-Efficient Transfer Learning for Point Cloud Analysis

Point cloud analysis has achieved outstanding performance by transferring point cloud pre-trained models. However, existing methods for model adaptation usually update all model parameters, i.e., full fine-tuning paradigm, which is…

Computer Vision and Pattern Recognition · Computer Science 2024-04-08 Xin Zhou , Dingkang Liang , Wei Xu , Xingkui Zhu , Yihan Xu , Zhikang Zou , Xiang Bai

Training Adaptive Computation for Open-Domain Question Answering with Computational Constraints

Adaptive Computation (AC) has been shown to be effective in improving the efficiency of Open-Domain Question Answering (ODQA) systems. However, current AC approaches require tuning of all model parameters, and training state-of-the-art ODQA…

Computation and Language · Computer Science 2021-07-06 Yuxiang Wu , Pasquale Minervini , Pontus Stenetorp , Sebastian Riedel

Cross-Modal Adapter for Vision-Language Retrieval

Vision-language retrieval is an important multi-modal learning topic, where the goal is to retrieve the most relevant visual candidate for a given text query. Recently, pre-trained models, e.g., CLIP, show great potential on retrieval…

Computer Vision and Pattern Recognition · Computer Science 2025-09-03 Haojun Jiang , Jianke Zhang , Rui Huang , Chunjiang Ge , Zanlin Ni , Shiji Song , Gao Huang

Parameter-Efficient Transfer Learning of Audio Spectrogram Transformers

Parameter-efficient transfer learning (PETL) methods have emerged as a solid alternative to the standard full fine-tuning approach. They only train a few extra parameters for each downstream task, without sacrificing performance and…

Audio and Speech Processing · Electrical Eng. & Systems 2024-07-16 Umberto Cappellazzo , Daniele Falavigna , Alessio Brutti , Mirco Ravanelli

Continual Adaptation: Environment-Conditional Parameter Generation for Object Detection in Dynamic Scenarios

In practice, environments constantly change over time and space, posing significant challenges for object detectors trained based on a closed-set assumption, i.e., training and test data share the same distribution. To this end, continual…

Computer Vision and Pattern Recognition · Computer Science 2025-07-01 Deng Li , Aming Wu , Yang Li , Yaowei Wang , Yahong Han

Evaluating Parameter-Efficient Transfer Learning Approaches on SURE Benchmark for Speech Understanding

Fine-tuning is widely used as the default algorithm for transfer learning from pre-trained models. Parameter inefficiency can however arise when, during transfer learning, all the parameters of a large pre-trained model need to be updated…

Computation and Language · Computer Science 2023-03-07 Yingting Li , Ambuj Mehrish , Shuai Zhao , Rishabh Bhardwaj , Amir Zadeh , Navonil Majumder , Rada Mihalcea , Soujanya Poria

Efficient Transfer Learning for Video-language Foundation Models

Pre-trained vision-language models provide a robust foundation for efficient transfer learning across various downstream tasks. In the field of video action recognition, mainstream approaches often introduce additional modules to capture…

Computer Vision and Pattern Recognition · Computer Science 2025-03-19 Haoxing Chen , Zizheng Huang , Yan Hong , Yanshuo Wang , Zhongcai Lyu , Zhuoer Xu , Jun Lan , Zhangxuan Gu

Hybrid Decoding: Rapid Pass and Selective Detailed Correction for Sequence Models

Recently, Transformer-based encoder-decoder models have demonstrated strong performance in multilingual speech recognition. However, the decoder's autoregressive nature and large size introduce significant bottlenecks during inference.…

Audio and Speech Processing · Electrical Eng. & Systems 2025-08-28 Yunkyu Lim , Jihwan Park , Hyung Yong Kim , Hanbin Lee , Byeong-Yeol Kim

Efficient Speech Translation with Dynamic Latent Perceivers

Transformers have been the dominant architecture for Speech Translation in recent years, achieving significant improvements in translation quality. Since speech signals are longer than their textual counterparts, and due to the quadratic…

Computation and Language · Computer Science 2023-03-15 Ioannis Tsiamas , Gerard I. Gállego , José A. R. Fonollosa , Marta R. Costa-jussà