English
Related papers

Related papers: Conditional Adapters: Parameter-efficient Transfer…

200 papers

Adapter-based parameter-efficient transfer learning has achieved exciting results in vision-language models. Traditional adapter methods often require training or fine-tuning, facing challenges such as insufficient samples or resource…

Computer Vision and Pattern Recognition · Computer Science 2024-04-22 Juncheng Yang , Zuchao Li , Shuai Xie , Weiping Zhu , Wei Yu , Shijun Li

Large language models (LLMs) have achieved substantial advances in logical reasoning, yet they continue to lag behind human-level performance. In-context learning provides a viable solution that boosts the model's performance via prompting…

Artificial Intelligence · Computer Science 2026-04-22 Jianzhi Yan , Le Liu , Buzhou Tang , Yang Xiang , Dongning Sun , Zhiming Li

Data-driven approaches to modeling physical systems fail to generalize to unseen systems that share the same general dynamics with the learning domain, but correspond to different physical contexts. We propose a new framework for this key…

Machine Learning · Computer Science 2022-06-27 Matthieu Kirchmeyer , Yuan Yin , Jérémie Donà , Nicolas Baskiotis , Alain Rakotomamonjy , Patrick Gallinari

We propose two methods to make unsupervised domain adaptation (UDA) more parameter efficient using adapters, small bottleneck layers interspersed with every layer of the large-scale pre-trained language model (PLM). The first method…

Computation and Language · Computer Science 2023-02-17 Bhavitvya Malik , Abhinav Ramesh Kashyap , Min-Yen Kan , Soujanya Poria

A long-term ambition of information seeking QA systems is to reason over multi-modal contexts and generate natural answers to user queries. Today, memory intensive pre-trained language models are adapted to downstream tasks such as QA by…

Computation and Language · Computer Science 2023-07-12 Vaishali Pal , Evangelos Kanoulas , Maarten de Rijke

Diffusion language models promise bidirectional context and infilling capabilities that autoregressive coders lack, yet practical systems remain heavyweight. We introduce CoDA, a 1.7B-parameter diffusion coder trained on TPU with a fully…

Intermediate training of pre-trained transformer-based language models on domain-specific data leads to substantial gains for downstream tasks. To increase efficiency and prevent catastrophic forgetting alleviated from full domain-adaptive…

Computation and Language · Computer Science 2023-05-23 Chia-Chien Hung , Lukas Lange , Jannik Strötgen

Domain adaptation (DA) enables knowledge transfer from a labeled source domain to an unlabeled target domain by reducing the cross-domain distribution discrepancy. Most prior DA approaches leverage complicated and powerful deep neural…

Computer Vision and Pattern Recognition · Computer Science 2021-03-31 Shuang Li , Jinming Zhang , Wenxuan Ma , Chi Harold Liu , Wei Li

Current parameter-efficient fine-tuning (PEFT) methods build adapters widely agnostic of the context of downstream task to learn, or the context of important knowledge to maintain. As a result, there is often a performance gap compared to…

Machine Learning · Computer Science 2025-03-11 Yibo Yang , Xiaojie Li , Zhongzhu Zhou , Shuaiwen Leon Song , Jianlong Wu , Liqiang Nie , Bernard Ghanem

Recently, the pre-trained Transformer models have received a rising interest in the field of speech processing thanks to their great success in various downstream tasks. However, most fine-tuning approaches update all the parameters of the…

Audio and Speech Processing · Electrical Eng. & Systems 2022-10-31 Junyi Peng , Themos Stafylakis , Rongzhi Gu , Oldřich Plchot , Ladislav Mošner , Lukáš Burget , Jan Černocký

Transfer learning aims to learn robust classifiers for the target domain by leveraging knowledge from a source domain. Since the source and the target domains are usually from different distributions, existing methods mainly focus on…

Machine Learning · Computer Science 2019-09-19 Jindong Wang , Yiqiang Chen , Wenjie Feng , Han Yu , Meiyu Huang , Qiang Yang

Point cloud analysis has achieved outstanding performance by transferring point cloud pre-trained models. However, existing methods for model adaptation usually update all model parameters, i.e., full fine-tuning paradigm, which is…

Computer Vision and Pattern Recognition · Computer Science 2024-04-08 Xin Zhou , Dingkang Liang , Wei Xu , Xingkui Zhu , Yihan Xu , Zhikang Zou , Xiang Bai

Adaptive Computation (AC) has been shown to be effective in improving the efficiency of Open-Domain Question Answering (ODQA) systems. However, current AC approaches require tuning of all model parameters, and training state-of-the-art ODQA…

Computation and Language · Computer Science 2021-07-06 Yuxiang Wu , Pasquale Minervini , Pontus Stenetorp , Sebastian Riedel

Vision-language retrieval is an important multi-modal learning topic, where the goal is to retrieve the most relevant visual candidate for a given text query. Recently, pre-trained models, e.g., CLIP, show great potential on retrieval…

Computer Vision and Pattern Recognition · Computer Science 2025-09-03 Haojun Jiang , Jianke Zhang , Rui Huang , Chunjiang Ge , Zanlin Ni , Shiji Song , Gao Huang

Parameter-efficient transfer learning (PETL) methods have emerged as a solid alternative to the standard full fine-tuning approach. They only train a few extra parameters for each downstream task, without sacrificing performance and…

Audio and Speech Processing · Electrical Eng. & Systems 2024-07-16 Umberto Cappellazzo , Daniele Falavigna , Alessio Brutti , Mirco Ravanelli

In practice, environments constantly change over time and space, posing significant challenges for object detectors trained based on a closed-set assumption, i.e., training and test data share the same distribution. To this end, continual…

Computer Vision and Pattern Recognition · Computer Science 2025-07-01 Deng Li , Aming Wu , Yang Li , Yaowei Wang , Yahong Han

Fine-tuning is widely used as the default algorithm for transfer learning from pre-trained models. Parameter inefficiency can however arise when, during transfer learning, all the parameters of a large pre-trained model need to be updated…

Computation and Language · Computer Science 2023-03-07 Yingting Li , Ambuj Mehrish , Shuai Zhao , Rishabh Bhardwaj , Amir Zadeh , Navonil Majumder , Rada Mihalcea , Soujanya Poria

Pre-trained vision-language models provide a robust foundation for efficient transfer learning across various downstream tasks. In the field of video action recognition, mainstream approaches often introduce additional modules to capture…

Computer Vision and Pattern Recognition · Computer Science 2025-03-19 Haoxing Chen , Zizheng Huang , Yan Hong , Yanshuo Wang , Zhongcai Lyu , Zhuoer Xu , Jun Lan , Zhangxuan Gu

Recently, Transformer-based encoder-decoder models have demonstrated strong performance in multilingual speech recognition. However, the decoder's autoregressive nature and large size introduce significant bottlenecks during inference.…

Audio and Speech Processing · Electrical Eng. & Systems 2025-08-28 Yunkyu Lim , Jihwan Park , Hyung Yong Kim , Hanbin Lee , Byeong-Yeol Kim

Transformers have been the dominant architecture for Speech Translation in recent years, achieving significant improvements in translation quality. Since speech signals are longer than their textual counterparts, and due to the quadratic…

Computation and Language · Computer Science 2023-03-15 Ioannis Tsiamas , Gerard I. Gállego , José A. R. Fonollosa , Marta R. Costa-jussà
‹ Prev 1 2 3 10 Next ›