English

Conditional Adapters: Parameter-efficient Transfer Learning with Fast Inference

Computation and Language 2023-11-28 v2

Abstract

We propose Conditional Adapter (CoDA), a parameter-efficient transfer learning method that also improves inference efficiency. CoDA generalizes beyond standard adapter approaches to enable a new way of balancing speed and accuracy using conditional computation. Starting with an existing dense pretrained model, CoDA adds sparse activation together with a small number of new parameters and a light-weight training phase. Our experiments demonstrate that the CoDA approach provides an unexpectedly efficient way to transfer knowledge. Across a variety of language, vision, and speech tasks, CoDA achieves a 2x to 8x inference speed-up compared to the state-of-the-art Adapter approaches with moderate to no accuracy loss and the same parameter efficiency.

Keywords

Cite

@article{arxiv.2304.04947,
  title  = {Conditional Adapters: Parameter-efficient Transfer Learning with Fast Inference},
  author = {Tao Lei and Junwen Bai and Siddhartha Brahma and Joshua Ainslie and Kenton Lee and Yanqi Zhou and Nan Du and Vincent Y. Zhao and Yuexin Wu and Bo Li and Yu Zhang and Ming-Wei Chang},
  journal= {arXiv preprint arXiv:2304.04947},
  year   = {2023}
}

Comments

NeurIPS camera ready version

R2 v1 2026-06-28T09:58:44.453Z