HyperLoader: Integrating Hypernetwork-Based LoRA and Adapter Layers into Multi-Task Transformers for Sequence Labelling

Jesus-German Ortiz-Barajas; Helena Gomez-Adorno; Thamar Solorio

HyperLoader: Integrating Hypernetwork-Based LoRA and Adapter Layers into Multi-Task Transformers for Sequence Labelling

Computation and Language 2024-08-27 v3

Authors: Jesus-German Ortiz-Barajas , Helena Gomez-Adorno , Thamar Solorio

Abstract

We present HyperLoader, a simple approach that combines different parameter-efficient fine-tuning methods in a multi-task setting. To achieve this goal, our model uses a hypernetwork to generate the weights of these modules based on the task, the transformer layer, and its position within this layer. Our method combines the benefits of multi-task learning by capturing the structure of all tasks while reducing the task interference problem by encapsulating the task-specific knowledge in the generated weights and the benefits of combining different parameter-efficient methods to outperform full-fine tuning. We provide empirical evidence that HyperLoader outperforms previous approaches in most datasets and obtains the best average performance across tasks in high-resource and low-resource scenarios.

Keywords

parameter-efficient fine-tuning multi-task learning model transformation

Cite

@article{arxiv.2407.01411,
  title  = {HyperLoader: Integrating Hypernetwork-Based LoRA and Adapter Layers into Multi-Task Transformers for Sequence Labelling},
  author = {Jesus-German Ortiz-Barajas and Helena Gomez-Adorno and Thamar Solorio},
  journal= {arXiv preprint arXiv:2407.01411},
  year   = {2024}
}

HyperLoader: Integrating Hypernetwork-Based LoRA and Adapter Layers into Multi-Task Transformers for Sequence Labelling

Abstract

Keywords

Cite

Related papers