English

Language Modelling as a Multi-Task Problem

Computation and Language 2021-01-28 v1 Machine Learning

Abstract

In this paper, we propose to study language modelling as a multi-task problem, bringing together three strands of research: multi-task learning, linguistics, and interpretability. Based on hypotheses derived from linguistic theory, we investigate whether language models adhere to learning principles of multi-task learning during training. To showcase the idea, we analyse the generalisation behaviour of language models as they learn the linguistic concept of Negative Polarity Items (NPIs). Our experiments demonstrate that a multi-task setting naturally emerges within the objective of the more general task of language modelling.We argue that this insight is valuable for multi-task learning, linguistics and interpretability research and can lead to exciting new findings in all three domains.

Keywords

Cite

@article{arxiv.2101.11287,
  title  = {Language Modelling as a Multi-Task Problem},
  author = {Lucas Weber and Jaap Jumelet and Elia Bruni and Dieuwke Hupkes},
  journal= {arXiv preprint arXiv:2101.11287},
  year   = {2021}
}

Comments

Accepted for publication at EACL 2021

R2 v1 2026-06-23T22:34:37.752Z