English

Conditional Language Learning with Context

Computation and Language 2024-06-05 v1

Abstract

Language models can learn sophisticated language understanding skills from fitting raw text. They also unselectively learn useless corpus statistics and biases, especially during finetuning on domain-specific corpora. In this paper, we propose a simple modification to causal language modeling called conditional finetuning, which performs language modeling conditioned on a context. We show that a context can "explain away" certain corpus statistics and make the model avoid learning them. In this fashion, conditional finetuning achieves selective learning from a corpus, learning knowledge useful for downstream tasks while avoiding learning useless corpus statistics like topic biases. This selective learning effect leads to less forgetting and better stability-plasticity tradeoff in domain finetuning, potentially benefitting lifelong learning with language models.

Keywords

Cite

@article{arxiv.2406.01976,
  title  = {Conditional Language Learning with Context},
  author = {Xiao Zhang and Miao Li and Ji Wu},
  journal= {arXiv preprint arXiv:2406.01976},
  year   = {2024}
}

Comments

To appear at the 41st International Conference on Machine Learning (ICML 2024)

R2 v1 2026-06-28T16:52:23.591Z