Large Language Model Programs

Imanol Schlag; Sainbayar Sukhbaatar; Asli Celikyilmaz; Wen-tau Yih; Jason Weston; Jürgen Schmidhuber; Xian Li

Large Language Model Programs

Machine Learning 2023-05-10 v1 Artificial Intelligence Computation and Language

Authors: Imanol Schlag , Sainbayar Sukhbaatar , Asli Celikyilmaz , Wen-tau Yih , Jason Weston , Jürgen Schmidhuber , Xian Li

View on arXiv ↗ PDF ↗

Abstract

In recent years, large pre-trained language models (LLMs) have demonstrated the ability to follow instructions and perform novel tasks from a few examples. The possibility to parameterise an LLM through such in-context examples widens their capability at a much lower cost than finetuning. We extend this line of reasoning and present a method which further expands the capabilities of an LLM by embedding it within an algorithm or program. To demonstrate the benefits of this approach, we present an illustrative example of evidence-supported question-answering. We obtain a 6.4\% improvement over the chain of thought baseline through a more algorithmic approach without any finetuning. Furthermore, we highlight recent work from this perspective and discuss the advantages and disadvantages in comparison to the standard approaches.

Keywords

large language model instruction tuning large language model reasoning

Cite

@article{arxiv.2305.05364,
  title  = {Large Language Model Programs},
  author = {Imanol Schlag and Sainbayar Sukhbaatar and Asli Celikyilmaz and Wen-tau Yih and Jason Weston and Jürgen Schmidhuber and Xian Li},
  journal= {arXiv preprint arXiv:2305.05364},
  year   = {2023}
}

Large Language Model Programs

Abstract

Keywords

Cite

Related papers