Language Model Cascades

David Dohan; Winnie Xu; Aitor Lewkowycz; Jacob Austin; David Bieber; Raphael Gontijo Lopes; Yuhuai Wu; Henryk Michalewski; Rif A. Saurous; Jascha Sohl-dickstein; Kevin Murphy; Charles Sutton

Language Model Cascades

Computation and Language 2022-07-29 v2 Artificial Intelligence

Authors: David Dohan , Winnie Xu , Aitor Lewkowycz , Jacob Austin , David Bieber , Raphael Gontijo Lopes , Yuhuai Wu , Henryk Michalewski , Rif A. Saurous , Jascha Sohl-dickstein , Kevin Murphy , Charles Sutton

View on arXiv ↗ PDF ↗

Abstract

Prompted models have demonstrated impressive few-shot learning abilities. Repeated interactions at test-time with a single model, or the composition of multiple models together, further expands capabilities. These compositions are probabilistic models, and may be expressed in the language of graphical models with random variables whose values are complex data types such as strings. Cases with control flow and dynamic structure require techniques from probabilistic programming, which allow implementing disparate model structures and inference strategies in a unified language. We formalize several existing techniques from this perspective, including scratchpads / chain of thought, verifiers, STaR, selection-inference, and tool use. We refer to the resulting programs as language model cascades.

Keywords

probabilistic programming language modeling natural language processing

Cite

@article{arxiv.2207.10342,
  title  = {Language Model Cascades},
  author = {David Dohan and Winnie Xu and Aitor Lewkowycz and Jacob Austin and David Bieber and Raphael Gontijo Lopes and Yuhuai Wu and Henryk Michalewski and Rif A. Saurous and Jascha Sohl-dickstein and Kevin Murphy and Charles Sutton},
  journal= {arXiv preprint arXiv:2207.10342},
  year   = {2022}
}

Comments

Presented as spotlight at the Beyond Bases workshop at ICML 2022 (https://beyond-bayes.github.io)

Language Model Cascades

Abstract

Keywords

Cite

Comments

Related papers