Large Language Models
Computation and Language
2023-10-09 v2 High Energy Physics - Theory
History and Overview
Computational Physics
Abstract
Artificial intelligence is making spectacular progress, and one of the best examples is the development of large language models (LLMs) such as OpenAI's GPT series. In these lectures, written for readers with a background in mathematics or physics, we give a brief history and survey of the state of the art, and describe the underlying transformer architecture in detail. We then explore some current ideas on how LLMs work and how models trained to predict the next word in a text are able to perform other tasks displaying intelligence.
Cite
@article{arxiv.2307.05782,
title = {Large Language Models},
author = {Michael R. Douglas},
journal= {arXiv preprint arXiv:2307.05782},
year = {2023}
}
Comments
47 pages (v2: added references, corrected typos)