English

Captioning Visualizations with Large Language Models (CVLLM): A Tutorial

Computation and Language 2024-07-01 v1 Artificial Intelligence Human-Computer Interaction

Abstract

Automatically captioning visualizations is not new, but recent advances in large language models(LLMs) open exciting new possibilities. In this tutorial, after providing a brief review of Information Visualization (InfoVis) principles and past work in captioning, we introduce neural models and the transformer architecture used in generic LLMs. We then discuss their recent applications in InfoVis, with a focus on captioning. Additionally, we explore promising future directions in this field.

Keywords

Cite

@article{arxiv.2406.19512,
  title  = {Captioning Visualizations with Large Language Models (CVLLM): A Tutorial},
  author = {Giuseppe Carenini and Jordon Johnson and Ali Salamatian},
  journal= {arXiv preprint arXiv:2406.19512},
  year   = {2024}
}

Comments

6 pages, 4 figures

R2 v1 2026-06-28T17:21:58.164Z