English

The Vector Grounding Problem

Computation and Language 2025-12-11 v3

Abstract

Large language models (LLMs) produce seemingly meaningful outputs, yet they are trained on text alone without direct interaction with the world. This leads to a modern variant of the classical symbol grounding problem in AI: can LLMs' internal states and outputs be about extra-linguistic reality, independently of the meaning human interpreters project onto them? We argue that they can. We first distinguish referential grounding -- the connection between a representation and its worldly referent -- from other forms of grounding and argue it is the only kind essential to solving the problem. We contend that referential grounding is achieved when a system's internal states satisfy two conditions derived from teleosemantic theories of representation: (1) they stand in appropriate causal-informational relations to the world, and (2) they have a history of selection that has endowed them with the function of carrying this information. We argue that LLMs can meet both conditions, even without multimodality or embodiment.

Keywords

Cite

@article{arxiv.2304.01481,
  title  = {The Vector Grounding Problem},
  author = {Dimitri Coelho Mollo and Raphaël Millière},
  journal= {arXiv preprint arXiv:2304.01481},
  year   = {2025}
}

Comments

Accepted for publication in Philosophy and the Mind Sciences