English

Multimodal Grounding for Language Processing

Computation and Language 2019-07-04 v2 Artificial Intelligence

Abstract

This survey discusses how recent developments in multimodal processing facilitate conceptual grounding of language. We categorize the information flow in multimodal processing with respect to cognitive models of human information processing and analyze different methods for combining multimodal representations. Based on this methodological inventory, we discuss the benefit of multimodal grounding for a variety of language processing tasks and the challenges that arise. We particularly focus on multimodal grounding of verbs which play a crucial role for the compositional power of language.

Keywords

Cite

@article{arxiv.1806.06371,
  title  = {Multimodal Grounding for Language Processing},
  author = {Lisa Beinborn and Teresa Botschen and Iryna Gurevych},
  journal= {arXiv preprint arXiv:1806.06371},
  year   = {2019}
}

Comments

The paper has been published in the Proceedings of the 27 Conference of Computational Linguistics. Please refer to this version for citations: https://www.aclweb.org/anthology/papers/C/C18/C18-1197/

R2 v1 2026-06-23T02:32:21.275Z