English

Natural Language Processing: Structure and Complexity

cmp-lg 2016-08-31 v1 Computation and Language

Abstract

We introduce a method for analyzing the complexity of natural language processing tasks, and for predicting the difficulty new NLP tasks. Our complexity measures are derived from the Kolmogorov complexity of a class of automata --- {\it meaning automata}, whose purpose is to extract relevant pieces of information from sentences. Natural language semantics is defined only relative to the set of questions an automaton can answer. The paper shows examples of complexity estimates for various NLP programs and tasks, and some recipes for complexity management. It positions natural language processing as a subdomain of software engineering, and lays down its formal foundation.

Keywords

Cite

@article{arxiv.cmp-lg/9607017,
  title  = {Natural Language Processing: Structure and Complexity},
  author = {Wlodek Zadrozny},
  journal= {arXiv preprint arXiv:cmp-lg/9607017},
  year   = {2016}
}

Comments

8 pp. Latex (documentstyle[ijcai89,named]). In: "Proc. SEKE'96, 8th Int. Conf. on Software Engineering and Knowledge Engineering", Lake Tahoe, 1996, pages 595-602