This paper is about the technical design of a large computational lexicon, its storage, and its access from a Prolog environment. Traditionally, efficient access and storage of data structures is implemented by a relational database management system. In Delilah, a lexicon-based NLP system, efficient access to the lexicon by the semantic generator is vital. We show that our highly detailed HPSG-style lexical specifications do not fit well in the Relational Model, and that they cannot be efficiently retrieved. We argue that they fit more naturally in the Object-Oriented Model. Although storage of objects is redundant, we claim that efficient access is still possible by applying indexing, and compression techniques from the Relational Model to the Object-Oriented Model. We demonstrate that it is possible to implement object-oriented storage and fast access in ISO Prolog.
@article{arxiv.0905.3318,
title = {An Object-Oriented and Fast Lexicon for Semantic Generation},
author = {Maarten Hijzelendoorn and Crit Cremers},
journal= {arXiv preprint arXiv:0905.3318},
year = {2009}
}
Comments
Paper presented at the 18th Computational Linguistics In the Netherlands Meeting (CLIN), Nijmegen, 10 December 2007, 15pp