English

Corpus sp{\'e}cialis{\'e} et ressource de sp{\'e}cialit{\'e}

Information Retrieval 2015-06-22 v2 Computation and Language

Abstract

"Semantic Atlas" is a mathematic and statistic model to visualise word senses according to relations between words. The model, that has been applied to proximity relations from a corpus, has shown its ability to distinguish word senses as the corpus' contributors comprehend them. We propose to use the model and a specialised corpus in order to create automatically a specialised dictionary relative to the corpus' domain. A morpho-syntactic analysis performed on the corpus makes it possible to create the dictionary from syntactic relations between lexical units. The semantic resource can be used to navigate semantically - and not only lexically - through the corpus, to create classical dictionaries or for diachronic studies of the language.

Keywords

Cite

@article{arxiv.0801.1179,
  title  = {Corpus sp{\'e}cialis{\'e} et ressource de sp{\'e}cialit{\'e}},
  author = {Bernard Jacquemin and Sabine Ploux},
  journal= {arXiv preprint arXiv:0801.1179},
  year   = {2015}
}

Comments

16 pages, in French

R2 v1 2026-06-21T10:00:39.168Z