English

Approximate textual retrieval

Information Retrieval 2007-05-23 v1 Digital Libraries

Abstract

An approximate textual retrieval algorithm for searching sources with high levels of defects is presented. It considers splitting the words in a query into two overlapping segments and subsequently building composite regular expressions from interlacing subsets of the segments. This procedure reduces the probability of missed occurrences due to source defects, yet diminishes the retrieval of irrelevant, non-contextual occurrences.

Keywords

Cite

@article{arxiv.0705.0751,
  title  = {Approximate textual retrieval},
  author = {Pere Constans},
  journal= {arXiv preprint arXiv:0705.0751},
  year   = {2007}
}
R2 v1 2026-06-21T08:25:16.480Z