English

Document Relevance Evaluation via Term Distribution Analysis Using Fourier Series Expansion

Information Retrieval 2009-07-18 v1

Abstract

In addition to the frequency of terms in a document collection, the distribution of terms plays an important role in determining the relevance of documents for a given search query. In this paper, term distribution analysis using Fourier series expansion as a novel approach for calculating an abstract representation of term positions in a document corpus is introduced. Based on this approach, two methods for improving the evaluation of document relevance are proposed: (a) a function-based ranking optimization representing a user defined document region, and (b) a query expansion technique based on overlapping the term distributions in the top-ranked documents. Experimental results demonstrate the effectiveness of the proposed approach in providing new possibilities for optimizing the retrieval process.

Keywords

Cite

@article{arxiv.0903.0153,
  title  = {Document Relevance Evaluation via Term Distribution Analysis Using Fourier Series Expansion},
  author = {Patricio Galeas and Ralph Kretschmer and Bernd Freisleben},
  journal= {arXiv preprint arXiv:0903.0153},
  year   = {2009}
}

Comments

9 pages, submitted to proceedings of JCDL-2009

R2 v1 2026-06-21T12:17:00.421Z