English

Semantic classifier approach to document classification

Information Retrieval 2017-01-17 v1 Computation and Language

Abstract

In this paper we propose a new document classification method, bridging discrepancies (so-called semantic gap) between the training set and the application sets of textual data. We demonstrate its superiority over classical text classification approaches, including traditional classifier ensembles. The method consists in combining a document categorization technique with a single classifier or a classifier ensemble (SEMCOM algorithm - Committee with Semantic Categorizer).

Keywords

Cite

@article{arxiv.1701.04292,
  title  = {Semantic classifier approach to document classification},
  author = {Piotr Borkowski and Krzysztof Ciesielski and Mieczysław A. Kłopotek},
  journal= {arXiv preprint arXiv:1701.04292},
  year   = {2017}
}