Document Clustering with K-tree

Christopher M. De Vries; Shlomo Geva

doi:10.1007/978-3-642-03761-0_43

Document Clustering with K-tree

Information Retrieval 2010-01-07 v1 Artificial Intelligence Data Structures and Algorithms

Authors: Christopher M. De Vries , Shlomo Geva

View on arXiv ↗ PDF ↗ DOI ↗

Abstract

This paper describes the approach taken to the XML Mining track at INEX 2008 by a group at the Queensland University of Technology. We introduce the K-tree clustering algorithm in an Information Retrieval context by adapting it for document clustering. Many large scale problems exist in document clustering. K-tree scales well with large inputs due to its low complexity. It offers promising results both in terms of efficiency and quality. Document classification was completed using Support Vector Machines.

Keywords

cluster analysis decision tree information retrieval

Cite

@article{arxiv.1001.0827,
  title  = {Document Clustering with K-tree},
  author = {Christopher M. De Vries and Shlomo Geva},
  journal= {arXiv preprint arXiv:1001.0827},
  year   = {2010}
}

Comments

12 pages, INEX 2008

Related papers

View all related →