English

Feature Selection For High-Dimensional Clustering

Statistics Theory 2014-06-10 v1 Machine Learning Statistics Theory

Abstract

We present a nonparametric method for selecting informative features in high-dimensional clustering problems. We start with a screening step that uses a test for multimodality. Then we apply kernel density estimation and mode clustering to the selected features. The output of the method consists of a list of relevant features, and cluster assignments. We provide explicit bounds on the error rate of the resulting clustering. In addition, we provide the first error bounds on mode based clustering.

Keywords

Cite

@article{arxiv.1406.2240,
  title  = {Feature Selection For High-Dimensional Clustering},
  author = {Larry Wasserman and Martin Azizyan and Aarti Singh},
  journal= {arXiv preprint arXiv:1406.2240},
  year   = {2014}
}

Comments

11 pages, 2 figures

R2 v1 2026-06-22T04:34:10.959Z