Variable selection in functional data classification: a maxima-hunting proposal
Abstract
Variable selection is considered in the setting of supervised binary classification with functional data . By "variable selection" we mean any dimension-reduction method which leads to replace the whole trajectory , with a low-dimensional vector still keeping a similar classification error. Our proposal for variable selection is based on the idea of selecting the local maxima of the function , where denotes the "distance covariance" association measure for random variables due to Sz\'ekely, Rizzo and Bakirov (2007). This method provides a simple natural way to deal with the relevance vs. redundancy trade-off which typically appears in variable selection. This paper includes (a) Some theoretical motivation: a result of consistent estimation on the maxima of is shown. We also show different theoretical models for the underlying process under which the relevant information in concentrated in the maxima of . (b) An extensive empirical study, including about 400 simulated models and real data examples, aimed at comparing our variable selection method with other standard proposals for dimension reduction.
Keywords
Cite
@article{arxiv.1309.6697,
title = {Variable selection in functional data classification: a maxima-hunting proposal},
author = {José R. Berrendero and Antonio Cuevas and José L. Torrecilla},
journal= {arXiv preprint arXiv:1309.6697},
year = {2016}
}