Discussion of: Treelets--An adaptive multi-scale basis for sparse unordered data
Abstract
We congratulate Lee, Nadler and Wasserman (henceforth LNW) on a very interesting paper on new methodology and supporting theory [arXiv:0707.0481]. Treelets seem to tackle two important problems of modern data analysis at once. For datasets with many variables, treelets give powerful predictions even if variables are highly correlated and redundant. Maybe more importantly, interpretation of the results is intuitive. Useful insights about relevant groups of variables can be gained. Our comments and questions include: (i) Could the success of treelets be replicated by a combination of hierarchical clustering and PCA? (ii) When choosing a suitable basis, treelets seem to be largely an unsupervised method. Could the results be even more interpretable and powerful if treelets would take into account some supervised response variable? (iii) Interpretability of the result hinges on the sparsity of the final basis. Do we expect that the selected groups of variables will always be sufficiently small to be amenable for interpretation?
Cite
@article{arxiv.0807.4018,
title = {Discussion of: Treelets--An adaptive multi-scale basis for sparse unordered data},
author = {Nicolai Meinshausen and Peter Bühlmann},
journal= {arXiv preprint arXiv:0807.4018},
year = {2008}
}
Comments
Published in at http://dx.doi.org/10.1214/08-AOAS137C the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org)