Rodeo: Sparse, greedy nonparametric regression
Abstract
We present a greedy method for simultaneously performing local bandwidth selection and variable selection in nonparametric regression. The method starts with a local linear estimator with large bandwidths, and incrementally decreases the bandwidth of variables for which the gradient of the estimator with respect to bandwidth is large. The method--called rodeo (regularization of derivative expectation operator)--conducts a sequence of hypothesis tests to threshold derivatives, and is easy to implement. Under certain assumptions on the regression function and sampling density, it is shown that the rodeo applied to local linear smoothing avoids the curse of dimensionality, achieving near optimal minimax rates of convergence in the number of relevant variables, as if these variables were isolated in advance.
Cite
@article{arxiv.0803.1709,
title = {Rodeo: Sparse, greedy nonparametric regression},
author = {John Lafferty and Larry Wasserman},
journal= {arXiv preprint arXiv:0803.1709},
year = {2008}
}
Comments
Published in at http://dx.doi.org/10.1214/009053607000000811 the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org)