Characterizing $L_2$Boosting

John Ehrlinger; Hemant Ishwaran

doi:10.1214/12-AOS997

Characterizing $L_2$Boosting

Statistics Theory 2012-07-24 v1 Statistics Theory

Authors: John Ehrlinger , Hemant Ishwaran

View on arXiv ↗ PDF ↗ DOI ↗

Abstract

We consider $L_2$ Boosting, a special case of Friedman's generic boosting algorithm applied to linear regression under $L_2$ -loss. We study $L_2$ Boosting for an arbitrary regularization parameter and derive an exact closed form expression for the number of steps taken along a fixed coordinate direction. This relationship is used to describe $L_2$ Boosting's solution path, to describe new tools for studying its path, and to characterize some of the algorithm's unique properties, including active set cycling, a property where the algorithm spends lengthy periods of time cycling between the same coordinates when the regularization parameter is arbitrarily small. Our fixed descent analysis also reveals a repressible condition that limits the effectiveness of $L_2$ Boosting in correlated problems by preventing desirable variables from entering the solution path. As a simple remedy, a data augmentation method similar to that used for the elastic net is used to introduce $L_2$ -penalization and is shown, in combination with decorrelation, to reverse the repressible condition and circumvents $L_2$ Boosting's deficiencies in correlated problems. In itself, this presents a new explanation for why the elastic net is successful in correlated problems and why methods like LAR and lasso can perform poorly in such settings.

Cite

@article{arxiv.1207.5367,
  title  = {Characterizing $L_2$Boosting},
  author = {John Ehrlinger and Hemant Ishwaran},
  journal= {arXiv preprint arXiv:1207.5367},
  year   = {2012}
}

Comments

Published in at http://dx.doi.org/10.1214/12-AOS997 the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org)

Characterizing $L_2$Boosting

Abstract

Cite

Comments

Related papers