English

Gaussian model selection with an unknown variance

Statistics Theory 2009-04-03 v3 Statistics Theory

Abstract

Let YY be a Gaussian vector whose components are independent with a common unknown variance. We consider the problem of estimating the mean μ\mu of YY by model selection. More precisely, we start with a collection S={Sm,mM}\mathcal{S}=\{S_m,m\in\mathcal{M}\} of linear subspaces of Rn\mathbb{R}^n and associate to each of these the least-squares estimator of μ\mu on SmS_m. Then, we use a data driven penalized criterion in order to select one estimator among these. Our first objective is to analyze the performance of estimators associated to classical criteria such as FPE, AIC, BIC and AMDL. Our second objective is to propose better penalties that are versatile enough to take into account both the complexity of the collection S\mathcal{S} and the sample size. Then we apply those to solve various statistical problems such as variable selection, change point detections and signal estimation among others. Our results are based on a nonasymptotic risk bound with respect to the Euclidean loss for the selected estimator. Some analogous results are also established for the Kullback loss.

Keywords

Cite

@article{arxiv.math/0701250,
  title  = {Gaussian model selection with an unknown variance},
  author = {Yannick Baraud and Christophe Giraud and Sylvie Huet},
  journal= {arXiv preprint arXiv:math/0701250},
  year   = {2009}
}

Comments

Published in at http://dx.doi.org/10.1214/07-AOS573 the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org)