Nonparametric Bayesian Aggregation for Massive Data

Zuofeng Shang; Botao Hao; Guang Cheng

Nonparametric Bayesian Aggregation for Massive Data

Statistics Theory 2019-09-05 v3 Statistics Theory

Authors: Zuofeng Shang , Botao Hao , Guang Cheng

Abstract

We develop a set of scalable Bayesian inference procedures for a general class of nonparametric regression models. Specifically, nonparametric Bayesian inferences are separately performed on each subset randomly split from a massive dataset, and then the obtained local results are aggregated into global counterparts. This aggregation step is explicit without involving any additional computation cost. By a careful partition, we show that our aggregated inference results obtain an oracle rule in the sense that they are equivalent to those obtained directly from the entire data (which are computationally prohibitive). For example, an aggregated credible ball achieves desirable credibility level and also frequentist coverage while possessing the same radius as the oracle ball.

Keywords

bayesian inference statistical inference nonparametric regression

Cite

@article{arxiv.1508.04175,
  title  = {Nonparametric Bayesian Aggregation for Massive Data},
  author = {Zuofeng Shang and Botao Hao and Guang Cheng},
  journal= {arXiv preprint arXiv:1508.04175},
  year   = {2019}
}

Comments

To appear in Journal of Machine Learning Research. arXiv admin note: text overlap with arXiv:1411.3686

Nonparametric Bayesian Aggregation for Massive Data

Abstract

Keywords

Cite

Comments

Related papers