English

Big Data and model-based survey sampling

Computation 2020-02-12 v1

Abstract

Big Data are huge amounts of digital information that are automatically accrued or merged from several sources and rarely result from properly planned surveys. A Big Dataset is herein conceived of as a collection of information concerning a finite population. We suggest selecting a sample of observations to get the inferential goal. We assume a super-population model has generated the Big Dataset. With this assumption, we can apply the theory of optimal design to draw a sample from the Big Dataset that contains the majority of the information about the unknown parameters.

Keywords

Cite

@article{arxiv.2002.04255,
  title  = {Big Data and model-based survey sampling},
  author = {Deldossi Laura and Tommasi Chiara},
  journal= {arXiv preprint arXiv:2002.04255},
  year   = {2020}
}