Scalable Multivariate Histograms
Computation
2021-01-01 v1 Distributed, Parallel, and Cluster Computing
Optimization and Control
Abstract
We give a distributed variant of an adaptive histogram estimation procedure previously developed by the first author. The procedure is based on regular pavings and is known to have numerous appealing statistical and arithmetical properties. The distributed version makes it possible to process data sets significantly bigger than previously. We provide prototype implementation under a permissive license.
Cite
@article{arxiv.2012.14847,
title = {Scalable Multivariate Histograms},
author = {Raazesh Sainudiin and Warwick Tucker and Tilo Wiklund},
journal= {arXiv preprint arXiv:2012.14847},
year = {2021}
}