English

ChauBoxplot and AdaptiveBoxplot: Two R packages for boxplot-based outlier detection

Methodology 2026-03-04 v2

Abstract

Tukey's boxplot is widely used for outlier detection; however, its classic fixed-fence rule tends to flag an excessive number of outliers as the sample size grows. To address this, we introduce two new R packages, ChauBoxplot and AdaptiveBoxplot, which implement more robust and statistically principled outlier detection methods. We illustrate their advantages and practical implications through comprehensive simulation studies and a real-world analysis of provincial university admission rates from China's National College Entrance Examination. Based on these findings, we provide practical guidance to help practitioners select appropriate boxplot methods, achieving a balance between interpretability and statistical reliability.

Cite

@article{arxiv.2601.13759,
  title  = {ChauBoxplot and AdaptiveBoxplot: Two R packages for boxplot-based outlier detection},
  author = {Tiejun Tong and Hongmei Lin and Bowen Gang and Riquan Zhang},
  journal= {arXiv preprint arXiv:2601.13759},
  year   = {2026}
}

Comments

11 pages, 2 figures, 2 tables

R2 v1 2026-07-01T09:12:07.668Z