English

Kernel Discrepancy-Based Rerandomization for Controlled Experiments

Methodology 2025-11-05 v2

Abstract

This paper introduces a kernel discrepancy-based framework for rerandomization to enhance the precision of causal inference in controlled experiments. We demonstrate that the kernel discrepancy is the key part of the variance upper bound for the difference-in-means estimator, thereby establishing a theoretical rationale for its use. It quantifies the difference between empirical covariate distributions of treatment groups. We can choose a suitable kernel function and the corresponding discrepancy to accommodate simple or complex relationships between the outcome and the covariates. The proposed framework efficiently applies to any number of treatment groups, overcoming a significant limitation of existing methods. Furthermore, we develop a computationally efficient composite strategy for factorial experiments by recursively applying two- or multi-group rerandomizations. Numerical studies demonstrate that our approach significantly reduces estimator variance, with the linear kernel being optimal for linear relationships and the L2\mathcal{L}_2-discrepancy offering robust performance under model uncertainty.

Keywords

Cite

@article{arxiv.1901.08984,
  title  = {Kernel Discrepancy-Based Rerandomization for Controlled Experiments},
  author = {Yiou Li and Lulu Kang},
  journal= {arXiv preprint arXiv:1901.08984},
  year   = {2025}
}

Comments

39 pages, 7 figures, 1 table