English

Forecasting Generative Amplification

High Energy Physics - Phenomenology 2025-10-17 v3 Machine Learning

Abstract

Generative networks are perfect tools to enhance the speed and precision of LHC simulations. It is important to understand their statistical precision, especially when generating events beyond the size of the training dataset. We present two complementary methods to estimate the amplification factor without large holdout datasets. Averaging amplification uses Bayesian networks or ensembling to estimate amplification from the precision of integrals over given phase-space volumes. Differential amplification uses hypothesis testing to quantify amplification without any resolution loss. Applied to state-of-the-art event generators, both methods indicate that amplification is possible in specific regions of phase space, but not yet across the entire distribution.

Keywords

Cite

@article{arxiv.2509.08048,
  title  = {Forecasting Generative Amplification},
  author = {Henning Bahl and Sascha Diefenbacher and Nina Elmer and Tilman Plehn and Jonas Spinner},
  journal= {arXiv preprint arXiv:2509.08048},
  year   = {2025}
}

Comments

23 pages, 15 figures. v2: added link to github repo, extended acknowledgements. v3: updated conventions and refined text, now 25 pages

R2 v1 2026-07-01T05:29:00.624Z