English

Batch learning equals online learning in Bayesian supervised learning

Statistics Theory 2026-05-22 v5 Statistics Theory

Abstract

In this paper we study Bayesian supervised learning models proposed by L\^e in \cite{Le2025}. We show the existence of Bayesian inversions on universal Bayesian supervised learning models (P(Y)X,μ,IdP(Y)X,P(Y)X(\mathcal{P}(\mathcal{Y})^{\mathcal{X}}, \mu, \mathrm{Id}_{\mathcal{P}(\mathcal{Y})^{\mathcal{X}}}, \mathcal{P}(\mathcal{Y})^{\mathcal{X}} for arbitrary input space X\mathcal{X}, Souslin label space Y\mathcal{Y}, and prior probability measure μP(P(Y)X)\mu \in \mathcal{P}( \mathcal{P}(\mathcal{Y})^{\mathcal{X}}). Using functoriality of probabilistic morphisms, we prove that sequential and batch Bayesian inversions coincide in supervised learning models with conditionally independent (possibly non-i.i.d.) data \cite{Le2025}. This equivalence holds without domination or discreteness assumptions on sampling operators. We derive a recursive formula for posterior predictive distributions, which reduces to the Kalman filter in Gaussian process regression. For Souslin label spaces Y\mathcal{Y} and arbitrary input sets X\mathcal{X}, we characterize probability measures on P(Y)X\mathcal{P}(\mathcal{Y})^{\mathcal{X}} via projective systems, generalizing Orbanz \cite{Orbanz2011}. We revisit MacEachern's Dependent Dirichlet Processes (DDP) \cite{MacEachern2000} using copula-based constructions \cite{BJQ2012} and show how to compute posterior predictive distributions in universal Bayesian supervised models with DDP priors.

Keywords

Cite

@article{arxiv.2510.16892,
  title  = {Batch learning equals online learning in Bayesian supervised learning},
  author = {Hông Vân Lê},
  journal= {arXiv preprint arXiv:2510.16892},
  year   = {2026}
}

Comments

Version 5: T. 31 pages, a chracterization of probability measures on $\mathcal{P}(\mathcal{Y})^{\mathcal{X}}$ extended to Souslin spaces (Theorem 5.4), typo correction in Subsection 6.2

R2 v1 2026-07-01T06:45:55.117Z