English

Optimizing Computational-Statistical Runtime for Wasserstein Distance Estimation

Machine Learning 2026-05-20 v1 Computational Complexity Machine Learning

Abstract

Squared Wasserstein distance is a frequently used tool to measure discrepancy between probability distributions. This distance is typically computed between empirical measures of size nn from two underlying random samples. Unfortunately, even in lower dimensional Euclidean space problems (d{2,3})\left( d \in \{2,3\} \right), algorithms for Wasserstein distance computation with approximate or exact precision guarantees scale poorly in the runtime as a function of nn and the desired precision. In response, we consider the computational-statistical runtime, where the goal is to estimate from samples the Wasserstein distance between potentially smooth measures up to ϵ\epsilon-additive error in expectation with respect to the sampling; we allow O(1)O(1) computational cost for collecting a sample. Towards this, we develop a Sample-Sketch-Solve paradigm where we introduce a regular cartesian grid sketch of the samples. We show that (especially under α\alpha-H\"older smooth distributions) this can compress the data without increasing asymptotic error, and also regularizes the structure which enables faster exact algorithms. Ultimately, we approximate W22(P,Q)W_2^2(P,Q) within ϵ\epsilon error in ϵmax(2,d+1+o(1)1+α)\epsilon^{-\max(2,\frac{d+1+o(1)}{1+\alpha})} time for 0<α<10 < \alpha < 1 H\"older smooth distributions P,QP,Q on (0,1)d(0,1)^{d}; an optimal Θ(ϵ2)\Theta(\epsilon^{-2}) for α>1/2\alpha > 1/2 when d=2d=2 and nearly optimal as α1\alpha \to 1 when d=3d = 3.

Keywords

Cite

@article{arxiv.2605.20122,
  title  = {Optimizing Computational-Statistical Runtime for Wasserstein Distance Estimation},
  author = {Peter Matthew Jacobs and Jeff M. Phillips},
  journal= {arXiv preprint arXiv:2605.20122},
  year   = {2026}
}