English

ComboStoc: Combinatorial Stochasticity for Diffusion Generative Models

Machine Learning 2026-04-30 v3 Artificial Intelligence Computer Vision and Pattern Recognition Graphics

Abstract

In this paper, we study an under-explored but important factor of diffusion generative models, i.e., the combinatorial complexity. Data samples are generally high-dimensional, and for various structured generation tasks, additional attributes are combined to associate with data samples. We show that the space spanned by the combination of dimensions and attributes can be insufficiently covered by existing training schemes of diffusion generative models, potentially limiting test time performance. We present a simple fix to this problem by constructing stochastic processes that fully exploit the combinatorial structures, hence the name ComboStoc. Using this simple strategy, we show that network training is significantly accelerated across diverse data modalities, including images and 3D structured shapes. Moreover, ComboStoc enables a new way of test time generation which uses asynchronous time steps for different dimensions and attributes, thus allowing for varying degrees of control over them. Our code is available at: https://github.com/Xrvitd/ComboStoc

Keywords

Cite

@article{arxiv.2405.13729,
  title  = {ComboStoc: Combinatorial Stochasticity for Diffusion Generative Models},
  author = {Rui Xu and Jiepeng Wang and Hao Pan and Yang Liu and Xin Tong and Shiqing Xin and Changhe Tu and Taku Komura and Wenping Wang},
  journal= {arXiv preprint arXiv:2405.13729},
  year   = {2026}
}

Comments

ACM Transactions on Graphics, SIGGRAPH 2026