English

Optical Diffusion Models for Image Generation

Optics 2024-11-01 v2 Computer Vision and Pattern Recognition Machine Learning

Abstract

Diffusion models generate new samples by progressively decreasing the noise from the initially provided random distribution. This inference procedure generally utilizes a trained neural network numerous times to obtain the final output, creating significant latency and energy consumption on digital electronic hardware such as GPUs. In this study, we demonstrate that the propagation of a light beam through a semi-transparent medium can be programmed to implement a denoising diffusion model on image samples. This framework projects noisy image patterns through passive diffractive optical layers, which collectively only transmit the predicted noise term in the image. The optical transparent layers, which are trained with an online training approach, backpropagating the error to the analytical model of the system, are passive and kept the same across different steps of denoising. Hence this method enables high-speed image generation with minimal power consumption, benefiting from the bandwidth and energy efficiency of optical information processing.

Keywords

Cite

@article{arxiv.2407.10897,
  title  = {Optical Diffusion Models for Image Generation},
  author = {Ilker Oguz and Niyazi Ulas Dinc and Mustafa Yildirim and Junjie Ke and Innfarn Yoo and Qifei Wang and Feng Yang and Christophe Moser and Demetri Psaltis},
  journal= {arXiv preprint arXiv:2407.10897},
  year   = {2024}
}

Comments

17 pages, 10 figures

R2 v1 2026-06-28T17:41:35.334Z