English

Consistent View Synthesis with Pose-Guided Diffusion Models

Computer Vision and Pattern Recognition 2023-03-31 v1

Abstract

Novel view synthesis from a single image has been a cornerstone problem for many Virtual Reality applications that provide immersive experiences. However, most existing techniques can only synthesize novel views within a limited range of camera motion or fail to generate consistent and high-quality novel views under significant camera movement. In this work, we propose a pose-guided diffusion model to generate a consistent long-term video of novel views from a single image. We design an attention layer that uses epipolar lines as constraints to facilitate the association between different viewpoints. Experimental results on synthetic and real-world datasets demonstrate the effectiveness of the proposed diffusion model against state-of-the-art transformer-based and GAN-based approaches.

Keywords

Cite

@article{arxiv.2303.17598,
  title  = {Consistent View Synthesis with Pose-Guided Diffusion Models},
  author = {Hung-Yu Tseng and Qinbo Li and Changil Kim and Suhib Alsisan and Jia-Bin Huang and Johannes Kopf},
  journal= {arXiv preprint arXiv:2303.17598},
  year   = {2023}
}

Comments

CVPR 2023. Project page: https://poseguided-diffusion.github.io/

R2 v1 2026-06-28T09:41:53.415Z