English

Jointly Generating Multi-view Consistent PBR Textures using Collaborative Control

Computer Vision and Pattern Recognition 2024-10-10 v1 Graphics

Abstract

Multi-view consistency remains a challenge for image diffusion models. Even within the Text-to-Texture problem, where perfect geometric correspondences are known a priori, many methods fail to yield aligned predictions across views, necessitating non-trivial fusion methods to incorporate the results onto the original mesh. We explore this issue for a Collaborative Control workflow specifically in PBR Text-to-Texture. Collaborative Control directly models PBR image probability distributions, including normal bump maps; to our knowledge, the only diffusion model to directly output full PBR stacks. We discuss the design decisions involved in making this model multi-view consistent, and demonstrate the effectiveness of our approach in ablation studies, as well as practical applications.

Keywords

Cite

@article{arxiv.2410.06985,
  title  = {Jointly Generating Multi-view Consistent PBR Textures using Collaborative Control},
  author = {Shimon Vainer and Konstantin Kutsy and Dante De Nigris and Ciara Rowles and Slava Elizarov and Simon Donné},
  journal= {arXiv preprint arXiv:2410.06985},
  year   = {2024}
}

Comments

19 pages, 13 figures

R2 v1 2026-06-28T19:14:35.955Z