English

GAN-Supervised Dense Visual Alignment

Computer Vision and Pattern Recognition 2022-04-06 v2

Abstract

We propose GAN-Supervised Learning, a framework for learning discriminative models and their GAN-generated training data jointly end-to-end. We apply our framework to the dense visual alignment problem. Inspired by the classic Congealing method, our GANgealing algorithm trains a Spatial Transformer to map random samples from a GAN trained on unaligned data to a common, jointly-learned target mode. We show results on eight datasets, all of which demonstrate our method successfully aligns complex data and discovers dense correspondences. GANgealing significantly outperforms past self-supervised correspondence algorithms and performs on-par with (and sometimes exceeds) state-of-the-art supervised correspondence algorithms on several datasets -- without making use of any correspondence supervision or data augmentation and despite being trained exclusively on GAN-generated data. For precise correspondence, we improve upon state-of-the-art supervised methods by as much as 3×3\times. We show applications of our method for augmented reality, image editing and automated pre-processing of image datasets for downstream GAN training.

Keywords

Cite

@article{arxiv.2112.05143,
  title  = {GAN-Supervised Dense Visual Alignment},
  author = {William Peebles and Jun-Yan Zhu and Richard Zhang and Antonio Torralba and Alexei A. Efros and Eli Shechtman},
  journal= {arXiv preprint arXiv:2112.05143},
  year   = {2022}
}

Comments

An updated version of our CVPR 2022 paper (oral); v2 features additional references and minor text changes. Code available at https://www.github.com/wpeebles/gangealing . Project page and videos available at https://www.wpeebles.com/gangealing

R2 v1 2026-06-24T08:11:20.082Z