English

Learning Gaze-aware Compositional GAN

Computer Vision and Pattern Recognition 2024-06-03 v1 Artificial Intelligence

Abstract

Gaze-annotated facial data is crucial for training deep neural networks (DNNs) for gaze estimation. However, obtaining these data is labor-intensive and requires specialized equipment due to the challenge of accurately annotating the gaze direction of a subject. In this work, we present a generative framework to create annotated gaze data by leveraging the benefits of labeled and unlabeled data sources. We propose a Gaze-aware Compositional GAN that learns to generate annotated facial images from a limited labeled dataset. Then we transfer this model to an unlabeled data domain to take advantage of the diversity it provides. Experiments demonstrate our approach's effectiveness in generating within-domain image augmentations in the ETH-XGaze dataset and cross-domain augmentations in the CelebAMask-HQ dataset domain for gaze estimation DNN training. We also show additional applications of our work, which include facial image editing and gaze redirection.

Keywords

Cite

@article{arxiv.2405.20643,
  title  = {Learning Gaze-aware Compositional GAN},
  author = {Nerea Aranjuelo and Siyu Huang and Ignacio Arganda-Carreras and Luis Unzueta and Oihana Otaegui and Hanspeter Pfister and Donglai Wei},
  journal= {arXiv preprint arXiv:2405.20643},
  year   = {2024}
}

Comments

Accepted by ETRA 2024 as Full paper, and as journal paper in Proceedings of the ACM on Computer Graphics and Interactive Techniques

R2 v1 2026-06-28T16:48:08.539Z