Related papers: Annotated Hands for Generative Models

Giving a Hand to Diffusion Models: a Two-Stage Approach to Improving Conditional Human Image Generation

Recent years have seen significant progress in human image generation, particularly with the advancements in diffusion models. However, existing diffusion methods encounter challenges when producing consistent hand anatomy and the generated…

Computer Vision and Pattern Recognition · Computer Science 2024-05-01 Anton Pelykh , Ozge Mercanoglu Sincan , Richard Bowden

Human Annotations Improve GAN Performances

Generative Adversarial Networks (GANs) have shown great success in many applications. In this work, we present a novel method that leverages human annotations to improve the quality of generated images. Unlike previous paradigms that…

Computer Vision and Pattern Recognition · Computer Science 2019-11-18 Juanyong Duan , Sim Heng Ong , Qi Zhao

HanDiffuser: Text-to-Image Generation With Realistic Hand Appearances

Text-to-image generative models can generate high-quality humans, but realism is lost when generating hands. Common artifacts include irregular hand poses, shapes, incorrect numbers of fingers, and physically implausible finger…

Computer Vision and Pattern Recognition · Computer Science 2024-11-26 Supreeth Narasimhaswamy , Uttaran Bhattacharya , Xiang Chen , Ishita Dasgupta , Saayan Mitra , Minh Hoai

An unobtrusive quality supervision approach for medical image annotation

Image annotation is one essential prior step to enable data-driven algorithms. In medical imaging, having large and reliably annotated data sets is crucial to recognize various diseases robustly. However, annotator performance varies…

Image and Video Processing · Electrical Eng. & Systems 2022-11-23 Sonja Kunzmann , Mathias Öttl , Prathmesh Madhu , Felix Denzinger , Andreas Maier

Improving image generative models with human interactions

GANs provide a framework for training generative models which mimic a data distribution. However, in many cases we wish to train these generative models to optimize some auxiliary objective function within the data it generates, such as…

Computer Vision and Pattern Recognition · Computer Science 2017-10-02 Andrew Kyle Lampinen , David So , Douglas Eck , Fred Bertsch

Generating Images with 3D Annotations Using Diffusion Models

Diffusion models have emerged as a powerful generative method, capable of producing stunning photo-realistic images from natural language descriptions. However, these models lack explicit control over the 3D structure in the generated…

Computer Vision and Pattern Recognition · Computer Science 2024-04-05 Wufei Ma , Qihao Liu , Jiahao Wang , Angtian Wang , Xiaoding Yuan , Yi Zhang , Zihao Xiao , Guofeng Zhang , Beijia Lu , Ruxiao Duan , Yongrui Qi , Adam Kortylewski , Yaoyao Liu , Alan Yuille

Latent Diffusion for Guided Document Table Generation

Obtaining annotated table structure data for complex tables is a challenging task due to the inherent diversity and complexity of real-world document layouts. The scarcity of publicly available datasets with comprehensive annotations for…

Computer Vision and Pattern Recognition · Computer Science 2024-08-20 Syed Jawwad Haider Hamdani , Saifullah Saifullah , Stefan Agne , Andreas Dengel , Sheraz Ahmed

Conditional Image Generation with Pretrained Generative Model

In recent years, diffusion models have gained popularity for their ability to generate higher-quality images in comparison to GAN models. However, like any other large generative models, these models require a huge amount of data,…

Computer Vision and Pattern Recognition · Computer Science 2023-12-21 Rajesh Shrestha , Bowen Xie

HandsOff: Labeled Dataset Generation With No Additional Human Annotations

Recent work leverages the expressive power of generative adversarial networks (GANs) to generate labeled synthetic datasets. These dataset generation methods often require new annotations of synthetic images, which forces practitioners to…

Computer Vision and Pattern Recognition · Computer Science 2023-04-03 Austin Xu , Mariya I. Vasileva , Achal Dave , Arjun Seshadri

Improving novelty detection with generative adversarial networks on hand gesture data

We propose a novel way of solving the issue of classification of out-of-vocabulary gestures using Artificial Neural Networks (ANNs) trained in the Generative Adversarial Network (GAN) framework. A generative model augments the data set in…

Machine Learning · Computer Science 2023-04-14 Miguel Simão , Pedro Neto , Olivier Gibaru

GANTASTIC: GAN-based Transfer of Interpretable Directions for Disentangled Image Editing in Text-to-Image Diffusion Models

The rapid advancement in image generation models has predominantly been driven by diffusion models, which have demonstrated unparalleled success in generating high-fidelity, diverse images from textual prompts. Despite their success,…

Computer Vision and Pattern Recognition · Computer Science 2024-03-29 Yusuf Dalva , Hidir Yesiltepe , Pinar Yanardag

FoundHand: Large-Scale Domain-Specific Learning for Controllable Hand Image Generation

Despite remarkable progress in image generation models, generating realistic hands remains a persistent challenge due to their complex articulation, varying viewpoints, and frequent occlusions. We present FoundHand, a large-scale…

Computer Vision and Pattern Recognition · Computer Science 2024-12-06 Kefan Chen , Chaerin Min , Linguang Zhang , Shreyas Hampali , Cem Keskin , Srinath Sridhar

Generating Realistic Training Images Based on Tonality-Alignment Generative Adversarial Networks for Hand Pose Estimation

Hand pose estimation from a monocular RGB image is an important but challenging task. The main factor affecting its performance is the lack of a sufficiently large training dataset with accurate hand-keypoint annotations. In this work, we…

Computer Vision and Pattern Recognition · Computer Science 2020-12-15 Liangjian Chen , Shih-Yao Lin , Yusheng Xie , Hui Tang , Yufan Xue , Xiaohui Xie , Yen-Yu Lin , Wei Fan

InterHandGen: Two-Hand Interaction Generation via Cascaded Reverse Diffusion

We present InterHandGen, a novel framework that learns the generative prior of two-hand interaction. Sampling from our model yields plausible and diverse two-hand shapes in close interaction with or without an object. Our prior can be…

Computer Vision and Pattern Recognition · Computer Science 2024-03-27 Jihyun Lee , Shunsuke Saito , Giljoo Nam , Minhyuk Sung , Tae-Kyun Kim

A Tiered GAN Approach for Monet-Style Image Generation

Generative Adversarial Networks (GANs) have proven to be a powerful tool in generating artistic images, capable of mimicking the styles of renowned painters, such as Claude Monet. This paper introduces a tiered GAN model to progressively…

Computer Vision and Pattern Recognition · Computer Science 2024-12-10 FNU Neha , Deepshikha Bhati , Deepak Kumar Shukla , Md Amiruzzaman

Attributing Fake Images to GANs: Learning and Analyzing GAN Fingerprints

Recent advances in Generative Adversarial Networks (GANs) have shown increasing success in generating photorealistic images. But they also raise challenges to visual forensics and model attribution. We present the first study of learning…

Computer Vision and Pattern Recognition · Computer Science 2019-08-19 Ning Yu , Larry Davis , Mario Fritz

Affordance Diffusion: Synthesizing Hand-Object Interactions

Recent successes in image synthesis are powered by large-scale diffusion models. However, most methods are currently limited to either text- or image-conditioned generation for synthesizing an entire image, texture transfer or inserting…

Computer Vision and Pattern Recognition · Computer Science 2023-05-23 Yufei Ye , Xueting Li , Abhinav Gupta , Shalini De Mello , Stan Birchfield , Jiaming Song , Shubham Tulsiani , Sifei Liu

MM-Hand: 3D-Aware Multi-Modal Guided Hand Generative Network for 3D Hand Pose Synthesis

Estimating the 3D hand pose from a monocular RGB image is important but challenging. A solution is training on large-scale RGB hand images with accurate 3D hand keypoint annotations. However, it is too expensive in practice. Instead, we…

Computer Vision and Pattern Recognition · Computer Science 2020-10-06 Zhenyu Wu , Duc Hoang , Shih-Yao Lin , Yusheng Xie , Liangjian Chen , Yen-Yu Lin , Zhangyang Wang , Wei Fan

DeLiGAN : Generative Adversarial Networks for Diverse and Limited Data

A class of recent approaches for generating images, called Generative Adversarial Networks (GAN), have been used to generate impressively realistic images of objects, bedrooms, handwritten digits and a variety of other image modalities.…

Computer Vision and Pattern Recognition · Computer Science 2017-06-08 Swaminathan Gurumurthy , Ravi Kiran Sarvadevabhatla , Venkatesh Babu Radhakrishnan

On Leveraging Pretrained GANs for Generation with Limited Data

Recent work has shown generative adversarial networks (GANs) can generate highly realistic images, that are often indistinguishable (by humans) from real images. Most images so generated are not contained in the training dataset, suggesting…

Computer Vision and Pattern Recognition · Computer Science 2020-08-11 Miaoyun Zhao , Yulai Cong , Lawrence Carin