Related papers: Patch-enhanced Mask Encoder Prompt Image Generatio…

RefAdGen: High-Fidelity Advertising Image Generation

The rapid advancement of Artificial Intelligence Generated Content (AIGC) techniques has unlocked opportunities in generating diverse and compelling advertising images based on referenced product images and textual scene descriptions. This…

Graphics · Computer Science 2025-08-19 Yiyun Chen , Weikai Yang

Mask Embedding in conditional GAN for Guided Synthesis of High Resolution Images

Recent advancements in conditional Generative Adversarial Networks (cGANs) have shown promises in label guided image synthesis. Semantic masks, such as sketches and label maps, are another intuitive and effective form of guidance in image…

Computer Vision and Pattern Recognition · Computer Science 2019-07-04 Yinhao Ren , Zhe Zhu , Yingzhou Li , Joseph Lo

Generative Visual Compression: A Review

Artificial Intelligence Generated Content (AIGC) is leading a new technical revolution for the acquisition of digital content and impelling the progress of visual compression towards competitive performance gains and diverse functionalities…

Computer Vision and Pattern Recognition · Computer Science 2024-02-07 Bolin Chen , Shanzhi Yin , Peilin Chen , Shiqi Wang , Yan Ye

Scalable AI Generative Content for Vehicular Network Semantic Communication

Perceiving vehicles in a driver's blind spot is vital for safe driving. The detection of potentially dangerous vehicles in these blind spots can benefit from vehicular network semantic communication technology. However, efficient semantic…

Artificial Intelligence · Computer Science 2023-11-27 Hao Feng , Yi Yang , Zhu Han

PatchCraft: Exploring Texture Patch for Efficient AI-generated Image Detection

Recent generative models show impressive performance in generating photographic images. Humans can hardly distinguish such incredibly realistic-looking AI-generated images from real ones. AI-generated images may lead to ubiquitous…

Computer Vision and Pattern Recognition · Computer Science 2024-03-08 Nan Zhong , Yiran Xu , Sheng Li , Zhenxing Qian , Xinpeng Zhang

PointCG: Self-supervised Point Cloud Learning via Joint Completion and Generation

The core of self-supervised point cloud learning lies in setting up appropriate pretext tasks, to construct a pre-training framework that enables the encoder to perceive 3D objects effectively. In this paper, we integrate two prevalent…

Computer Vision and Pattern Recognition · Computer Science 2025-04-07 Yun Liu , Peng Li , Xuefeng Yan , Liangliang Nan , Bing Wang , Honghua Chen , Lina Gong , Wei Zhao , Mingqiang Wei

Iterative Facial Image Inpainting Based on an Encoder-Generator Architecture

Facial image inpainting is a challenging problem as it requires generating new pixels that include semantic information for masked key components in a face, e.g., eyes and nose. Recently, remarkable methods have been proposed in this field.…

Image and Video Processing · Electrical Eng. & Systems 2022-02-15 Yahya Dogan , Hacer Yalim Keles

Semantics-Guided Generative Image Compression

Advancements in text-to-image generative AI with large multimodal models are spreading into the field of image compression, creating high-quality representation of images at extremely low bit rates. This work introduces novel components to…

Image and Video Processing · Electrical Eng. & Systems 2025-06-02 Cheng-Lin Wu , Hyomin Choi , Ivan V. Bajić

MAGIC: Mask-Guided Image Synthesis by Inverting a Quasi-Robust Classifier

We offer a method for one-shot mask-guided image synthesis that allows controlling manipulations of a single image by inverting a quasi-robust classifier equipped with strong regularizers. Our proposed method, entitled MAGIC, leverages…

Computer Vision and Pattern Recognition · Computer Science 2023-07-03 Mozhdeh Rouhsedaghat , Masoud Monajatipoor , C. -C. Jay Kuo , Iacopo Masi

Mask-Guided Portrait Editing with Conditional GANs

Portrait editing is a popular subject in photo manipulation. The Generative Adversarial Network (GAN) advances the generating of realistic faces and allows more face editing. In this paper, we argue about three issues in existing…

Computer Vision and Pattern Recognition · Computer Science 2019-05-27 Shuyang Gu , Jianmin Bao , Hao Yang , Dong Chen , Fang Wen , Lu Yuan

PCQA: A Strong Baseline for AIGC Quality Assessment Based on Prompt Condition

The development of Large Language Models (LLM) and Diffusion Models brings the boom of Artificial Intelligence Generated Content (AIGC). It is essential to build an effective quality assessment framework to provide a quantifiable evaluation…

Computer Vision and Pattern Recognition · Computer Science 2024-04-23 Xi Fang , Weigang Wang , Xiaoxin Lv , Jun Yan

MCGM: Mask Conditional Text-to-Image Generative Model

Recent advancements in generative models have revolutionized the field of artificial intelligence, enabling the creation of highly-realistic and detailed images. In this study, we propose a novel Mask Conditional Text-to-Image Generative…

Computer Vision and Pattern Recognition · Computer Science 2024-10-02 Rami Skaik , Leonardo Rossi , Tomaso Fontanini , Andrea Prati

Detecting AI-Generated Images via Contextual Anomaly Estimation in Masked AutoEncoders

Context-based detection methods such as DetectGPT achieve strong generalization in identifying AI-generated text by evaluating content compatibility with a model's learned distribution. In contrast, existing image detectors rely on…

Computer Vision and Pattern Recognition · Computer Science 2026-03-10 Minsuk Jang , Hyunseo Jeong , Minseok Son , Changick Kim

Trinity Detector:text-assisted and attention mechanisms based spectral fusion for diffusion generation image detection

Artificial Intelligence Generated Content (AIGC) techniques, represented by text-to-image generation, have led to a malicious use of deep forgeries, raising concerns about the trustworthiness of multimedia content. Adapting traditional…

Computer Vision and Pattern Recognition · Computer Science 2024-04-29 Jiawei Song , Dengpan Ye , Yunming Zhang

MaskGIT: Masked Generative Image Transformer

Generative transformers have experienced rapid popularity growth in the computer vision community in synthesizing high-fidelity and high-resolution images. The best generative transformer models so far, however, still treat an image naively…

Computer Vision and Pattern Recognition · Computer Science 2022-02-10 Huiwen Chang , Han Zhang , Lu Jiang , Ce Liu , William T. Freeman

Fine-grained Semantic Constraint in Image Synthesis

In this paper, we propose a multi-stage and high-resolution model for image synthesis that uses fine-grained attributes and masks as input. With a fine-grained attribute, the proposed model can detailedly constrain the features of the…

Computer Vision and Pattern Recognition · Computer Science 2021-01-13 Pengyang Li , Donghui Wang

AGIC: Attention-Guided Image Captioning to Improve Caption Relevance

Despite significant progress in image captioning, generating accurate and descriptive captions remains a long-standing challenge. In this study, we propose Attention-Guided Image Captioning (AGIC), which amplifies salient visual regions…

Computer Vision and Pattern Recognition · Computer Science 2025-08-12 L. D. M. S. Sai Teja , Ashok Urlana , Pruthwik Mishra

Mask Guided Attention For Fine-Grained Patchy Image Classification

In this work, we present a novel mask guided attention (MGA) method for fine-grained patchy image classification. The key challenge of fine-grained patchy image classification lies in two folds, ultra-fine-grained inter-category variances…

Computer Vision and Pattern Recognition · Computer Science 2021-09-23 Jun Wang , Xiaohan Yu , Yongsheng Gao

MagGAN: High-Resolution Face Attribute Editing with Mask-Guided Generative Adversarial Network

We present Mask-guided Generative Adversarial Network (MagGAN) for high-resolution face attribute editing, in which semantic facial masks from a pre-trained face parser are used to guide the fine-grained image editing process. With the…

Computer Vision and Pattern Recognition · Computer Science 2020-10-06 Yi Wei , Zhe Gan , Wenbo Li , Siwei Lyu , Ming-Ching Chang , Lei Zhang , Jianfeng Gao , Pengchuan Zhang

Fast Training of Diffusion Models with Masked Transformers

We propose an efficient approach to train large diffusion models with masked transformers. While masked transformers have been extensively explored for representation learning, their application to generative learning is less explored in…

Computer Vision and Pattern Recognition · Computer Science 2024-03-06 Hongkai Zheng , Weili Nie , Arash Vahdat , Anima Anandkumar