Related papers: MureObjectStitch: Multi-reference Image Compositio…

CareCom: Generative Image Composition with Calibrated Reference Features

Image composition aims to seamlessly insert foreground object into background. Despite the huge progress in generative image composition, the existing methods are still struggling with simultaneous detail preservation and foreground…

Computer Vision and Pattern Recognition · Computer Science 2025-11-17 Jiaxuan Chen , Bo Zhang , Qingdong He , Jinlong Peng , Li Niu

OSInsert: Towards High-authenticity and High-fidelity Image Composition

Generative image composition aims to regenerate the given foreground object in the background image to produce a realistic composite image. Some high-authenticity methods can adjust foreground pose/view to be compatible with background,…

Computer Vision and Pattern Recognition · Computer Science 2026-02-24 Jingyuan Wang , Li Niu

DreamCom: Finetuning Text-guided Inpainting Model for Image Composition

The goal of image composition is merging a foreground object into a background image to obtain a realistic composite image. Recently, generative composition methods are built on large pretrained diffusion models, due to their unprecedented…

Computer Vision and Pattern Recognition · Computer Science 2024-01-25 Lingxiao Lu , Jiangtong Li , Bo Zhang , Li Niu

PostureObjectstitch: Anomaly Image Generation Considering Assembly Relationships in Industrial Scenarios

Image generation technology can synthesize condition-specific images to supplement real-world industrial anomaly data and enhance anomaly detection model performance. Existing generation techniques rarely account for the pose and…

Computer Vision and Pattern Recognition · Computer Science 2026-04-16 Zebei Tong , Hongchang Chen , Yujie Lei , Gang Chen , Yushi Liu , Zhi Zheng , Hao Chen , Jieming Zhang , Ying Li , Dongpu Cao

ControlCom: Controllable Image Composition using Diffusion Model

Image composition targets at synthesizing a realistic composite image from a pair of foreground and background images. Recently, generative composition methods are built on large pretrained diffusion models to generate composite images,…

Computer Vision and Pattern Recognition · Computer Science 2023-08-22 Bo Zhang , Yuxuan Duan , Jun Lan , Yan Hong , Huijia Zhu , Weiqiang Wang , Li Niu

Making Images Real Again: A Comprehensive Survey on Deep Image Composition

As a common image editing operation, image composition (object insertion) aims to combine the foreground from one image and another background image, to produce a composite image. However, there are many issues that could make the composite…

Computer Vision and Pattern Recognition · Computer Science 2026-03-20 Li Niu , Wenyan Cong , Liu Liu , Yan Hong , Bo Zhang , Jing Liang , Liqing Zhang

ObjectStitch: Generative Object Compositing

Object compositing based on 2D images is a challenging problem since it typically involves multiple processing stages such as color harmonization, geometry correction and shadow generation to generate realistic results. Furthermore,…

Computer Vision and Pattern Recognition · Computer Science 2022-12-06 Yizhi Song , Zhifei Zhang , Zhe Lin , Scott Cohen , Brian Price , Jianming Zhang , Soo Ye Kim , Daniel Aliaga

Generative Panoramic Image Stitching

We introduce the task of generative panoramic image stitching, which aims to synthesize seamless panoramas that are faithful to the content of multiple reference images containing parallax effects and strong variations in lighting, camera…

Graphics · Computer Science 2025-07-11 Mathieu Tuli , Kaveh Kamali , David B. Lindell

Infusing Definiteness into Randomness: Rethinking Composition Styles for Deep Image Matting

We study the composition style in deep image matting, a notion that characterizes a data generation flow on how to exploit limited foregrounds and random backgrounds to form a training dataset. Prior art executes this flow in a completely…

Computer Vision and Pattern Recognition · Computer Science 2022-12-29 Zixuan Ye , Yutong Dai , Chaoyi Hong , Zhiguo Cao , Hao Lu

Image Harmonization by Matching Regional References

To achieve visual consistency in composite images, recent image harmonization methods typically summarize the appearance pattern of global background and apply it to the global foreground without location discrepancy. However, for a real…

Computer Vision and Pattern Recognition · Computer Science 2022-04-12 Ziyue Zhu , Zhao Zhang , Zheng Lin , Ruiqi Wu , Zhi Chai , Chun-Le Guo

Reference-based Image Composition with Sketch via Structure-aware Diffusion Model

Recent remarkable improvements in large-scale text-to-image generative models have shown promising results in generating high-fidelity images. To further enhance editability and enable fine-grained generation, we introduce a…

Computer Vision and Pattern Recognition · Computer Science 2023-04-20 Kangyeol Kim , Sunghyun Park , Junsoo Lee , Jaegul Choo

Thinking the Fusion Strategy of Multi-reference Face Reenactment

In recent advances of deep generative models, face reenactment -manipulating and controlling human face, including their head movement-has drawn much attention for its wide range of applicability. Despite its strong expressiveness, it is…

Computer Vision and Pattern Recognition · Computer Science 2022-02-23 Takuya Yashima , Takuya Narihira , Tamaki Kojima

Where and Who? Automatic Semantic-Aware Person Composition

Image compositing is a method used to generate realistic yet fake imagery by inserting contents from one image to another. Previous work in compositing has focused on improving appearance compatibility of a user selected foreground segment…

Graphics · Computer Science 2017-12-05 Fuwen Tan , Crispin Bernier , Benjamin Cohen , Vicente Ordonez , Connelly Barnes

Resolving Multi-Condition Confusion for Finetuning-Free Personalized Image Generation

Personalized text-to-image generation methods can generate customized images based on the reference images, which have garnered wide research interest. Recent methods propose a finetuning-free approach with a decoupled cross-attention…

Computer Vision and Pattern Recognition · Computer Science 2024-12-19 Qihan Huang , Siming Fu , Jinlong Liu , Hao Jiang , Yipeng Yu , Jie Song

ObjectComposer: Consistent Generation of Multiple Objects Without Fine-tuning

Recent text-to-image generative models can generate high-fidelity images from text prompts. However, these models struggle to consistently generate the same objects in different contexts with the same appearance. Consistent object…

Computer Vision and Pattern Recognition · Computer Science 2023-10-12 Alec Helbling , Evan Montoya , Duen Horng Chau

Diverse Image Harmonization

Image harmonization aims to adjust the foreground illumination in a composite image to make it harmonious. The existing harmonization methods can only produce one deterministic result for a composite image, ignoring that a composite image…

Computer Vision and Pattern Recognition · Computer Science 2024-07-23 Xinhao Tao , Tianyuan Qiu , Junyan Cao , Li Niu

FreeCompose: Generic Zero-Shot Image Composition with Diffusion Prior

We offer a novel approach to image composition, which integrates multiple input images into a single, coherent image. Rather than concentrating on specific use cases such as appearance editing (image harmonization) or semantic editing…

Computer Vision and Pattern Recognition · Computer Science 2024-07-09 Zhekai Chen , Wen Wang , Zhen Yang , Zeqing Yuan , Hao Chen , Chunhua Shen

Generative Photomontage

Text-to-image models are powerful tools for image creation. However, the generation process is akin to a dice roll and makes it difficult to achieve a single image that captures everything a user wants. In this paper, we propose a framework…

Computer Vision and Pattern Recognition · Computer Science 2025-06-10 Sean J. Liu , Nupur Kumari , Ariel Shamir , Jun-Yan Zhu

Fine-grained Image-to-Image Transformation towards Visual Recognition

Existing image-to-image transformation approaches primarily focus on synthesizing visually pleasing data. Generating images with correct identity labels is challenging yet much less explored. It is even more challenging to deal with image…

Computer Vision and Pattern Recognition · Computer Science 2020-06-16 Wei Xiong , Yutong He , Yixuan Zhang , Wenhan Luo , Lin Ma , Jiebo Luo

NEUCORE: Neural Concept Reasoning for Composed Image Retrieval

Composed image retrieval which combines a reference image and a text modifier to identify the desired target image is a challenging task, and requires the model to comprehend both vision and language modalities and their interactions.…

Computer Vision and Pattern Recognition · Computer Science 2023-10-03 Shu Zhao , Huijuan Xu