Related papers: ObjectStitch: Generative Object Compositing

Thinking Outside the BBox: Unconstrained Generative Object Compositing

Compositing an object into an image involves multiple non-trivial sub-tasks such as object placement and scaling, color/lighting harmonization, viewpoint/geometry adjustment, and shadow/reflection generation. Recent generative image…

Computer Vision and Pattern Recognition · Computer Science 2024-09-12 Gemma Canet Tarrés , Zhe Lin , Zhifei Zhang , Jianming Zhang , Yizhi Song , Dan Ruta , Andrew Gilbert , John Collomosse , Soo Ye Kim

ObjectComposer: Consistent Generation of Multiple Objects Without Fine-tuning

Recent text-to-image generative models can generate high-fidelity images from text prompts. However, these models struggle to consistently generate the same objects in different contexts with the same appearance. Consistent object…

Computer Vision and Pattern Recognition · Computer Science 2023-10-12 Alec Helbling , Evan Montoya , Duen Horng Chau

IMPRINT: Generative Object Compositing by Learning Identity-Preserving Representation

Generative object compositing emerges as a promising new avenue for compositional image editing. However, the requirement of object identity preservation poses a significant challenge, limiting practical usage of most existing methods. In…

Computer Vision and Pattern Recognition · Computer Science 2024-03-19 Yizhi Song , Zhifei Zhang , Zhe Lin , Scott Cohen , Brian Price , Jianming Zhang , Soo Ye Kim , He Zhang , Wei Xiong , Daniel Aliaga

CatalogStitch: Dimension-Aware and Occlusion-Preserving Object Compositing for Catalog Image Generation

Generative object compositing methods have shown remarkable ability to seamlessly insert objects into scenes. However, when applied to real-world catalog image generation, these methods require tedious manual intervention: users must…

Computer Vision and Pattern Recognition · Computer Science 2026-04-13 Sanyam Jain , Pragya Kandari , Manit Singhal , He Zhang , Soo Ye Kim

3D Object Manipulation in a Single Image using Generative Models

Object manipulation in images aims to not only edit the object's presentation but also gift objects with motion. Previous methods encountered challenges in concurrently handling static editing and dynamic generation, while also struggling…

Computer Vision and Pattern Recognition · Computer Science 2025-01-23 Ruisi Zhao , Zechuan Zhang , Zongxin Yang , Yi Yang

MultiShadow: Multi-Object Shadow Generation for Image Compositing via Diffusion Model

Realistic shadow generation is crucial for achieving seamless image compositing, yet existing methods primarily focus on single-object insertion and often fail to generalize when multiple foreground objects are composited into a background…

Computer Vision and Pattern Recognition · Computer Science 2026-03-06 Waqas Ahmed , Dean Diepeveen , Ferdous Sohel

PostureObjectstitch: Anomaly Image Generation Considering Assembly Relationships in Industrial Scenarios

Image generation technology can synthesize condition-specific images to supplement real-world industrial anomaly data and enhance anomaly detection model performance. Existing generation techniques rarely account for the pose and…

Computer Vision and Pattern Recognition · Computer Science 2026-04-16 Zebei Tong , Hongchang Chen , Yujie Lei , Gang Chen , Yushi Liu , Zhi Zheng , Hao Chen , Jieming Zhang , Ying Li , Dongpu Cao

Emergence of Object Segmentation in Perturbed Generative Models

We introduce a novel framework to build a model that can learn how to segment objects from a collection of images without any human annotation. Our method builds on the observation that the location of object segments can be perturbed…

Computer Vision and Pattern Recognition · Computer Science 2019-11-05 Adam Bielski , Paolo Favaro

Object-Centric Relational Representations for Image Generation

Conditioning image generation on specific features of the desired output is a key ingredient of modern generative models. However, existing approaches lack a general and unified way of representing structural and semantic conditioning at…

Computer Vision and Pattern Recognition · Computer Science 2024-07-08 Luca Butera , Andrea Cini , Alberto Ferrante , Cesare Alippi

Investigating Object Compositionality in Generative Adversarial Networks

Deep generative models seek to recover the process with which the observed data was generated. They may be used to synthesize new samples or to subsequently extract representations. Successful approaches in the domain of images are driven…

Computer Vision and Pattern Recognition · Computer Science 2020-07-27 Sjoerd van Steenkiste , Karol Kurach , Jürgen Schmidhuber , Sylvain Gelly

ObjectMover: Generative Object Movement with Video Prior

Simple as it seems, moving an object to another location within an image is, in fact, a challenging image-editing task that requires re-harmonizing the lighting, adjusting the pose based on perspective, accurately filling occluded regions,…

Graphics · Computer Science 2025-03-12 Xin Yu , Tianyu Wang , Soo Ye Kim , Paul Guerrero , Xi Chen , Qing Liu , Zhe Lin , Xiaojuan Qi

BlenderFusion: 3D-Grounded Visual Editing and Generative Compositing

We present BlenderFusion, a generative visual compositing framework that synthesizes new scenes by recomposing objects, camera, and background. It follows a layering-editing-compositing pipeline: (i) segmenting and converting visual inputs…

Computer Vision and Pattern Recognition · Computer Science 2025-09-03 Jiacheng Chen , Ramin Mehran , Xuhui Jia , Saining Xie , Sanghyun Woo

Cross-domain Compositing with Pretrained Diffusion Models

Diffusion models have enabled high-quality, conditional image editing capabilities. We propose to expand their arsenal, and demonstrate that off-the-shelf diffusion models can be used for a wide range of cross-domain compositing tasks.…

Computer Vision and Pattern Recognition · Computer Science 2023-05-26 Roy Hachnochi , Mingrui Zhao , Nadav Orzech , Rinon Gal , Ali Mahdavi-Amiri , Daniel Cohen-Or , Amit Haim Bermano

Composing Parts for Expressive Object Generation

Image composition and generation are processes where the artists need control over various parts of the generated images. However, the current state-of-the-art generation models, like Stable Diffusion, cannot handle fine-grained part-level…

Computer Vision and Pattern Recognition · Computer Science 2025-07-01 Harsh Rangwani , Aishwarya Agarwal , Kuldeep Kulkarni , R. Venkatesh Babu , Srikrishna Karanam

Learning to Compose: Improving Object Centric Learning by Injecting Compositionality

Learning compositional representation is a key aspect of object-centric learning as it enables flexible systematic generalization and supports complex visual reasoning. However, most of the existing approaches rely on auto-encoding…

Computer Vision and Pattern Recognition · Computer Science 2025-11-11 Whie Jung , Jaehoon Yoo , Sungjin Ahn , Seunghoon Hong

Scene-Conditional 3D Object Stylization and Composition

Recently, 3D generative models have made impressive progress, enabling the generation of almost arbitrary 3D assets from text or image inputs. However, these approaches generate objects in isolation without any consideration for the scene…

Computer Vision and Pattern Recognition · Computer Science 2025-05-02 Jinghao Zhou , Tomas Jakab , Philip Torr , Christian Rupprecht

Multitwine: Multi-Object Compositing with Text and Layout Control

We introduce the first generative model capable of simultaneous multi-object compositing, guided by both text and layout. Our model allows for the addition of multiple objects within a scene, capturing a range of interactions from simple…

Computer Vision and Pattern Recognition · Computer Science 2025-02-10 Gemma Canet Tarrés , Zhe Lin , Zhifei Zhang , He Zhang , Andrew Gilbert , John Collomosse , Soo Ye Kim

A Simple Background Augmentation Method for Object Detection with Diffusion Model

In computer vision, it is well-known that a lack of data diversity will impair model performance. In this study, we address the challenges of enhancing the dataset diversity problem in order to benefit various downstream tasks such as…

Computer Vision and Pattern Recognition · Computer Science 2024-08-02 Yuhang Li , Xin Dong , Chen Chen , Weiming Zhuang , Lingjuan Lyu

gCoRF: Generative Compositional Radiance Fields

3D generative models of objects enable photorealistic image synthesis with 3D control. Existing methods model the scene as a global scene representation, ignoring the compositional aspect of the scene. Compositional reasoning can enable a…

Graphics · Computer Science 2022-11-01 Mallikarjun BR , Ayush Tewari , Xingang Pan , Mohamed Elgharib , Christian Theobalt

ObjectMate: A Recurrence Prior for Object Insertion and Subject-Driven Generation

This paper introduces a tuning-free method for both object insertion and subject-driven generation. The task involves composing an object, given multiple views, into a scene specified by either an image or text. Existing methods struggle to…

Computer Vision and Pattern Recognition · Computer Science 2024-12-12 Daniel Winter , Asaf Shul , Matan Cohen , Dana Berman , Yael Pritch , Alex Rav-Acha , Yedid Hoshen