English
Related papers

Related papers: Compositional Image Decomposition with Diffusion M…

200 papers

Given a factorization of an image into a sum of linear components, we present a zero-shot method to control each individual component through diffusion model sampling. For example, we can decompose an image into low and high spatial…

Computer Vision and Pattern Recognition · Computer Science 2025-01-13 Daniel Geng , Inbum Park , Andrew Owens

Generative models have demonstrated remarkable abilities in generating high-fidelity visual content. In this work, we explore how generative models can further be used not only to synthesize visual content but also to understand the…

Computer Vision and Pattern Recognition · Computer Science 2025-06-25 Yanbo Wang , Justin Dauwels , Yilun Du

Image composition targets at synthesizing a realistic composite image from a pair of foreground and background images. Recently, generative composition methods are built on large pretrained diffusion models to generate composite images,…

Computer Vision and Pattern Recognition · Computer Science 2023-08-22 Bo Zhang , Yuxuan Duan , Jun Lan , Yan Hong , Huijia Zhu , Weiqiang Wang , Li Niu

Denoising diffusion models enable conditional generation and density modeling of complex relationships like images and text. However, the nature of the learned relationships is opaque making it difficult to understand precisely what…

Machine Learning · Computer Science 2024-05-21 Xianghao Kong , Ollie Liu , Han Li , Dani Yogatama , Greg Ver Steeg

For an artist or a graphic designer, the spatial layout of a scene is a critical design choice. However, existing text-to-image diffusion models provide limited support for incorporating spatial information. This paper introduces Composite…

Computer Vision and Pattern Recognition · Computer Science 2023-07-27 Vikram Jamwal , Ramaneswaran S

While score based generative models, or diffusion models, have found success in image synthesis, they are often coupled with text data or image label to be able to manipulate and conditionally generate images. Even though manipulation of…

Computer Vision and Pattern Recognition · Computer Science 2023-02-07 Sandesh Ghimire , Armand Comas , Davin Hill , Aria Masoomi , Octavia Camps , Jennifer Dy

Large text-guided diffusion models, such as DALLE-2, are able to generate stunning photorealistic images given natural language descriptions. While such models are highly flexible, they struggle to understand the composition of certain…

Computer Vision and Pattern Recognition · Computer Science 2023-01-18 Nan Liu , Shuang Li , Yilun Du , Antonio Torralba , Joshua B. Tenenbaum

Image composition in image editing involves merging a foreground image with a background image to create a composite. Inconsistent lighting conditions between the foreground and background often result in unrealistic composites. Image…

Computer Vision and Pattern Recognition · Computer Science 2023-06-21 Jiajie Li , Jian Wang , Chen Wang , Jinjun Xiong

Conditional image synthesis based on user-specified requirements is a key component in creating complex visual content. In recent years, diffusion-based generative modeling has become a highly effective way for conditional image synthesis,…

Computer Vision and Pattern Recognition · Computer Science 2025-06-03 Zheyuan Zhan , Defang Chen , Jian-Ping Mei , Zhenghe Zhao , Jiawei Chen , Chun Chen , Siwei Lyu , Can Wang

Deep generative models allow for photorealistic image synthesis at high resolutions. But for many applications, this is not enough: content creation also needs to be controllable. While several recent works investigate how to disentangle…

Computer Vision and Pattern Recognition · Computer Science 2021-04-30 Michael Niemeyer , Andreas Geiger

Diffusion models are capable of generating photo-realistic images that combine elements which likely do not appear together in the training set, demonstrating the ability to \textit{compositionally generalize}. Nonetheless, the precise…

Artificial Intelligence · Computer Science 2024-10-14 Qiyao Liang , Ziming Liu , Mitchell Ostrow , Ila Fiete

Many visual scenes can be described as compositions of latent factors. Effective recognition, reasoning, and editing often require not only forming such compositional representations, but also solving the decomposition problem. One popular…

Computer Vision and Pattern Recognition · Computer Science 2026-02-11 Calvin Yeung , Ali Zakeri , Zhuowen Zou , Mohsen Imani

Image composition involves inserting a foreground object into the background while synthesizing environment-consistent effects such as shadows and reflections. Although shadow generation has been extensively studied, reflection generation…

Computer Vision and Pattern Recognition · Computer Science 2026-04-03 Haonan Zhao , Qingyang Liu , Jiaxuan Chen , Li Niu

We propose Context Diffusion, a diffusion-based framework that enables image generation models to learn from visual examples presented in context. Recent work tackles such in-context learning for image generation, where a query image is…

Computer Vision and Pattern Recognition · Computer Science 2025-07-24 Ivona Najdenkoska , Animesh Sinha , Abhimanyu Dubey , Dhruv Mahajan , Vignesh Ramanathan , Filip Radenovic

We present Intrinsic Image Diffusion, a generative model for appearance decomposition of indoor scenes. Given a single input view, we sample multiple possible material explanations represented as albedo, roughness, and metallic maps.…

Computer Vision and Pattern Recognition · Computer Science 2024-03-22 Peter Kocsis , Vincent Sitzmann , Matthias Nießner

Image decomposition is a crucial subject in the field of image processing. It can extract salient features from the source image. We propose a new image decomposition method based on convolutional neural network. This method can be applied…

Computer Vision and Pattern Recognition · Computer Science 2022-08-04 Yu Fu , Xiao-Jun Wu , Josef Kittler

The appearance of the same object may vary in different scene images due to perspectives and occlusions between objects. Humans can easily identify the same object, even if occlusions exist, by completing the occluded parts based on its…

Computer Vision and Pattern Recognition · Computer Science 2022-11-28 Tonglin Chen , Bin Li , Zhimeng Shen , Xiangyang Xue

Diffusion models generate images with an unprecedented level of quality, but how can we freely rearrange image layouts? Recent works generate controllable scenes via learning spatially disentangled latent codes, but these methods do not…

Computer Vision and Pattern Recognition · Computer Science 2024-04-11 Jiawei Ren , Mengmeng Xu , Jui-Chieh Wu , Ziwei Liu , Tao Xiang , Antoine Toisoul

Image composition and generation are processes where the artists need control over various parts of the generated images. However, the current state-of-the-art generation models, like Stable Diffusion, cannot handle fine-grained part-level…

Computer Vision and Pattern Recognition · Computer Science 2025-07-01 Harsh Rangwani , Aishwarya Agarwal , Kuldeep Kulkarni , R. Venkatesh Babu , Srikrishna Karanam

Images can be viewed as layered compositions, foreground objects over background, with potential occlusions. This layered representation enables independent editing of elements, offering greater flexibility for content creation. Despite the…

Computer Vision and Pattern Recognition · Computer Science 2026-03-25 Jingxi Chen , Yixiao Zhang , Xiaoye Qian , Zongxia Li , Cornelia Fermuller , Caren Chen , Yiannis Aloimonos
‹ Prev 1 2 3 10 Next ›