Related papers: Transparent Image Layer Diffusion using Latent Tra…

PSDiffusion: Harmonized Multi-Layer Image Generation via Layout and Appearance Alignment

Transparent image layer generation plays a significant role in digital art and design workflows. Existing methods typically decompose transparent layers from a single RGB image using a set of tools or generate multiple transparent layers…

Computer Vision and Pattern Recognition · Computer Science 2025-11-11 Dingbang Huang , Wenbo Li , Yifei Zhao , Xinyu Pan , Chun Wang , Yanhong Zeng , Bo Dai

Text2Layer: Layered Image Generation using Latent Diffusion Model

Layer compositing is one of the most popular image editing workflows among both amateurs and professionals. Motivated by the success of diffusion models, we explore layer compositing from a layered image generation perspective. Instead of…

Computer Vision and Pattern Recognition · Computer Science 2023-07-20 Xinyang Zhang , Wentian Zhao , Xin Lu , Jeff Chien

Optical Diffusion Models for Image Generation

Diffusion models generate new samples by progressively decreasing the noise from the initially provided random distribution. This inference procedure generally utilizes a trained neural network numerous times to obtain the final output,…

Optics · Physics 2024-11-01 Ilker Oguz , Niyazi Ulas Dinc , Mustafa Yildirim , Junjie Ke , Innfarn Yoo , Qifei Wang , Feng Yang , Christophe Moser , Demetri Psaltis

LayerFusion: Harmonized Multi-Layer Text-to-Image Generation with Generative Priors

Large-scale diffusion models have achieved remarkable success in generating high-quality images from textual descriptions, gaining popularity across various applications. However, the generation of layered content, such as transparent…

Computer Vision and Pattern Recognition · Computer Science 2024-12-06 Yusuf Dalva , Yijun Li , Qing Liu , Nanxuan Zhao , Jianming Zhang , Zhe Lin , Pinar Yanardag

High-Resolution Image Synthesis with Latent Diffusion Models

By decomposing the image formation process into a sequential application of denoising autoencoders, diffusion models (DMs) achieve state-of-the-art synthesis results on image data and beyond. Additionally, their formulation allows for a…

Computer Vision and Pattern Recognition · Computer Science 2022-04-14 Robin Rombach , Andreas Blattmann , Dominik Lorenz , Patrick Esser , Björn Ommer

LayerDiff: Exploring Text-guided Multi-layered Composable Image Synthesis via Layer-Collaborative Diffusion Model

Despite the success of generating high-quality images given any text prompts by diffusion-based generative models, prior works directly generate the entire images, but cannot provide object-wise manipulation capability. To support wider…

Computer Vision and Pattern Recognition · Computer Science 2024-03-19 Runhui Huang , Kaixin Cai , Jianhua Han , Xiaodan Liang , Renjing Pei , Guansong Lu , Songcen Xu , Wei Zhang , Hang Xu

Enabling Competitive Performance of Medical Imaging with Diffusion Model-generated Images without Privacy Leakage

Deep learning methods have impacted almost every research field, demonstrating notable successes in medical imaging tasks such as denoising and super-resolution. However, the prerequisite for deep learning is data at scale, but data sharing…

Medical Physics · Physics 2024-02-16 Yongyi Shi , Wenjun Xia , Chuang Niu , Christopher Wiedeman , Ge Wang

DreamLayer: Simultaneous Multi-Layer Generation via Diffusion Mode

Text-driven image generation using diffusion models has recently gained significant attention. To enable more flexible image manipulation and editing, recent research has expanded from single image generation to transparent layer generation…

Computer Vision and Pattern Recognition · Computer Science 2025-03-18 Junjia Huang , Pengxiang Yan , Jinhang Cai , Jiyang Liu , Zhao Wang , Yitong Wang , Xinglong Wu , Guanbin Li

Diverse Diffusion: Enhancing Image Diversity in Text-to-Image Generation

Latent diffusion models excel at producing high-quality images from text. Yet, concerns appear about the lack of diversity in the generated imagery. To tackle this, we introduce Diverse Diffusion, a method for boosting image diversity…

Computer Vision and Pattern Recognition · Computer Science 2023-10-20 Mariia Zameshina , Olivier Teytaud , Laurent Najman

Interactive Fashion Content Generation Using LLMs and Latent Diffusion Models

Fashionable image generation aims to synthesize images of diverse fashion prevalent around the globe, helping fashion designers in real-time visualization by giving them a basic customized structure of how a specific design preference would…

Computer Vision and Pattern Recognition · Computer Science 2023-06-14 Krishna Sri Ipsit Mantri , Nevasini Sasikumar

LayeringDiff: Layered Image Synthesis via Generation, then Disassembly with Generative Knowledge

Layers have become indispensable tools for professional artists, allowing them to build a hierarchical structure that enables independent control over individual visual elements. In this paper, we propose LayeringDiff, a novel pipeline for…

Computer Vision and Pattern Recognition · Computer Science 2025-01-03 Kyoungkook Kang , Gyujin Sim , Geonung Kim , Donguk Kim , Seungho Nam , Sunghyun Cho

LayerDiffusion: Layered Controlled Image Editing with Diffusion Models

Text-guided image editing has recently experienced rapid development. However, simultaneously performing multiple editing actions on a single image, such as background replacement and specific subject attribute changes, while maintaining…

Computer Vision and Pattern Recognition · Computer Science 2024-04-09 Pengzhi Li , QInxuan Huang , Yikang Ding , Zhiheng Li

High-Resolution Image Editing via Multi-Stage Blended Diffusion

Diffusion models have shown great results in image generation and in image editing. However, current approaches are limited to low resolutions due to the computational cost of training diffusion models for high-resolution generation. We…

Computer Vision and Pattern Recognition · Computer Science 2022-10-25 Johannes Ackermann , Minjun Li

DiffuseHigh: Training-free Progressive High-Resolution Image Synthesis through Structure Guidance

Large-scale generative models, such as text-to-image diffusion models, have garnered widespread attention across diverse domains due to their creative and high-fidelity image generation. Nonetheless, existing large-scale diffusion models…

Computer Vision and Pattern Recognition · Computer Science 2024-08-28 Younghyun Kim , Geunmin Hwang , Junyu Zhang , Eunbyung Park

LayoutDiffuse: Adapting Foundational Diffusion Models for Layout-to-Image Generation

Layout-to-image generation refers to the task of synthesizing photo-realistic images based on semantic layouts. In this paper, we propose LayoutDiffuse that adapts a foundational diffusion model pretrained on large-scale image or text-image…

Computer Vision and Pattern Recognition · Computer Science 2023-02-20 Jiaxin Cheng , Xiao Liang , Xingjian Shi , Tong He , Tianjun Xiao , Mu Li

Diffusion-based Holistic Texture Rectification and Synthesis

We present a novel framework for rectifying occlusions and distortions in degraded texture samples from natural images. Traditional texture synthesis approaches focus on generating textures from pristine samples, which necessitate…

Graphics · Computer Science 2023-09-27 Guoqing Hao , Satoshi Iizuka , Kensho Hara , Edgar Simo-Serra , Hirokatsu Kataoka , Kazuhiro Fukui

DiffuseTrace: A Transparent and Flexible Watermarking Scheme for Latent Diffusion Model

Latent Diffusion Models (LDMs) enable a wide range of applications but raise ethical concerns regarding illegal utilization. Adding watermarks to generative model outputs is a vital technique employed for copyright tracking and mitigating…

Cryptography and Security · Computer Science 2025-06-02 Liangqi Lei , Keke Gai , Jing Yu , Liehuang Zhu

StereoDiffusion: Training-Free Stereo Image Generation Using Latent Diffusion Models

The demand for stereo images increases as manufacturers launch more XR devices. To meet this demand, we introduce StereoDiffusion, a method that, unlike traditional inpainting pipelines, is trainning free, remarkably straightforward to use,…

Computer Vision and Pattern Recognition · Computer Science 2025-01-07 Lezhong Wang , Jeppe Revall Frisvad , Mark Bo Jensen , Siavash Arjomand Bigdeli

DiffHarmony: Latent Diffusion Model Meets Image Harmonization

Image harmonization, which involves adjusting the foreground of a composite image to attain a unified visual consistency with the background, can be conceptualized as an image-to-image translation task. Diffusion models have recently…

Computer Vision and Pattern Recognition · Computer Science 2024-04-10 Pengfei Zhou , Fangxiang Feng , Xiaojie Wang

ZoomLDM: Latent Diffusion Model for multi-scale image generation

Diffusion models have revolutionized image generation, yet several challenges restrict their application to large-image domains, such as digital pathology and satellite imagery. Given that it is infeasible to directly train a model on…

Computer Vision and Pattern Recognition · Computer Science 2025-03-26 Srikar Yellapragada , Alexandros Graikos , Kostas Triaridis , Prateek Prasanna , Rajarsi R. Gupta , Joel Saltz , Dimitris Samaras