Related papers: Structured Pattern Expansion with Diffusion Models

SaRA: High-Efficient Diffusion Model Fine-tuning with Progressive Sparse Low-Rank Adaptation

In recent years, the development of diffusion models has led to significant progress in image and video generation tasks, with pre-trained models like the Stable Diffusion series playing a crucial role. Inspired by model pruning which…

Computer Vision and Pattern Recognition · Computer Science 2025-04-03 Teng Hu , Jiangning Zhang , Ran Yi , Hongrui Huang , Yabiao Wang , Lizhuang Ma

Structured 3D Latents Are Surprisingly Powerful: Unleashing Generalizable Style with 2D Diffusion

3D asset generation plays a pivotal role in fields such as gaming and virtual reality, enabling the rapid synthesis of high-fidelity 3D objects from a single or multiple images. Building on this capability, enabling style-controllable…

Computer Vision and Pattern Recognition · Computer Science 2026-05-08 Yiran Qiao , Yiren Lu , Yunlai Zhou , Disheng Liu , Linlin Hou , Rui Yang , Yu Yin , Jing Ma

Cross-domain Compositing with Pretrained Diffusion Models

Diffusion models have enabled high-quality, conditional image editing capabilities. We propose to expand their arsenal, and demonstrate that off-the-shelf diffusion models can be used for a wide range of cross-domain compositing tasks.…

Computer Vision and Pattern Recognition · Computer Science 2023-05-26 Roy Hachnochi , Mingrui Zhao , Nadav Orzech , Rinon Gal , Ali Mahdavi-Amiri , Daniel Cohen-Or , Amit Haim Bermano

Diffusion models for multivariate subsurface generation and efficient probabilistic inversion

Diffusion models offer stable training and state-of-the-art performance for deep generative modeling tasks. Here, we consider their use in the context of multivariate subsurface modeling and probabilistic inversion. We first demonstrate…

Computer Vision and Pattern Recognition · Computer Science 2026-01-28 Roberto Miele , Niklas Linde

FashionSD-X: Multimodal Fashion Garment Synthesis using Latent Diffusion

The rapid evolution of the fashion industry increasingly intersects with technological advancements, particularly through the integration of generative AI. This study introduces a novel generative pipeline designed to transform the fashion…

Computer Vision and Pattern Recognition · Computer Science 2024-04-30 Abhishek Kumar Singh , Ioannis Patras

Precise Parameter Localization for Textual Generation in Diffusion Models

Novel diffusion models can synthesize photo-realistic images with integrated high-quality text. Surprisingly, we demonstrate through attention activation patching that only less than $1$% of diffusion models' parameters, all contained in…

Computer Vision and Pattern Recognition · Computer Science 2026-03-03 Łukasz Staniszewski , Bartosz Cywiński , Franziska Boenisch , Kamil Deja , Adam Dziedzic

Training-Free Constrained Generation With Stable Diffusion Models

Stable diffusion models represent the state-of-the-art in data synthesis across diverse domains and hold transformative potential for applications in science and engineering, e.g., by facilitating the discovery of novel solutions and…

Machine Learning · Computer Science 2025-10-23 Stefano Zampini , Jacob K. Christopher , Luca Oneto , Davide Anguita , Ferdinando Fioretto

An Overview of Diffusion Models: Applications, Guided Generation, Statistical Rates and Optimization

Diffusion models, a powerful and universal generative AI technology, have achieved tremendous success in computer vision, audio, reinforcement learning, and computational biology. In these applications, diffusion models provide flexible…

Machine Learning · Computer Science 2024-04-12 Minshuo Chen , Song Mei , Jianqing Fan , Mengdi Wang

Diffusion Models for Time Series Applications: A Survey

Diffusion models, a family of generative models based on deep learning, have become increasingly prominent in cutting-edge machine learning research. With a distinguished performance in generating samples that resemble the observed data,…

Machine Learning · Computer Science 2023-05-02 Lequan Lin , Zhengkun Li , Ruikun Li , Xuliang Li , Junbin Gao

Parameter-aware high-fidelity microstructure generation using stable diffusion

Synthesizing realistic microstructure images conditioned on processing parameters is crucial for understanding process-structure relationships in materials design. However, this task remains challenging due to limited training micrographs…

Materials Science · Physics 2025-11-21 Hoang Cuong Phan , Minh Tien Tran , Chihun Lee , Hoheok Kim , Sehyeok Oh , Dong-Kyu Kim , Ho Won Lee

LayoutDiffusion: Controllable Diffusion Model for Layout-to-image Generation

Recently, diffusion models have achieved great success in image synthesis. However, when it comes to the layout-to-image generation where an image often has a complex scene of multiple objects, how to make strong control over both the…

Computer Vision and Pattern Recognition · Computer Science 2024-03-13 Guangcong Zheng , Xianpan Zhou , Xuewei Li , Zhongang Qi , Ying Shan , Xi Li

Few-Shot Synthetic Data Generation with Diffusion Models for Downstream Vision Tasks

Class imbalance is a persistent challenge in visual recognition, particularly in safety-critical domains where collecting positive examples is expensive and rare events are inherently underrepresented. We propose a lightweight synthetic…

Computer Vision and Pattern Recognition · Computer Science 2026-05-13 Daniil Dushenev , Nazariy Karpov , Daniil Zinovjev , Alexander Gorin , Konstantin Kulikov

STAY Diffusion: Styled Layout Diffusion Model for Diverse Layout-to-Image Generation

In layout-to-image (L2I) synthesis, controlled complex scenes are generated from coarse information like bounding boxes. Such a task is exciting to many downstream applications because the input layouts offer strong guidance to the…

Computer Vision and Pattern Recognition · Computer Science 2025-03-18 Ruyu Wang , Xuefeng Hou , Sabrina Schmedding , Marco F. Huber

Latent Diffusion Models for Structural Component Design

Recent advances in generative modeling, namely Diffusion models, have revolutionized generative modeling, enabling high-quality image generation tailored to user needs. This paper proposes a framework for the generative design of structural…

Machine Learning · Computer Science 2023-09-26 Ethan Herron , Jaydeep Rade , Anushrut Jignasu , Baskar Ganapathysubramanian , Aditya Balu , Soumik Sarkar , Adarsh Krishnamurthy

Steered Diffusion: A Generalized Framework for Plug-and-Play Conditional Image Synthesis

Conditional generative models typically demand large annotated training sets to achieve high-quality synthesis. As a result, there has been significant interest in designing models that perform plug-and-play generation, i.e., to use a…

Computer Vision and Pattern Recognition · Computer Science 2023-10-03 Nithin Gopalakrishnan Nair , Anoop Cherian , Suhas Lohit , Ye Wang , Toshiaki Koike-Akino , Vishal M. Patel , Tim K. Marks

LoRA-Enhanced Distillation on Guided Diffusion Models

Diffusion models, such as Stable Diffusion (SD), offer the ability to generate high-resolution images with diverse features, but they come at a significant computational and memory cost. In classifier-free guided diffusion models, prolonged…

Computer Vision and Pattern Recognition · Computer Science 2023-12-13 Pareesa Ameneh Golnari

DiffuseHigh: Training-free Progressive High-Resolution Image Synthesis through Structure Guidance

Large-scale generative models, such as text-to-image diffusion models, have garnered widespread attention across diverse domains due to their creative and high-fidelity image generation. Nonetheless, existing large-scale diffusion models…

Computer Vision and Pattern Recognition · Computer Science 2024-08-28 Younghyun Kim , Geunmin Hwang , Junyu Zhang , Eunbyung Park

Text Semantics to Image Generation: A method of building facades design base on Stable Diffusion model

Stable Diffusion model has been extensively employed in the study of archi-tectural image generation, but there is still an opportunity to enhance in terms of the controllability of the generated image content. A multi-network combined…

Computer Vision and Pattern Recognition · Computer Science 2023-04-10 Haoran Ma

Stylized Text-to-Motion Generation via Hypernetwork-Driven Low-Rank Adaptation

Text-driven motion diffusion models are capable of generating realistic human motions, but text alone often struggles to express fine-level nuances of motion, commonly referred to as style. Recent approaches have tackled this challenge by…

Computer Vision and Pattern Recognition · Computer Science 2026-05-14 Junhyuk Jeon , Seokhyeon Hong , Junyong Noh

Enhancing Diffusion Models with 3D Perspective Geometry Constraints

While perspective is a well-studied topic in art, it is generally taken for granted in images. However, for the recent wave of high-quality image synthesis methods such as latent diffusion models, perspective accuracy is not an explicit…

Computer Vision and Pattern Recognition · Computer Science 2023-12-05 Rishi Upadhyay , Howard Zhang , Yunhao Ba , Ethan Yang , Blake Gella , Sicheng Jiang , Alex Wong , Achuta Kadambi