Related papers: Gradient-free Decoder Inversion in Latent Diffusio…

High-Resolution Image Synthesis with Latent Diffusion Models

By decomposing the image formation process into a sequential application of denoising autoencoders, diffusion models (DMs) achieve state-of-the-art synthesis results on image data and beyond. Additionally, their formulation allows for a…

Computer Vision and Pattern Recognition · Computer Science 2022-04-14 Robin Rombach , Andreas Blattmann , Dominik Lorenz , Patrick Esser , Björn Ommer

Representation Learning with Diffusion Models

Diffusion models (DMs) have achieved state-of-the-art results for image synthesis tasks as well as density estimation. Applied in the latent space of a powerful pretrained autoencoder (LDM), their immense computational requirements can be…

Computer Vision and Pattern Recognition · Computer Science 2022-10-21 Jeremias Traub

Communication-Efficient Serving for Video Diffusion Models with Latent Parallelism

Video diffusion models (VDMs) perform attention computation over the 3D spatio-temporal domain. Compared to large language models (LLMs) processing 1D sequences, their memory consumption scales cubically, necessitating parallel serving…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-12-09 Zhiyuan Wu , Shuai Wang , Li Chen , Kaihui Gao , Dan Li , Yanyu Ren , Qiming Zhang , Yong Wang

How to Train Your Latent Diffusion Language Model Jointly With the Latent Space

Latent diffusion models offer an attractive alternative to discrete diffusion for non-autoregressive text generation by operating on continuous text representations and denoising entire sequences in parallel. The major challenge in latent…

Computation and Language · Computer Science 2026-05-11 Viacheslav Meshchaninov , Alexander Shabalin , Egor Chimbulatov , Nikita Gushchin , Ilya Koziev , Alexander Korotin , Dmitry Vetrov

There and Back Again: On the relation between Noise and Image Inversions in Diffusion Models

Diffusion Models achieve state-of-the-art performance in generating new samples but lack a low-dimensional latent space that encodes the data into editable features. Inversion-based methods address this by reversing the denoising…

Computer Vision and Pattern Recognition · Computer Science 2026-01-28 Łukasz Staniszewski , Łukasz Kuciński , Kamil Deja

Toward Lightweight and Fast Decoders for Diffusion Models in Image and Video Generation

We investigate methods to reduce inference time and memory footprint in stable diffusion models by introducing lightweight decoders for both image and video synthesis. Traditional latent diffusion pipelines rely on large Variational…

Computer Vision and Pattern Recognition · Computer Science 2025-03-10 Alexey Buzovkin , Evgeny Shilov

Controlling Latent Diffusion Using Latent CLIP

Instead of performing text-conditioned denoising in the image domain, latent diffusion models (LDMs) operate in latent space of a variational autoencoder (VAE), enabling more efficient processing at reduced computational costs. However,…

Computer Vision and Pattern Recognition · Computer Science 2025-03-12 Jason Becker , Chris Wendler , Peter Baylies , Robert West , Christian Wressnegger

LVMark: Robust Watermark for Latent Video Diffusion Models

Rapid advancements in video diffusion models have enabled the creation of realistic videos, raising concerns about unauthorized use and driving the demand for techniques to protect model ownership. Existing watermarking methods, while…

Computer Vision and Pattern Recognition · Computer Science 2025-03-31 MinHyuk Jang , Youngdong Jang , JaeHyeok Lee , Feng Yang , Gyeongrok Oh , Jongheon Jeong , Sangpil Kim

Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference

Latent Diffusion models (LDMs) have achieved remarkable results in synthesizing high-resolution images. However, the iterative sampling process is computationally intensive and leads to slow generation. Inspired by Consistency Models (song…

Computer Vision and Pattern Recognition · Computer Science 2023-10-09 Simian Luo , Yiqin Tan , Longbo Huang , Jian Li , Hang Zhao

Latent Diffusion Prior Enhanced Deep Unfolding for Snapshot Spectral Compressive Imaging

Snapshot compressive spectral imaging reconstruction aims to reconstruct three-dimensional spatial-spectral images from a single-shot two-dimensional compressed measurement. Existing state-of-the-art methods are mostly based on deep…

Image and Video Processing · Electrical Eng. & Systems 2024-08-27 Zongliang Wu , Ruiying Lu , Ying Fu , Xin Yuan

Latent Diffusion Inversion Requires Understanding the Latent Space

The recovery of training data from generative models ("model inversion") has been extensively studied for diffusion models in the data domain as a memorization/overfitting phenomenon. Latent diffusion models (LDMs), which operate on the…

Machine Learning · Computer Science 2026-03-26 Mingxing Rao , Bowen Qu , Daniel Moyer

Gradient-Free Textual Inversion

Recent works on personalized text-to-image generation usually learn to bind a special token with specific subjects or styles of a few given images by tuning its embedding through gradient descent. It is natural to question whether we can…

Computer Vision and Pattern Recognition · Computer Science 2023-04-13 Zhengcong Fei , Mingyuan Fan , Junshi Huang

Blind Inversion using Latent Diffusion Priors

Diffusion models have emerged as powerful tools for solving inverse problems due to their exceptional ability to model complex prior distributions. However, existing methods predominantly assume known forward operators (i.e., non-blind),…

Computer Vision and Pattern Recognition · Computer Science 2024-07-02 Weimin Bai , Siyi Chen , Wenzheng Chen , He Sun

Align your Latents: High-Resolution Video Synthesis with Latent Diffusion Models

Latent Diffusion Models (LDMs) enable high-quality image synthesis while avoiding excessive compute demands by training a diffusion model in a compressed lower-dimensional latent space. Here, we apply the LDM paradigm to high-resolution…

Computer Vision and Pattern Recognition · Computer Science 2023-12-29 Andreas Blattmann , Robin Rombach , Huan Ling , Tim Dockhorn , Seung Wook Kim , Sanja Fidler , Karsten Kreis

Faster Diffusion: Rethinking the Role of the Encoder for Diffusion Model Inference

One of the main drawback of diffusion models is the slow inference time for image generation. Among the most successful approaches to addressing this problem are distillation methods. However, these methods require considerable…

Computer Vision and Pattern Recognition · Computer Science 2024-10-16 Senmao Li , Taihang Hu , Joost van de Weijer , Fahad Shahbaz Khan , Tao Liu , Linxuan Li , Shiqi Yang , Yaxing Wang , Ming-Ming Cheng , Jian Yang

DP-LDMs: Differentially Private Latent Diffusion Models

Diffusion models (DMs) are one of the most widely used generative models for producing high quality images. However, a flurry of recent papers points out that DMs are least private forms of image generators, by extracting a significant…

Machine Learning · Statistics 2025-03-06 Michael F. Liu , Saiyue Lyu , Margarita Vinaroz , Mijung Park

Principled Latent Diffusion for Graphs via Laplacian Autoencoders

Graph diffusion models achieve state-of-the-art performance in graph generation but suffer from quadratic complexity in the number of nodes -- and much of their capacity is wasted modeling the absence of edges in sparse graphs. Inspired by…

Machine Learning · Computer Science 2026-05-13 Antoine Siraudin , Christopher Morris

Gradient flow encoding with distance optimization adaptive step size

The autoencoder model uses an encoder to map data samples to a lower dimensional latent space and then a decoder to map the latent space representations back to the data space. Implicitly, it relies on the encoder to approximate the inverse…

Machine Learning · Statistics 2021-05-12 Kyriakos Flouris , Anna Volokitin , Gustav Bredell , Ender Konukoglu

Differentiable Inverse Modeling with Physics-Constrained Latent Diffusion for Heterogeneous Subsurface Parameter Fields

We present a latent diffusion-based differentiable inversion method (LD-DIM) for PDE-constrained inverse problems involving high-dimensional spatially distributed coefficients. LD-DIM couples a pretrained latent diffusion prior with an…

Numerical Analysis · Mathematics 2025-12-30 Zihan Lin , QiZhi He

LatentINDIGO: An INN-Guided Latent Diffusion Algorithm for Image Restoration

There is a growing interest in the use of latent diffusion models (LDMs) for image restoration (IR) tasks due to their ability to model effectively the distribution of natural images. While significant progress has been made, there are…

Computer Vision and Pattern Recognition · Computer Science 2025-05-20 Di You , Daniel Siromani , Pier Luigi Dragotti