Related papers: Controllable Stylistic Text Generation with Train-…

CtrlDiff: Boosting Large Diffusion Language Models with Dynamic Block Prediction and Controllable Generation

Although autoregressive models have dominated language modeling in recent years, there has been a growing interest in exploring alternative paradigms to the conventional next-token prediction framework. Diffusion-based language models have…

Computation and Language · Computer Science 2025-10-23 Chihan Huang , Hao Tang

Air-Decoding: Attribute Distribution Reconstruction for Decoding-Time Controllable Text Generation

Controllable text generation (CTG) aims to generate text with desired attributes, and decoding-time-based methods have shown promising performance on this task. However, in this paper, we identify the phenomenon of Attribute Collapse for…

Computation and Language · Computer Science 2023-11-03 Tianqi Zhong , Quan Wang , Jingxuan Han , Yongdong Zhang , Zhendong Mao

ChangeDiff: A Multi-Temporal Change Detection Data Generator with Flexible Text Prompts via Diffusion Model

Data-driven deep learning models have enabled tremendous progress in change detection (CD) with the support of pixel-level annotations. However, collecting diverse data and manually annotating them is costly, laborious, and…

Computer Vision and Pattern Recognition · Computer Science 2024-12-23 Qi Zang , Jiayi Yang , Shuang Wang , Dong Zhao , Wenjun Yi , Zhun Zhong

Constrained Discrete Diffusion

Discrete diffusion models are a class of generative models that construct sequences by progressively denoising samples from a categorical noise distribution. Beyond their rapidly growing ability to generate coherent natural language, these…

Computation and Language · Computer Science 2025-12-11 Michael Cardei , Jacob K Christopher , Thomas Hartvigsen , Bhavya Kailkhura , Ferdinando Fioretto

Controllable Graph Generation with Diffusion Models via Inference-Time Tree Search Guidance

Graph generation is a fundamental problem in graph learning with broad applications across Web-scale systems, knowledge graphs, and scientific domains such as drug and material discovery. Recent approaches leverage diffusion models for…

Machine Learning · Computer Science 2026-03-18 Jiachi Zhao , Zehong Wang , Yamei Liao , Chuxu Zhang , Yanfang Ye

An Invariant Learning Characterization of Controlled Text Generation

Controlled generation refers to the problem of creating text that contains stylistic or semantic attributes of interest. Many approaches reduce this problem to training a predictor of the desired attribute. For example, researchers hoping…

Computation and Language · Computer Science 2023-06-02 Carolina Zheng , Claudia Shi , Keyon Vafa , Amir Feder , David M. Blei

From Diffusion to Rectified Flow: Rethinking Text-Based Segmentation

Text-based image segmentation aims to delineate object boundaries within an image from text prompts, offering higher flexibility and broader application scope compared to traditional fixed-category segmentation tasks. Recent studies have…

Computer Vision and Pattern Recognition · Computer Science 2026-05-07 Zishen Qu , Xuesong Li , Haijian Gu , Hongwei Kang , Quan Meng , Tianrui Niu , Xin Yang , Ruidong Pan

Rectified Diffusion Guidance for Conditional Generation

Classifier-Free Guidance (CFG), which combines the conditional and unconditional score functions with two coefficients summing to one, serves as a practical technique for diffusion model sampling. Theoretically, however, denoising with CFG…

Computer Vision and Pattern Recognition · Computer Science 2025-10-02 Mengfei Xia , Nan Xue , Yujun Shen , Ran Yi , Tieliang Gong , Yong-Jin Liu

TaxDiff: Taxonomic-Guided Diffusion Model for Protein Sequence Generation

Designing protein sequences with specific biological functions and structural stability is crucial in biology and chemistry. Generative models already demonstrated their capabilities for reliable protein design. However, previous models are…

Machine Learning · Computer Science 2024-02-28 Lin Zongying , Li Hao , Lv Liuzhenghao , Lin Bin , Zhang Junwu , Chen Calvin Yu-Chian , Yuan Li , Tian Yonghong

SGDiff: A Style Guided Diffusion Model for Fashion Synthesis

This paper reports on the development of \textbf{a novel style guided diffusion model (SGDiff)} which overcomes certain weaknesses inherent in existing models for image synthesis. The proposed SGDiff combines image modality with a…

Computer Vision and Pattern Recognition · Computer Science 2023-08-16 Zhengwentai Sun , Yanghong Zhou , Honghong He , P. Y. Mok

CFG++: Manifold-constrained Classifier Free Guidance for Diffusion Models

Classifier-free guidance (CFG) is a fundamental tool in modern diffusion models for text-guided generation. Although effective, CFG has notable drawbacks. For instance, DDIM with CFG lacks invertibility, complicating image editing;…

Computer Vision and Pattern Recognition · Computer Science 2024-09-13 Hyungjin Chung , Jeongsol Kim , Geon Yeong Park , Hyelin Nam , Jong Chul Ye

Graph Guided Diffusion: Unified Guidance for Conditional Graph Generation

Diffusion models have emerged as powerful generative models for graph generation, yet their use for conditional graph generation remains a fundamental challenge. In particular, guiding diffusion models on graphs under arbitrary reward…

Machine Learning · Computer Science 2025-05-27 Victor M. Tenorio , Nicolas Zilberstein , Santiago Segarra , Antonio G. Marques

Stylistic Attribute Control in Latent Diffusion Models

Text-to-image diffusion models have revolutionized image synthesis and editing, but precise control over stylistic attributes remains a challenge, often causing unintended content modifications. We propose an approach for fine-grained…

Computer Vision and Pattern Recognition · Computer Science 2026-05-05 Max Reimann , Benito Buchheim , Jürgen Döllner

DesignDiffusion: High-Quality Text-to-Design Image Generation with Diffusion Models

In this paper, we present DesignDiffusion, a simple yet effective framework for the novel task of synthesizing design images from textual descriptions. A primary challenge lies in generating accurate and style-consistent textual and visual…

Computer Vision and Pattern Recognition · Computer Science 2025-03-04 Zhendong Wang , Jianmin Bao , Shuyang Gu , Dong Chen , Wengang Zhou , Houqiang Li

ControlStyle: Text-Driven Stylized Image Generation Using Diffusion Priors

Recently, the multimedia community has witnessed the rise of diffusion models trained on large-scale multi-modal data for visual content creation, particularly in the field of text-to-image generation. In this paper, we propose a new task…

Computer Vision and Pattern Recognition · Computer Science 2023-11-10 Jingwen Chen , Yingwei Pan , Ting Yao , Tao Mei

LatRef-Diff: Latent and Reference-Guided Diffusion for Facial Attribute Editing and Style Manipulation

Facial attribute editing and style manipulation are crucial for applications like virtual avatars and photo editing. However, achieving precise control over facial attributes without altering unrelated features is challenging due to the…

Computer Vision and Pattern Recognition · Computer Science 2026-04-24 Wenmin Huang , Weiqi Luo , Xiaochun Cao , Jiwu Huang

DEADiff: An Efficient Stylization Diffusion Model with Disentangled Representations

The diffusion-based text-to-image model harbors immense potential in transferring reference style. However, current encoder-based approaches significantly impair the text controllability of text-to-image models while transferring styles. In…

Computer Vision and Pattern Recognition · Computer Science 2024-03-13 Tianhao Qi , Shancheng Fang , Yanze Wu , Hongtao Xie , Jiawei Liu , Lang Chen , Qian He , Yongdong Zhang

CustomText: Customized Textual Image Generation using Diffusion Models

Textual image generation spans diverse fields like advertising, education, product packaging, social media, information visualization, and branding. Despite recent strides in language-guided image synthesis using diffusion models, current…

Computer Vision and Pattern Recognition · Computer Science 2024-05-22 Shubham Paliwal , Arushi Jain , Monika Sharma , Vikram Jamwal , Lovekesh Vig

TreeDiff: AST-Guided Code Generation with Diffusion LLMs

Code generation is increasingly critical for real-world applications. Still, diffusion-based large language models continue to struggle with this demand. Unlike free-form text, code requires syntactic precision; even minor structural…

Computation and Language · Computer Science 2026-01-07 Yiming Zeng , Jinghan Cao , Zexin Li , Yiming Chen , Tao Ren , Zhuochun Li , Dawei Xiang , Xidong Wu , Shangqian Gao , Tingting Yu

GlyphDiffusion: Text Generation as Image Generation

Diffusion models have become a new generative paradigm for text generation. Considering the discrete categorical nature of text, in this paper, we propose GlyphDiffusion, a novel diffusion approach for text generation via text-guided image…

Computation and Language · Computer Science 2023-05-09 Junyi Li , Wayne Xin Zhao , Jian-Yun Nie , Ji-Rong Wen