Related papers: DiffStyler: Controllable Dual Diffusion for Text-D…

ControlStyle: Text-Driven Stylized Image Generation Using Diffusion Priors

Recently, the multimedia community has witnessed the rise of diffusion models trained on large-scale multi-modal data for visual content creation, particularly in the field of text-to-image generation. In this paper, we propose a new task…

Computer Vision and Pattern Recognition · Computer Science 2023-11-10 Jingwen Chen , Yingwei Pan , Ting Yao , Tao Mei

FreeStyle: Free Lunch for Text-guided Style Transfer using Diffusion Models

The rapid development of generative diffusion models has significantly advanced the field of style transfer. However, most current style transfer methods based on diffusion models typically involve a slow iterative optimization process,…

Computer Vision and Pattern Recognition · Computer Science 2024-10-28 Feihong He , Gang Li , Fuhui Sun , Mengyuan Zhang , Lingyu Si , Xiaoyan Wang , Li Shen

D2Styler: Advancing Arbitrary Style Transfer with Discrete Diffusion Methods

In image processing, one of the most challenging tasks is to render an image's semantic meaning using a variety of artistic approaches. Existing techniques for arbitrary style transfer (AST) frequently experience mode-collapse,…

Computer Vision and Pattern Recognition · Computer Science 2024-08-08 Onkar Susladkar , Gayatri Deshmukh , Sparsh Mittal , Parth Shastri

Inversion-Free Style Transfer with Dual Rectified Flows

Style transfer, a pivotal task in image processing, synthesizes visually compelling images by seamlessly blending realistic content with artistic styles, enabling applications in photo editing and creative design. While mainstream…

Computer Vision and Pattern Recognition · Computer Science 2025-11-27 Yingying Deng , Xiangyu He , Fan Tang , Weiming Dong , Xucheng Yin

DiffuseST: Unleashing the Capability of the Diffusion Model for Style Transfer

Style transfer aims to fuse the artistic representation of a style image with the structural information of a content image. Existing methods train specific networks or utilize pre-trained models to learn content and style features.…

Computer Vision and Pattern Recognition · Computer Science 2024-10-22 Ying Hu , Chenyi Zhuang , Pan Gao

DiffColor: Toward High Fidelity Text-Guided Image Colorization with Diffusion Models

Recent data-driven image colorization methods have enabled automatic or reference-based colorization, while still suffering from unsatisfactory and inaccurate object-level color control. To address these issues, we propose a new method…

Computer Vision and Pattern Recognition · Computer Science 2023-08-04 Jianxin Lin , Peng Xiao , Yijun Wang , Rongju Zhang , Xiangxiang Zeng

Diffusion-based Image Translation using Disentangled Style and Content Representation

Diffusion-based image translation guided by semantic texts or a single target image has enabled flexible style transfer which is not limited to the specific domains. Unfortunately, due to the stochastic nature of diffusion models, it is…

Computer Vision and Pattern Recognition · Computer Science 2023-02-02 Gihyun Kwon , Jong Chul Ye

LayerDiffusion: Layered Controlled Image Editing with Diffusion Models

Text-guided image editing has recently experienced rapid development. However, simultaneously performing multiple editing actions on a single image, such as background replacement and specific subject attribute changes, while maintaining…

Computer Vision and Pattern Recognition · Computer Science 2024-04-09 Pengzhi Li , QInxuan Huang , Yikang Ding , Zhiheng Li

Improving Diffusion Models for Scene Text Editing with Dual Encoders

Scene text editing is a challenging task that involves modifying or inserting specified texts in an image while maintaining its natural and realistic appearance. Most previous approaches to this task rely on style-transfer models that crop…

Computer Vision and Pattern Recognition · Computer Science 2023-04-13 Jiabao Ji , Guanhua Zhang , Zhaowen Wang , Bairu Hou , Zhifei Zhang , Brian Price , Shiyu Chang

DreamWalk: Style Space Exploration using Diffusion Guidance

Text-conditioned diffusion models can generate impressive images, but fall short when it comes to fine-grained control. Unlike direct-editing tools like Photoshop, text conditioned models require the artist to perform "prompt engineering,"…

Computer Vision and Pattern Recognition · Computer Science 2024-04-05 Michelle Shu , Charles Herrmann , Richard Strong Bowen , Forrester Cole , Ramin Zabih

DiffFashion: Reference-based Fashion Design with Structure-aware Transfer by Diffusion Models

Image-based fashion design with AI techniques has attracted increasing attention in recent years. We focus on a new fashion design task, where we aim to transfer a reference appearance image onto a clothing image while preserving the…

Computer Vision and Pattern Recognition · Computer Science 2023-02-15 Shidong Cao , Wenhao Chai , Shengyu Hao , Yanting Zhang , Hangyue Chen , Gaoang Wang

DiffStyleTS: Diffusion Model for Style Transfer in Time Series

Style transfer combines the content of one signal with the style of another. It supports applications such as data augmentation and scenario simulation, helping machine learning models generalize in data-scarce domains. While well developed…

Machine Learning · Computer Science 2025-10-14 Mayank Nagda , Phil Ostheimer , Justus Arweiler , Indra Jungjohann , Jennifer Werner , Dennis Wagner , Aparna Muraleedharan , Pouya Jafari , Jochen Schmid , Fabian Jirasek , Jakob Burger , Michael Bortz , Hans Hasse , Stephan Mandt , Marius Kloft , Sophie Fellenz

Leveraging Diffusion Models for Stylization using Multiple Style Images

Recent advances in latent diffusion models have enabled exciting progress in image style transfer. However, several key issues remain. For example, existing methods still struggle to accurately match styles. They are often limited in the…

Computer Vision and Pattern Recognition · Computer Science 2025-08-19 Dan Ruta , Abdelaziz Djelouah , Raphael Ortiz , Christopher Schroers

DEADiff: An Efficient Stylization Diffusion Model with Disentangled Representations

The diffusion-based text-to-image model harbors immense potential in transferring reference style. However, current encoder-based approaches significantly impair the text controllability of text-to-image models while transferring styles. In…

Computer Vision and Pattern Recognition · Computer Science 2024-03-13 Tianhao Qi , Shancheng Fang , Yanze Wu , Hongtao Xie , Jiawei Liu , Lang Chen , Qian He , Yongdong Zhang

DIFF-NST: Diffusion Interleaving For deFormable Neural Style Transfer

Neural Style Transfer (NST) is the field of study applying neural techniques to modify the artistic appearance of a content image to match the style of a reference style image. Traditionally, NST methods have focused on texture-based image…

Computer Vision and Pattern Recognition · Computer Science 2023-07-12 Dan Ruta , Gemma Canet Tarrés , Andrew Gilbert , Eli Shechtman , Nicholas Kolkin , John Collomosse

DiffArtist: Towards Structure and Appearance Controllable Image Stylization

Artistic styles are defined by both their structural and appearance elements. Existing neural stylization techniques primarily focus on transferring appearance-level features such as color and texture, often neglecting the equally crucial…

Computer Vision and Pattern Recognition · Computer Science 2025-08-28 Ruixiang Jiang , Changwen Chen

Diff-Tracker: Text-to-Image Diffusion Models are Unsupervised Trackers

We introduce Diff-Tracker, a novel approach for the challenging unsupervised visual tracking task leveraging the pre-trained text-to-image diffusion model. Our main idea is to leverage the rich knowledge encapsulated within the pre-trained…

Computer Vision and Pattern Recognition · Computer Science 2024-07-17 Zhengbo Zhang , Li Xu , Duo Peng , Hossein Rahmani , Jun Liu

TurboEdit: Text-Based Image Editing Using Few-Step Diffusion Models

Diffusion models have opened the path to a wide range of text-based image editing frameworks. However, these typically build on the multi-step nature of the diffusion backwards process, and adapting them to distilled, fast-sampling methods…

Computer Vision and Pattern Recognition · Computer Science 2024-08-02 Gilad Deutch , Rinon Gal , Daniel Garibi , Or Patashnik , Daniel Cohen-Or

Style-A-Video: Agile Diffusion for Arbitrary Text-based Video Style Transfer

Large-scale text-to-video diffusion models have demonstrated an exceptional ability to synthesize diverse videos. However, due to the lack of extensive text-to-video datasets and the necessary computational resources for training, directly…

Computer Vision and Pattern Recognition · Computer Science 2023-05-10 Nisha Huang , Yuxin Zhang , Weiming Dong

3DStyle-Diffusion: Pursuing Fine-grained Text-driven 3D Stylization with 2D Diffusion Models

3D content creation via text-driven stylization has played a fundamental challenge to multimedia and graphics community. Recent advances of cross-modal foundation models (e.g., CLIP) have made this problem feasible. Those approaches…

Computer Vision and Pattern Recognition · Computer Science 2023-11-10 Haibo Yang , Yang Chen , Yingwei Pan , Ting Yao , Zhineng Chen , Tao Mei