English
Related papers

Related papers: DiffStyler: Controllable Dual Diffusion for Text-D…

200 papers

Recently, the multimedia community has witnessed the rise of diffusion models trained on large-scale multi-modal data for visual content creation, particularly in the field of text-to-image generation. In this paper, we propose a new task…

Computer Vision and Pattern Recognition · Computer Science 2023-11-10 Jingwen Chen , Yingwei Pan , Ting Yao , Tao Mei

The rapid development of generative diffusion models has significantly advanced the field of style transfer. However, most current style transfer methods based on diffusion models typically involve a slow iterative optimization process,…

Computer Vision and Pattern Recognition · Computer Science 2024-10-28 Feihong He , Gang Li , Fuhui Sun , Mengyuan Zhang , Lingyu Si , Xiaoyan Wang , Li Shen

In image processing, one of the most challenging tasks is to render an image's semantic meaning using a variety of artistic approaches. Existing techniques for arbitrary style transfer (AST) frequently experience mode-collapse,…

Computer Vision and Pattern Recognition · Computer Science 2024-08-08 Onkar Susladkar , Gayatri Deshmukh , Sparsh Mittal , Parth Shastri

Style transfer, a pivotal task in image processing, synthesizes visually compelling images by seamlessly blending realistic content with artistic styles, enabling applications in photo editing and creative design. While mainstream…

Computer Vision and Pattern Recognition · Computer Science 2025-11-27 Yingying Deng , Xiangyu He , Fan Tang , Weiming Dong , Xucheng Yin

Style transfer aims to fuse the artistic representation of a style image with the structural information of a content image. Existing methods train specific networks or utilize pre-trained models to learn content and style features.…

Computer Vision and Pattern Recognition · Computer Science 2024-10-22 Ying Hu , Chenyi Zhuang , Pan Gao

Recent data-driven image colorization methods have enabled automatic or reference-based colorization, while still suffering from unsatisfactory and inaccurate object-level color control. To address these issues, we propose a new method…

Computer Vision and Pattern Recognition · Computer Science 2023-08-04 Jianxin Lin , Peng Xiao , Yijun Wang , Rongju Zhang , Xiangxiang Zeng

Diffusion-based image translation guided by semantic texts or a single target image has enabled flexible style transfer which is not limited to the specific domains. Unfortunately, due to the stochastic nature of diffusion models, it is…

Computer Vision and Pattern Recognition · Computer Science 2023-02-02 Gihyun Kwon , Jong Chul Ye

Text-guided image editing has recently experienced rapid development. However, simultaneously performing multiple editing actions on a single image, such as background replacement and specific subject attribute changes, while maintaining…

Computer Vision and Pattern Recognition · Computer Science 2024-04-09 Pengzhi Li , QInxuan Huang , Yikang Ding , Zhiheng Li

Scene text editing is a challenging task that involves modifying or inserting specified texts in an image while maintaining its natural and realistic appearance. Most previous approaches to this task rely on style-transfer models that crop…

Computer Vision and Pattern Recognition · Computer Science 2023-04-13 Jiabao Ji , Guanhua Zhang , Zhaowen Wang , Bairu Hou , Zhifei Zhang , Brian Price , Shiyu Chang

Text-conditioned diffusion models can generate impressive images, but fall short when it comes to fine-grained control. Unlike direct-editing tools like Photoshop, text conditioned models require the artist to perform "prompt engineering,"…

Computer Vision and Pattern Recognition · Computer Science 2024-04-05 Michelle Shu , Charles Herrmann , Richard Strong Bowen , Forrester Cole , Ramin Zabih

Image-based fashion design with AI techniques has attracted increasing attention in recent years. We focus on a new fashion design task, where we aim to transfer a reference appearance image onto a clothing image while preserving the…

Computer Vision and Pattern Recognition · Computer Science 2023-02-15 Shidong Cao , Wenhao Chai , Shengyu Hao , Yanting Zhang , Hangyue Chen , Gaoang Wang

Style transfer combines the content of one signal with the style of another. It supports applications such as data augmentation and scenario simulation, helping machine learning models generalize in data-scarce domains. While well developed…

Recent advances in latent diffusion models have enabled exciting progress in image style transfer. However, several key issues remain. For example, existing methods still struggle to accurately match styles. They are often limited in the…

Computer Vision and Pattern Recognition · Computer Science 2025-08-19 Dan Ruta , Abdelaziz Djelouah , Raphael Ortiz , Christopher Schroers

The diffusion-based text-to-image model harbors immense potential in transferring reference style. However, current encoder-based approaches significantly impair the text controllability of text-to-image models while transferring styles. In…

Computer Vision and Pattern Recognition · Computer Science 2024-03-13 Tianhao Qi , Shancheng Fang , Yanze Wu , Hongtao Xie , Jiawei Liu , Lang Chen , Qian He , Yongdong Zhang

Neural Style Transfer (NST) is the field of study applying neural techniques to modify the artistic appearance of a content image to match the style of a reference style image. Traditionally, NST methods have focused on texture-based image…

Computer Vision and Pattern Recognition · Computer Science 2023-07-12 Dan Ruta , Gemma Canet Tarrés , Andrew Gilbert , Eli Shechtman , Nicholas Kolkin , John Collomosse

Artistic styles are defined by both their structural and appearance elements. Existing neural stylization techniques primarily focus on transferring appearance-level features such as color and texture, often neglecting the equally crucial…

Computer Vision and Pattern Recognition · Computer Science 2025-08-28 Ruixiang Jiang , Changwen Chen

We introduce Diff-Tracker, a novel approach for the challenging unsupervised visual tracking task leveraging the pre-trained text-to-image diffusion model. Our main idea is to leverage the rich knowledge encapsulated within the pre-trained…

Computer Vision and Pattern Recognition · Computer Science 2024-07-17 Zhengbo Zhang , Li Xu , Duo Peng , Hossein Rahmani , Jun Liu

Diffusion models have opened the path to a wide range of text-based image editing frameworks. However, these typically build on the multi-step nature of the diffusion backwards process, and adapting them to distilled, fast-sampling methods…

Computer Vision and Pattern Recognition · Computer Science 2024-08-02 Gilad Deutch , Rinon Gal , Daniel Garibi , Or Patashnik , Daniel Cohen-Or

Large-scale text-to-video diffusion models have demonstrated an exceptional ability to synthesize diverse videos. However, due to the lack of extensive text-to-video datasets and the necessary computational resources for training, directly…

Computer Vision and Pattern Recognition · Computer Science 2023-05-10 Nisha Huang , Yuxin Zhang , Weiming Dong

3D content creation via text-driven stylization has played a fundamental challenge to multimedia and graphics community. Recent advances of cross-modal foundation models (e.g., CLIP) have made this problem feasible. Those approaches…

Computer Vision and Pattern Recognition · Computer Science 2023-11-10 Haibo Yang , Yang Chen , Yingwei Pan , Ting Yao , Zhineng Chen , Tao Mei
‹ Prev 1 2 3 10 Next ›