Related papers: Control Color: Multimodal Diffusion-based Interact…

DiffColor: Toward High Fidelity Text-Guided Image Colorization with Diffusion Models

Recent data-driven image colorization methods have enabled automatic or reference-based colorization, while still suffering from unsatisfactory and inaccurate object-level color control. To address these issues, we propose a new method…

Computer Vision and Pattern Recognition · Computer Science 2023-08-04 Jianxin Lin , Peng Xiao , Yijun Wang , Rongju Zhang , Xiangxiang Zeng

Diffusing Colors: Image Colorization with Text Guided Diffusion

The colorization of grayscale images is a complex and subjective task with significant challenges. Despite recent progress in employing large-scale datasets with deep neural networks, difficulties with controllability and visual quality…

Computer Vision and Pattern Recognition · Computer Science 2023-12-08 Nir Zabari , Aharon Azulay , Alexey Gorkor , Tavi Halperin , Ohad Fried

Language-based Image Colorization: A Benchmark and Beyond

Image colorization aims to bring colors back to grayscale images. Automatic image colorization methods, which requires no additional guidance, struggle to generate high-quality images due to color ambiguity, and provides limited user…

Computer Vision and Pattern Recognition · Computer Science 2025-03-20 Yifan Li , Shuai Yang , Jiaying Liu

Training-Free Text-Guided Color Editing with Multi-Modal Diffusion Transformer

Text-guided color editing in images and videos is a fundamental yet unsolved problem, requiring fine-grained manipulation of color attributes, including albedo, light source color, and ambient lighting, while preserving physical consistency…

Graphics · Computer Science 2026-02-04 Zixin Yin , Xili Dai , Ling-Hao Chen , Deyu Zhou , Jianan Wang , Duomin Wang , Gang Yu , Lionel M. Ni , Lei Zhang , Heung-Yeung Shum

ControlStyle: Text-Driven Stylized Image Generation Using Diffusion Priors

Recently, the multimedia community has witnessed the rise of diffusion models trained on large-scale multi-modal data for visual content creation, particularly in the field of text-to-image generation. In this paper, we propose a new task…

Computer Vision and Pattern Recognition · Computer Science 2023-11-10 Jingwen Chen , Yingwei Pan , Ting Yao , Tao Mei

Controllable-Continuous Color Editing in Diffusion Model via Color Mapping

In recent years, text-driven image editing has made significant progress. However, due to the inherent ambiguity and discreteness of natural language, color editing still faces challenges such as insufficient precision and difficulty in…

Computer Vision and Pattern Recognition · Computer Science 2025-09-18 Yuqi Yang , Dongliang Chang , Yuanchen Fang , Yi-Zhe SonG , Zhanyu Ma , Jun Guo

Local Conditional Controlling for Text-to-Image Diffusion Models

Diffusion models have exhibited impressive prowess in the text-to-image task. Recent methods add image-level structure controls, e.g., edge and depth maps, to manipulate the generation process together with text prompts to obtain desired…

Computer Vision and Pattern Recognition · Computer Science 2024-08-23 Yibo Zhao , Liang Peng , Yang Yang , Zekai Luo , Hengjia Li , Yao Chen , Zheng Yang , Xiaofei He , Wei Zhao , qinglin lu , Boxi Wu , Wei Liu

Interactive Deep Colorization With Simultaneous Global and Local Inputs

Colorization methods using deep neural networks have become a recent trend. However, most of them do not allow user inputs, or only allow limited user inputs (only global inputs or only local inputs), to control the output colorful images.…

Computer Vision and Pattern Recognition · Computer Science 2018-01-30 Yi Xiao , Peiyao Zhou , Yan Zheng

ControlCom: Controllable Image Composition using Diffusion Model

Image composition targets at synthesizing a realistic composite image from a pair of foreground and background images. Recently, generative composition methods are built on large pretrained diffusion models to generate composite images,…

Computer Vision and Pattern Recognition · Computer Science 2023-08-22 Bo Zhang , Yuxuan Duan , Jun Lan , Yan Hong , Huijia Zhu , Weiqiang Wang , Li Niu

Instance-aware Image Colorization with Controllable Textual Descriptions and Segmentation Masks

Recently, the application of deep learning in image colorization has received widespread attention. The maturation of diffusion models has further advanced the development of image colorization models. However, current mainstream image…

Computer Vision and Pattern Recognition · Computer Science 2025-09-26 Yanru An , Ling Gui , Chunlei Cai , Tianxiao Ye , JIangchao Yao , Guangtao Zhai , Qiang Hu , Xiaoyun Zhang

Follow-Your-Color: Multi-Instance Sketch Colorization

We present Follow-Your-Color, a diffusion-based framework for multi-instance sketch colorization. The production of multi-instance 2D line art colorization adheres to an industry-standard workflow, which consists of three crucial stages:…

Computer Vision and Pattern Recognition · Computer Science 2025-08-08 Yinhan Zhang , Yue Ma , Bingyuan Wang , Qifeng Chen , Zeyu Wang

LayerDiffusion: Layered Controlled Image Editing with Diffusion Models

Text-guided image editing has recently experienced rapid development. However, simultaneously performing multiple editing actions on a single image, such as background replacement and specific subject attribute changes, while maintaining…

Computer Vision and Pattern Recognition · Computer Science 2024-04-09 Pengzhi Li , QInxuan Huang , Yikang Ding , Zhiheng Li

Exploring Palette based Color Guidance in Diffusion Models

With the advent of diffusion models, Text-to-Image (T2I) generation has seen substantial advancements. Current T2I models allow users to specify object colors using linguistic color names, and some methods aim to personalize color-object…

Graphics · Computer Science 2025-08-13 Qianru Qiu , Jiafeng Mao , Xueting Wang

ColorFlow: Retrieval-Augmented Image Sequence Colorization

Automatic black-and-white image sequence colorization while preserving character and object identity (ID) is a complex task with significant market demand, such as in cartoon or comic series colorization. Despite advancements in visual…

Computer Vision and Pattern Recognition · Computer Science 2025-05-06 Junhao Zhuang , Xuan Ju , Zhaoyang Zhang , Yong Liu , Shiyi Zhang , Chun Yuan , Ying Shan

Multi-party Collaborative Attention Control for Image Customization

The rapid advancement of diffusion models has increased the need for customized image generation. However, current customization methods face several limitations: 1) typically accept either image or text conditions alone; 2) customization…

Computer Vision and Pattern Recognition · Computer Science 2025-05-06 Han Yang , Chuanguang Yang , Qiuli Wang , Zhulin An , Weilun Feng , Libo Huang , Yongjun Xu

DDColor: Towards Photo-Realistic Image Colorization via Dual Decoders

Image colorization is a challenging problem due to multi-modal uncertainty and high ill-posedness. Directly training a deep neural network usually leads to incorrect semantic colors and low color richness. While transformer-based methods…

Computer Vision and Pattern Recognition · Computer Science 2023-09-06 Xiaoyang Kang , Tao Yang , Wenqi Ouyang , Peiran Ren , Lingzhi Li , Xuansong Xie

ControlEdit: A MultiModal Local Clothing Image Editing Method

Multimodal clothing image editing refers to the precise adjustment and modification of clothing images using data such as textual descriptions and visual images as control conditions, which effectively improves the work efficiency of…

Computer Vision and Pattern Recognition · Computer Science 2024-09-24 Di Cheng , YingJie Shi , ShiXin Sun , JiaFu Zhang , WeiJing Wang , Yu Liu

ColorizeDiffusion: Adjustable Sketch Colorization with Reference Image and Text

Diffusion models have recently demonstrated their effectiveness in generating extremely high-quality images and are now utilized in a wide range of applications, including automatic sketch colorization. Although many methods have been…

Computer Vision and Pattern Recognition · Computer Science 2024-07-04 Dingkun Yan , Liang Yuan , Erwin Wu , Yuma Nishioka , Issei Fujishiro , Suguru Saito

UniColor: A Unified Framework for Multi-Modal Colorization with Transformer

We propose the first unified framework UniColor to support colorization in multiple modalities, including both unconditional and conditional ones, such as stroke, exemplar, text, and even a mix of them. Rather than learning a separate model…

Computer Vision and Pattern Recognition · Computer Science 2022-09-23 Zhitong Huang , Nanxuan Zhao , Jing Liao

ControlText: Unlocking Controllable Fonts in Multilingual Text Rendering without Font Annotations

This work demonstrates that diffusion models can achieve font-controllable multilingual text rendering using just raw images without font label annotations.Visual text rendering remains a significant challenge. While recent methods…

Computer Vision and Pattern Recognition · Computer Science 2025-10-28 Bowen Jiang , Yuan Yuan , Xinyi Bai , Zhuoqun Hao , Alyson Yin , Yaojie Hu , Wenyu Liao , Lyle Ungar , Camillo J. Taylor