Related papers: Multi-turn Consistent Image Editing

Improving Editability in Image Generation with Layer-wise Memory

Most real-world image editing tasks require multiple sequential edits to achieve desired results. Current editing approaches, primarily designed for single-object modifications, struggle with sequential editing: especially with maintaining…

Computer Vision and Pattern Recognition · Computer Science 2025-05-05 Daneul Kim , Jaeah Lee , Jaesik Park

Consolidating Attention Features for Multi-view Image Editing

Large-scale text-to-image models enable a wide range of image editing techniques, using text prompts or even spatial controls. However, applying these editing methods to multi-view images depicting a single scene leads to 3D-inconsistent…

Computer Vision and Pattern Recognition · Computer Science 2024-02-23 Or Patashnik , Rinon Gal , Daniel Cohen-Or , Jun-Yan Zhu , Fernando De la Torre

Unveil Inversion and Invariance in Flow Transformer for Versatile Image Editing

Leveraging the large generative prior of the flow transformer for tuning-free image editing requires authentic inversion to project the image into the model's domain and a flexible invariance control mechanism to preserve non-target…

Computer Vision and Pattern Recognition · Computer Science 2025-03-26 Pengcheng Xu , Boyuan Jiang , Xiaobin Hu , Donghao Luo , Qingdong He , Jiangning Zhang , Chengjie Wang , Yunsheng Wu , Charles Ling , Boyu Wang

Interactive Image Restoration

Machine learning and many of its applications are considered hard to approach due to their complexity and lack of transparency. One mission of human-centric machine learning is to improve algorithm transparency and user satisfaction while…

Human-Computer Interaction · Computer Science 2019-10-25 Zhiwei Han , Thomas Weber , Stefan Matthes , Yuanting Liu , Hao Shen

INRetouch: Context Aware Implicit Neural Representation for Photography Retouching

Professional photo editing remains challenging, requiring extensive knowledge of imaging pipelines and significant expertise. While recent deep learning approaches, particularly style transfer methods, have attempted to automate this…

Image and Video Processing · Electrical Eng. & Systems 2025-12-11 Omar Elezabi , Marcos V. Conde , Zongwei Wu , Radu Timofte

Inverse-and-Edit: Effective and Fast Image Editing by Cycle Consistency Models

Recent advances in image editing with diffusion models have achieved impressive results, offering fine-grained control over the generation process. However, these methods are computationally intensive because of their iterative nature.…

Computer Vision and Pattern Recognition · Computer Science 2025-06-25 Ilia Beletskii , Andrey Kuznetsov , Aibek Alanov

Continuous Layout Editing of Single Images with Diffusion Models

Recent advancements in large-scale text-to-image diffusion models have enabled many applications in image editing. However, none of these methods have been able to edit the layout of single existing images. To address this gap, we propose…

Computer Vision and Pattern Recognition · Computer Science 2023-06-23 Zhiyuan Zhang , Zhitong Huang , Jing Liao

ReFlex: Text-Guided Editing of Real Images in Rectified Flow via Mid-Step Feature Extraction and Attention Adaptation

Rectified Flow text-to-image models surpass diffusion models in image quality and text alignment, but adapting ReFlow for real-image editing remains challenging. We propose a new real-image editing method for ReFlow by analyzing the…

Computer Vision and Pattern Recognition · Computer Science 2025-07-03 Jimyeong Kim , Jungwon Park , Yeji Song , Nojun Kwak , Wonjong Rhee

Unified Diffusion-Based Rigid and Non-Rigid Editing with Text and Image Guidance

Existing text-to-image editing methods tend to excel either in rigid or non-rigid editing but encounter challenges when combining both, resulting in misaligned outputs with the provided text prompts. In addition, integrating reference…

Computer Vision and Pattern Recognition · Computer Science 2024-01-05 Jiacheng Wang , Ping Liu , Wei Xu

Instilling Multi-round Thinking to Text-guided Image Generation

This paper delves into the text-guided image editing task, focusing on modifying a reference image according to user-specified textual feedback to embody specific attributes. Despite recent advancements, a persistent challenge remains that…

Computer Vision and Pattern Recognition · Computer Science 2024-03-12 Lidong Zeng , Zhedong Zheng , Yinwei Wei , Tat-seng Chua

FreqEdit: Preserving High-Frequency Features for Robust Multi-Turn Image Editing

Instruction-based image editing through natural language has emerged as a powerful paradigm for intuitive visual manipulation. While recent models achieve impressive results on single edits, they suffer from severe quality degradation under…

Computer Vision and Pattern Recognition · Computer Science 2026-03-24 Yucheng Liao , Jiajun Liang , Kaiqian Cui , Baoquan Zhao , Haoran Xie , Wei Liu , Qing Li , Xudong Mao

IMAGAgent: Orchestrating Multi-Turn Image Editing via Constraint-Aware Planning and Reflection

Existing multi-turn image editing paradigms are often confined to isolated single-step execution. Due to a lack of context-awareness and closed-loop feedback mechanisms, they are prone to error accumulation and semantic drift during…

Graphics · Computer Science 2026-04-01 Fei Shen , Chengyu Xie , Lihong Wang , Zhanyi Zhang , Xin Jiang , Xiaoyu Du , Jinhui Tang

Consistent Image Layout Editing with Diffusion Models

Despite the great success of large-scale text-to-image diffusion models in image generation and image editing, existing methods still struggle to edit the layout of real images. Although a few works have been proposed to tackle this…

Computer Vision and Pattern Recognition · Computer Science 2025-03-11 Tao Xia , Yudi Zhang , Ting Liu Lei Zhang

Attention-based Adaptive Selection of Operations for Image Restoration in the Presence of Unknown Combined Distortions

Many studies have been conducted so far on image restoration, the problem of restoring a clean image from its distorted version. There are many different types of distortion which affect image quality. Previous studies have focused on…

Computer Vision and Pattern Recognition · Computer Science 2019-04-09 Masanori Suganuma , Xing Liu , Takayuki Okatani

DiffRetouch: Using Diffusion to Retouch on the Shoulder of Experts

Image retouching aims to enhance the visual quality of photos. Considering the different aesthetic preferences of users, the target of retouching is subjective. However, current retouching methods mostly adopt deterministic models, which…

Computer Vision and Pattern Recognition · Computer Science 2024-07-08 Zheng-Peng Duan , Jiawei zhang , Zheng Lin , Xin Jin , Dongqing Zou , Chunle Guo , Chongyi Li

SteerFlow: Steering Rectified Flows for Faithful Inversion-Based Image Editing

Recent advances in flow-based generative models have enabled training-free, text-guided image editing by inverting an image into its latent noise and regenerating it under a new target conditional guidance. However, existing methods…

Computer Vision and Pattern Recognition · Computer Science 2026-04-03 Thinh Dao , Zhen Wang , Kien T. Pham , Long Chen

Exposure: A White-Box Photo Post-Processing Framework

Retouching can significantly elevate the visual appeal of photos, but many casual photographers lack the expertise to do this well. To address this problem, previous works have proposed automatic retouching systems based on supervised…

Graphics · Computer Science 2018-02-09 Yuanming Hu , Hao He , Chenxi Xu , Baoyuan Wang , Stephen Lin

Multi-Expert Adaptive Selection: Task-Balancing for All-in-One Image Restoration

The use of a single image restoration framework to achieve multi-task image restoration has garnered significant attention from researchers. However, several practical challenges remain, including meeting the specific and simultaneous…

Computer Vision and Pattern Recognition · Computer Science 2024-07-30 Xiaoyan Yu , Shen Zhou , Huafeng Li , Liehuang Zhu

Making Image Editing Easier via Adaptive Task Reformulation with Agentic Executions

Instruction guided image editing has advanced substantially with recent generative models, yet it still fails to produce reliable results across many seemingly simple cases. We observe that a large portion of these failures stem not from…

Computer Vision and Pattern Recognition · Computer Science 2026-04-20 Bo Zhao , Kairui Guo , Runnan Du , Haiyang Sun , Pengshan Wang , Huan Yang , Kun Gai , Yixin Cao , Wei Ji

Content-Aware Depth-Adaptive Image Restoration

This work prioritizes building a modular pipeline that utilizes existing models to systematically restore images, rather than creating new restoration models from scratch. Restoration is carried out at an object-specific level, with each…

Computer Vision and Pattern Recognition · Computer Science 2025-01-10 Tom Richard Vargis , Siavash Ghiasvand