Related papers: Interactive Image Manipulation with Complex Text I…

Text as Neural Operator: Image Manipulation by Text Instruction

In recent years, text-guided image manipulation has gained increasing attention in the multimedia and computer vision community. The input to conditional image generation has evolved from image-only to multimodality. In this paper, we study…

Computer Vision and Pattern Recognition · Computer Science 2021-11-30 Tianhao Zhang , Hung-Yu Tseng , Lu Jiang , Weilong Yang , Honglak Lee , Irfan Essa

ImageBrush: Learning Visual In-Context Instructions for Exemplar-Based Image Manipulation

While language-guided image manipulation has made remarkable progress, the challenge of how to instruct the manipulation process faithfully reflecting human intentions persists. An accurate and comprehensive description of a manipulation…

Computer Vision and Pattern Recognition · Computer Science 2023-08-03 Yasheng Sun , Yifan Yang , Houwen Peng , Yifei Shen , Yuqing Yang , Han Hu , Lili Qiu , Hideki Koike

Semantic-Guided Inpainting Network for Complex Urban Scenes Manipulation

Manipulating images of complex scenes to reconstruct, insert and/or remove specific object instances is a challenging task. Complex scenes contain multiple semantics and objects, which are frequently cluttered or ambiguous, thus hampering…

Computer Vision and Pattern Recognition · Computer Science 2020-10-20 Pierfrancesco Ardino , Yahui Liu , Elisa Ricci , Bruno Lepri , Marco De Nadai

Action-based image editing guided by human instructions

Text-based image editing is typically approached as a static task that involves operations such as inserting, deleting, or modifying elements of an input image based on human instructions. Given the static nature of this task, in this…

Computer Vision and Pattern Recognition · Computer Science 2025-02-05 Maria Mihaela Trusca , Mingxiao Li , Marie-Francine Moens

Improving Image Restoration through Removing Degradations in Textual Representations

In this paper, we introduce a new perspective for improving image restoration by removing degradation in the textual representations of a given degraded image. Intuitively, restoration is much easier on text modality than image one. For…

Computer Vision and Pattern Recognition · Computer Science 2024-01-01 Jingbo Lin , Zhilu Zhang , Yuxiang Wei , Dongwei Ren , Dongsheng Jiang , Wangmeng Zuo

Open-Edit: Open-Domain Image Manipulation with Open-Vocabulary Instructions

We propose a novel algorithm, named Open-Edit, which is the first attempt on open-domain image manipulation with open-vocabulary instructions. It is a challenging task considering the large variation of image domains and the lack of…

Computer Vision and Pattern Recognition · Computer Science 2021-04-22 Xihui Liu , Zhe Lin , Jianming Zhang , Handong Zhao , Quan Tran , Xiaogang Wang , Hongsheng Li

Towards Real-time Text-driven Image Manipulation with Unconditional Diffusion Models

Recent advances in diffusion models enable many powerful instruments for image editing. One of these instruments is text-driven image manipulations: editing semantic attributes of an image according to the provided text description. %…

Computer Vision and Pattern Recognition · Computer Science 2023-04-11 Nikita Starodubcev , Dmitry Baranchuk , Valentin Khrulkov , Artem Babenko

Text-Guided Mask-free Local Image Retouching

In the realm of multi-modality, text-guided image retouching techniques emerged with the advent of deep learning. Most currently available text-guided methods, however, rely on object-level supervision to constrain the region that may be…

Computer Vision and Pattern Recognition · Computer Science 2023-02-27 Zerun Liu , Fan Zhang , Jingxuan He , Jin Wang , Zhangye Wang , Lechao Cheng

Prompt Augmentation for Self-supervised Text-guided Image Manipulation

Text-guided image editing finds applications in various creative and practical fields. While recent studies in image generation have advanced the field, they often struggle with the dual challenges of coherent image transformation and…

Computer Vision and Pattern Recognition · Computer Science 2024-12-18 Rumeysa Bodur , Binod Bhattarai , Tae-Kyun Kim

Point and Instruct: Enabling Precise Image Editing by Unifying Direct Manipulation and Text Instructions

Machine learning has enabled the development of powerful systems capable of editing images from natural language instructions. However, in many common scenarios it is difficult for users to specify precise image transformations with text…

Artificial Intelligence · Computer Science 2024-02-14 Alec Helbling , Seongmin Lee , Polo Chau

Optimisation-Based Multi-Modal Semantic Image Editing

Image editing affords increased control over the aesthetics and content of generated images. Pre-existing works focus predominantly on text-based instructions to achieve desired image modifications, which limit edit precision and accuracy.…

Computer Vision and Pattern Recognition · Computer Science 2023-11-29 Bowen Li , Yongxin Yang , Steven McDonagh , Shifeng Zhang , Petru-Daniel Tudosiu , Sarah Parisot

Visual Instruction Inversion: Image Editing via Visual Prompting

Text-conditioned image editing has emerged as a powerful tool for editing images. However, in many situations, language can be ambiguous and ineffective in describing specific image edits. When faced with such challenges, visual prompts can…

Computer Vision and Pattern Recognition · Computer Science 2023-07-27 Thao Nguyen , Yuheng Li , Utkarsh Ojha , Yong Jae Lee

Direct Inversion: Optimization-Free Text-Driven Real Image Editing with Diffusion Models

With the rise of large, publicly-available text-to-image diffusion models, text-guided real image editing has garnered much research attention recently. Existing methods tend to either rely on some form of per-instance or per-task…

Computer Vision and Pattern Recognition · Computer Science 2022-11-16 Adham Elarabawy , Harish Kamath , Samuel Denton

Interactive Image Inpainting Using Semantic Guidance

Image inpainting approaches have achieved significant progress with the help of deep neural networks. However, existing approaches mainly focus on leveraging the priori distribution learned by neural networks to produce a single inpainting…

Computer Vision and Pattern Recognition · Computer Science 2022-01-27 Wangbo Yu , Jinhao Du , Ruixin Liu , Yixuan Li , Yuesheng zhu

Neural Image Compression with Text-guided Encoding for both Pixel-level and Perceptual Fidelity

Recent advances in text-guided image compression have shown great potential to enhance the perceptual quality of reconstructed images. These methods, however, tend to have significantly degraded pixel-wise fidelity, limiting their…

Computer Vision and Pattern Recognition · Computer Science 2024-05-24 Hagyeong Lee , Minkyu Kim , Jun-Hyuk Kim , Seungeon Kim , Dokwan Oh , Jaeho Lee

Learning to Follow Object-Centric Image Editing Instructions Faithfully

Natural language instructions are a powerful interface for editing the outputs of text-to-image diffusion models. However, several challenges need to be addressed: 1) underspecification (the need to model the implicit meaning of…

Computation and Language · Computer Science 2023-10-31 Tuhin Chakrabarty , Kanishk Singh , Arkadiy Saakyan , Smaranda Muresan

Text-IF: Leveraging Semantic Text Guidance for Degradation-Aware and Interactive Image Fusion

Image fusion aims to combine information from different source images to create a comprehensively representative image. Existing fusion methods are typically helpless in dealing with degradations in low-quality source images and…

Computer Vision and Pattern Recognition · Computer Science 2024-03-26 Xunpeng Yi , Han Xu , Hao Zhang , Linfeng Tang , Jiayi Ma

Beyond Editing Pairs: Fine-Grained Instructional Image Editing via Multi-Scale Learnable Regions

Current text-driven image editing methods typically follow one of two directions: relying on large-scale, high-quality editing pair datasets to improve editing precision and diversity, or exploring alternative dataset-free techniques.…

Computer Vision and Pattern Recognition · Computer Science 2025-05-27 Chenrui Ma , Xi Xiao , Tianyang Wang , Yanning Shen

Semantic Image Manipulation Using Scene Graphs

Image manipulation can be considered a special case of image generation where the image to be produced is a modification of an existing image. Image generation and manipulation have been, for the most part, tasks that operate on raw pixels.…

Computer Vision and Pattern Recognition · Computer Science 2020-04-09 Helisa Dhamo , Azade Farshad , Iro Laina , Nassir Navab , Gregory D. Hager , Federico Tombari , Christian Rupprecht

Interactive Image Restoration

Machine learning and many of its applications are considered hard to approach due to their complexity and lack of transparency. One mission of human-centric machine learning is to improve algorithm transparency and user satisfaction while…

Human-Computer Interaction · Computer Science 2019-10-25 Zhiwei Han , Thomas Weber , Stefan Matthes , Yuanting Liu , Hao Shen