Related papers: Program-Guided Image Manipulators

Neurosymbolic Models for Computer Graphics

Procedural models (i.e. symbolic programs that output visual data) are a historically-popular method for representing graphics content: vegetation, buildings, textures, etc. They offer many advantages: interpretable design parameters,…

Graphics · Computer Science 2023-04-21 Daniel Ritchie , Paul Guerrero , R. Kenny Jones , Niloy J. Mitra , Adriana Schulz , Karl D. D. Willis , Jiajun Wu

Proactive Image Manipulation Detection

Image manipulation detection algorithms are often trained to discriminate between images manipulated with particular Generative Models (GMs) and genuine/real images, yet generalize poorly to images manipulated with GMs unseen in the…

Computer Vision and Pattern Recognition · Computer Science 2022-04-01 Vishal Asnani , Xi Yin , Tal Hassner , Sijia Liu , Xiaoming Liu

Generative Memory-Guided Semantic Reasoning Model for Image Inpainting

Most existing methods for image inpainting focus on learning the intra-image priors from the known regions of the current input image to infer the content of the corrupted regions in the same image. While such methods perform well on images…

Computer Vision and Pattern Recognition · Computer Science 2022-03-22 Xin Feng , Wenjie Pei , Fengjun Li , Fanglin Chen , David Zhang , Guangming Lu

Interactive Image Inpainting Using Semantic Guidance

Image inpainting approaches have achieved significant progress with the help of deep neural networks. However, existing approaches mainly focus on leveraging the priori distribution learned by neural networks to produce a single inpainting…

Computer Vision and Pattern Recognition · Computer Science 2022-01-27 Wangbo Yu , Jinhao Du , Ruixin Liu , Yixuan Li , Yuesheng zhu

Hypernetwork functional image representation

Motivated by the human way of memorizing images we introduce their functional representation, where an image is represented by a neural network. For this purpose, we construct a hypernetwork which takes an image and returns weights to the…

Machine Learning · Computer Science 2019-11-26 Sylwester Klocek , Łukasz Maziarka , Maciej Wołczyk , Jacek Tabor , Jakub Nowak , Marek Śmieja

Image Inpainting Using AutoEncoder and Guided Selection of Predicted Pixels

Image inpainting is an effective method to enhance distorted digital images. Different inpainting methods use the information of neighboring pixels to predict the value of missing pixels. Recently deep neural networks have been used to…

Computer Vision and Pattern Recognition · Computer Science 2021-12-20 Mohammad H. Givkashi , Mahshid Hadipour , Arezoo PariZanganeh , Zahra Nabizadeh , Nader Karimi , Shadrokh Samavi

Fast-PGM: Fast Probabilistic Graphical Model Learning and Inference

Probabilistic graphical models (PGMs) serve as a powerful framework for modeling complex systems with uncertainty and extracting valuable insights from data. However, users face challenges when applying PGMs to their problems in terms of…

Machine Learning · Computer Science 2024-05-29 Jiantong Jiang , Zeyi Wen , Peiyu Yang , Atif Mansoor , Ajmal Mian

Text as Neural Operator: Image Manipulation by Text Instruction

In recent years, text-guided image manipulation has gained increasing attention in the multimedia and computer vision community. The input to conditional image generation has evolved from image-only to multimodality. In this paper, we study…

Computer Vision and Pattern Recognition · Computer Science 2021-11-30 Tianhao Zhang , Hung-Yu Tseng , Lu Jiang , Weilong Yang , Honglak Lee , Irfan Essa

SG-MIM: Structured Knowledge Guided Efficient Pre-training for Dense Prediction

Masked Image Modeling (MIM) techniques have redefined the landscape of computer vision, enabling pre-trained models to achieve exceptional performance across a broad spectrum of tasks. Despite their success, the full potential of MIM-based…

Computer Vision and Pattern Recognition · Computer Science 2024-09-05 Sumin Son , Hyesong Choi , Dongbo Min

Masked Image Modeling as a Framework for Self-Supervised Learning across Eye Movements

To make sense of their surroundings, intelligent systems must transform complex sensory inputs to structured codes that are reduced to task-relevant information such as object category. Biological agents achieve this in a largely autonomous…

Computer Vision and Pattern Recognition · Computer Science 2024-07-09 Robin Weiler , Matthias Brucklacher , Cyriel M. A. Pennartz , Sander M. Bohté

Generative Visual Manipulation on the Natural Image Manifold

Realistic image manipulation is challenging because it requires modifying the image appearance in a user-controlled way, while preserving the realism of the result. Unless the user has considerable artistic skill, it is easy to "fall off"…

Computer Vision and Pattern Recognition · Computer Science 2018-12-18 Jun-Yan Zhu , Philipp Krähenbühl , Eli Shechtman , Alexei A. Efros

Masked Image Modeling with Local Multi-Scale Reconstruction

Masked Image Modeling (MIM) achieves outstanding success in self-supervised representation learning. Unfortunately, MIM models typically have huge computational burden and slow learning process, which is an inevitable obstacle for their…

Computer Vision and Pattern Recognition · Computer Science 2023-03-10 Haoqing Wang , Yehui Tang , Yunhe Wang , Jianyuan Guo , Zhi-Hong Deng , Kai Han

Remember What You have drawn: Semantic Image Manipulation with Memory

Image manipulation with natural language, which aims to manipulate images with the guidance of language descriptions, has been a challenging problem in the fields of computer vision and natural language processing (NLP). Currently, a number…

Computer Vision and Pattern Recognition · Computer Science 2021-07-28 Xiangxi Shi , Zhonghua Wu , Guosheng Lin , Jianfei Cai , Shafiq Joty

DPPMask: Masked Image Modeling with Determinantal Point Processes

Masked Image Modeling (MIM) has achieved impressive representative performance with the aim of reconstructing randomly masked images. Despite the empirical success, most previous works have neglected the important fact that it is…

Computer Vision and Pattern Recognition · Computer Science 2023-03-28 Junde Xu , Zikai Lin , Donghao Zhou , Yaodong Yang , Xiangyun Liao , Bian Wu , Guangyong Chen , Pheng-Ann Heng

CIMGEN: Controlled Image Manipulation by Finetuning Pretrained Generative Models on Limited Data

Content creation and image editing can benefit from flexible user controls. A common intermediate representation for conditional image generation is a semantic map, that has information of objects present in the image. When compared to raw…

Artificial Intelligence · Computer Science 2024-01-25 Chandrakanth Gudavalli , Erik Rosten , Lakshmanan Nataraj , Shivkumar Chandrasekaran , B. S. Manjunath

Semantic Photo Manipulation with a Generative Image Prior

Despite the recent success of GANs in synthesizing images conditioned on inputs such as a user sketch, text, or semantic labels, manipulating the high-level attributes of an existing natural photograph with GANs is challenging for two…

Computer Vision and Pattern Recognition · Computer Science 2020-09-15 David Bau , Hendrik Strobelt , William Peebles , Jonas Wulff , Bolei Zhou , Jun-Yan Zhu , Antonio Torralba

Learning Neuro-symbolic Programs for Language Guided Robot Manipulation

Given a natural language instruction and an input scene, our goal is to train a model to output a manipulation program that can be executed by the robot. Prior approaches for this task possess one of the following limitations: (i) rely on…

Robotics · Computer Science 2024-04-03 Namasivayam Kalithasan , Himanshu Singh , Vishal Bindal , Arnav Tuli , Vishwajeet Agrawal , Rahul Jain , Parag Singla , Rohan Paul

Human-Aligned Image Models Improve Visual Decoding from the Brain

Decoding visual images from brain activity has significant potential for advancing brain-computer interaction and enhancing the understanding of human perception. Recent approaches align the representation spaces of images and brain…

Computer Vision and Pattern Recognition · Computer Science 2025-06-11 Nona Rajabi , Antônio H. Ribeiro , Miguel Vasco , Farzaneh Taleb , Mårten Björkman , Danica Kragic

Semantic-Guided Inpainting Network for Complex Urban Scenes Manipulation

Manipulating images of complex scenes to reconstruct, insert and/or remove specific object instances is a challenging task. Complex scenes contain multiple semantics and objects, which are frequently cluttered or ambiguous, thus hampering…

Computer Vision and Pattern Recognition · Computer Science 2020-10-20 Pierfrancesco Ardino , Yahui Liu , Elisa Ricci , Bruno Lepri , Marco De Nadai

Image Shape Manipulation from a Single Augmented Training Sample

In this paper, we present DeepSIM, a generative model for conditional image manipulation based on a single image. We find that extensive augmentation is key for enabling single image training, and incorporate the use of thin-plate-spline…

Computer Vision and Pattern Recognition · Computer Science 2021-11-29 Yael Vinker , Eliahu Horwitz , Nir Zabari , Yedid Hoshen