Related papers: Learning to Edit Visual Programs with Self-Supervi…

Boosting Few-Shot Visual Learning with Self-Supervision

Few-shot learning and self-supervised learning address different facets of the same problem: how to train a model with little or no labeled data. Few-shot learning aims for optimization methods and models that can learn efficiently to…

Computer Vision and Pattern Recognition · Computer Science 2019-06-13 Spyros Gidaris , Andrei Bursuc , Nikos Komodakis , Patrick Pérez , Matthieu Cord

End-to-End Visual Editing with a Generatively Pre-Trained Artist

We consider the targeted image editing problem: blending a region in a source image with a driver image that specifies the desired change. Differently from prior works, we solve this problem by learning a conditional probability…

Computer Vision and Pattern Recognition · Computer Science 2022-05-04 Andrew Brown , Cheng-Yang Fu , Omkar Parkhi , Tamara L. Berg , Andrea Vedaldi

TextTopicNet - Self-Supervised Learning of Visual Features Through Embedding Images on Semantic Text Spaces

The immense success of deep learning based methods in computer vision heavily relies on large scale training datasets. These richly annotated datasets help the network learn discriminative visual features. Collecting and annotating such…

Computer Vision and Pattern Recognition · Computer Science 2018-07-09 Yash Patel , Lluis Gomez , Raul Gomez , Marçal Rusiñol , Dimosthenis Karatzas , C. V. Jawahar

Learning to Infer Generative Template Programs for Visual Concepts

People grasp flexible visual concepts from a few examples. We explore a neurosymbolic system that learns how to infer programs that capture visual concepts in a domain-general fashion. We introduce Template Programs: programmatic…

Computer Vision and Pattern Recognition · Computer Science 2024-06-11 R. Kenny Jones , Siddhartha Chaudhuri , Daniel Ritchie

Customize Your Own Paired Data via Few-shot Way

Existing solutions to image editing tasks suffer from several issues. Though achieving remarkably satisfying generated results, some supervised methods require huge amounts of paired training data, which greatly limits their usages. The…

Computer Vision and Pattern Recognition · Computer Science 2024-05-22 Jinshu Chen , Bingchuan Li , Miao Hua , Panpan Xu , Qian He

Towards Data-Driven Automatic Video Editing

Automatic video editing involving at least the steps of selecting the most valuable footage from points of view of visual quality and the importance of action filmed; and cutting the footage into a brief and coherent visual story that would…

Computer Vision and Pattern Recognition · Computer Science 2019-07-18 Sergey Podlesnyy

Learning Where to Edit Vision Transformers

Model editing aims to data-efficiently correct predictive errors of large pre-trained models while ensuring generalization to neighboring failures and locality to minimize unintended effects on unrelated examples. While significant progress…

Computer Vision and Pattern Recognition · Computer Science 2024-11-05 Yunqiao Yang , Long-Kai Huang , Shengzhuang Chen , Kede Ma , Ying Wei

Fine-Pruning: Joint Fine-Tuning and Compression of a Convolutional Network with Bayesian Optimization

When approaching a novel visual recognition problem in a specialized image domain, a common strategy is to start with a pre-trained deep neural network and fine-tune it to the specialized domain. If the target domain covers a smaller visual…

Computer Vision and Pattern Recognition · Computer Science 2017-07-31 Frederick Tung , Srikanth Muralidharan , Greg Mori

Matching Networks for One Shot Learning

Learning from a few examples remains a key challenge in machine learning. Despite recent advances in important domains such as vision and language, the standard supervised deep learning paradigm does not offer a satisfactory solution for…

Machine Learning · Computer Science 2018-01-01 Oriol Vinyals , Charles Blundell , Timothy Lillicrap , Koray Kavukcuoglu , Daan Wierstra

Self-Supervised Viewpoint Learning From Image Collections

Training deep neural networks to estimate the viewpoint of objects requires large labeled training datasets. However, manually labeling viewpoints is notoriously hard, error-prone, and time-consuming. On the other hand, it is relatively…

Computer Vision and Pattern Recognition · Computer Science 2020-04-07 Siva Karthik Mustikovela , Varun Jampani , Shalini De Mello , Sifei Liu , Umar Iqbal , Carsten Rother , Jan Kautz

Self-Training Large Language Models for Improved Visual Program Synthesis With Visual Reinforcement

Visual program synthesis is a promising approach to exploit the reasoning abilities of large language models for compositional computer vision tasks. Previous work has used few-shot prompting with frozen LLMs to synthesize visual programs.…

Computer Vision and Pattern Recognition · Computer Science 2024-04-09 Zaid Khan , Vijay Kumar BG , Samuel Schulter , Yun Fu , Manmohan Chandraker

FaceShop: Deep Sketch-based Face Image Editing

We present a novel system for sketch-based face image editing, enabling users to edit images intuitively by sketching a few strokes on a region of interest. Our interface features tools to express a desired image manipulation by providing…

Computer Vision and Pattern Recognition · Computer Science 2018-06-08 Tiziano Portenier , Qiyang Hu , Attila Szabó , Siavash Arjomand Bigdeli , Paolo Favaro , Matthias Zwicker

ViewNet: Unsupervised Viewpoint Estimation from Conditional Generation

Understanding the 3D world without supervision is currently a major challenge in computer vision as the annotations required to supervise deep networks for tasks in this domain are expensive to obtain on a large scale. In this paper, we…

Computer Vision and Pattern Recognition · Computer Science 2022-12-02 Octave Mariotti , Oisin Mac Aodha , Hakan Bilen

Revisiting Fine-tuning for Few-shot Learning

Few-shot learning is the process of learning novel classes using only a few examples and it remains a challenging task in machine learning. Many sophisticated few-shot learning algorithms have been proposed based on the notion that networks…

Machine Learning · Computer Science 2019-10-04 Akihiro Nakamura , Tatsuya Harada

Prompt Tuning based Adapter for Vision-Language Model Adaption

Large pre-trained vision-language (VL) models have shown significant promise in adapting to various downstream tasks. However, fine-tuning the entire network is challenging due to the massive number of model parameters. To address this…

Computer Vision and Pattern Recognition · Computer Science 2023-03-28 Jingchen Sun , Jiayu Qin , Zihao Lin , Changyou Chen

Pre-Trained Vision-Language Models as Partial Annotators

Pre-trained vision-language models learn massive data to model unified representations of images and natural languages, which can be widely applied to downstream machine learning tasks. In addition to zero-shot inference, in order to better…

Computer Vision and Pattern Recognition · Computer Science 2024-06-28 Qian-Wei Wang , Yuqiu Xie , Letian Zhang , Zimo Liu , Shu-Tao Xia

Self-Supervised training for blind multi-frame video denoising

We propose a self-supervised approach for training multi-frame video denoising networks. These networks predict frame t from a window of frames around t. Our self-supervised approach benefits from the video temporal consistency by…

Computer Vision and Pattern Recognition · Computer Science 2021-04-21 Valéry Dewil , Jérémy Anger , Axel Davy , Thibaud Ehret , Pablo Arias , Gabriele Facciolo

A Flexible Selection Scheme for Minimum-Effort Transfer Learning

Fine-tuning is a popular way of exploiting knowledge contained in a pre-trained convolutional network for a new visual recognition task. However, the orthogonal setting of transferring knowledge from a pretrained network to a visually…

Computer Vision and Pattern Recognition · Computer Science 2020-08-28 Amelie Royer , Christoph H. Lampert

Language-Guided Joint Audio-Visual Editing via One-Shot Adaptation

In this paper, we introduce a novel task called language-guided joint audio-visual editing. Given an audio and image pair of a sounding event, this task aims at generating new audio-visual content by editing the given sounding event…

Computer Vision and Pattern Recognition · Computer Science 2024-11-12 Susan Liang , Chao Huang , Yapeng Tian , Anurag Kumar , Chenliang Xu

Visual Prompting via Image Inpainting

How does one adapt a pre-trained visual model to novel downstream tasks without task-specific finetuning or any model modification? Inspired by prompting in NLP, this paper investigates visual prompting: given input-output image example(s)…

Computer Vision and Pattern Recognition · Computer Science 2022-09-02 Amir Bar , Yossi Gandelsman , Trevor Darrell , Amir Globerson , Alexei A. Efros