Related papers: Comparison Knowledge Translation for Generalizable…

Visual Boundary Knowledge Translation for Foreground Segmentation

When confronted with objects of unknown types in an image, humans can effortlessly and precisely tell their visual boundaries. This recognition mechanism and underlying generalization capability seem to contrast to state-of-the-art image…

Computer Vision and Pattern Recognition · Computer Science 2021-08-03 Zunlei Feng , Lechao Cheng , Xinchao Wang , Xiang Wang , Yajie Liu , Xiangtong Du , Mingli Song

Learning an Adaptation Function to Assess Image Visual Similarities

Human perception is routinely assessing the similarity between images, both for decision making and creative thinking. But the underlying cognitive process is not really well understood yet, hence difficult to be mimicked by computer vision…

Computer Vision and Pattern Recognition · Computer Science 2022-06-06 Olivier Risser-Maroix , Amine Marzouki , Hala Djeghim , Camille Kurtz , Nicolas Lomenie

Joint Learning of Generative Translator and Classifier for Visually Similar Classes

In this paper, we propose a Generative Translation Classification Network (GTCN) for improving visual classification accuracy in settings where classes are visually similar and data is scarce. For this purpose, we propose joint learning…

Computer Vision and Pattern Recognition · Computer Science 2020-12-03 ByungIn Yoo , Tristan Sylvain , Yoshua Bengio , Junmo Kim

Comparator Networks

The objective of this work is set-based verification, e.g. to decide if two sets of images of a face are of the same person or not. The traditional approach to this problem is to learn to generate a feature vector per image, aggregate them…

Computer Vision and Pattern Recognition · Computer Science 2018-07-31 Weidi Xie , Li Shen , Andrew Zisserman

Convolutional Neural Networks Can (Meta-)Learn the Same-Different Relation

While convolutional neural networks (CNNs) have come to match and exceed human performance in many settings, the tasks these models optimize for are largely constrained to the level of individual objects, such as classification and…

Computer Vision and Pattern Recognition · Computer Science 2025-09-09 Max Gupta , Sunayana Rane , R. Thomas McCoy , Thomas L. Griffiths

Semantic and Visual Similarities for Efficient Knowledge Transfer in CNN Training

In recent years, representation learning approaches have disrupted many multimedia computing tasks. Among those approaches, deep convolutional neural networks (CNNs) have notably reached human level expertise on some constrained image…

Computer Vision and Pattern Recognition · Computer Science 2019-10-01 Lucas Pascal , Xavier Bost , Benoît Huet

Visual and Semantic Knowledge Transfer for Large Scale Semi-supervised Object Detection

Deep CNN-based object detection systems have achieved remarkable success on several large-scale object detection benchmarks. However, training such detectors requires a large number of labeled bounding boxes, which are more difficult to…

Computer Vision and Pattern Recognition · Computer Science 2018-03-14 Yuxing Tang , Josiah Wang , Xiaofang Wang , Boyang Gao , Emmanuel Dellandrea , Robert Gaizauskas , Liming Chen

In-Context Translation: Towards Unifying Image Recognition, Processing, and Generation

We propose In-Context Translation (ICT), a general learning framework to unify visual recognition (e.g., semantic segmentation), low-level image processing (e.g., denoising), and conditional image generation (e.g., edge-to-image synthesis).…

Computer Vision and Pattern Recognition · Computer Science 2024-11-07 Han Xue , Qianru Sun , Li Song , Wenjun Zhang , Zhiwu Huang

Deep Image Category Discovery using a Transferred Similarity Function

Automatically discovering image categories in unlabeled natural images is one of the important goals of unsupervised learning. However, the task is challenging and even human beings define visual categories based on a large amount of prior…

Computer Vision and Pattern Recognition · Computer Science 2016-12-06 Yen-Chang Hsu , Zhaoyang Lv , Zsolt Kira

Categorical Knowledge Fused Recognition: Fusing Hierarchical Knowledge with Image Classification through Aligning and Deep Metric Learning

Image classification is a fundamental computer vision task and an important baseline for deep metric learning. In decades efforts have been made on enhancing image classification accuracy by using deep learning models while less attention…

Computer Vision and Pattern Recognition · Computer Science 2025-01-14 Yunfeng Zhao , Huiyu Zhou , Fei Wu , Xifeng Wu

Semantic Clustering based Deduction Learning for Image Recognition and Classification

The paper proposes a semantic clustering based deduction learning by mimicking the learning and thinking process of human brains. Human beings can make judgments based on experience and cognition, and as a result, no one would recognize an…

Computer Vision and Pattern Recognition · Computer Science 2021-12-28 Wenchi Ma , Xuemin Tu , Bo Luo , Guanghui Wang

Memory-Modular Classification: Learning to Generalize with Memory Replacement

We propose a novel memory-modular learner for image classification that separates knowledge memorization from reasoning. Our model enables effective generalization to new classes by simply replacing the memory contents, without the need for…

Computer Vision and Pattern Recognition · Computer Science 2025-04-09 Dahyun Kang , Ahmet Iscen , Eunchan Jo , Sua Choi , Minsu Cho , Cordelia Schmid

Show, Translate and Tell

Humans have an incredible ability to process and understand information from multiple sources such as images, video, text, and speech. Recent success of deep neural networks has enabled us to develop algorithms which give machines the…

Computer Vision and Pattern Recognition · Computer Science 2019-03-18 Dheeraj Peri , Shagan Sah , Raymond Ptucha

Seeing in Words: Learning to Classify through Language Bottlenecks

Neural networks for computer vision extract uninterpretable features despite achieving high accuracy on benchmarks. In contrast, humans can explain their predictions using succinct and intuitive descriptions. To incorporate explainability…

Computer Vision and Pattern Recognition · Computer Science 2023-07-04 Khalid Saifullah , Yuxin Wen , Jonas Geiping , Micah Goldblum , Tom Goldstein

Deep Cross-media Knowledge Transfer

Cross-media retrieval is a research hotspot in multimedia area, which aims to perform retrieval across different media types such as image and text. The performance of existing methods usually relies on labeled data for model training.…

Multimedia · Computer Science 2018-03-13 Xin Huang , Yuxin Peng

Categorizing Semantic Representations for Neural Machine Translation

Modern neural machine translation (NMT) models have achieved competitive performance in standard benchmarks. However, they have recently been shown to suffer limitation in compositional generalization, failing to effectively learn the…

Computation and Language · Computer Science 2022-10-14 Yongjing Yin , Yafu Li , Fandong Meng , Jie Zhou , Yue Zhang

A Novel Word Sense Disambiguation Approach Using WordNet Knowledge Graph

Various applications in computational linguistics and artificial intelligence rely on high-performing word sense disambiguation techniques to solve challenging tasks such as information retrieval, machine translation, question answering,…

Computation and Language · Computer Science 2021-01-11 Mohannad AlMousa , Rachid Benlamri , Richard Khoury

The Role of Typicality in Object Classification: Improving The Generalization Capacity of Convolutional Neural Networks

Deep artificial neural networks have made remarkable progress in different tasks in the field of computer vision. However, the empirical analysis of these models and investigation of their failure cases has received attention recently. In…

Computer Vision and Pattern Recognition · Computer Science 2016-02-10 Babak Saleh , Ahmed Elgammal , Jacob Feldman

Image Classification with A Deep Network Model based on Compressive Sensing

To simplify the parameter of the deep learning network, a cascaded compressive sensing model "CSNet" is implemented for image classification. Firstly, we use cascaded compressive sensing network to learn feature from the data. Secondly,…

Computer Vision and Pattern Recognition · Computer Science 2014-09-26 Yufei Gan , Tong Zhuo , Chu He

Seeing eye-to-eye? A comparison of object recognition performance in humans and deep convolutional neural networks under image manipulation

For a considerable time, deep convolutional neural networks (DCNNs) have reached human benchmark performance in object recognition. On that account, computational neuroscience and the field of machine learning have started to attribute…

Computer Vision and Pattern Recognition · Computer Science 2020-12-15 Leonard E. van Dyck , Walter R. Gruber