Related papers: Visual-Semantic Embedding Model Informed by Struct…

Learning Structured Semantic Embeddings for Visual Recognition

Numerous embedding models have been recently explored to incorporate semantic knowledge into visual recognition. Existing methods typically focus on minimizing the distance between the corresponding images and texts in the embedding space…

Computer Vision and Pattern Recognition · Computer Science 2017-06-06 Dong Li , Hsin-Ying Lee , Jia-Bin Huang , Shengjin Wang , Ming-Hsuan Yang

VGSE: Visually-Grounded Semantic Embeddings for Zero-Shot Learning

Human-annotated attributes serve as powerful semantic embeddings in zero-shot learning. However, their annotation process is labor-intensive and needs expert supervision. Current unsupervised semantic embeddings, i.e., word embeddings,…

Computer Vision and Pattern Recognition · Computer Science 2023-05-29 Wenjia Xu , Yongqin Xian , Jiuniu Wang , Bernt Schiele , Zeynep Akata

Visual-Semantic Decomposition and Partial Alignment for Document-based Zero-Shot Learning

Recent work shows that documents from encyclopedias serve as helpful auxiliary information for zero-shot learning. Existing methods align the entire semantics of a document with corresponding images to transfer knowledge. However, they…

Computer Vision and Pattern Recognition · Computer Science 2024-07-24 Xiangyan Qu , Jing Yu , Keke Gai , Jiamin Zhuang , Yuanmin Tang , Gang Xiong , Gaopeng Gou , Qi Wu

Semantically Grounded Visual Embeddings for Zero-Shot Learning

Zero-shot learning methods rely on fixed visual and semantic embeddings, extracted from independent vision and language models, both pre-trained for other large-scale tasks. This is a weakness of current zero-shot learning frameworks as…

Computer Vision and Pattern Recognition · Computer Science 2022-04-12 Shah Nawaz , Jacopo Cavazza , Alessio Del Bue

Language Models as Zero-shot Visual Semantic Learners

Visual Semantic Embedding (VSE) models, which map images into a rich semantic embedding space, have been a milestone in object recognition and zero-shot learning. Current approaches to VSE heavily rely on static word em-bedding techniques.…

Computer Vision and Pattern Recognition · Computer Science 2021-07-27 Yue Jiao , Jonathon Hare , Adam Prügel-Bennett

Improving Semantic Embedding Consistency by Metric Learning for Zero-Shot Classification

This paper addresses the task of zero-shot image classification. The key contribution of the proposed approach is to control the semantic embedding of images -- one of the main ingredients of zero-shot learning -- by formulating it as a…

Computer Vision and Pattern Recognition · Computer Science 2016-07-28 Maxime Bucher , Stéphane Herbin , Frédéric Jurie

Learning Visually Consistent Label Embeddings for Zero-Shot Learning

In this work, we propose a zero-shot learning method to effectively model knowledge transfer between classes via jointly learning visually consistent word vectors and label embedding model in an end-to-end manner. The main idea is to…

Computer Vision and Pattern Recognition · Computer Science 2019-05-17 Berkan Demirel , Ramazan Gokberk Cinbis , Nazli Ikizler-Cinbis

Zero-Shot Learning by Convex Combination of Semantic Embeddings

Several recent publications have proposed methods for mapping images into continuous semantic embedding spaces. In some cases the embedding space is trained jointly with the image transformation. In other cases the semantic embedding space…

Machine Learning · Computer Science 2017-02-28 Mohammad Norouzi , Tomas Mikolov , Samy Bengio , Yoram Singer , Jonathon Shlens , Andrea Frome , Greg S. Corrado , Jeffrey Dean

Learning semantic Image attributes using Image recognition and knowledge graph embeddings

Extracting structured knowledge from texts has traditionally been used for knowledge base generation. However, other sources of information, such as images can be leveraged into this process to build more complete and richer knowledge…

Computer Vision and Pattern Recognition · Computer Science 2020-09-15 Ashutosh Tiwari , Sandeep Varma

Survey of Visual-Semantic Embedding Methods for Zero-Shot Image Retrieval

Visual-semantic embedding is an interesting research topic because it is useful for various tasks, such as visual question answering (VQA), image-text retrieval, image captioning, and scene graph generation. In this paper, we focus on…

Computer Vision and Pattern Recognition · Computer Science 2021-09-29 Kazuya Ueki

Efficient Multi-Modal Embeddings from Structured Data

Multi-modal word semantics aims to enhance embeddings with perceptual input, assuming that human meaning representation is grounded in sensory experience. Most research focuses on evaluation involving direct visual input, however, visual…

Computation and Language · Computer Science 2021-10-07 Anita L. Verő , Ann Copestake

Zero-Shot Audio Classification using Image Embeddings

Supervised learning methods can solve the given problem in the presence of a large set of labeled data. However, the acquisition of a dataset covering all the target classes typically requires manual labeling which is expensive and…

Sound · Computer Science 2022-06-13 Duygu Dogan , Huang Xie , Toni Heittola , Tuomas Virtanen

Zero-Shot Recognition through Image-Guided Semantic Classification

We present a new embedding-based framework for zero-shot learning (ZSL). Most embedding-based methods aim to learn the correspondence between an image classifier (visual representation) and its class prototype (semantic representation) for…

Computer Vision and Pattern Recognition · Computer Science 2020-07-24 Mei-Chen Yeh , Fang Li

Learning semantic sentence representations from visually grounded language without lexical knowledge

Current approaches to learning semantic representations of sentences often use prior word-level knowledge. The current study aims to leverage visual information in order to capture sentence level semantics without the need for word…

Computation and Language · Computer Science 2019-09-25 Danny Merkx , Stefan Frank

Predicting Visual Exemplars of Unseen Classes for Zero-Shot Learning

Leveraging class semantic descriptions and examples of known objects, zero-shot learning makes it possible to train a recognition model for an object class whose examples are not available. In this paper, we propose a novel zero-shot…

Computer Vision and Pattern Recognition · Computer Science 2017-08-22 Soravit Changpinyo , Wei-Lun Chao , Fei Sha

Hierarchy-based Image Embeddings for Semantic Image Retrieval

Deep neural networks trained for classification have been found to learn powerful image representations, which are also often used for other tasks such as comparing images w.r.t. their visual similarity. However, visual similarity does not…

Computer Vision and Pattern Recognition · Computer Science 2019-07-24 Björn Barz , Joachim Denzler

Webly Supervised Semantic Embeddings for Large Scale Zero-Shot Learning

Zero-shot learning (ZSL) makes object recognition in images possible in absence of visual training data for a part of the classes from a dataset. When the number of classes is large, classes are usually represented by semantic class…

Computer Vision and Pattern Recognition · Computer Science 2020-08-10 Yannick Le Cacheux , Adrian Popescu , Hervé Le Borgne

Multi-Instance Visual-Semantic Embedding

Visual-semantic embedding models have been recently proposed and shown to be effective for image classification and zero-shot learning, by mapping images into a continuous semantic label space. Although several approaches have been proposed…

Computer Vision and Pattern Recognition · Computer Science 2015-12-23 Zhou Ren , Hailin Jin , Zhe Lin , Chen Fang , Alan Yuille

Not just a matter of semantics: the relationship between visual similarity and semantic similarity

Knowledge transfer, zero-shot learning and semantic image retrieval are methods that aim at improving accuracy by utilizing semantic information, e.g. from WordNet. It is assumed that this information can augment or replace missing visual…

Computer Vision and Pattern Recognition · Computer Science 2019-06-03 Clemens-Alexander Brust , Joachim Denzler

Beyond Word Embeddings: Learning Entity and Concept Representations from Large Scale Knowledge Bases

Text representations using neural word embeddings have proven effective in many NLP applications. Recent researches adapt the traditional word embedding models to learn vectors of multiword expressions (concepts/entities). However, these…

Computation and Language · Computer Science 2018-12-21 Walid Shalaby , Wlodek Zadrozny , Hongxia Jin