Related papers: Learning Robust Visual-Semantic Embeddings

Webly Supervised Joint Embedding for Cross-Modal Image-Text Retrieval

Cross-modal retrieval between visual data and natural language description remains a long-standing challenge in multimedia. While recent image-text retrieval methods offer great promise by learning deep representations aligned across…

Multimedia · Computer Science 2018-08-24 Niluthpol Chowdhury Mithun , Rameswar Panda , Evangelos E. Papalexakis , Amit K. Roy-Chowdhury

Learning Deep Representations of Fine-grained Visual Descriptions

State-of-the-art methods for zero-shot visual recognition formulate learning as a joint embedding problem of images and side information. In these formulations the current best complement to visual features are attributes: manually encoded…

Computer Vision and Pattern Recognition · Computer Science 2016-05-19 Scott Reed , Zeynep Akata , Bernt Schiele , Honglak Lee

Towards Universal Representation Learning for Deep Face Recognition

Recognizing wild faces is extremely hard as they appear with all kinds of variations. Traditional methods either train with specifically annotated variation data from target domains, or by introducing unlabeled target variation data to…

Computer Vision and Pattern Recognition · Computer Science 2020-02-28 Yichun Shi , Xiang Yu , Kihyuk Sohn , Manmohan Chandraker , Anil K. Jain

Local Aggregation for Unsupervised Learning of Visual Embeddings

Unsupervised approaches to learning in neural networks are of substantial interest for furthering artificial intelligence, both because they would enable the training of networks without the need for large numbers of expensive annotations,…

Computer Vision and Pattern Recognition · Computer Science 2019-04-11 Chengxu Zhuang , Alex Lin Zhai , Daniel Yamins

Self-Supervised Learning from Web Data for Multimodal Retrieval

Self-Supervised learning from multimodal image and text data allows deep neural networks to learn powerful features with no need of human annotated data. Web and Social Media platforms provide a virtually unlimited amount of this multimodal…

Computer Vision and Pattern Recognition · Computer Science 2019-01-09 Raul Gomez , Lluis Gomez , Jaume Gibert , Dimosthenis Karatzas

Semantically Grounded Visual Embeddings for Zero-Shot Learning

Zero-shot learning methods rely on fixed visual and semantic embeddings, extracted from independent vision and language models, both pre-trained for other large-scale tasks. This is a weakness of current zero-shot learning frameworks as…

Computer Vision and Pattern Recognition · Computer Science 2022-04-12 Shah Nawaz , Jacopo Cavazza , Alessio Del Bue

Evaluation of Output Embeddings for Fine-Grained Image Classification

Image classification has advanced significantly in recent years with the availability of large-scale image sets. However, fine-grained classification remains a major challenge due to the annotation cost of large numbers of fine-grained…

Computer Vision and Pattern Recognition · Computer Science 2016-04-21 Zeynep Akata , Scott Reed , Daniel Walter , Honglak Lee , Bernt Schiele

Learning Structured Semantic Embeddings for Visual Recognition

Numerous embedding models have been recently explored to incorporate semantic knowledge into visual recognition. Existing methods typically focus on minimizing the distance between the corresponding images and texts in the embedding space…

Computer Vision and Pattern Recognition · Computer Science 2017-06-06 Dong Li , Hsin-Ying Lee , Jia-Bin Huang , Shengjin Wang , Ming-Hsuan Yang

A robust self-learning method for fully unsupervised cross-lingual mappings of word embeddings

Recent work has managed to learn cross-lingual word embeddings without parallel data by mapping monolingual embeddings to a shared space through adversarial training. However, their evaluation has focused on favorable conditions, using…

Computation and Language · Computer Science 2021-12-28 Mikel Artetxe , Gorka Labaka , Eneko Agirre

AugNet: End-to-End Unsupervised Visual Representation Learning with Image Augmentation

Most of the achievements in artificial intelligence so far were accomplished by supervised learning which requires numerous annotated training data and thus costs innumerable manpower for labeling. Unsupervised learning is one of the…

Computer Vision and Pattern Recognition · Computer Science 2021-06-14 Mingxiang Chen , Zhanguo Chang , Haonan Lu , Bitao Yang , Zhuang Li , Liufang Guo , Zhecheng Wang

Learning Deep Convolutional Embeddings for Face Representation Using Joint Sample- and Set-based Supervision

In this work, we investigate several methods and strategies to learn deep embeddings for face recognition, using joint sample- and set-based optimization. We explain our framework that expands traditional learning with set-based supervision…

Computer Vision and Pattern Recognition · Computer Science 2020-09-09 Baris Gecer , Vassileios Balntas , Tae-Kyun Kim

Unsupervised Image Classification for Deep Representation Learning

Deep clustering against self-supervised learning is a very important and promising direction for unsupervised visual representation learning since it requires little domain knowledge to design pretext tasks. However, the key component,…

Computer Vision and Pattern Recognition · Computer Science 2020-08-21 Weijie Chen , Shiliang Pu , Di Xie , Shicai Yang , Yilu Guo , Luojun Lin

Unsupervised Multimodal Language Representations using Convolutional Autoencoders

Multimodal Language Analysis is a demanding area of research, since it is associated with two requirements: combining different modalities and capturing temporal information. During the last years, several works have been proposed in the…

Computation and Language · Computer Science 2022-01-10 Panagiotis Koromilas , Theodoros Giannakopoulos

A Comprehensive Approach to Unsupervised Embedding Learning based on AND Algorithm

Unsupervised embedding learning aims to extract good representation from data without the need for any manual labels, which has been a critical challenge in many supervised learning tasks. This paper proposes a new unsupervised embedding…

Machine Learning · Computer Science 2020-02-28 Sungwon Han , Yizhan Xu , Sungwon Park , Meeyoung Cha , Cheng-Te Li

Points2Vec: Unsupervised Object-level Feature Learning from Point Clouds

Unsupervised representation learning techniques, such as learning word embeddings, have had a significant impact on the field of natural language processing. Similar representation learning techniques have not yet become commonplace in the…

Computer Vision and Pattern Recognition · Computer Science 2021-02-09 Joël Bachmann , Kenneth Blomqvist , Julian Förster , Roland Siegwart

Learning to Learn from Web Data through Deep Semantic Embeddings

In this paper we propose to learn a multimodal image and text embedding from Web and Social Media data, aiming to leverage the semantic knowledge learnt in the text domain and transfer it to a visual model for semantic image retrieval. We…

Computer Vision and Pattern Recognition · Computer Science 2018-08-21 Raul Gomez , Lluis Gomez , Jaume Gibert , Dimosthenis Karatzas

Dual-Path Convolutional Image-Text Embeddings with Instance Loss

Matching images and sentences demands a fine understanding of both modalities. In this paper, we propose a new system to discriminatively embed the image and text to a shared visual-textual space. In this field, most existing works apply…

Computer Vision and Pattern Recognition · Computer Science 2021-07-28 Zhedong Zheng , Liang Zheng , Michael Garrett , Yi Yang , Mingliang Xu , Yi-Dong Shen

Unsupervised Embedding Learning via Invariant and Spreading Instance Feature

This paper studies the unsupervised embedding learning problem, which requires an effective similarity measurement between samples in low-dimensional embedding space. Motivated by the positive concentrated and negative separated properties…

Computer Vision and Pattern Recognition · Computer Science 2019-04-09 Mang Ye , Xu Zhang , Pong C. Yuen , Shih-Fu Chang

Semi-supervised Visual Feature Integration for Pre-trained Language Models

Integrating visual features has been proved useful for natural language understanding tasks. Nevertheless, in most existing multimodal language models, the alignment of visual and textual data is expensive. In this paper, we propose a novel…

Computation and Language · Computer Science 2020-08-14 Lisai Zhang , Qingcai Chen , Dongfang Li , Buzhou Tang

On the Robustness of Unsupervised and Semi-supervised Cross-lingual Word Embedding Learning

Cross-lingual word embeddings are vector representations of words in different languages where words with similar meaning are represented by similar vectors, regardless of the language. Recent developments which construct these embeddings…

Computation and Language · Computer Science 2020-03-04 Yerai Doval , Jose Camacho-Collados , Luis Espinosa-Anke , Steven Schockaert