English
Related papers

Related papers: Cross-model convolutional neural network for multi…

200 papers

People can recognize scenes across many different modalities beyond natural images. In this paper, we investigate how to learn cross-modal scene representations that transfer across modalities. To study this problem, we introduce a new…

Computer Vision and Pattern Recognition · Computer Science 2016-07-26 Lluis Castrejon , Yusuf Aytar , Carl Vondrick , Hamed Pirsiavash , Antonio Torralba

In the problem of domain transfer learning, we learn a model for the predic-tion in a target domain from the data of both some source domains and the target domain, where the target domain is in lack of labels while the source domain has…

Computer Vision and Pattern Recognition · Computer Science 2018-05-21 Guohui Zhang , Gaoyuan Liang , Fang Su , Fanxin Qu , Jing-Yan Wang

Deep learning techniques have been successfully used in learning a common representation for multi-view data, wherein the different modalities are projected onto a common subspace. In a broader perspective, the techniques used to…

Computer Vision and Pattern Recognition · Computer Science 2017-11-02 Gaurav Bhatt , Piyush Jha , Balasubramanian Raman

Deep learning models face persistent challenges in training, particularly due to internal covariate shift and label shift. While single-mode normalization methods like Batch Normalization partially address these issues, they are constrained…

Machine Learning · Computer Science 2024-10-31 Bilal Faye , Hanane Azzag , Mustapha Lebbah , Djamel Bouchaffra

People can recognize scenes across many different modalities beyond natural images. In this paper, we investigate how to learn cross-modal scene representations that transfer across modalities. To study this problem, we introduce a new…

Computer Vision and Pattern Recognition · Computer Science 2016-10-31 Yusuf Aytar , Lluis Castrejon , Carl Vondrick , Hamed Pirsiavash , Antonio Torralba

With the prevalence of RGB-D cameras, multi-modal video data have become more available for human action recognition. One main challenge for this task lies in how to effectively leverage their complementary information. In this work, we…

Computer Vision and Pattern Recognition · Computer Science 2020-02-03 Sijie Song , Jiaying Liu , Yanghao Li , Zongming Guo

Cross-modality recognition has many important applications in science, law enforcement and entertainment. Popular methods to bridge the modality gap include reducing the distributional differences of representations of different modalities,…

Computer Vision and Pattern Recognition · Computer Science 2026-05-19 Xin Niu , Enyi Li , Jinchao Liu , Yan Wang , Margarita Osadchy , Yongchun Fang

In this paper, we study the problem of transfer learning with the attribute data. In the transfer learning problem, we want to leverage the data of the auxiliary and the target domains to build an effective model for the classification…

Machine Learning · Computer Science 2018-04-03 Fang Su , Jing-Yan Wang

Given a large unlabeled set of images, how to efficiently and effectively group them into clusters based on extracted visual representations remains a challenging problem. To address this problem, we propose a convolutional neural network…

Computer Vision and Pattern Recognition · Computer Science 2017-08-14 Chih-Chung Hsu , Chia-Wen Lin

The storage and computation requirements of Convolutional Neural Networks (CNNs) can be prohibitive for exploiting these models over low-power or embedded devices. This paper reduces the computational complexity of the CNNs by minimizing an…

Neural and Evolutionary Computing · Computer Science 2017-01-17 Farkhondeh Kiaee , Christian Gagné , Mahdieh Abbasi

Most existing neural networks for learning graphs address permutation invariance by conceiving of the network as a message passing scheme, where each node sums the feature vectors coming from its neighbors. We argue that this imposes a…

Machine Learning · Computer Science 2018-01-09 Risi Kondor , Hy Truong Son , Horace Pan , Brandon Anderson , Shubhendu Trivedi

DNN-based cross-modal retrieval is a research hotspot to retrieve across different modalities as image and text, but existing methods often face the challenge of insufficient cross-modal training data. In single-modal scenario, similar…

Multimedia · Computer Science 2017-06-27 Xin Huang , Yuxin Peng , Mingkuan Yuan

Recently, transfer subspace learning based approaches have shown to be a valid alternative to unsupervised subspace clustering and temporal data clustering for human motion segmentation (HMS). These approaches leverage prior knowledge from…

Computer Vision and Pattern Recognition · Computer Science 2021-08-30 Mariella Dimiccoli , Lluís Garrido , Guillem Rodriguez-Corominas , Herwig Wendt

Precisely-labeled data sets with sufficient amount of samples are very important for training deep convolutional neural networks (CNNs). However, many of the available real-world data sets contain erroneously labeled samples and those…

Computer Vision and Pattern Recognition · Computer Science 2016-03-03 Samaneh Azadi , Jiashi Feng , Stefanie Jegelka , Trevor Darrell

Multi-view representation learning has developed rapidly over the past decades and has been applied in many fields. However, most previous works assumed that each view is complete and aligned. This leads to an inevitable deterioration in…

Computer Vision and Pattern Recognition · Computer Science 2022-11-10 Yiming Wang , Dongxia Chang , Zhiqiang Fu , Jie Wen , Yao Zhao

Convolutional neural networks (CNNs) leverage the great power in representation learning on regular grid data such as image and video. Recently, increasing attention has been paid on generalizing CNNs to graph or network data which is…

Social and Information Networks · Computer Science 2018-08-21 Yao Ma , Suhang Wang , Charu C. Aggarwal , Dawei Yin , Jiliang Tang

We propose a novel visual tracking algorithm based on the representations from a discriminatively trained Convolutional Neural Network (CNN). Our algorithm pretrains a CNN using a large set of videos with tracking ground-truths to obtain a…

Computer Vision and Pattern Recognition · Computer Science 2016-01-07 Hyeonseob Nam , Bohyung Han

High dimensional data analysis for exploration and discovery includes three fundamental tasks: dimensionality reduction, clustering, and visualization. When the three associated tasks are done separately, as is often the case thus far,…

Machine Learning · Computer Science 2020-12-02 Stan Z. Li , Lirong Wu , Zelin Zang

We address the problem of anomaly detection, that is, detecting anomalous events in a video sequence. Anomaly detection methods based on convolutional neural networks (CNNs) typically leverage proxy tasks, such as reconstructing input video…

Computer Vision and Pattern Recognition · Computer Science 2020-03-31 Hyunjong Park , Jongyoun Noh , Bumsub Ham

Convolutional Neural Network (CNN) is a popular model in computer vision and has the advantage of making good use of the correlation information of data. However, CNN is challenging to learn efficiently if the given dimension of data or…

Quantum Physics · Physics 2020-09-22 Seunghyeok Oh , Jaeho Choi , Joongheon Kim
‹ Prev 1 2 3 10 Next ›