Related papers: Multimodal Subspace Support Vector Data Descriptio…

Subspace Support Vector Data Description

This paper proposes a novel method for solving one-class classification problems. The proposed approach, namely Subspace Support Vector Data Description, maps the data to a subspace that is optimized for one-class classification. In that…

Computer Vision and Pattern Recognition · Computer Science 2023-08-29 Fahad Sohrab , Jenni Raitoharju , Moncef Gabbouj , Alexandros Iosifidis

Ellipsoidal Subspace Support Vector Data Description

In this paper, we propose a novel method for transforming data into a low-dimensional space optimized for one-class classification. The proposed method iteratively transforms data into a new subspace optimized for ellipsoidal encapsulation…

Machine Learning · Computer Science 2020-09-15 Fahad Sohrab , Jenni Raitoharju , Alexandros Iosifidis , Moncef Gabbouj

Graph-Embedded Subspace Support Vector Data Description

In this paper, we propose a novel subspace learning framework for one-class classification. The proposed framework presents the problem in the form of graph embedding. It includes the previously proposed subspace one-class techniques as its…

Machine Learning · Computer Science 2023-08-29 Fahad Sohrab , Alexandros Iosifidis , Moncef Gabbouj , Jenni Raitoharju

Supervised cross-modal factor analysis for multiple modal data classification

In this paper we study the problem of learning from multiple modal data for purpose of document classification. In this problem, each document is composed two different modals of data, i.e., an image and a text. Cross-modal factor analysis…

Machine Learning · Computer Science 2015-08-19 Jingbin Wang , Yihua Zhou , Kanghong Duan , Jim Jing-Yan Wang , Halima Bensmail

Learning Shared Cross-modality Representation Using Multispectral-LiDAR and Hyperspectral Data

Due to the ever-growing diversity of the data source, multi-modality feature learning has attracted more and more attention. However, most of these methods are designed by jointly learning feature representation from multi-modalities that…

Computer Vision and Pattern Recognition · Computer Science 2020-06-09 Danfeng Hong , Jocelyn Chanussot , Naoto Yokoya , Jian Kang , Xiao Xiang Zhu

Learn to Combine Modalities in Multimodal Deep Learning

Combining complementary information from multiple modalities is intuitively appealing for improving the performance of learning-based approaches. However, it is challenging to fully leverage different modalities due to practical challenges…

Machine Learning · Statistics 2018-05-31 Kuan Liu , Yanen Li , Ning Xu , Prem Natarajan

We introduce an efficient computational framework for hashing data belonging to multiple modalities into a single representation space where they become mutually comparable. The proposed approach is based on a novel coupled siamese neural…

Computer Vision and Pattern Recognition · Computer Science 2012-07-09 Jonathan Masci , Michael M. Bronstein , Alexander A. Bronstein , Jürgen Schmidhuber

MMSFormer: Multimodal Transformer for Material and Semantic Segmentation

Leveraging information across diverse modalities is known to enhance performance on multimodal segmentation tasks. However, effectively fusing information from different modalities remains challenging due to the unique characteristics of…

Computer Vision and Pattern Recognition · Computer Science 2024-04-22 Md Kaykobad Reza , Ashley Prater-Bennette , M. Salman Asif

Learning Multi-modal Similarity

In many applications involving multi-media data, the definition of similarity between items is integral to several key tasks, e.g., nearest-neighbor retrieval, classification, and recommendation. Data in such regimes typically exhibits…

Artificial Intelligence · Computer Science 2010-09-01 Brian McFee , Gert Lanckriet

Robust Semi-supervised Multimodal Medical Image Segmentation via Cross Modality Collaboration

Multimodal learning leverages complementary information derived from different modalities, thereby enhancing performance in medical image segmentation. However, prevailing multimodal learning methods heavily rely on extensive well-annotated…

Computer Vision and Pattern Recognition · Computer Science 2024-09-05 Xiaogen Zhou , Yiyou Sun , Min Deng , Winnie Chiu Wing Chu , Qi Dou

Adaptive Fusion Techniques for Multimodal Data

Effective fusion of data from multiple modalities, such as video, speech, and text, is challenging due to the heterogeneous nature of multimodal data. In this paper, we propose adaptive fusion techniques that aim to model context from…

Computation and Language · Computer Science 2021-01-27 Gaurav Sahu , Olga Vechtomova

MENTOR: Multi-level Self-supervised Learning for Multimodal Recommendation

With the increasing multimedia information, multimodal recommendation has received extensive attention. It utilizes multimodal information to alleviate the data sparsity problem in recommendation systems, thus improving recommendation…

Information Retrieval · Computer Science 2024-03-01 Jinfeng Xu , Zheyu Chen , Shuo Yang , Jinze Li , Hewei Wang , Edith C. -H. Ngai

Cross-Modal and Multimodal Data Analysis Based on Functional Mapping of Spectral Descriptors and Manifold Regularization

Multimodal manifold modeling methods extend the spectral geometry-aware data analysis to learning from several related and complementary modalities. Most of these methods work based on two major assumptions: 1) there are the same number of…

Machine Learning · Computer Science 2021-05-13 Maysam Behmanesh , Peyman Adibi , Jocelyn Chanussot , Sayyed Mohammad Saeed Ehsani

Towards Achieving Perfect Multimodal Alignment

Multimodal alignment constructs a joint latent vector space where modalities representing the same concept map to neighboring latent vectors. We formulate this as an inverse problem and show that, under certain conditions, paired data from…

Machine Learning · Computer Science 2025-06-10 Abhi Kamboj , Minh N. Do

Fast Multilevel Support Vector Machines

Solving different types of optimization models (including parameters fitting) for support vector machines on large-scale training data is often an expensive computational task. This paper proposes a multilevel algorithmic framework that…

Machine Learning · Statistics 2014-10-14 Talayeh Razzaghi , Ilya Safro

Enhancing multimodal cooperation via sample-level modality valuation

One primary topic of multimodal learning is to jointly incorporate heterogeneous information from different modalities. However most models often suffer from unsatisfactory multimodal cooperation which cannot jointly utilize all modalities…

Computer Vision and Pattern Recognition · Computer Science 2024-06-17 Yake Wei , Ruoxuan Feng , Zihe Wang , Di Hu

A Shared Encoder Approach to Multimodal Representation Learning

Multimodal representation learning has demonstrated remarkable potential in enabling models to process and integrate diverse data modalities, such as text and images, for improved understanding and performance. While the medical domain can…

Computer Vision and Pattern Recognition · Computer Science 2025-03-04 Shuvendu Roy , Franklin Ogidi , Ali Etemad , Elham Dolatabadi , Arash Afkanpour

Recovering Hidden Components in Multimodal Data with Composite Diffusion Operators

Finding appropriate low dimensional representations of high-dimensional multi-modal data can be challenging, since each modality embodies unique deformations and interferences. In this paper, we address the problem using manifold learning,…

Signal Processing · Electrical Eng. & Systems 2018-08-23 Tal Shnitzer , Mirela Ben-Chen , Leonidas Guibas , Ronen Talmon , Hau-Tieng Wu

Continual Learning for Multiple Modalities

Continual learning aims to learn knowledge of tasks observed in sequential time steps while mitigating the forgetting of previously learned knowledge. Existing methods were designed to learn a single modality (e.g., image) over time, which…

Computer Vision and Pattern Recognition · Computer Science 2025-08-15 Hyundong Jin , Eunwoo Kim

Adaptive Cross-Modal Few-Shot Learning

Metric-based meta-learning techniques have successfully been applied to few-shot classification problems. In this paper, we propose to leverage cross-modal information to enhance metric-based few-shot learning methods. Visual and semantic…

Machine Learning · Computer Science 2020-02-19 Chen Xing , Negar Rostamzadeh , Boris N. Oreshkin , Pedro O. Pinheiro