English
Related papers

Related papers: Maximum Likelihood Estimation for Multimodal Learn…

200 papers

During multimodal model training and testing, certain data modalities may be absent due to sensor limitations, cost constraints, privacy concerns, or data loss, negatively affecting performance. Multimodal learning techniques designed to…

Computer Vision and Pattern Recognition · Computer Science 2026-02-05 Renjie Wu , Hu Wang , Hsiang-Ting Chen , Gustavo Carneiro

Multimodal learning seeks to utilize data from multiple sources to improve the overall performance of downstream tasks. It is desirable for redundancies in the data to make multimodal systems robust to missing or corrupted observations in…

Computer Vision and Pattern Recognition · Computer Science 2024-10-14 Md Kaykobad Reza , Ashley Prater-Bennette , M. Salman Asif

Multimodal learning typically relies on the assumption that all modalities are fully available during both the training and inference phases. However, in real-world scenarios, consistently acquiring complete multimodal data presents…

Computer Vision and Pattern Recognition · Computer Science 2024-07-18 Donggeun Kim , Taesup Kim

A common assumption in multimodal learning is the completeness of training data, i.e., full modalities are available in all training examples. Although there exists research endeavor in developing novel methods to tackle the incompleteness…

Computer Vision and Pattern Recognition · Computer Science 2021-03-11 Mengmeng Ma , Jian Ren , Long Zhao , Sergey Tulyakov , Cathy Wu , Xi Peng

Multimodal networks have demonstrated remarkable performance improvements over their unimodal counterparts. Existing multimodal networks are designed in a multi-branch fashion that, due to the reliance on fusion strategies, exhibit…

Multimodal learning seeks to combine data from multiple input sources to enhance the performance of different downstream tasks. In real-world scenarios, performance can degrade substantially if some input modalities are missing. Existing…

Machine Learning · Computer Science 2024-10-10 Niki Nezakati , Md Kaykobad Reza , Ameya Patil , Mashhour Solh , M. Salman Asif

Multimodal learning leverages the integration of diverse data modalities to enhance performance in complex tasks. Yet, it frequently encounters incomplete or redundant modality data in real-world scenarios. This paper presents a…

Machine Learning · Computer Science 2026-05-05 Richeng Zhou , Xuelin Zhang , Liyuan Liu

In this paper, we propose SimMLM, a simple yet powerful framework for multimodal learning with missing modalities. Unlike existing approaches that rely on sophisticated network architectures or complex data imputation techniques, SimMLM…

Computer Vision and Pattern Recognition · Computer Science 2025-08-07 Sijie Li , Chen Chen , Jungong Han

Multimodal data collected from the real world are often imperfect due to missing modalities. Therefore multimodal models that are robust against modal-incomplete data are highly preferred. Recently, Transformer models have shown great…

Computer Vision and Pattern Recognition · Computer Science 2022-04-13 Mengmeng Ma , Jian Ren , Long Zhao , Davide Testuggine , Xi Peng

Multimodal learning, which integrates diverse data sources such as images, text, and structured data, has proven superior to unimodal counterparts in high-stakes decision-making. However, while performance gains remain the gold standard for…

Artificial Intelligence · Computer Science 2025-05-07 Kishore Sampath , Pratheesh , Ayaazuddin Mohammad , Resmi Ramachandranpillai

Multimodal learning integrates data from diverse sensors to effectively harness information from different modalities. However, recent studies reveal that joint learning often overfits certain modalities while neglecting others, leading to…

Computer Vision and Pattern Recognition · Computer Science 2026-03-11 Feng Yu , Xiangyu Wu , Yang Yang , Jianfeng Lu

Multimodal learning is of continued interest in artificial intelligence-based applications, motivated by the potential information gain from combining different data modalities. However, modalities observed in the source environment may…

Machine Learning · Computer Science 2026-03-03 Young Sang Choi , Vincent Jeanselme , Pierre Elias , Shalmali Joshi

Multimodal machine learning with missing modalities is an increasingly relevant challenge arising in various applications such as healthcare. This paper extends the current research into missing modalities to the low-data regime, i.e., a…

Machine Learning · Computer Science 2024-03-27 Zhuo Zhi , Ziquan Liu , Moe Elbadawi , Adam Daneshmend , Mine Orlu , Abdul Basit , Andreas Demosthenous , Miguel Rodrigues

Multimodal learning assumes all modality combinations of interest are available during training to learn cross-modal correspondences. In this paper, we challenge this modality-complete assumption for multimodal learning and instead strive…

Computer Vision and Pattern Recognition · Computer Science 2023-10-26 Yunhua Zhang , Hazel Doughty , Cees G. M. Snoek

Multimodal learning has demonstrated remarkable performance improvements over unimodal architectures. However, multimodal learning methods often exhibit deteriorated performances if one or more modalities are missing. This may be attributed…

Existing multimodal tasks mostly target at the complete input modality setting, i.e., each modality is either complete or completely missing in both training and test sets. However, the randomly missing situations have still been…

Computation and Language · Computer Science 2022-10-25 Wei Han , Hui Chen , Min-Yen Kan , Soujanya Poria

Multimodal deep learning systems which employ multiple modalities like text, image, audio, video, etc., are showing better performance in comparison with individual modalities (i.e., unimodal) systems. Multimodal machine learning involves…

Machine Learning · Computer Science 2022-01-19 Anil Rahate , Rahee Walambe , Sheela Ramanna , Ketan Kotecha

Multimodal sentiment analysis aims to identify the emotions expressed by individuals through visual, language, and acoustic cues. However, most existing research assume that all modalities are available during both training and testing,…

Sound · Computer Science 2026-04-21 Weide Liu , Huijing Zhan

Missing modality issues are common in real-world applications, arising from factors such as equipment failures and privacy concerns. When fine-tuning pre-trained models on downstream datasets with missing modalities, performance can degrade…

Machine Learning · Computer Science 2025-03-04 Zirun Guo , Shulei Wang , Wang Lin , Weicai Yan , Yangyang Wu , Tao Jin

As a knowledge discovery task over heterogeneous data sources, current Multimodal Affective Computing (MAC) heavily rely on the completeness of multiple modalities to accurately understand human's affective state. However, in real-world…

Artificial Intelligence · Computer Science 2026-02-03 Ronghao Lin , Honghao Lu , Ruixing Wu , Aolin Xiong , Qinggong Chu , Qiaolin He , Sijie Mai , Haifeng Hu
‹ Prev 1 2 3 10 Next ›