Related papers: Maximum Likelihood Estimation for Multimodal Learn…

Deep Multimodal Learning with Missing Modality: A Survey

During multimodal model training and testing, certain data modalities may be absent due to sensor limitations, cost constraints, privacy concerns, or data loss, negatively affecting performance. Multimodal learning techniques designed to…

Computer Vision and Pattern Recognition · Computer Science 2026-02-05 Renjie Wu , Hu Wang , Hsiang-Ting Chen , Gustavo Carneiro

Robust Multimodal Learning with Missing Modalities via Parameter-Efficient Adaptation

Multimodal learning seeks to utilize data from multiple sources to improve the overall performance of downstream tasks. It is desirable for redundancies in the data to make multimodal systems robust to missing or corrupted observations in…

Computer Vision and Pattern Recognition · Computer Science 2024-10-14 Md Kaykobad Reza , Ashley Prater-Bennette , M. Salman Asif

Missing Modality Prediction for Unpaired Multimodal Learning via Joint Embedding of Unimodal Models

Multimodal learning typically relies on the assumption that all modalities are fully available during both the training and inference phases. However, in real-world scenarios, consistently acquiring complete multimodal data presents…

Computer Vision and Pattern Recognition · Computer Science 2024-07-18 Donggeun Kim , Taesup Kim

SMIL: Multimodal Learning with Severely Missing Modality

A common assumption in multimodal learning is the completeness of training data, i.e., full modalities are available in all training examples. Although there exists research endeavor in developing novel methods to tackle the incompleteness…

Computer Vision and Pattern Recognition · Computer Science 2021-03-11 Mengmeng Ma , Jian Ren , Long Zhao , Sergey Tulyakov , Cathy Wu , Xi Peng

Modality Invariant Multimodal Learning to Handle Missing Modalities: A Single-Branch Approach

Multimodal networks have demonstrated remarkable performance improvements over their unimodal counterparts. Existing multimodal networks are designed in a multi-branch fashion that, due to the reliance on fusion strategies, exhibit…

Computer Vision and Pattern Recognition · Computer Science 2024-08-15 Muhammad Saad Saeed , Shah Nawaz , Muhammad Zaigham Zaheer , Muhammad Haris Khan , Karthik Nandakumar , Muhammad Haroon Yousaf , Hassan Sajjad , Tom De Schepper , Markus Schedl

MMP: Towards Robust Multi-Modal Learning with Masked Modality Projection

Multimodal learning seeks to combine data from multiple input sources to enhance the performance of different downstream tasks. In real-world scenarios, performance can degrade substantially if some input modalities are missing. Existing…

Machine Learning · Computer Science 2024-10-10 Niki Nezakati , Md Kaykobad Reza , Ameya Patil , Mashhour Solh , M. Salman Asif

Quantifying Multimodal Capabilities: Formal Generalization Guarantees in Pairwise Metric Learning

Multimodal learning leverages the integration of diverse data modalities to enhance performance in complex tasks. Yet, it frequently encounters incomplete or redundant modality data in real-world scenarios. This paper presents a…

Machine Learning · Computer Science 2026-05-05 Richeng Zhou , Xuelin Zhang , Liyuan Liu

SimMLM: A Simple Framework for Multi-modal Learning with Missing Modality

In this paper, we propose SimMLM, a simple yet powerful framework for multimodal learning with missing modalities. Unlike existing approaches that rely on sophisticated network architectures or complex data imputation techniques, SimMLM…

Computer Vision and Pattern Recognition · Computer Science 2025-08-07 Sijie Li , Chen Chen , Jungong Han

Are Multimodal Transformers Robust to Missing Modality?

Multimodal data collected from the real world are often imperfect due to missing modalities. Therefore multimodal models that are robust against modal-incomplete data are highly preferred. Recently, Transformer models have shown great…

Computer Vision and Pattern Recognition · Computer Science 2022-04-13 Mengmeng Ma , Jian Ren , Long Zhao , Davide Testuggine , Xi Peng

The Multimodal Paradox: How Added and Missing Modalities Shape Bias and Performance in Multimodal AI

Multimodal learning, which integrates diverse data sources such as images, text, and structured data, has proven superior to unimodal counterparts in high-stakes decision-making. However, while performance gains remain the gold standard for…

Artificial Intelligence · Computer Science 2025-05-07 Kishore Sampath , Pratheesh , Ayaazuddin Mohammad , Resmi Ramachandranpillai

Multimodal Classification via Total Correlation Maximization

Multimodal learning integrates data from diverse sensors to effectively harness information from different modalities. However, recent studies reveal that joint learning often overfits certain modalities while neglecting others, leading to…

Computer Vision and Pattern Recognition · Computer Science 2026-03-11 Feng Yu , Xiangyu Wu , Yang Yang , Jianfeng Lu

ICYM2I: The illusion of multimodal informativeness under missingness

Multimodal learning is of continued interest in artificial intelligence-based applications, motivated by the potential information gain from combining different data modalities. However, modalities observed in the source environment may…

Machine Learning · Computer Science 2026-03-03 Young Sang Choi , Vincent Jeanselme , Pierre Elias , Shalmali Joshi

Borrowing Treasures from Neighbors: In-Context Learning for Multimodal Learning with Missing Modalities and Data Scarcity

Multimodal machine learning with missing modalities is an increasingly relevant challenge arising in various applications such as healthcare. This paper extends the current research into missing modalities to the low-data regime, i.e., a…

Machine Learning · Computer Science 2024-03-27 Zhuo Zhi , Ziquan Liu , Moe Elbadawi , Adam Daneshmend , Mine Orlu , Abdul Basit , Andreas Demosthenous , Miguel Rodrigues

Learning Unseen Modality Interaction

Multimodal learning assumes all modality combinations of interest are available during training to learn cross-modal correspondences. In this paper, we challenge this modality-complete assumption for multimodal learning and instead strive…

Computer Vision and Pattern Recognition · Computer Science 2023-10-26 Yunhua Zhang , Hazel Doughty , Cees G. M. Snoek

Chameleon: Images Are What You Need For Multimodal Learning Robust To Missing Modalities

Multimodal learning has demonstrated remarkable performance improvements over unimodal architectures. However, multimodal learning methods often exhibit deteriorated performances if one or more modalities are missing. This may be attributed…

Computer Vision and Pattern Recognition · Computer Science 2024-07-24 Muhammad Irzam Liaqat , Shah Nawaz , Muhammad Zaigham Zaheer , Muhammad Saad Saeed , Hassan Sajjad , Tom De Schepper , Karthik Nandakumar , Muhammad Haris Khan Markus Schedl

MM-Align: Learning Optimal Transport-based Alignment Dynamics for Fast and Accurate Inference on Missing Modality Sequences

Existing multimodal tasks mostly target at the complete input modality setting, i.e., each modality is either complete or completely missing in both training and test sets. However, the randomly missing situations have still been…

Computation and Language · Computer Science 2022-10-25 Wei Han , Hui Chen , Min-Yen Kan , Soujanya Poria

Multimodal Co-learning: Challenges, Applications with Datasets, Recent Advances and Future Directions

Multimodal deep learning systems which employ multiple modalities like text, image, audio, video, etc., are showing better performance in comparison with individual modalities (i.e., unimodal) systems. Multimodal machine learning involves…

Machine Learning · Computer Science 2022-01-19 Anil Rahate , Rahee Walambe , Sheela Ramanna , Ketan Kotecha

Multimodal Sentiment Analysis with Missing Modality: A Knowledge-Transfer Approach

Multimodal sentiment analysis aims to identify the emotions expressed by individuals through visual, language, and acoustic cues. However, most existing research assume that all modalities are available during both training and testing,…

Sound · Computer Science 2026-04-21 Weide Liu , Huijing Zhan

Efficient Prompting for Continual Adaptation to Missing Modalities

Missing modality issues are common in real-world applications, arising from factors such as equipment failures and privacy concerns. When fine-tuning pre-trained models on downstream datasets with missing modalities, performance can degrade…

Machine Learning · Computer Science 2025-03-04 Zirun Guo , Shulei Wang , Wang Lin , Weicai Yan , Yangyang Wu , Tao Jin

MissMAC-Bench: Building Solid Benchmark for Missing Modality Issue in Robust Multimodal Affective Computing

As a knowledge discovery task over heterogeneous data sources, current Multimodal Affective Computing (MAC) heavily rely on the completeness of multiple modalities to accurately understand human's affective state. However, in real-world…

Artificial Intelligence · Computer Science 2026-02-03 Ronghao Lin , Honghao Lu , Ruixing Wu , Aolin Xiong , Qinggong Chu , Qiaolin He , Sijie Mai , Haifeng Hu