English
Related papers

Related papers: SimMLM: A Simple Framework for Multi-modal Learnin…

200 papers

Multimodal networks have demonstrated remarkable performance improvements over their unimodal counterparts. Existing multimodal networks are designed in a multi-branch fashion that, due to the reliance on fusion strategies, exhibit…

A common assumption in multimodal learning is the completeness of training data, i.e., full modalities are available in all training examples. Although there exists research endeavor in developing novel methods to tackle the incompleteness…

Computer Vision and Pattern Recognition · Computer Science 2021-03-11 Mengmeng Ma , Jian Ren , Long Zhao , Sergey Tulyakov , Cathy Wu , Xi Peng

During multimodal model training and testing, certain data modalities may be absent due to sensor limitations, cost constraints, privacy concerns, or data loss, negatively affecting performance. Multimodal learning techniques designed to…

Computer Vision and Pattern Recognition · Computer Science 2026-02-05 Renjie Wu , Hu Wang , Hsiang-Ting Chen , Gustavo Carneiro

Combining multiple modalities carrying complementary information through multimodal learning (MML) has shown considerable benefits for diagnosing multiple pathologies. However, the robustness of multimodal models to missing modalities is…

Machine Learning · Computer Science 2024-07-31 Hava Chaptoukaev , Vincenzo Marcianó , Francesco Galati , Maria A. Zuluaga

Multimodal learning seeks to utilize data from multiple sources to improve the overall performance of downstream tasks. It is desirable for redundancies in the data to make multimodal systems robust to missing or corrupted observations in…

Computer Vision and Pattern Recognition · Computer Science 2024-10-14 Md Kaykobad Reza , Ashley Prater-Bennette , M. Salman Asif

Multimodal learning has achieved great successes in many scenarios. Compared with unimodal learning, it can effectively combine the information from different modalities to improve the performance of learning tasks. In reality, the…

Machine Learning · Computer Science 2021-08-25 Fei Ma , Xiangxiang Xu , Shao-Lun Huang , Lin Zhang

Multimodal learning has shown promising performance in content-based recommendation due to the auxiliary user and item information of multiple modalities such as text and images. However, the problem of incomplete and missing modality is…

Information Retrieval · Computer Science 2018-08-31 Cheng Wang , Mathias Niepert , Hui Li

Multimodal remote sensing classification often suffers from missing modalities caused by sensor failures and environmental interference, leading to severe performance degradation. In this work, we rethink missing-modality learning from a…

Computer Vision and Pattern Recognition · Computer Science 2026-02-04 Qinghao Gao , Jiahui Qu , Wenqian Dong

Existing multimodal tasks mostly target at the complete input modality setting, i.e., each modality is either complete or completely missing in both training and test sets. However, the randomly missing situations have still been…

Computation and Language · Computer Science 2022-10-25 Wei Han , Hui Chen , Min-Yen Kan , Soujanya Poria

Multimodal learning has demonstrated remarkable performance improvements over unimodal architectures. However, multimodal learning methods often exhibit deteriorated performances if one or more modalities are missing. This may be attributed…

Multimodal learning typically relies on the assumption that all modalities are fully available during both the training and inference phases. However, in real-world scenarios, consistently acquiring complete multimodal data presents…

Computer Vision and Pattern Recognition · Computer Science 2024-07-18 Donggeun Kim , Taesup Kim

Existing multimodal sentiment analysis tasks are highly rely on the assumption that the training and test sets are complete multimodal data, while this assumption can be difficult to hold: the multimodal data are often incomplete in…

Computer Vision and Pattern Recognition · Computer Science 2024-01-26 Xianbing Zhao , Soujanya Poria , Xuejiao Li , Yixin Chen , Buzhou Tang

Missing or corrupted modalities are common in physiological signal-based medical applications owing to hardware constraints or motion artifacts. However, most existing methods assume the availability of all modalities, resulting in…

Machine Learning · Computer Science 2025-10-14 Cheol-Hui Lee , Hwa-Yeon Lee , Min-Kyung Jung , Dong-Joo Kim

Learning from multiple modalities often suffers from imbalance, where information-rich modalities dominate optimization while weaker or partially missing modalities contribute less. This imbalance becomes severe in realistic settings with…

Computer Vision and Pattern Recognition · Computer Science 2026-03-23 Phuong-Anh Nguyen , Tien Anh Pham , Duc-Trong Le , Cam-Van Thi Nguyen

The Internet of Things (IoT) ecosystem generates vast amounts of multimodal data from heterogeneous sources such as sensors, cameras, and microphones. As edge intelligence continues to evolve, IoT devices have progressed from simple data…

Machine Learning · Computer Science 2025-05-23 Heqiang Wang , Xiang Liu , Xiaoxiong Zhong , Lixing Chen , Fangming Liu , Weizhe Zhang

Malignant brain tumors have become an aggressive and dangerous disease that leads to death worldwide.Multi-modal MRI data is crucial for accurate brain tumor segmentation, but missing modalities common in clinical practice can severely…

Methodology · Statistics 2025-07-11 Guoyan Liang , Qin Zhou , Jingyuan Chen , Bingcang Huang , Kai Chen , Lin Gu , Zhe Wang , Sai Wu , Chang Yao

Multimodal learning seeks to combine data from multiple input sources to enhance the performance of different downstream tasks. In real-world scenarios, performance can degrade substantially if some input modalities are missing. Existing…

Machine Learning · Computer Science 2024-10-10 Niki Nezakati , Md Kaykobad Reza , Ameya Patil , Mashhour Solh , M. Salman Asif

Multimodal representation learning harmonizes distinct modalities by aligning them into a unified latent space. Recent research generalizes traditional cross-modal alignment to produce enhanced multimodal synergy but requires all modalities…

Computer Vision and Pattern Recognition · Computer Science 2026-05-13 Xiaohao Liu , Xiaobo Xia , Jiaheng Wei , Shuo Yang , Xiu Su , See-Kiong Ng , Tat-Seng Chua

Multimodal machine learning with missing modalities is an increasingly relevant challenge arising in various applications such as healthcare. This paper extends the current research into missing modalities to the low-data regime, i.e., a…

Machine Learning · Computer Science 2024-03-27 Zhuo Zhi , Ziquan Liu , Moe Elbadawi , Adam Daneshmend , Mine Orlu , Abdul Basit , Andreas Demosthenous , Miguel Rodrigues

Using multiple spatial modalities has been proven helpful in improving semantic segmentation performance. However, there are several real-world challenges that have yet to be addressed: (a) improving label efficiency and (b) enhancing…

Computer Vision and Pattern Recognition · Computer Science 2023-04-24 Harsh Maheshwari , Yen-Cheng Liu , Zsolt Kira
‹ Prev 1 2 3 10 Next ›