Related papers: Multi-modal Deep Learning

Application of Multimodal Fusion Deep Learning Model in Disease Recognition

This paper introduces an innovative multi-modal fusion deep learning approach to overcome the drawbacks of traditional single-modal recognition techniques. These drawbacks include incomplete information and limited diagnostic accuracy.…

Computer Vision and Pattern Recognition · Computer Science 2024-06-28 Xiaoyi Liu , Hongjie Qiu , Muqing Li , Zhou Yu , Yutian Yang , Yafeng Yan

Review of multimodal machine learning approaches in healthcare

Machine learning methods in healthcare have traditionally focused on using data from a single modality, limiting their ability to effectively replicate the clinical practice of integrating multiple sources of information for improved…

Machine Learning · Computer Science 2024-02-13 Felix Krones , Umar Marikkar , Guy Parsons , Adam Szmul , Adam Mahdi

Common Representation Learning Using Step-based Correlation Multi-Modal CNN

Deep learning techniques have been successfully used in learning a common representation for multi-view data, wherein the different modalities are projected onto a common subspace. In a broader perspective, the techniques used to…

Computer Vision and Pattern Recognition · Computer Science 2017-11-02 Gaurav Bhatt , Piyush Jha , Balasubramanian Raman

Integrating Medical Imaging and Clinical Reports Using Multimodal Deep Learning for Advanced Disease Analysis

In this paper, an innovative multi-modal deep learning model is proposed to deeply integrate heterogeneous information from medical images and clinical reports. First, for medical images, convolutional neural networks were used to extract…

Machine Learning · Computer Science 2024-05-29 Ziyan Yao , Fei Lin , Sheng Chai , Weijie He , Lu Dai , Xinghui Fei

CC-DCNet: Dynamic Convolutional Neural Network with Contrastive Constraints for Identifying Lung Cancer Subtypes on Multi-modality Images

The accurate diagnosis of pathological subtypes of lung cancer is of paramount importance for follow-up treatments and prognosis managements. Assessment methods utilizing deep learning technologies have introduced novel approaches for…

Image and Video Processing · Electrical Eng. & Systems 2024-07-19 Yuan Jin , Gege Ma , Geng Chen , Tianling Lyu , Jan Egger , Junhui Lyu , Shaoting Zhang , Wentao Zhu

Temporal Context Matters: Enhancing Single Image Prediction with Disease Progression Representations

Clinical outcome or severity prediction from medical images has largely focused on learning representations from single-timepoint or snapshot scans. It has been shown that disease progression can be better characterized by temporal imaging.…

Image and Video Processing · Electrical Eng. & Systems 2022-04-01 Aishik Konwer , Xuan Xu , Joseph Bae , Chao Chen , Prateek Prasanna

A Multimodal Deep Learning Model for Cardiac Resynchronisation Therapy Response Prediction

We present a novel multimodal deep learning framework for cardiac resynchronisation therapy (CRT) response prediction from 2D echocardiography and cardiac magnetic resonance (CMR) data. The proposed method first uses the `nnU-Net'…

Image and Video Processing · Electrical Eng. & Systems 2021-07-23 Esther Puyol-Antón , Baldeep S. Sidhu , Justin Gould , Bradley Porter , Mark K. Elliott , Vishal Mehta , Christopher A. Rinaldi , Andrew P. King

A review of deep learning-based information fusion techniques for multimodal medical image classification

Multimodal medical imaging plays a pivotal role in clinical diagnosis and research, as it combines information from various imaging modalities to provide a more comprehensive understanding of the underlying pathology. Recently, deep…

Computer Vision and Pattern Recognition · Computer Science 2024-04-24 Yihao Li , Mostafa El Habib Daho , Pierre-Henri Conze , Rachid Zeghlache , Hugo Le Boité , Ramin Tadayoni , Béatrice Cochener , Mathieu Lamard , Gwenolé Quellec

Continual Self-supervised Learning: Towards Universal Multi-modal Medical Data Representation Learning

Self-supervised learning is an efficient pre-training method for medical image analysis. However, current research is mostly confined to specific-modality data pre-training, consuming considerable time and resources without achieving…

Computer Vision and Pattern Recognition · Computer Science 2023-12-01 Yiwen Ye , Yutong Xie , Jianpeng Zhang , Ziyang Chen , Qi Wu , Yong Xia

Cross-Modal Clustering-Guided Negative Sampling for Self-Supervised Joint Learning from Medical Images and Reports

Learning medical visual representations directly from paired images and reports through multimodal self-supervised learning has emerged as a novel and efficient approach to digital diagnosis in recent years. However, existing models suffer…

Computer Vision and Pattern Recognition · Computer Science 2025-06-16 Libin Lan , Hongxing Li , Zunhui Xia , Juan Zhou , Xiaofei Zhu , Yongmei Li , Yudong Zhang , Xin Luo

Medical Multimodal Classifiers Under Scarce Data Condition

Data is one of the essential ingredients to power deep learning research. Small datasets, especially specific to medical institutes, bring challenges to deep learning training stage. This work aims to develop a practical deep multimodal…

Machine Learning · Computer Science 2019-02-26 Faik Aydin , Maggie Zhang , Michelle Ananda-Rajah , Gholamreza Haffari

Deep Multi-modal Fusion of Image and Non-image Data in Disease Diagnosis and Prognosis: A Review

The rapid development of diagnostic technologies in healthcare is leading to higher requirements for physicians to handle and integrate the heterogeneous, yet complementary data that are produced during routine practice. For instance, the…

Machine Learning · Computer Science 2023-01-30 Can Cui , Haichun Yang , Yaohong Wang , Shilin Zhao , Zuhayr Asad , Lori A. Coburn , Keith T. Wilson , Bennett A. Landman , Yuankai Huo

A review: Deep learning for medical image segmentation using multi-modality fusion

Multi-modality is widely used in medical imaging, because it can provide multiinformation about a target (tumor, organ or tissue). Segmentation using multimodality consists of fusing multi-information to improve the segmentation. Recently,…

Image and Video Processing · Electrical Eng. & Systems 2020-07-17 Tongxue Zhou , Su Ruan , Stéphane Canu

A scoping review on multimodal deep learning in biomedical images and texts

Computer-assisted diagnostic and prognostic systems of the future should be capable of simultaneously processing multimodal data. Multimodal deep learning (MDL), which involves the integration of multiple sources of data, such as images and…

Computer Vision and Pattern Recognition · Computer Science 2023-10-20 Zhaoyi Sun , Mingquan Lin , Qingqing Zhu , Qianqian Xie , Fei Wang , Zhiyong Lu , Yifan Peng

What are You Looking at? Modality Contribution in Multimodal Medical Deep Learning

Purpose High dimensional, multimodal data can nowadays be analyzed by huge deep neural networks with little effort. Several fusion methods for bringing together different modalities have been developed. Given the prevalence of…

Computer Vision and Pattern Recognition · Computer Science 2025-10-03 Christian Gapp , Elias Tappeiner , Martin Welk , Karl Fritscher , Elke Ruth Gizewski , Rainer Schubert

Learning Contrastive Multimodal Fusion with Improved Modality Dropout for Disease Detection and Prediction

As medical diagnoses increasingly leverage multimodal data, machine learning models are expected to effectively fuse heterogeneous information while remaining robust to missing modalities. In this work, we propose a novel multimodal…

Computer Vision and Pattern Recognition · Computer Science 2025-09-24 Yi Gu , Kuniaki Saito , Jiaxin Ma

Multimodal Data Integration for Oncology in the Era of Deep Neural Networks: A Review

Cancer has relational information residing at varying scales, modalities, and resolutions of the acquired data, such as radiology, pathology, genomics, proteomics, and clinical records. Integrating diverse data types can improve the…

Machine Learning · Computer Science 2024-07-29 Asim Waqas , Aakash Tripathi , Ravi P. Ramachandran , Paul Stewart , Ghulam Rasool

Enhancing Multimodal Medical Image Classification using Cross-Graph Modal Contrastive Learning

The classification of medical images is a pivotal aspect of disease diagnosis, often enhanced by deep learning techniques. However, traditional approaches typically focus on unimodal medical image data, neglecting the integration of diverse…

Image and Video Processing · Electrical Eng. & Systems 2025-11-11 Jun-En Ding , Chien-Chin Hsu , Chi-Hsiang Chu , Shuqiang Wang , Feng Liu

Context-driven Missing-Modality Learning for Robust Medical Diagnosis with Image-Tabular Data

While multimodal data integrating diverse imaging and clinical tabular records is crucial for accurate medical diagnosis, the arbitrary absence of specific modalities is prevalent in clinical practice, severely degrading the performance of…

Computer Vision and Pattern Recognition · Computer Science 2026-05-26 Tianling Liu , Lequan Yu , Tong Han , Liang Wan

A Survey of Deep Learning-based Radiology Report Generation Using Multimodal Data

Automatic radiology report generation can alleviate the workload for physicians and minimize regional disparities in medical resources, therefore becoming an important topic in the medical image analysis field. It is a challenging task, as…

Computer Vision and Pattern Recognition · Computer Science 2025-03-07 Xinyi Wang , Grazziela Figueredo , Ruizhe Li , Wei Emma Zhang , Weitong Chen , Xin Chen