English
Related papers

Related papers: Multi-modal Deep Learning

200 papers

This paper introduces an innovative multi-modal fusion deep learning approach to overcome the drawbacks of traditional single-modal recognition techniques. These drawbacks include incomplete information and limited diagnostic accuracy.…

Computer Vision and Pattern Recognition · Computer Science 2024-06-28 Xiaoyi Liu , Hongjie Qiu , Muqing Li , Zhou Yu , Yutian Yang , Yafeng Yan

Machine learning methods in healthcare have traditionally focused on using data from a single modality, limiting their ability to effectively replicate the clinical practice of integrating multiple sources of information for improved…

Machine Learning · Computer Science 2024-02-13 Felix Krones , Umar Marikkar , Guy Parsons , Adam Szmul , Adam Mahdi

Deep learning techniques have been successfully used in learning a common representation for multi-view data, wherein the different modalities are projected onto a common subspace. In a broader perspective, the techniques used to…

Computer Vision and Pattern Recognition · Computer Science 2017-11-02 Gaurav Bhatt , Piyush Jha , Balasubramanian Raman

In this paper, an innovative multi-modal deep learning model is proposed to deeply integrate heterogeneous information from medical images and clinical reports. First, for medical images, convolutional neural networks were used to extract…

Machine Learning · Computer Science 2024-05-29 Ziyan Yao , Fei Lin , Sheng Chai , Weijie He , Lu Dai , Xinghui Fei

The accurate diagnosis of pathological subtypes of lung cancer is of paramount importance for follow-up treatments and prognosis managements. Assessment methods utilizing deep learning technologies have introduced novel approaches for…

Image and Video Processing · Electrical Eng. & Systems 2024-07-19 Yuan Jin , Gege Ma , Geng Chen , Tianling Lyu , Jan Egger , Junhui Lyu , Shaoting Zhang , Wentao Zhu

Clinical outcome or severity prediction from medical images has largely focused on learning representations from single-timepoint or snapshot scans. It has been shown that disease progression can be better characterized by temporal imaging.…

Image and Video Processing · Electrical Eng. & Systems 2022-04-01 Aishik Konwer , Xuan Xu , Joseph Bae , Chao Chen , Prateek Prasanna

We present a novel multimodal deep learning framework for cardiac resynchronisation therapy (CRT) response prediction from 2D echocardiography and cardiac magnetic resonance (CMR) data. The proposed method first uses the `nnU-Net'…

Image and Video Processing · Electrical Eng. & Systems 2021-07-23 Esther Puyol-Antón , Baldeep S. Sidhu , Justin Gould , Bradley Porter , Mark K. Elliott , Vishal Mehta , Christopher A. Rinaldi , Andrew P. King

Multimodal medical imaging plays a pivotal role in clinical diagnosis and research, as it combines information from various imaging modalities to provide a more comprehensive understanding of the underlying pathology. Recently, deep…

Computer Vision and Pattern Recognition · Computer Science 2024-04-24 Yihao Li , Mostafa El Habib Daho , Pierre-Henri Conze , Rachid Zeghlache , Hugo Le Boité , Ramin Tadayoni , Béatrice Cochener , Mathieu Lamard , Gwenolé Quellec

Self-supervised learning is an efficient pre-training method for medical image analysis. However, current research is mostly confined to specific-modality data pre-training, consuming considerable time and resources without achieving…

Computer Vision and Pattern Recognition · Computer Science 2023-12-01 Yiwen Ye , Yutong Xie , Jianpeng Zhang , Ziyang Chen , Qi Wu , Yong Xia

Learning medical visual representations directly from paired images and reports through multimodal self-supervised learning has emerged as a novel and efficient approach to digital diagnosis in recent years. However, existing models suffer…

Computer Vision and Pattern Recognition · Computer Science 2025-06-16 Libin Lan , Hongxing Li , Zunhui Xia , Juan Zhou , Xiaofei Zhu , Yongmei Li , Yudong Zhang , Xin Luo

Data is one of the essential ingredients to power deep learning research. Small datasets, especially specific to medical institutes, bring challenges to deep learning training stage. This work aims to develop a practical deep multimodal…

Machine Learning · Computer Science 2019-02-26 Faik Aydin , Maggie Zhang , Michelle Ananda-Rajah , Gholamreza Haffari

The rapid development of diagnostic technologies in healthcare is leading to higher requirements for physicians to handle and integrate the heterogeneous, yet complementary data that are produced during routine practice. For instance, the…

Machine Learning · Computer Science 2023-01-30 Can Cui , Haichun Yang , Yaohong Wang , Shilin Zhao , Zuhayr Asad , Lori A. Coburn , Keith T. Wilson , Bennett A. Landman , Yuankai Huo

Multi-modality is widely used in medical imaging, because it can provide multiinformation about a target (tumor, organ or tissue). Segmentation using multimodality consists of fusing multi-information to improve the segmentation. Recently,…

Image and Video Processing · Electrical Eng. & Systems 2020-07-17 Tongxue Zhou , Su Ruan , Stéphane Canu

Computer-assisted diagnostic and prognostic systems of the future should be capable of simultaneously processing multimodal data. Multimodal deep learning (MDL), which involves the integration of multiple sources of data, such as images and…

Computer Vision and Pattern Recognition · Computer Science 2023-10-20 Zhaoyi Sun , Mingquan Lin , Qingqing Zhu , Qianqian Xie , Fei Wang , Zhiyong Lu , Yifan Peng

Purpose High dimensional, multimodal data can nowadays be analyzed by huge deep neural networks with little effort. Several fusion methods for bringing together different modalities have been developed. Given the prevalence of…

Computer Vision and Pattern Recognition · Computer Science 2025-10-03 Christian Gapp , Elias Tappeiner , Martin Welk , Karl Fritscher , Elke Ruth Gizewski , Rainer Schubert

As medical diagnoses increasingly leverage multimodal data, machine learning models are expected to effectively fuse heterogeneous information while remaining robust to missing modalities. In this work, we propose a novel multimodal…

Computer Vision and Pattern Recognition · Computer Science 2025-09-24 Yi Gu , Kuniaki Saito , Jiaxin Ma

Cancer has relational information residing at varying scales, modalities, and resolutions of the acquired data, such as radiology, pathology, genomics, proteomics, and clinical records. Integrating diverse data types can improve the…

Machine Learning · Computer Science 2024-07-29 Asim Waqas , Aakash Tripathi , Ravi P. Ramachandran , Paul Stewart , Ghulam Rasool

The classification of medical images is a pivotal aspect of disease diagnosis, often enhanced by deep learning techniques. However, traditional approaches typically focus on unimodal medical image data, neglecting the integration of diverse…

Image and Video Processing · Electrical Eng. & Systems 2025-11-11 Jun-En Ding , Chien-Chin Hsu , Chi-Hsiang Chu , Shuqiang Wang , Feng Liu

While multimodal data integrating diverse imaging and clinical tabular records is crucial for accurate medical diagnosis, the arbitrary absence of specific modalities is prevalent in clinical practice, severely degrading the performance of…

Computer Vision and Pattern Recognition · Computer Science 2026-05-26 Tianling Liu , Lequan Yu , Tong Han , Liang Wan

Automatic radiology report generation can alleviate the workload for physicians and minimize regional disparities in medical resources, therefore becoming an important topic in the medical image analysis field. It is a challenging task, as…

Computer Vision and Pattern Recognition · Computer Science 2025-03-07 Xinyi Wang , Grazziela Figueredo , Ruizhe Li , Wei Emma Zhang , Weitong Chen , Xin Chen
‹ Prev 1 2 3 10 Next ›