Related papers: Prospective Study for Semantic Inter-Media Fusion …

SMFusion: Semantic-Preserving Fusion of Multimodal Medical Images for Enhanced Clinical Diagnosis

Multimodal medical image fusion plays a crucial role in medical diagnosis by integrating complementary information from different modalities to enhance image readability and clinical applicability. However, existing methods mainly follow…

Computer Vision and Pattern Recognition · Computer Science 2025-05-20 Haozhe Xiang , Han Zhang , Yu Cheng , Xiongwen Quan , Wanwan Huang

A Semantic-based Medical Image Fusion Approach

It is necessary for clinicians to comprehensively analyze patient information from different sources. Medical image fusion is a promising approach to providing overall information from medical images of different modalities. However,…

Image and Video Processing · Electrical Eng. & Systems 2019-12-12 Fanda Fan , Yunyou Huang , Lei Wang , Xingwang Xiong , Zihan Jiang , Zhifei Zhang , Jianfeng Zhan

Medical Image Retrieval using Deep Convolutional Neural Network

With a widespread use of digital imaging data in hospitals, the size of medical image repositories is increasing rapidly. This causes difficulty in managing and querying these large databases leading to the need of content based medical…

Computer Vision and Pattern Recognition · Computer Science 2017-08-02 Adnan Qayyum , Syed Muhammad Anwar , Muhammad Awais , Muhammad Majid

Content-Based Image Retrieval for Multi-Class Volumetric Radiology Images: A Benchmark Study

While content-based image retrieval (CBIR) has been extensively studied in natural image retrieval, its application to medical images presents ongoing challenges, primarily due to the 3D nature of medical images. Recent studies have shown…

Computer Vision and Pattern Recognition · Computer Science 2024-07-08 Farnaz Khun Jush , Steffen Vogler , Tuan Truong , Matthias Lenga

Semantic Image Fusion

Image fusion methods and metrics for their evaluation have conventionally used pixel-based or low-level features. However, for many applications, the aim of image fusion is to effectively combine the semantic content of the input images.…

Computer Vision and Pattern Recognition · Computer Science 2021-10-14 P. R. Hill , D. R. Bull

Cross-modal Image Retrieval with Deep Mutual Information Maximization

In this paper, we study the cross-modal image retrieval, where the inputs contain a source image plus some text that describes certain modifications to this image and the desired image. Prior work usually uses a three-stage strategy to…

Computer Vision and Pattern Recognition · Computer Science 2021-03-11 Chunbin Gu , Jiajun Bu , Xixi Zhou , Chengwei Yao , Dongfang Ma , Zhi Yu , Xifeng Yan

MiMIC: Mitigating Visual Modality Collapse in Universal Multimodal Retrieval While Avoiding Semantic Misalignment

Universal Multimodal Retrieval (UMR) aims to map different modalities (e.g., visual and textual) into a shared embedding space for multi-modal retrieval. Existing UMR methods can be broadly divided into two categories: early-fusion…

Computer Vision and Pattern Recognition · Computer Science 2026-04-24 Juan Li , Chuanghao Ding , Xujie Zhang , Cam-Tu Nguyen

CBIDR: A novel method for information retrieval combining image and data by means of TOPSIS applied to medical diagnosis

Content-Based Image Retrieval (CBIR) have shown promising results in the field of medical diagnosis, which aims to provide support to medical professionals (doctor or pathologist). However, the ultimate decision regarding the diagnosis is…

Information Retrieval · Computer Science 2025-07-03 Humberto Giuri , Renato A. Krohling

A Text-Image Fusion Method with Data Augmentation Capabilities for Referring Medical Image Segmentation

Deep learning relies heavily on data augmentation to mitigate limited data, especially in medical imaging. Recent multimodal learning integrates text and images for segmentation, known as referring or text-guided image segmentation.…

Computer Vision and Pattern Recognition · Computer Science 2025-10-15 Shurong Chai , Rahul Kumar JAIN , Rui Xu , Shaocong Mo , Ruibo Hou , Shiyu Teng , Jiaqing Liu , Lanfen Lin , Yen-Wei Chen

On the Combined Use of Extrinsic Semantic Resources for Medical Information Search

Semantic concepts and relations encoded in domain-specific ontologies and other medical semantic resources play a crucial role in deciphering terms in medical queries and documents. The exploitation of these resources for tackling the…

Information Retrieval · Computer Science 2020-05-19 Mohammed Maree , Israa Noor , Khaled Rabayah , Mohammed Belkhatir , Saadat M. Alhashmi

TMCIR: Token Merge Benefits Composed Image Retrieval

Composed Image Retrieval (CIR) retrieves target images using a multi-modal query that combines a reference image with text describing desired modifications. The primary challenge is effectively fusing this visual and textual information.…

Computer Vision and Pattern Recognition · Computer Science 2025-04-16 Chaoyang Wang , Zeyu Zhang , Long Teng , Zijun Li , Shichao Kan

SciMMIR: Benchmarking Scientific Multi-modal Information Retrieval

Multi-modal information retrieval (MMIR) is a rapidly evolving field, where significant progress, particularly in image-text pairing, has been made through advanced representation learning and cross-modality alignment research. However,…

Information Retrieval · Computer Science 2024-06-12 Siwei Wu , Yizhi Li , Kang Zhu , Ge Zhang , Yiming Liang , Kaijing Ma , Chenghao Xiao , Haoran Zhang , Bohao Yang , Wenhu Chen , Wenhao Huang , Noura Al Moubayed , Jie Fu , Chenghua Lin

WWFedCBMIR: World-Wide Federated Content-Based Medical Image Retrieval

The paper proposes a Federated Content-Based Medical Image Retrieval (FedCBMIR) platform that utilizes Federated Learning (FL) to address the challenges of acquiring a diverse medical data set for training CBMIR models. CBMIR assists…

Image and Video Processing · Electrical Eng. & Systems 2023-05-08 Zahra Tabatabaei , Yuandou Wang , Adrián Colomer , Javier Oliver Moll , Zhiming Zhao , Valery Naranjo

Coupled Feature Learning for Multimodal Medical Image Fusion

Multimodal image fusion aims to combine relevant information from images acquired with different sensors. In medical imaging, fused images play an essential role in both standard and automated diagnosis. In this paper, we propose a novel…

Computer Vision and Pattern Recognition · Computer Science 2021-02-18 Farshad G. Veshki , Nora Ouzir , Sergiy A. Vorobyov , Esa Ollila

A review of deep learning-based information fusion techniques for multimodal medical image classification

Multimodal medical imaging plays a pivotal role in clinical diagnosis and research, as it combines information from various imaging modalities to provide a more comprehensive understanding of the underlying pathology. Recently, deep…

Computer Vision and Pattern Recognition · Computer Science 2024-04-24 Yihao Li , Mostafa El Habib Daho , Pierre-Henri Conze , Rachid Zeghlache , Hugo Le Boité , Ramin Tadayoni , Béatrice Cochener , Mathieu Lamard , Gwenolé Quellec

Multimodal Medical Image Classification via Synergistic Learning Pre-training

Multimodal pathological images are usually in clinical diagnosis, but computer vision-based multimodal image-assisted diagnosis faces challenges with modality fusion, especially in the absence of expert-annotated data. To achieve the…

Computer Vision and Pattern Recognition · Computer Science 2025-09-24 Qinghua Lin , Guang-Hai Liu , Zuoyong Li , Yang Li , Yuting Jiang , Xiang Wu

Advancements in Content-Based Image Retrieval: A Comprehensive Survey of Relevance Feedback Techniques

Content-based image retrieval (CBIR) systems have emerged as crucial tools in the field of computer vision, allowing for image search based on visual content rather than relying solely on metadata. This survey paper presents a comprehensive…

Computer Vision and Pattern Recognition · Computer Science 2023-12-19 Hamed Qazanfari , Mohammad M. AlyanNezhadi , Zohreh Nozari Khoshdaregi

Content-based 3D Image Retrieval and a ColBERT-inspired Re-ranking for Tumor Flagging and Staging

The increasing volume of medical images poses challenges for radiologists in retrieving relevant cases. Content-based image retrieval (CBIR) systems offer potential for efficient access to similar cases, yet lack standardized evaluation and…

Computer Vision and Pattern Recognition · Computer Science 2025-08-26 Farnaz Khun Jush , Steffen Vogler , Matthias Lenga

DocMMIR: A Framework for Document Multi-modal Information Retrieval

The rapid advancement of unsupervised representation learning and large-scale pre-trained vision-language models has significantly improved cross-modal retrieval tasks. However, existing multi-modal information retrieval (MMIR) studies lack…

Information Retrieval · Computer Science 2025-10-20 Zirui Li , Siwei Wu , Yizhi Li , Xingyu Wang , Yi Zhou , Chenghua Lin

MedFuse: Multi-modal fusion with clinical time-series data and chest X-ray images

Multi-modal fusion approaches aim to integrate information from different data sources. Unlike natural datasets, such as in audio-visual applications, where samples consist of "paired" modalities, data in healthcare is often collected…

Image and Video Processing · Electrical Eng. & Systems 2023-03-03 Nasir Hayat , Krzysztof J. Geras , Farah E. Shamout