Related papers: l0-Regularized Sparse Coding-based Interpretable N…

MMA-UNet: A Multi-Modal Asymmetric UNet Architecture for Infrared and Visible Image Fusion

Multi-modal image fusion (MMIF) maps useful information from various modalities into the same representation space, thereby producing an informative fused image. However, the existing fusion algorithms tend to symmetrically fuse the…

Computer Vision and Pattern Recognition · Computer Science 2024-07-12 Jingxue Huang , Xilai Li , Tianshu Tan , Xiaosong Li , Tao Ye

Bridging the Gap between Multi-focus and Multi-modal: A Focused Integration Framework for Multi-modal Image Fusion

Multi-modal image fusion (MMIF) integrates valuable information from different modality images into a fused one. However, the fusion of multiple visible images with different focal regions and infrared images is a unprecedented challenge in…

Computer Vision and Pattern Recognition · Computer Science 2024-02-01 Xilai Li , Xiaosong Li , Tao Ye , Xiaoqi Cheng , Wuyang Liu , Haishu Tan

MCFNet: A Multimodal Collaborative Fusion Network for Fine-Grained Semantic Classification

Multimodal information processing has become increasingly important for enhancing image classification performance. However, the intricate and implicit dependencies across different modalities often hinder conventional methods from…

Computer Vision and Pattern Recognition · Computer Science 2025-05-30 Yang Qiao , Xiaoyu Zhong , Xiaofeng Gu , Zhiguo Yu

DM-FNet: Unified multimodal medical image fusion via diffusion process-trained encoder-decoder

Multimodal medical image fusion (MMIF) extracts the most meaningful information from multiple source images, enabling a more comprehensive and accurate diagnosis. Achieving high-quality fusion results requires a careful balance of…

Computer Vision and Pattern Recognition · Computer Science 2025-06-19 Dan He , Weisheng Li , Guofen Wang , Yuping Huang , Shiqiang Liu

Rethinking Normalization Strategies and Convolutional Kernels for Multimodal Image Fusion

Multimodal image fusion (MMIF) integrates information from different modalities to obtain a comprehensive image, aiding downstream tasks. However, existing research focuses on complementary information fusion and training strategies,…

Computer Vision and Pattern Recognition · Computer Science 2025-12-12 Dan He , Guofen Wang , Weisheng Li , Yucheng Shu , Wenbo Li , Lijian Yang , Yuping Huang , Feiyan Li

Modality-Aware Infrared and Visible Image Fusion with Target-Aware Supervision

Infrared and visible image fusion (IVIF) is a fundamental task in multi-modal perception that aims to integrate complementary structural and textural cues from different spectral domains. In this paper, we propose FusionNet, a novel…

Computer Vision and Pattern Recognition · Computer Science 2025-09-16 Tianyao Sun , Dawei Xiang , Tianqi Ding , Xiang Fang , Yijiashun Qi , Zunduo Zhao

All-weather Multi-Modality Image Fusion: Unified Framework and 100k Benchmark

Multi-modality image fusion (MMIF) combines complementary information from different image modalities to provide a comprehensive and objective interpretation of scenes. However, existing fusion methods cannot resist different weather…

Computer Vision and Pattern Recognition · Computer Science 2026-03-17 Xilai Li , Wuyang Liu , Xiaosong Li , Fuqiang Zhou , Huafeng Li , Feiping Nie

Deep Convolutional Neural Network for Multi-modal Image Restoration and Fusion

In this paper, we propose a novel deep convolutional neural network to solve the general multi-modal image restoration (MIR) and multi-modal image fusion (MIF) problems. Different from other methods based on deep learning, our network…

Computer Vision and Pattern Recognition · Computer Science 2019-10-10 Xin Deng , Pier Luigi Dragotti

A Semantic-Aware and Multi-Guided Network for Infrared-Visible Image Fusion

Multi-modality image fusion aims at fusing modality-specific (complementarity) and modality-shared (correlation) information from multiple source images. To tackle the problem of the neglect of inter-feature relationships, high-frequency…

Computer Vision and Pattern Recognition · Computer Science 2025-05-20 Xiaoli Zhang , Liying Wang , Libo Zhao , Xiongfei Li , Siwei Ma

TSJNet: A Multi-modality Target and Semantic Awareness Joint-driven Image Fusion Network

This study aims to address the problem of incomplete information in unimodal images for semantic segmentation and object detection tasks. Existing multimodal fusion methods suffer from limited capability in discriminative modeling of…

Computer Vision and Pattern Recognition · Computer Science 2026-02-10 Yuchan Jie , Yushen Xu , Xiaosong Li , Huafeng Li , Haishu Tan , Feiping Nie

MMIF-AMIN: Adaptive Loss-Driven Multi-Scale Invertible Dense Network for Multimodal Medical Image Fusion

Multimodal medical image fusion (MMIF) aims to integrate images from different modalities to produce a comprehensive image that enhances medical diagnosis by accurately depicting organ structures, tissue textures, and metabolic information.…

Computer Vision and Pattern Recognition · Computer Science 2025-12-02 Tao Luo , Weihua Xu

Learning a Graph Neural Network with Cross Modality Interaction for Image Fusion

Infrared and visible image fusion has gradually proved to be a vital fork in the field of multi-modality imaging technologies. In recent developments, researchers not only focus on the quality of fused images but also evaluate their…

Computer Vision and Pattern Recognition · Computer Science 2023-08-08 Jiawei Li , Jiansheng Chen , Jinyuan Liu , Huimin Ma

Cross-Modality Fusion Transformer for Multispectral Object Detection

Multispectral image pairs can provide the combined information, making object detection applications more reliable and robust in the open world. To fully exploit the different modalities, we present a simple yet effective cross-modality…

Image and Video Processing · Electrical Eng. & Systems 2022-10-05 Fang Qingyun , Han Dapeng , Wang Zhaokui

LASFNet: A Lightweight Attention-Guided Self-Modulation Feature Fusion Network for Multimodal Object Detection

Effective deep feature extraction via feature-level fusion is crucial for multimodal object detection. However, previous studies often involve complex training processes that integrate modality-specific features by stacking multiple…

Computer Vision and Pattern Recognition · Computer Science 2025-06-27 Lei Hao , Lina Xu , Chang Liu , Yanni Dong

MIFNet: Learning Modality-Invariant Features for Generalizable Multimodal Image Matching

Many keypoint detection and description methods have been proposed for image matching or registration. While these methods demonstrate promising performance for single-modality image matching, they often struggle with multimodal data…

Computer Vision and Pattern Recognition · Computer Science 2025-06-25 Yepeng Liu , Zhichao Sun , Baosheng Yu , Yitian Zhao , Bo Du , Yongchao Xu , Jun Cheng

LMFNet: An Efficient Multimodal Fusion Approach for Semantic Segmentation in High-Resolution Remote Sensing

Despite the rapid evolution of semantic segmentation for land cover classification in high-resolution remote sensing imagery, integrating multiple data modalities such as Digital Surface Model (DSM), RGB, and Near-infrared (NIR) remains a…

Computer Vision and Pattern Recognition · Computer Science 2025-04-08 Tong Wang , Guanzhou Chen , Xiaodong Zhang , Chenxi Liu , Xiaoliang Tan , Jiaqi Wang , Chanjuan He , Wenlin Zhou

MSFNet-CPD: Multi-Scale Cross-Modal Fusion Network for Crop Pest Detection

Accurate identification of agricultural pests is essential for crop protection but remains challenging due to the large intra-class variance and fine-grained differences among pest species. While deep learning has advanced pest detection,…

Artificial Intelligence · Computer Science 2025-05-06 Jiaqi Zhang , Zhuodong Liu , Kejian Yu

CHITNet: A Complementary to Harmonious Information Transfer Network for Infrared and Visible Image Fusion

Current infrared and visible image fusion (IVIF) methods go to great lengths to excavate complementary features and design complex fusion strategies, which is extremely challenging. To this end, we rethink the IVIF outside the box,…

Computer Vision and Pattern Recognition · Computer Science 2024-10-23 Keying Du , Huafeng Li , Yafei Zhang , Zhengtao Yu

Multi-scale Adaptive Fusion Network for Hyperspectral Image Denoising

Removing the noise and improving the visual quality of hyperspectral images (HSIs) is challenging in academia and industry. Great efforts have been made to leverage local, global or spectral context information for HSI denoising. However,…

Image and Video Processing · Electrical Eng. & Systems 2023-04-20 Haodong Pan , Feng Gao , Junyu Dong , Qian Du

Cross-Modal Object Tracking via Modality-Aware Fusion Network and A Large-Scale Dataset

Visual tracking often faces challenges such as invalid targets and decreased performance in low-light conditions when relying solely on RGB image sequences. While incorporating additional modalities like depth and infrared data has proven…

Computer Vision and Pattern Recognition · Computer Science 2023-12-25 Lei Liu , Mengya Zhang , Cheng Li , Chenglong Li , Jin Tang