English
Related papers

Related papers: Memory based fusion for multi-modal deep learning

200 papers

Developing effective multimodal data fusion strategies has become increasingly essential for improving the predictive power of statistical machine learning methods across a wide range of applications, from autonomous driving to medical…

Machine Learning · Computer Science 2025-07-29 Ziyi Liang , Annie Qu , Babak Shahbaba

Deep learning-based image fusion approaches have obtained wide attention in recent years, achieving promising performance in terms of visual perception. However, the fusion module in the current deep learning-based methods suffers from two…

Computer Vision and Pattern Recognition · Computer Science 2022-02-01 Dongyu Rao , Xiao-Jun Wu , Tianyang Xu , Guoyang Chen

This paper introduces an innovative multi-modal fusion deep learning approach to overcome the drawbacks of traditional single-modal recognition techniques. These drawbacks include incomplete information and limited diagnostic accuracy.…

Computer Vision and Pattern Recognition · Computer Science 2024-06-28 Xiaoyi Liu , Hongjie Qiu , Muqing Li , Zhou Yu , Yutian Yang , Yafeng Yan

Traditional and deep learning-based fusion methods generated the intermediate decision map to obtain the fusion image through a series of post-processing procedures. However, the fusion results generated by these methods are easy to lose…

Computer Vision and Pattern Recognition · Computer Science 2021-04-21 Yongsheng Zang , Dongming Zhou , Changcheng Wang , Rencan Nie , Yanbu Guo

Multi-modal learning has been intensified in recent years, especially for applications in facial analysis and action unit detection whilst there still exist two main challenges in terms of 1) relevant feature learning for representation and…

Computer Vision and Pattern Recognition · Computer Science 2022-03-23 Xiang Zhang , Lijun Yin

Multiple modalities can provide more valuable information than single one by describing the same contents in various ways. Hence, it is highly expected to learn effective joint representation by fusing the features of different modalities.…

Computer Vision and Pattern Recognition · Computer Science 2018-10-09 Di Hu , Feiping Nie , Xuelong Li

Multimodal medical image fusion is a crucial task that combines complementary information from different imaging modalities into a unified representation, thereby enhancing diagnostic accuracy and treatment planning. While deep learning…

Image and Video Processing · Electrical Eng. & Systems 2024-11-19 Meng Zhou , Yuxuan Zhang , Xiaolan Xu , Jiayi Wang , Farzad Khalvati

The goal of multi-modal learning is to use complimentary information on the relevant task provided by the multiple modalities to achieve reliable and robust performance. Recently, deep learning has led significant improvement in multi-modal…

Computer Vision and Pattern Recognition · Computer Science 2018-11-05 Jaekyum Kim , Junho Koh , Yecheol Kim , Jaehyung Choi , Youngbae Hwang , Jun Won Choi

Feature fusion, the combination of features from different layers or branches, is an omnipresent part of modern network architectures. It is often implemented via simple operations, such as summation or concatenation, but this might not be…

Computer Vision and Pattern Recognition · Computer Science 2020-11-10 Yimian Dai , Fabian Gieseke , Stefan Oehmcke , Yiquan Wu , Kobus Barnard

Multimodal learning enhances the perceptual capabilities of cognitive systems by integrating information from different sensory modalities. However, existing multimodal fusion research typically assumes static integration, not fully…

Neural and Evolutionary Computing · Computer Science 2025-05-16 Xiang He , Dongcheng Zhao , Yang Li , Qingqun Kong , Xin Yang , Yi Zeng

We propose a compact and effective framework to fuse multimodal features at multiple layers in a single network. The framework consists of two innovative fusion schemes. Firstly, unlike existing multimodal methods that necessitate…

Computer Vision and Pattern Recognition · Computer Science 2021-08-12 Yikai Wang , Fuchun Sun , Ming Lu , Anbang Yao

Effective deep feature extraction via feature-level fusion is crucial for multimodal object detection. However, previous studies often involve complex training processes that integrate modality-specific features by stacking multiple…

Computer Vision and Pattern Recognition · Computer Science 2025-06-27 Lei Hao , Lina Xu , Chang Liu , Yanni Dong

Gaining insights into the structural and functional mechanisms of the brain has been a longstanding focus in neuroscience research, particularly in the context of understanding and treating neuropsychiatric disorders such as Schizophrenia…

Computer Vision and Pattern Recognition · Computer Science 2026-01-27 Badhan Mazumder , Lei Wu , Vince D. Calhoun , Dong Hye Ye

Multi-modal fusion is of great significance in neuroscience which integrates information from different modalities and can achieve better performance than uni-modal methods in downstream tasks. Current multi-modal fusion methods in brain…

Artificial Intelligence · Computer Science 2026-04-03 Rui Dong , Xiaotong Zhang , Jiaxing Li , Yueying Li , Jiayin Wei , Youyong Kong

Integration of multimodal information from various sources has been shown to boost the performance of machine learning models and thus has received increased attention in recent years. Often such models use deep modality-specific networks…

Machine Learning · Computer Science 2022-11-22 Shiv Shankar , Laure Thompson , Madalina Fiterau

Point clouds and images could provide complementary information when representing 3D objects. Fusing the two kinds of data usually helps to improve the detection results. However, it is challenging to fuse the two data modalities, due to…

Computer Vision and Pattern Recognition · Computer Science 2021-08-31 Xun Tan , Xingyu Chen , Guowei Zhang , Jishiyu Ding , Xuguang Lan

Multimodal learning has gained much success in recent years. However, current multimodal fusion methods adopt the attention mechanism of Transformers to implicitly learn the underlying correlation of multimodal features. As a result, the…

Computer Vision and Pattern Recognition · Computer Science 2025-11-27 Thanh-Dat Truong , Christophe Bobda , Nitin Agarwal , Khoa Luu

The main idea of multimodal recommendation is the rational utilization of the item's multimodal information to improve the recommendation performance. Previous works directly integrate item multimodal features with item ID embeddings,…

Information Retrieval · Computer Science 2023-04-25 Yan Zhou , Jie Guo , Hao Sun , Bin Song , Fei Richard Yu

In this paper, we present an Adaptive Ensemble Learning framework that aims to boost the performance of deep neural networks by intelligently fusing features through ensemble learning techniques. The proposed framework integrates ensemble…

Artificial Intelligence · Computer Science 2023-04-07 Neelesh Mungoli

Deep learning-based methods have achieved encouraging performances in the field of magnetic resonance (MR) image reconstruction. Nevertheless, to properly learn a powerful and robust model, these methods generally require large quantities…

Image and Video Processing · Electrical Eng. & Systems 2023-04-18 Ruoyou Wu , Cheng Li , Juan Zou , Qiegen Liu , Hairong Zheng , Shanshan Wang
‹ Prev 1 2 3 10 Next ›