Deep Mamba Multi-modal Learning

Jian Zhu; Xin Zou; Yu Cui; Zhangmin Huang; Chenshu Hu; Bo Lyu

Deep Mamba Multi-modal Learning

Multimedia 2024-06-27 v1

Authors: Jian Zhu , Xin Zou , Yu Cui , Zhangmin Huang , Chenshu Hu , Bo Lyu

Abstract

Inspired by the excellent performance of Mamba networks, we propose a novel Deep Mamba Multi-modal Learning (DMML). It can be used to achieve the fusion of multi-modal features. We apply DMML to the field of multimedia retrieval and propose an innovative Deep Mamba Multi-modal Hashing (DMMH) method. It combines the advantages of algorithm accuracy and inference speed. We validated the effectiveness of DMMH on three public datasets and achieved state-of-the-art results.

Deep Mamba Multi-modal Learning

Abstract

Keywords

Cite

Comments

Related papers