Related papers: Progressively Modality Freezing for Multi-Modal En…

MCSFF: Multi-modal Consistency and Specificity Fusion Framework for Entity Alignment

Multi-modal entity alignment (MMEA) is essential for enhancing knowledge graphs and improving information retrieval and question-answering systems. Existing methods often focus on integrating modalities through their complementarity but…

Artificial Intelligence · Computer Science 2024-10-21 Wei Ai , Wen Deng , Hongyi Chen , Jiayi Du , Tao Meng , Yuntao Shou

Universal Multi-modal Entity Alignment via Iteratively Fusing Modality Similarity Paths

The objective of Entity Alignment (EA) is to identify equivalent entity pairs from multiple Knowledge Graphs (KGs) and create a more comprehensive and unified KG. The majority of EA methods have primarily focused on the structural modality…

Computation and Language · Computer Science 2023-10-16 Bolin Zhu , Xiaoze Liu , Xin Mao , Zhuo Chen , Lingbing Guo , Tao Gui , Qi Zhang

MEAformer: Multi-modal Entity Alignment Transformer for Meta Modality Hybrid

Multi-modal entity alignment (MMEA) aims to discover identical entities across different knowledge graphs (KGs) whose entities are associated with relevant images. However, current MMEA algorithms rely on KG-level modality fusion strategies…

Artificial Intelligence · Computer Science 2023-08-01 Zhuo Chen , Jiaoyan Chen , Wen Zhang , Lingbing Guo , Yin Fang , Yufeng Huang , Yichi Zhang , Yuxia Geng , Jeff Z. Pan , Wenting Song , Huajun Chen

Multi-modal Contrastive Representation Learning for Entity Alignment

Multi-modal entity alignment aims to identify equivalent entities between two different multi-modal knowledge graphs, which consist of structural triples and images associated with entities. Most previous works focus on how to utilize and…

Computation and Language · Computer Science 2022-09-05 Zhenxi Lin , Ziheng Zhang , Meng Wang , Yinghui Shi , Xian Wu , Yefeng Zheng

IBMEA: Exploring Variational Information Bottleneck for Multi-modal Entity Alignment

Multi-modal entity alignment (MMEA) aims to identify equivalent entities between multi-modal knowledge graphs (MMKGs), where the entities can be associated with related images. Most existing studies integrate multi-modal information heavily…

Computation and Language · Computer Science 2024-07-30 Taoyu Su , Jiawei Sheng , Shicheng Wang , Xinghua Zhang , Hongbo Xu , Tingwen Liu

Rethinking Uncertainly Missing and Ambiguous Visual Modality in Multi-Modal Entity Alignment

As a crucial extension of entity alignment (EA), multi-modal entity alignment (MMEA) aims to identify identical entities across disparate knowledge graphs (KGs) by exploiting associated visual information. However, existing MMEA approaches…

Artificial Intelligence · Computer Science 2023-08-02 Zhuo Chen , Lingbing Guo , Yin Fang , Yichi Zhang , Jiaoyan Chen , Jeff Z. Pan , Yangning Li , Huajun Chen , Wen Zhang

Pseudo-Label Calibration Semi-supervised Multi-Modal Entity Alignment

Multi-modal entity alignment (MMEA) aims to identify equivalent entities between two multi-modal knowledge graphs for integration. Unfortunately, prior arts have attempted to improve the interaction and fusion of multi-modal information,…

Machine Learning · Computer Science 2024-03-05 Luyao Wang , Pengnian Qi , Xigang Bao , Chunlai Zhou , Biao Qin

Leveraging Foundation Models for Multi-modal Federated Learning with Incomplete Modality

Federated learning (FL) has obtained tremendous progress in providing collaborative training solutions for distributed data silos with privacy guarantees. However, few existing works explore a more realistic scenario where the clients hold…

Machine Learning · Computer Science 2024-06-18 Liwei Che , Jiaqi Wang , Xinyue Liu , Fenglong Ma

Progressive Representation Learning for Multimodal Sentiment Analysis with Incomplete Modalities

Multimodal Sentiment Analysis (MSA) seeks to infer human emotions by integrating textual, acoustic, and visual cues. However, existing approaches often rely on all modalities are completeness, whereas real-world applications frequently…

Computer Vision and Pattern Recognition · Computer Science 2026-03-11 Jindi Bao , Jianjun Qian , Mengkai Yan , Jian Yang

Meta Fusion: A Unified Framework For Multimodality Fusion with Mutual Learning

Developing effective multimodal data fusion strategies has become increasingly essential for improving the predictive power of statistical machine learning methods across a wide range of applications, from autonomous driving to medical…

Machine Learning · Computer Science 2025-07-29 Ziyi Liang , Annie Qu , Babak Shahbaba

Mitigating Modality Bias in Multi-modal Entity Alignment from a Causal Perspective

Multi-Modal Entity Alignment (MMEA) aims to retrieve equivalent entities from different Multi-Modal Knowledge Graphs (MMKGs), a critical information retrieval task. Existing studies have explored various fusion paradigms and consistency…

Multimedia · Computer Science 2025-05-16 Taoyu Su , Jiawei Sheng , Duohe Ma , Xiaodong Li , Juwei Yue , Mengxiao Song , Yingkai Tang , Tingwen Liu

Zoom and Shift are All You Need

Feature alignment serves as the primary mechanism for fusing multimodal data. We put forth a feature alignment approach that achieves full integration of multimodal information. This is accomplished via an alternating process of shifting…

Computer Vision and Pattern Recognition · Computer Science 2024-06-14 Jiahao Qin

Multi-Modal Knowledge Graph Transformer Framework for Multi-Modal Entity Alignment

Multi-Modal Entity Alignment (MMEA) is a critical task that aims to identify equivalent entity pairs across multi-modal knowledge graphs (MMKGs). However, this task faces challenges due to the presence of different types of information,…

Computation and Language · Computer Science 2023-10-11 Qian Li , Cheng Ji , Shu Guo , Zhaoji Liang , Lihong Wang , Jianxin Li

Robust Dynamic Multi-Modal Data Fusion: A Model Uncertainty Perspective

This paper is concerned with multi-modal data fusion (MMDF) under unexpected modality failures in nonlinear non-Gaussian dynamic processes. An efficient framework to tackle this problem is proposed. In particular, a notion termed modality…

Machine Learning · Computer Science 2021-11-24 Bin Liu

Leveraging Intra-modal and Inter-modal Interaction for Multi-Modal Entity Alignment

Multi-modal entity alignment (MMEA) aims to identify equivalent entity pairs across different multi-modal knowledge graphs (MMKGs). Existing approaches focus on how to better encode and aggregate information from different modalities.…

Information Retrieval · Computer Science 2024-04-30 Zhiwei Hu , Víctor Gutiérrez-Basulto , Zhiliang Xiang , Ru Li , Jeff Z. Pan

MyGram: Modality-aware Graph Transformer with Global Distribution for Multi-modal Entity Alignment

Multi-modal entity alignment aims to identify equivalent entities between two multi-modal Knowledge graphs by integrating multi-modal data, such as images and text, to enrich the semantic representations of entities. However, existing…

Artificial Intelligence · Computer Science 2026-01-21 Zhifei Li , Ziyue Qin , Xiangyu Luo , Xiaoju Hou , Yue Zhao , Miao Zhang , Zhifang Huang , Kui Xiao , Bing Yang

MDE: Modality Discrimination Enhancement for Multi-modal Recommendation

Multi-modal recommendation systems aim to enhance performance by integrating an item's content features across various modalities with user behavior data. Effective utilization of features from different modalities requires addressing two…

Information Retrieval · Computer Science 2025-02-27 Hang Zhou , Yucheng Wang , Huijing Zhan

Exploring Cross-Modal Flows for Few-Shot Learning

Aligning features from different modalities, is one of the most fundamental challenges for cross-modal tasks. Although pre-trained vision-language models can achieve a general alignment between image and text, they often require…

Computer Vision and Pattern Recognition · Computer Science 2026-04-14 Ziqi Jiang , Yanghao Wang , Long Chen

A Closer Look at Multimodal Representation Collapse

We aim to develop a fundamental understanding of modality collapse, a recently observed empirical phenomenon wherein models trained for multimodal fusion tend to rely only on a subset of the modalities, ignoring the rest. We show that…

Machine Learning · Computer Science 2025-08-18 Abhra Chaudhuri , Anjan Dutta , Tu Bui , Serban Georgescu

PMR: Prototypical Modal Rebalance for Multimodal Learning

Multimodal learning (MML) aims to jointly exploit the common priors of different modalities to compensate for their inherent limitations. However, existing MML methods often optimize a uniform objective for different modalities, leading to…

Machine Learning · Computer Science 2022-11-15 Yunfeng Fan , Wenchao Xu , Haozhao Wang , Junxiao Wang , Song Guo