Related papers: Hypergraph based Understanding for Document Semant…

Enhancing Visually-Rich Document Understanding via Layout Structure Modeling

In recent years, the use of multi-modal pre-trained Transformers has led to significant advancements in visually-rich document understanding. However, existing models have mainly focused on features such as text and vision while neglecting…

Computation and Language · Computer Science 2023-08-16 Qiwei Li , Zuchao Li , Xiantao Cai , Bo Du , Hai Zhao

Hierarchical Attention Graph for Scientific Document Summarization in Global and Local Level

Scientific document summarization has been a challenging task due to the long structure of the input text. The long input hinders the simultaneous effective modeling of both global high-order relations between sentences and local…

Computation and Language · Computer Science 2024-05-17 Chenlong Zhao , Xiwen Zhou , Xiaopeng Xie , Yong Zhang

MGSA: Multi-Granularity Graph Structure Attention for Knowledge Graph-to-Text Generation

The Knowledge Graph-to-Text Generation task aims to convert structured knowledge graphs into coherent and human-readable natural language text. Recent efforts in this field have focused on enhancing pre-trained language models (PLMs) by…

Computation and Language · Computer Science 2024-09-24 Shanshan Wang , Chun Zhang , Ning Zhang

End-to-End Hierarchical Relation Extraction for Generic Form Understanding

Form understanding is a challenging problem which aims to recognize semantic entities from the input document and their hierarchical relations. Previous approaches face significant difficulty dealing with the complexity of the task, thus…

Artificial Intelligence · Computer Science 2021-06-03 Tuan-Anh Nguyen Dang , Duc-Thanh Hoang , Quang-Bach Tran , Chih-Wei Pan , Thanh-Dat Nguyen

Structural-Aware Key Node Identification in Hypergraphs via Representation Learning and Fine-Tuning

Evaluating node importance is a critical aspect of analyzing complex systems, with broad applications in digital marketing, rumor suppression, and disease control. However, existing methods typically rely on conventional network structures…

Social and Information Networks · Computer Science 2025-07-29 Xiaonan Ni , Guangyuan Mei , Su-Su Zhang , Yang Chen , Xin Xu , Chuang Liu , Xiu-Xiu Zhan

HeGTa: Leveraging Heterogeneous Graph-enhanced Large Language Models for Few-shot Complex Table Understanding

Table understanding (TU) has achieved promising advancements, but it faces the challenges of the scarcity of manually labeled tables and the presence of complex table structures.To address these challenges, we propose HGT, a framework with…

Computation and Language · Computer Science 2024-12-17 Rihui Jin , Yu Li , Guilin Qi , Nan Hu , Yuan-Fang Li , Jiaoyan Chen , Jianan Wang , Yongrui Chen , Dehai Min , Sheng Bi

Towards Efficient LLM-aware Heterogeneous Graph Learning

Heterogeneous graphs are widely present in real-world complex networks, where the diversity of node and relation types leads to complex and rich semantics. Efforts for modeling complex relation semantics in heterogeneous graphs are…

Computation and Language · Computer Science 2025-11-25 Wenda Li , Tongya Zheng , Shunyu Liu , Yu Wang , Kaixuan Chen , Hanyang Yuan , Bingde Hu , Zujie Ren , Mingli Song , Gang Chen

OS-HGAdapter: Open Semantic Hypergraph Adapter for Large Language Models Assisted Entropy-Enhanced Image-Text Alignment

Text-image alignment constitutes a foundational challenge in multimedia content understanding, where effective modeling of cross-modal semantic correspondences critically enhances retrieval system performance through joint embedding space…

Computer Vision and Pattern Recognition · Computer Science 2025-10-16 Rongjun Chen , Chengsi Yao , Jinchang Ren , Xianxian Zeng , Peixian Wang , Jun Yuan , Jiawen Li , Huimin Zhao , Xu Lu

Learning to Recognize Discontiguous Entities

This paper focuses on the study of recognizing discontiguous entities. Motivated by a previous work, we propose to use a novel hypergraph representation to jointly encode discontiguous entities of unbounded length, which can overlap with…

Computation and Language · Computer Science 2020-05-28 Aldrian Obaja Muis , Wei Lu

Cross-Granularity Hypergraph Retrieval-Augmented Generation for Multi-hop Question Answering

Multi-hop question answering (MHQA) requires integrating knowledge scattered across multiple passages to derive the correct answer. Traditional retrieval-augmented generation (RAG) methods primarily focus on coarse-grained textual semantic…

Computation and Language · Computer Science 2025-08-18 Changjian Wang , Weihong Deng , Weili Guan , Quan Lu , Ning Jiang

Deep Semantic Graph Learning via LLM based Node Enhancement

Graph learning has attracted significant attention due to its widespread real-world applications. Current mainstream approaches rely on text node features and obtain initial node embeddings through shallow embedding learning using GNNs,…

Artificial Intelligence · Computer Science 2025-02-13 Chuanqi Shi , Yiyi Tao , Hang Zhang , Lun Wang , Shaoshuai Du , Yixian Shen , Yanxin Shen

Graph-Convolutional Networks: Named Entity Recognition and Large Language Model Embedding in Document Clustering

Recent advances in machine learning, particularly Large Language Models (LLMs) such as BERT and GPT, provide rich contextual embeddings that improve text representation. However, current document clustering approaches often ignore the…

Computation and Language · Computer Science 2024-12-20 Imed Keraghel , Mohamed Nadif

Boosting Entity-aware Image Captioning with Multi-modal Knowledge Graph

Entity-aware image captioning aims to describe named entities and events related to the image by utilizing the background knowledge in the associated article. This task remains challenging as it is difficult to learn the association between…

Computer Vision and Pattern Recognition · Computer Science 2021-07-27 Wentian Zhao , Yao Hu , Heda Wang , Xinxiao Wu , Jiebo Luo

Investigating Graph Structure Information for Entity Alignment with Dangling Cases

Entity alignment (EA) aims to discover the equivalent entities in different knowledge graphs (KGs), which play an important role in knowledge engineering. Recently, EA with dangling entities has been proposed as a more realistic setting,…

Computation and Language · Computer Science 2023-04-11 Jin Xu , Yangning Li , Xiangjin Xie , Yinghui Li , Niu Hu , Haitao Zheng , Yong Jiang

Hierarchical Attentional Hybrid Neural Networks for Document Classification

Document classification is a challenging task with important applications. The deep learning approaches to the problem have gained much attention recently. Despite the progress, the proposed models do not incorporate the knowledge of the…

Computation and Language · Computer Science 2019-10-15 Jader Abreu , Luis Fred , David Macêdo , Cleber Zanchettin

RAGA: Relation-aware Graph Attention Networks for Global Entity Alignment

Entity alignment (EA) is the task to discover entities referring to the same real-world object from different knowledge graphs (KGs), which is the most crucial step in integrating multi-source KGs. The majority of the existing…

Computation and Language · Computer Science 2021-03-02 Renbo Zhu , Meng Ma , Ping Wang

Text Classification Based on Knowledge Graphs and Improved Attention Mechanism

To resolve the semantic ambiguity in texts, we propose a model, which innovatively combines a knowledge graph with an improved attention mechanism. An existing knowledge base is utilized to enrich the text with relevant contextual concepts.…

Computation and Language · Computer Science 2024-01-30 Siyu Li , Lu Chen , Chenwei Song , Xinyi Liu

Vision-Enhanced Semantic Entity Recognition in Document Images via Visually-Asymmetric Consistency Learning

Extracting meaningful entities belonging to predefined categories from Visually-rich Form-like Documents (VFDs) is a challenging task. Visual and layout features such as font, background, color, and bounding box location and size provide…

Computer Vision and Pattern Recognition · Computer Science 2023-10-24 Hao Wang , Xiahua Chen , Rui Wang , Chenhui Chu

MLaGA: Multimodal Large Language and Graph Assistant

Large Language Models (LLMs) have demonstrated substantial efficacy in advancing graph-structured data analysis. Prevailing LLM-based graph methods excel in adapting LLMs to text-rich graphs, wherein node attributes are text descriptions.…

Artificial Intelligence · Computer Science 2025-06-04 Dongzhe Fan , Yi Fang , Jiajin Liu , Djellel Difallah , Qiaoyu Tan

Pretraining Language Models with Text-Attributed Heterogeneous Graphs

In many real-world scenarios (e.g., academic networks, social platforms), different types of entities are not only associated with texts but also connected by various relationships, which can be abstracted as Text-Attributed Heterogeneous…

Computation and Language · Computer Science 2023-10-24 Tao Zou , Le Yu , Yifei Huang , Leilei Sun , Bowen Du