Related papers: Code-Craft: Hierarchical Graph-Based Code Summariz…

HierarchyNet: Learning to Summarize Source Code with Heterogeneous Representations

We propose a novel method for code summarization utilizing Heterogeneous Code Representations (HCRs) and our specially designed HierarchyNet. HCRs effectively capture essential code features at lexical, syntactic, and semantic levels by…

Software Engineering · Computer Science 2023-05-10 Minh Huynh Nguyen , Nghi D. Q. Bui , Truong Son Hy , Long Tran-Thanh , Tien N. Nguyen

HIGGS: HIerarchy-Guided Graph Stream Summarization

Graph stream summarization refers to the process of processing a continuous stream of edges that form a rapidly evolving graph. The primary challenges in handling graph streams include the impracticality of fully storing the ever-growing…

Databases · Computer Science 2024-12-23 Xuan Zhao , Xike Xie , Christian S. Jensen

Hierarchical Repository-Level Code Summarization for Business Applications Using Local LLMs

In large-scale software development, understanding the functionality and intent behind complex codebases is critical for effective development and maintenance. While code summarization has been widely studied, existing methods primarily…

Software Engineering · Computer Science 2025-01-15 Nilesh Dhulshette , Sapan Shah , Vinay Kulkarni

GraphSkill: Documentation-Guided Hierarchical Retrieval-Augmented Coding for Complex Graph Reasoning

The growing demand for automated graph algorithm reasoning has attracted increasing attention in the large language model (LLM) community. Recent LLM-based graph reasoning methods typically decouple task descriptions from graph data,…

Software Engineering · Computer Science 2026-03-10 Fali Wang , Chenglin Weng , Xianren Zhang , Siyuan Hong , Hui Liu , Suhang Wang

Knowledge Graph Based Repository-Level Code Generation

Recent advancements in Large Language Models (LLMs) have transformed code generation from natural language queries. However, despite their extensive knowledge and ability to produce high-quality code, LLMs often struggle with contextual…

Artificial Intelligence · Computer Science 2025-07-17 Mihir Athale , Vishal Vaddina

GRACE: Graph-Guided Repository-Aware Code Completion through Hierarchical Code Fusion

LLMs excel in localized code completion but struggle with repository-level tasks due to limited context windows and complex semantic and structural dependencies across codebases. While Retrieval-Augmented Generation (RAG) mitigates context…

Software Engineering · Computer Science 2025-09-09 Xingliang Wang , Baoyi Wang , Chen Zhi , Junxiao Han , Xinkui Zhao , Jianwei Yin , Shuiguang Deng

HiGraph: A Large-Scale Hierarchical Graph Dataset for Malware Analysis

The advancement of graph-based malware analysis is critically limited by the absence of large-scale datasets that capture the inherent hierarchical structure of software. Existing methods often oversimplify programs into single level…

Machine Learning · Computer Science 2026-05-26 Han Chen , Hanchen Wang , Hongmei Chen , Ying Zhang , Lu Qin , Wenjie Zhang

Deep Graph Matching and Searching for Semantic Code Retrieval

Code retrieval is to find the code snippet from a large corpus of source code repositories that highly matches the query of natural language description. Recent work mainly uses natural language processing techniques to process both query…

Artificial Intelligence · Computer Science 2021-06-23 Xiang Ling , Lingfei Wu , Saizhuo Wang , Gaoning Pan , Tengfei Ma , Fangli Xu , Alex X. Liu , Chunming Wu , Shouling Ji

GraphLSS: Integrating Lexical, Structural, and Semantic Features for Long Document Extractive Summarization

Heterogeneous graph neural networks have recently gained attention for long document summarization, modeling the extraction as a node classification task. Although effective, these models often require external tools or additional machine…

Computation and Language · Computer Science 2024-10-30 Margarita Bugueño , Hazem Abou Hamdan , Gerard de Melo

Retrieval-Augmented Generation for Code Summarization via Hybrid GNN

Source code summarization aims to generate natural language summaries from structured code snippets for better understanding code functionalities. However, automatic code summarization is challenging due to the complexity of the source code…

Machine Learning · Computer Science 2021-05-14 Shangqing Liu , Yu Chen , Xiaofei Xie , Jingkai Siow , Yang Liu

CodexGraph: Bridging Large Language Models and Code Repositories via Code Graph Databases

Large Language Models (LLMs) excel in stand-alone code tasks like HumanEval and MBPP, but struggle with handling entire code repositories. This challenge has prompted research on enhancing LLM-codebase interaction at a repository scale.…

Software Engineering · Computer Science 2024-08-13 Xiangyan Liu , Bo Lan , Zhiyuan Hu , Yang Liu , Zhicheng Zhang , Fei Wang , Michael Shieh , Wenmeng Zhou

Hierarchical Lexical Graph for Enhanced Multi-Hop Retrieval

Retrieval-Augmented Generation (RAG) grounds large language models in external evidence, yet it still falters when answers must be pieced together across semantically distant documents. We close this gap with the Hierarchical Lexical Graph…

Information Retrieval · Computer Science 2025-06-11 Abdellah Ghassel , Ian Robinson , Gabriel Tanase , Hal Cooper , Bryan Thompson , Zhen Han , Vassilis N. Ioannidis , Soji Adeshina , Huzefa Rangwala

Fine-grained Video-Text Retrieval with Hierarchical Graph Reasoning

Cross-modal retrieval between videos and texts has attracted growing attentions due to the rapid emergence of videos on the web. The current dominant approach for this problem is to learn a joint embedding space to measure cross-modal…

Computer Vision and Pattern Recognition · Computer Science 2020-03-03 Shizhe Chen , Yida Zhao , Qin Jin , Qi Wu

Code Search based on Context-aware Code Translation

Code search is a widely used technique by developers during software development. It provides semantically similar implementations from a large code corpus to developers based on their queries. Existing techniques leverage deep learning…

Software Engineering · Computer Science 2022-02-17 Weisong Sun , Chunrong Fang , Yuchen Chen , Guanhong Tao , Tingxu Han , Quanjun Zhang

Heterogeneous Directed Hypergraph Neural Network over abstract syntax tree (AST) for Code Classification

Code classification is a difficult issue in program understanding and automatic coding. Due to the elusive syntax and complicated semantics in programs, most existing studies use techniques based on abstract syntax tree (AST) and graph…

Software Engineering · Computer Science 2025-09-25 Guang Yang , Tiancheng Jin , Liang Dou

HSGM: Hierarchical Segment-Graph Memory for Scalable Long-Text Semantics

Semantic parsing of long documents remains challenging due to quadratic growth in pairwise composition and memory requirements. We introduce \textbf{Hierarchical Segment-Graph Memory (HSGM)}, a novel framework that decomposes an input of…

Artificial Intelligence · Computer Science 2025-09-24 Dong Liu , Yanxuan Yu

HGCLIP: Exploring Vision-Language Models with Graph Representations for Hierarchical Understanding

Object categories are typically organized into a multi-granularity taxonomic hierarchy. When classifying categories at different hierarchy levels, traditional uni-modal approaches focus primarily on image features, revealing limitations in…

Computer Vision and Pattern Recognition · Computer Science 2024-12-03 Peng Xia , Xingtong Yu , Ming Hu , Lie Ju , Zhiyong Wang , Peibo Duan , Zongyuan Ge

CoCoSum: Contextual Code Summarization with Multi-Relational Graph Neural Network

Source code summaries are short natural language descriptions of code snippets that help developers better understand and maintain source code. There has been a surge of work on automatic code summarization to reduce the burden of writing…

Software Engineering · Computer Science 2021-07-06 Yanlin Wang , Ensheng Shi , Lun Du , Xiaodi Yang , Yuxuan Hu , Shi Han , Hongyu Zhang , Dongmei Zhang

CodeGraph: Enhancing Graph Reasoning of LLMs with Code

With the increasing popularity of large language models (LLMs), reasoning on basic graph algorithm problems is an essential intermediate step in assessing their abilities to process and infer complex graph reasoning tasks. Existing methods…

Computation and Language · Computer Science 2024-08-27 Qiaolong Cai , Zhaowei Wang , Shizhe Diao , James Kwok , Yangqiu Song

Code Definition Analysis for Call Graph Generation

Enterprise level software is implemented using multi-layer architecture. These layers are often implemented using de-coupled solutions with millions of lines of code. Programmers often have to track and debug a function call from user…

Software Engineering · Computer Science 2016-10-17 Anne Veenendaal , Elliot Daly , Eddie Jones , Zhao Gang , Sumalini Vartak , Rahul S Patwardhan