English
Related papers

Related papers: Code-Craft: Hierarchical Graph-Based Code Summariz…

200 papers

We propose a novel method for code summarization utilizing Heterogeneous Code Representations (HCRs) and our specially designed HierarchyNet. HCRs effectively capture essential code features at lexical, syntactic, and semantic levels by…

Software Engineering · Computer Science 2023-05-10 Minh Huynh Nguyen , Nghi D. Q. Bui , Truong Son Hy , Long Tran-Thanh , Tien N. Nguyen

Graph stream summarization refers to the process of processing a continuous stream of edges that form a rapidly evolving graph. The primary challenges in handling graph streams include the impracticality of fully storing the ever-growing…

Databases · Computer Science 2024-12-23 Xuan Zhao , Xike Xie , Christian S. Jensen

In large-scale software development, understanding the functionality and intent behind complex codebases is critical for effective development and maintenance. While code summarization has been widely studied, existing methods primarily…

Software Engineering · Computer Science 2025-01-15 Nilesh Dhulshette , Sapan Shah , Vinay Kulkarni

The growing demand for automated graph algorithm reasoning has attracted increasing attention in the large language model (LLM) community. Recent LLM-based graph reasoning methods typically decouple task descriptions from graph data,…

Software Engineering · Computer Science 2026-03-10 Fali Wang , Chenglin Weng , Xianren Zhang , Siyuan Hong , Hui Liu , Suhang Wang

Recent advancements in Large Language Models (LLMs) have transformed code generation from natural language queries. However, despite their extensive knowledge and ability to produce high-quality code, LLMs often struggle with contextual…

Artificial Intelligence · Computer Science 2025-07-17 Mihir Athale , Vishal Vaddina

LLMs excel in localized code completion but struggle with repository-level tasks due to limited context windows and complex semantic and structural dependencies across codebases. While Retrieval-Augmented Generation (RAG) mitigates context…

Software Engineering · Computer Science 2025-09-09 Xingliang Wang , Baoyi Wang , Chen Zhi , Junxiao Han , Xinkui Zhao , Jianwei Yin , Shuiguang Deng

The advancement of graph-based malware analysis is critically limited by the absence of large-scale datasets that capture the inherent hierarchical structure of software. Existing methods often oversimplify programs into single level…

Machine Learning · Computer Science 2026-05-26 Han Chen , Hanchen Wang , Hongmei Chen , Ying Zhang , Lu Qin , Wenjie Zhang

Code retrieval is to find the code snippet from a large corpus of source code repositories that highly matches the query of natural language description. Recent work mainly uses natural language processing techniques to process both query…

Artificial Intelligence · Computer Science 2021-06-23 Xiang Ling , Lingfei Wu , Saizhuo Wang , Gaoning Pan , Tengfei Ma , Fangli Xu , Alex X. Liu , Chunming Wu , Shouling Ji

Heterogeneous graph neural networks have recently gained attention for long document summarization, modeling the extraction as a node classification task. Although effective, these models often require external tools or additional machine…

Computation and Language · Computer Science 2024-10-30 Margarita Bugueño , Hazem Abou Hamdan , Gerard de Melo

Source code summarization aims to generate natural language summaries from structured code snippets for better understanding code functionalities. However, automatic code summarization is challenging due to the complexity of the source code…

Machine Learning · Computer Science 2021-05-14 Shangqing Liu , Yu Chen , Xiaofei Xie , Jingkai Siow , Yang Liu

Large Language Models (LLMs) excel in stand-alone code tasks like HumanEval and MBPP, but struggle with handling entire code repositories. This challenge has prompted research on enhancing LLM-codebase interaction at a repository scale.…

Software Engineering · Computer Science 2024-08-13 Xiangyan Liu , Bo Lan , Zhiyuan Hu , Yang Liu , Zhicheng Zhang , Fei Wang , Michael Shieh , Wenmeng Zhou

Retrieval-Augmented Generation (RAG) grounds large language models in external evidence, yet it still falters when answers must be pieced together across semantically distant documents. We close this gap with the Hierarchical Lexical Graph…

Cross-modal retrieval between videos and texts has attracted growing attentions due to the rapid emergence of videos on the web. The current dominant approach for this problem is to learn a joint embedding space to measure cross-modal…

Computer Vision and Pattern Recognition · Computer Science 2020-03-03 Shizhe Chen , Yida Zhao , Qin Jin , Qi Wu

Code search is a widely used technique by developers during software development. It provides semantically similar implementations from a large code corpus to developers based on their queries. Existing techniques leverage deep learning…

Software Engineering · Computer Science 2022-02-17 Weisong Sun , Chunrong Fang , Yuchen Chen , Guanhong Tao , Tingxu Han , Quanjun Zhang

Code classification is a difficult issue in program understanding and automatic coding. Due to the elusive syntax and complicated semantics in programs, most existing studies use techniques based on abstract syntax tree (AST) and graph…

Software Engineering · Computer Science 2025-09-25 Guang Yang , Tiancheng Jin , Liang Dou

Semantic parsing of long documents remains challenging due to quadratic growth in pairwise composition and memory requirements. We introduce \textbf{Hierarchical Segment-Graph Memory (HSGM)}, a novel framework that decomposes an input of…

Artificial Intelligence · Computer Science 2025-09-24 Dong Liu , Yanxuan Yu

Object categories are typically organized into a multi-granularity taxonomic hierarchy. When classifying categories at different hierarchy levels, traditional uni-modal approaches focus primarily on image features, revealing limitations in…

Computer Vision and Pattern Recognition · Computer Science 2024-12-03 Peng Xia , Xingtong Yu , Ming Hu , Lie Ju , Zhiyong Wang , Peibo Duan , Zongyuan Ge

Source code summaries are short natural language descriptions of code snippets that help developers better understand and maintain source code. There has been a surge of work on automatic code summarization to reduce the burden of writing…

Software Engineering · Computer Science 2021-07-06 Yanlin Wang , Ensheng Shi , Lun Du , Xiaodi Yang , Yuxuan Hu , Shi Han , Hongyu Zhang , Dongmei Zhang

With the increasing popularity of large language models (LLMs), reasoning on basic graph algorithm problems is an essential intermediate step in assessing their abilities to process and infer complex graph reasoning tasks. Existing methods…

Computation and Language · Computer Science 2024-08-27 Qiaolong Cai , Zhaowei Wang , Shizhe Diao , James Kwok , Yangqiu Song

Enterprise level software is implemented using multi-layer architecture. These layers are often implemented using de-coupled solutions with millions of lines of code. Programmers often have to track and debug a function call from user…

Software Engineering · Computer Science 2016-10-17 Anne Veenendaal , Elliot Daly , Eddie Jones , Zhao Gang , Sumalini Vartak , Rahul S Patwardhan
‹ Prev 1 2 3 10 Next ›