Related papers: CodexGraph: Bridging Large Language Models and Cod…

Code Graph Model (CGM): A Graph-Integrated Large Language Model for Repository-Level Software Engineering Tasks

Recent advances in Large Language Models (LLMs) have shown promise in function-level code generation, yet repository-level software engineering tasks remain challenging. Current solutions predominantly rely on proprietary LLM agents, which…

Software Engineering · Computer Science 2025-06-25 Hongyuan Tao , Ying Zhang , Zhenhao Tang , Hongen Peng , Xukun Zhu , Bingchang Liu , Yingguang Yang , Ziyin Zhang , Zhaogui Xu , Haipeng Zhang , Linchao Zhu , Rui Wang , Hang Yu , Jianguo Li , Peng Di

Knowledge Graph Based Repository-Level Code Generation

Recent advancements in Large Language Models (LLMs) have transformed code generation from natural language queries. However, despite their extensive knowledge and ability to produce high-quality code, LLMs often struggle with contextual…

Artificial Intelligence · Computer Science 2025-07-17 Mihir Athale , Vishal Vaddina

GraphCodeAgent: Dual Graph-Guided LLM Agent for Retrieval-Augmented Repo-Level Code Generation

Writing code requires significant time and effort in software development. To automate this process, researchers have made substantial progress for code generation. Recently, large language models (LLMs) have demonstrated remarkable…

Software Engineering · Computer Science 2025-11-19 Jia Li , Xianjie Shi , Kechi Zhang , Ge Li , Zhi Jin , Lei Li , Huangzhao Zhang , Jia Li , Fang Liu , Yuwei Zhang , Zhengwei Tao , Yihong Dong , Yuqi Zhu , Chongyang Tao

CodeGraph: Enhancing Graph Reasoning of LLMs with Code

With the increasing popularity of large language models (LLMs), reasoning on basic graph algorithm problems is an essential intermediate step in assessing their abilities to process and infer complex graph reasoning tasks. Existing methods…

Computation and Language · Computer Science 2024-08-27 Qiaolong Cai , Zhaowei Wang , Shizhe Diao , James Kwok , Yangqiu Song

CodeGRAG: Bridging the Gap between Natural Language and Programming Language via Graphical Retrieval Augmented Generation

Utilizing large language models to generate codes has shown promising meaning in software development revolution. Despite the intelligence shown by the large language models, their specificity in code generation can still be improved due to…

Software Engineering · Computer Science 2025-05-20 Kounianhua Du , Jizheng Chen , Renting Rui , Huacan Chai , Lingyue Fu , Wei Xia , Yasheng Wang , Ruiming Tang , Yong Yu , Weinan Zhang

HDLxGraph: Bridging Large Language Models and HDL Repositories via HDL Graph Databases

Retrieval Augmented Generation (RAG) is an essential agent for Large Language Model (LLM) aided Description Language (HDL) tasks, addressing the challenges of limited training data and prohibitively long prompts. However, its performance in…

Hardware Architecture · Computer Science 2026-03-10 Pingqing Zheng , Jiayin Qin , Fuqi Zhang , Niraj Chitla , Zishen Wan , Shang Wu , Yu Cao , Caiwen Ding , Yang , Zhao

Can Large Language Models Analyze Graphs like Professionals? A Benchmark, Datasets and Models

The need to analyze graphs is ubiquitous across various fields, from social networks to biological research and recommendation systems. Therefore, enabling the ability of large language models (LLMs) to process graphs is an important step…

Computation and Language · Computer Science 2025-11-04 Xin Li , Weize Chen , Qizhi Chu , Haopeng Li , Zhaojun Sun , Ran Li , Chen Qian , Yiwei Wei , Zhiyuan Liu , Chuan Shi , Maosong Sun , Cheng Yang

Understanding Codebase like a Professional! Human-AI Collaboration for Code Comprehension

Understanding an unfamiliar codebase is an essential task for developers in various scenarios, such as during the onboarding process. Especially when the codebase is large and time is limited, achieving a decent level of comprehension…

Human-Computer Interaction · Computer Science 2026-02-16 Jie Gao , Yue Xue , Xiaofei Xie , SoeMin Thant , Erika Lee , Bowen Xu

GraphCogent: Mitigating LLMs' Working Memory Constraints via Multi-Agent Collaboration in Complex Graph Understanding

Large language models (LLMs) show promising performance on small-scale graph reasoning tasks but fail when handling real-world graphs with complex queries. This phenomenon arises from LLMs' working memory constraints, which result in their…

Artificial Intelligence · Computer Science 2025-10-01 Rongzheng Wang , Shuang Liang , Qizhi Chen , Yihong Huang , Muquan Li , Yizhuo Ma , Dongyang Zhang , Ke Qin , Man-Fai Leung

Bridging Code Property Graphs and Language Models for Program Analysis

Large Language Models (LLMs) face critical challenges when analyzing security vulnerabilities in real world codebases: token limits prevent loading entire repositories, code embeddings fail to capture inter procedural data flows, and LLMs…

Cryptography and Security · Computer Science 2026-03-27 Ahmed Lekssays

Can Language Models Solve Graph Problems in Natural Language?

Large language models (LLMs) are increasingly adopted for a variety of tasks with implicit graphical structures, such as planning in robotics, multi-hop question answering or knowledge probing, structured commonsense reasoning, and more.…

Computation and Language · Computer Science 2024-01-09 Heng Wang , Shangbin Feng , Tianxing He , Zhaoxuan Tan , Xiaochuang Han , Yulia Tsvetkov

RepoGraph: Enhancing AI Software Engineering with Repository-level Code Graph

Large Language Models (LLMs) excel in code generation yet struggle with modern AI software engineering tasks. Unlike traditional function-level or file-level coding tasks, AI software engineering requires not only basic coding proficiency…

Software Engineering · Computer Science 2025-03-20 Siru Ouyang , Wenhao Yu , Kaixin Ma , Zilin Xiao , Zhihan Zhang , Mengzhao Jia , Jiawei Han , Hongming Zhang , Dong Yu

CodeTree: Agent-guided Tree Search for Code Generation with Large Language Models

Pre-trained on massive amounts of code and text data, large language models (LLMs) have demonstrated remarkable achievements in performing code generation tasks. With additional execution-based feedback, these models can act as agents with…

Computation and Language · Computer Science 2024-11-14 Jierui Li , Hung Le , Yingbo Zhou , Caiming Xiong , Silvio Savarese , Doyen Sahoo

LocAgent: Graph-Guided LLM Agents for Code Localization

Code localization--identifying precisely where in a codebase changes need to be made--is a fundamental yet challenging task in software maintenance. Existing approaches struggle to efficiently navigate complex codebases when identifying…

Software Engineering · Computer Science 2025-04-30 Zhaoling Chen , Xiangru Tang , Gangda Deng , Fang Wu , Jialong Wu , Zhiwei Jiang , Viktor Prasanna , Arman Cohan , Xingyao Wang

Colorful Talks with Graphs: Human-Interpretable Graph Encodings for Large Language Models

Graph problems are fundamentally challenging for large language models (LLMs). While LLMs excel at processing unstructured text, graph tasks require reasoning over explicit structure, permutation invariance, and computationally complex…

Machine Learning · Computer Science 2026-04-23 Angelo Zangari , Peyman Baghershahi , Sourav Medya

CodeAgent: Enhancing Code Generation with Tool-Integrated Agent Systems for Real-World Repo-level Coding Challenges

Large Language Models (LLMs) have shown promise in automated code generation but typically excel only in simpler tasks such as generating standalone code units. Real-world software development, however, often involves complex code…

Software Engineering · Computer Science 2024-08-12 Kechi Zhang , Jia Li , Ge Li , Xianjie Shi , Zhi Jin

Leveraging Large Language Models for Automated Reproduction of Networking Research Results

Code reproduction is a cornerstone of scientific validity, yet it remains a formidable challenge in computer networking research due to the scarcity of open-source implementations and the complexity of heterogeneous system architectures.…

Networking and Internet Architecture · Computer Science 2026-02-17 Yining Jiang , Yunxin Xu , Wenyun Xu , Yufan Zhu , Tangtang He , Haiying Huang , Letian Zhu , Qingyu Song , Qiang Su , Lizhao You , Lu Tang , Wanjin Feng , Yuchao Zhang , Linghe Kong , Qiao Xiang , Jiwu Shu

CodeTaste: Can LLMs Generate Human-Level Code Refactorings?

Large language model (LLM) coding agents can generate working code, but their solutions often accumulate complexity, duplication, and architectural debt. Human developers address such issues through refactoring: behavior-preserving program…

Software Engineering · Computer Science 2026-03-05 Alex Thillen , Niels Mündler , Veselin Raychev , Martin Vechev

LARGER: Lexically Anchored Repository Graph Exploration and Retrieval

Repository-level coding agents must first localize the files and symbols relevant to a task; failures at this stage can cascade across downstream objectives ranging from patch generation to test writing and codebase question answering.…

Information Retrieval · Computer Science 2026-05-19 Yuntong Hu , Tongli Su , Liang Zhao , Bowen Zhu , Hasibul Haque

RepoHyper: Search-Expand-Refine on Semantic Graphs for Repository-Level Code Completion

Code Large Language Models (CodeLLMs) have demonstrated impressive proficiency in code completion tasks. However, they often fall short of fully understanding the extensive context of a project repository, such as the intricacies of…

Software Engineering · Computer Science 2024-08-15 Huy N. Phan , Hoang N. Phan , Tien N. Nguyen , Nghi D. Q. Bui