Related papers: A Toolkit for Generating Code Knowledge Graphs

CodeKGC: Code Language Model for Generative Knowledge Graph Construction

Current generative knowledge graph construction approaches usually fail to capture structural knowledge by simply flattening natural language into serialized texts or a specification language. However, large generative language model…

Computation and Language · Computer Science 2024-01-19 Zhen Bi , Jing Chen , Yinuo Jiang , Feiyu Xiong , Wei Guo , Huajun Chen , Ningyu Zhang

Generating Knowledge Graphs by Employing Natural Language Processing and Machine Learning Techniques within the Scholarly Domain

The continuous growth of scientific literature brings innovations and, at the same time, raises new challenges. One of them is related to the fact that its analysis has become difficult due to the high volume of published papers for which…

Computation and Language · Computer Science 2020-11-06 Danilo Dessì , Francesco Osborne , Diego Reforgiato Recupero , Davide Buscaldi , Enrico Motta

WikiCoder: Learning to Write Knowledge-Powered Code

We tackle the problem of automatic generation of computer programs from a few pairs of input-output examples. The starting point of this work is the observation that in many applications a solution program must use external knowledge not…

Machine Learning · Computer Science 2023-03-16 Théo Matricon , Nathanaël Fijalkow , Gaëtan Margueritte

GraphSearchNet: Enhancing GNNs via Capturing Global Dependencies for Semantic Code Search

Code search aims to retrieve accurate code snippets based on a natural language query to improve software productivity and quality. With the massive amount of available programs such as (on GitHub or Stack Overflow), identifying and…

Software Engineering · Computer Science 2023-02-14 Shangqing Liu , Xiaofei Xie , Jingkai Siow , Lei Ma , Guozhu Meng , Yang Liu

LogicLens: Leveraging Semantic Code Graph to explore Multi Repository large systems

Understanding large software systems is a challenging task, especially when code is distributed across multiple repositories and microservices. Developers often need to reason not only about the structure of the code, but also about its…

Software Engineering · Computer Science 2026-01-19 Niko Usai , Dario Montagnini , Kristian Ilianov Iliev , Raffaele Camanzo

A Framework for Large Scale Synthetic Graph Dataset Generation

Recently there has been increasing interest in developing and deploying deep graph learning algorithms for many tasks, such as fraud detection and recommender systems. Albeit, there is a limited number of publicly available graph-structured…

Machine Learning · Computer Science 2023-10-06 Sajad Darabi , Piotr Bigaj , Dawid Majchrowski , Artur Kasymov , Pawel Morkisz , Alex Fit-Florea

Semantic Code Graph -- an information model to facilitate software comprehension

Software comprehension can be extremely time-consuming due to the ever-growing size of codebases. Consequently, there is an increasing need to accelerate the code comprehension process to facilitate maintenance and reduce associated costs.…

Software Engineering · Computer Science 2024-01-15 Krzysztof Borowski , Bartosz Baliś , Tomasz Orzechowski

Generative Code Modeling with Graphs

Generative models for source code are an interesting structured prediction problem, requiring to reason about both hard syntactic and semantic constraints as well as about natural, likely programs. We present a novel model for this problem…

Machine Learning · Computer Science 2019-04-18 Marc Brockschmidt , Miltiadis Allamanis , Alexander L. Gaunt , Oleksandr Polozov

Methods for Knowledge Graph Construction from Text Collections: Development and Applications

Virtually every sector of society is experiencing a dramatic growth in the volume of unstructured textual data that is generated and published, from news and social media online interactions, through open access scholarly communications and…

Computation and Language · Computer Science 2026-03-30 Vanni Zavarella

Unsupervised Construction of Knowledge Graphs From Text and Code

The scientific literature is a rich source of information for data mining with conceptual knowledge graphs; the open science movement has enriched this literature with complementary source code that implements scientific models. To exploit…

Machine Learning · Computer Science 2019-08-27 Kun Cao , James Fairbanks

SCoGen: Scenario-Centric Graph-Based Synthesis of Real-World Code Problems

Significant advancements have been made in the capabilities of code large language models, leading to their rapid adoption and application across a wide range of domains. However, their further advancements are often constrained by the…

Software Engineering · Computer Science 2025-09-19 Xifeng Yao , Dongyu Lang , Wu Zhang , Xintong Guo , Huarui Xie , Yinhao Ni , Ping Liu , Guang Shen , Yi Bai , Dandan Tu , Changzheng Zhang

ChartGen: Scaling Chart Understanding Via Code-Guided Synthetic Chart Generation

Chart-to-code reconstruction -- the task of recovering executable plotting scripts from chart images -- provides important insights into a model's ability to ground data visualizations in precise, machine-readable form. Yet many existing…

Human-Computer Interaction · Computer Science 2025-07-29 Jovana Kondic , Pengyuan Li , Dhiraj Joshi , Zexue He , Shafiq Abedin , Jennifer Sun , Ben Wiesel , Eli Schwartz , Ahmed Nassar , Bo Wu , Assaf Arbelle , Aude Oliva , Dan Gutfreund , Leonid Karlinsky , Rogerio Feris

FabKG: A Knowledge graph of Manufacturing Science domain utilizing structured and unconventional unstructured knowledge source

As the demands for large-scale information processing have grown, knowledge graph-based approaches have gained prominence for representing general and domain knowledge. The development of such general representations is essential,…

Computation and Language · Computer Science 2022-06-22 Aman Kumar , Akshay G Bharadwaj , Binil Starly , Collin Lynch

VisionKG: Unleashing the Power of Visual Datasets via Knowledge Graph

The availability of vast amounts of visual data with heterogeneous features is a key factor for developing, testing, and benchmarking of new computer vision (CV) algorithms and architectures. Most visual datasets are created and curated for…

Computer Vision and Pattern Recognition · Computer Science 2024-03-29 Jicheng Yuan , Anh Le-Tuan , Manh Nguyen-Duc , Trung-Kien Tran , Manfred Hauswirth , Danh Le-Phuoc

AI-Driven Code Refactoring: Using Graph Neural Networks to Enhance Software Maintainability

This study explores Graph Neural Networks (GNNs) as a transformative tool for code refactoring, using abstract syntax trees (ASTs) to boost software maintainability. It analyzes a dataset of 2 million snippets from CodeSearchNet and a…

Artificial Intelligence · Computer Science 2025-04-15 Gopichand Bandarupalli

GraphGen: Enhancing Supervised Fine-Tuning for LLMs with Knowledge-Driven Synthetic Data Generation

Fine-tuning for large language models (LLMs) typically requires substantial amounts of high-quality supervised data, which is both costly and labor-intensive to acquire. While synthetic data generation has emerged as a promising solution,…

Computation and Language · Computer Science 2025-05-28 Zihong Chen , Wanli Jiang , Jinzhe Li , Zhonghang Yuan , Huanjun Kong , Wanli Ouyang , Nanqing Dong

Graphcode: Learning from multiparameter persistent homology using graph neural networks

We introduce graphcodes, a novel multi-scale summary of the topological properties of a dataset that is based on the well-established theory of persistent homology. Graphcodes handle datasets that are filtered along two real-valued scale…

Algebraic Topology · Mathematics 2024-05-24 Michael Kerber , Florian Russold

Code Definition Analysis for Call Graph Generation

Enterprise level software is implemented using multi-layer architecture. These layers are often implemented using de-coupled solutions with millions of lines of code. Programmers often have to track and debug a function call from user…

Software Engineering · Computer Science 2016-10-17 Anne Veenendaal , Elliot Daly , Eddie Jones , Zhao Gang , Sumalini Vartak , Rahul S Patwardhan

Knowledge Graphs: The Future of Data Integration and Insightful Discovery

Knowledge graphs are an efficient method for representing and connecting information across various concepts, useful in reasoning, question answering, and knowledge base completion tasks. They organize data by linking points, enabling…

Artificial Intelligence · Computer Science 2025-02-25 Saher Mohamed , Kirollos Farah , Abdelrahman Lotfy , Kareem Rizk , Abdelrahman Saeed , Shahenda Mohamed , Ghada Khouriba , Tamer Arafa

Program Classification Using Gated Graph Attention Neural Network for Online Programming Service

The online programing services, such as Github,TopCoder, and EduCoder, have promoted a lot of social interactions among the service users. However, the existing social interactions is rather limited and inefficient due to the rapid…

Artificial Intelligence · Computer Science 2019-03-12 Mingming Lu , Dingwu Tan , Naixue Xiong , Zailiang Chen , Haifeng Li