Related papers: Learning to Represent Programs with Graphs

Generative Code Modeling with Graphs

Generative models for source code are an interesting structured prediction problem, requiring to reason about both hard syntactic and semantic constraints as well as about natural, likely programs. We present a novel model for this problem…

Machine Learning · Computer Science 2019-04-18 Marc Brockschmidt , Miltiadis Allamanis , Alexander L. Gaunt , Oleksandr Polozov

Learning to Represent Programs with Heterogeneous Graphs

Program source code contains complex structure information, which can be represented in structured data forms like trees or graphs. To acquire the structural information in source code, most existing researches use abstract syntax trees…

Software Engineering · Computer Science 2022-04-13 Kechi Zhang , Wenhan Wang , Huangzhao Zhang , Ge Li , Zhi Jin

Open Vocabulary Learning on Source Code with a Graph-Structured Cache

Machine learning models that take computer program source code as input typically use Natural Language Processing (NLP) techniques. However, a major challenge is that code is written using an open, rapidly changing vocabulary due to, e.g.,…

Machine Learning · Computer Science 2019-05-21 Milan Cvitkovic , Badal Singh , Anima Anandkumar

Modeling Semantics with Gated Graph Neural Networks for Knowledge Base Question Answering

The most approaches to Knowledge Base Question Answering are based on semantic parsing. In this paper, we address the problem of learning vector representations for complex semantic parses that consist of multiple entities and relations.…

Computation and Language · Computer Science 2018-08-14 Daniil Sorokin , Iryna Gurevych

Comparison of Syntactic and Semantic Representations of Programs in Neural Embeddings

Neural approaches to program synthesis and understanding have proliferated widely in the last few years; at the same time graph based neural networks have become a promising new tool. This work aims to be the first empirical study comparing…

Software Engineering · Computer Science 2020-01-28 Austin P. Wright , Herbert Wiklicky

Universal Representation for Code

Learning from source code usually requires a large amount of labeled data. Despite the possible scarcity of labeled data, the trained model is highly task-specific and lacks transferability to different tasks. In this work, we present…

Machine Learning · Computer Science 2021-03-05 Linfeng Liu , Hoan Nguyen , George Karypis , Srinivasan Sengamedu

Learn to Think: Bootstrapping LLM Reasoning Capability Through Graph Representation Learning

Large Language Models (LLMs) have achieved remarkable success across various domains. However, they still face significant challenges, including high computational costs for training and limitations in solving complex reasoning problems.…

Machine Learning · Computer Science 2025-05-20 Hang Gao , Chenhao Zhang , Tie Wang , Junsuo Zhao , Fengge Wu , Changwen Zheng , Huaping Liu

Graphs for deep learning representations

In recent years, Deep Learning methods have achieved state of the art performance in a vast range of machine learning tasks, including image classification and multilingual automatic text translation. These architectures are trained to…

Machine Learning · Computer Science 2020-12-15 Carlos Lassance

Software Vulnerability Detection via Deep Learning over Disaggregated Code Graph Representation

Identifying vulnerable code is a precautionary measure to counter software security breaches. Tedious expert effort has been spent to build static analyzers, yet insecure patterns are barely fully enumerated. This work explores a deep…

Artificial Intelligence · Computer Science 2021-09-09 Yufan Zhuang , Sahil Suneja , Veronika Thost , Giacomo Domeniconi , Alessandro Morari , Jim Laredo

Learning to Extend Program Graphs to Work-in-Progress Code

Source code spends most of its time in a broken or incomplete state during software development. This presents a challenge to machine learning for code, since high-performing models typically rely on graph structured representations of…

Machine Learning · Computer Science 2021-06-01 Xuechen Li , Chris J. Maddison , Daniel Tarlow

Learning Program Semantics with Code Representations: An Empirical Study

Program semantics learning is the core and fundamental for various code intelligent tasks e.g., vulnerability detection, clone detection. A considerable amount of existing works propose diverse approaches to learn the program semantics for…

Software Engineering · Computer Science 2022-03-23 Jing Kai Siow , Shangqing Liu , Xiaofei Xie , Guozhu Meng , Yang Liu

Building Graph Representations of Deep Vector Embeddings

Patterns stored within pre-trained deep neural networks compose large and powerful descriptive languages that can be used for many different purposes. Typically, deep network representations are implemented within vector embedding spaces,…

Neural and Evolutionary Computing · Computer Science 2017-08-10 Dario Garcia-Gasulla , Armand Vilalta , Ferran Parés , Jonatan Moreno , Eduard Ayguadé , Jesus Labarta , Ulises Cortés , Toyotaro Suzumura

Program Classification Using Gated Graph Attention Neural Network for Online Programming Service

The online programing services, such as Github,TopCoder, and EduCoder, have promoted a lot of social interactions among the service users. However, the existing social interactions is rather limited and inefficient due to the rapid…

Artificial Intelligence · Computer Science 2019-03-12 Mingming Lu , Dingwu Tan , Naixue Xiong , Zailiang Chen , Haifeng Li

Learning to map source code to software vulnerability using code-as-a-graph

We explore the applicability of Graph Neural Networks in learning the nuances of source code from a security perspective. Specifically, whether signatures of vulnerabilities in source code can be learned from its graph representation, in…

Software Engineering · Computer Science 2020-06-17 Sahil Suneja , Yunhui Zheng , Yufan Zhuang , Jim Laredo , Alessandro Morari

Learning Deep Generative Models of Graphs

Graphs are fundamental data structures which concisely capture the relational structure in many important real-world domains, such as knowledge graphs, physical and social interactions, language, and chemistry. Here we introduce a powerful…

Machine Learning · Computer Science 2018-03-12 Yujia Li , Oriol Vinyals , Chris Dyer , Razvan Pascanu , Peter Battaglia

Representation Learning on Graphs: Methods and Applications

Machine learning on graphs is an important and ubiquitous task with applications ranging from drug design to friendship recommendation in social networks. The primary challenge in this domain is finding a way to represent, or encode, graph…

Social and Information Networks · Computer Science 2018-04-11 William L. Hamilton , Rex Ying , Jure Leskovec

Towards Versatile Graph Learning Approach: from the Perspective of Large Language Models

Graph-structured data are the commonly used and have wide application scenarios in the real world. For these diverse applications, the vast variety of learning tasks, graph domains, and complex graph learning procedures present challenges…

Machine Learning · Computer Science 2024-02-26 Lanning Wei , Jun Gao , Huan Zhao , Quanming Yao

Multi-View Graph Representation for Programming Language Processing: An Investigation into Algorithm Detection

Program representation, which aims at converting program source code into vectors with automatically extracted features, is a fundamental problem in programming language processing (PLP). Recent work tries to represent programs with neural…

Machine Learning · Computer Science 2022-02-28 Ting Long , Yutong Xie , Xianyu Chen , Weinan Zhang , Qinxiang Cao , Yong Yu

A General Path-Based Representation for Predicting Program Properties

Predicting program properties such as names or expression types has a wide range of applications. It can ease the task of programming and increase programmer productivity. A major challenge when learning from programs is $\textit{how to…

Programming Languages · Computer Science 2018-04-24 Uri Alon , Meital Zilberstein , Omer Levy , Eran Yahav

Let Your Graph Do the Talking: Encoding Structured Data for LLMs

How can we best encode structured data into sequential form for use in large language models (LLMs)? In this work, we introduce a parameter-efficient method to explicitly represent structured data for LLMs. Our method, GraphToken, learns an…

Machine Learning · Computer Science 2024-02-09 Bryan Perozzi , Bahare Fatemi , Dustin Zelle , Anton Tsitsulin , Mehran Kazemi , Rami Al-Rfou , Jonathan Halcrow