Related papers: WikiCoder: Learning to Write Knowledge-Powered Cod…

WikiGraphs: A Wikipedia Text - Knowledge Graph Paired Dataset

We present a new dataset of Wikipedia articles each paired with a knowledge graph, to facilitate the research in conditional text generation, graph generation and graph representation learning. Existing graph-text paired datasets typically…

Computation and Language · Computer Science 2021-07-21 Luyu Wang , Yujia Li , Ozlem Aslan , Oriol Vinyals

DreamCoder: Growing generalizable, interpretable knowledge with wake-sleep Bayesian program learning

Expert problem-solving is driven by powerful languages for thinking about problems and their solutions. Acquiring expertise means learning these languages -- systems of concepts, alongside the skills to use them. We present DreamCoder, a…

Artificial Intelligence · Computer Science 2020-06-16 Kevin Ellis , Catherine Wong , Maxwell Nye , Mathias Sable-Meyer , Luc Cary , Lucas Morales , Luke Hewitt , Armando Solar-Lezama , Joshua B. Tenenbaum

Assisted Knowledge Graph Authoring: Human-Supervised Knowledge Graph Construction from Natural Language

Encyclopedic knowledge graphs, such as Wikidata, host an extensive repository of millions of knowledge statements. However, domain-specific knowledge from fields such as history, physics, or medicine is significantly underrepresented in…

Computation and Language · Computer Science 2024-01-17 Marcel Gohsen , Benno Stein

An Approach for Automatic Construction of an Algorithmic Knowledge Graph from Textual Resources

There is enormous growth in various fields of research. This development is accompanied by new problems. To solve these problems efficiently and in an optimized manner, algorithms are created and described by researchers in the scientific…

Artificial Intelligence · Computer Science 2022-05-27 Jyotima Patel , Biswanath Dutta

A Toolkit for Generating Code Knowledge Graphs

Knowledge graphs have been proven extremely useful in powering diverse applications in semantic search and natural language understanding. In this paper, we present GraphGen4Code, a toolkit to build code knowledge graphs that can similarly…

Databases · Computer Science 2021-09-29 Ibrahim Abdelaziz , Julian Dolby , Jamie McCusker , Kavitha Srinivas

Automatic Program Synthesis of Long Programs with a Learned Garbage Collector

We consider the problem of generating automatic code given sample input-output pairs. We train a neural network to map from the current state and the outputs to the program's next statement. The neural network optimizes multiple tasks…

Machine Learning · Computer Science 2019-01-23 Amit Zohar , Lior Wolf

Text Generation from Knowledge Graphs with Graph Transformers

Generating texts which express complex ideas spanning multiple sentences requires a structured representation of their content (document plan), but these representations are prohibitively expensive to manually produce. In this work, we…

Computation and Language · Computer Science 2022-03-28 Rik Koncel-Kedziorski , Dhanush Bekal , Yi Luan , Mirella Lapata , Hannaneh Hajishirzi

Building Knowledge Bases for the Generation of Software Documentation

Automated text generation requires a underlying knowledge base from which to generate, which is often difficult to produce. Software documentation is one domain in which parts of this knowledge base may be derived automatically. In this…

cmp-lg · Computer Science 2008-02-03 Cecile Paris , Keith Vander Linden

Knowledge Graph Based Synthetic Corpus Generation for Knowledge-Enhanced Language Model Pre-training

Prior work on Data-To-Text Generation, the task of converting knowledge graph (KG) triples into natural text, focused on domain-specific benchmark datasets. In this paper, however, we verbalize the entire English Wikidata KG, and discuss…

Computation and Language · Computer Science 2021-03-16 Oshin Agarwal , Heming Ge , Siamak Shakeri , Rami Al-Rfou

Unsupervised Construction of Knowledge Graphs From Text and Code

The scientific literature is a rich source of information for data mining with conceptual knowledge graphs; the open science movement has enriched this literature with complementary source code that implements scientific models. To exploit…

Machine Learning · Computer Science 2019-08-27 Kun Cao , James Fairbanks

AutoKG: Constructing Virtual Knowledge Graphs from Unstructured Documents for Question Answering

Knowledge graphs (KGs) have the advantage of providing fine-grained detail for question-answering systems. Unfortunately, building a reliable KG is time-consuming and expensive as it requires human intervention. To overcome this issue, we…

Computation and Language · Computer Science 2021-03-12 Seunghak Yu , Tianxing He , James Glass

Towards information-rich, logical text generation with knowledge-enhanced neural models

Text generation system has made massive promising progress contributed by deep learning techniques and has been widely applied in our life. However, existing end-to-end neural models suffer from the problem of tending to generate…

Artificial Intelligence · Computer Science 2020-03-03 Hao Wang , Bin Guo , Wei Wu , Zhiwen Yu

Natural Language-Guided Programming

In today's software world with its cornucopia of reusable software libraries, when a programmer is faced with a programming task that they suspect can be completed through the use of a library, they often look for code examples using a…

Software Engineering · Computer Science 2021-10-08 Geert Heyman , Rafael Huysegems , Pascal Justen , Tom Van Cutsem

WikiDataSets: Standardized sub-graphs from Wikidata

Developing new ideas and algorithms in the fields of graph processing and relational learning requires public datasets. While Wikidata is the largest open source knowledge graph, involving more than fifty million entities, it is larger than…

Machine Learning · Computer Science 2019-10-07 Armand Boschin , Thomas Bonald

Ontology Design Facilitating Wikibase Integration -- and a Worked Example for Historical Data

Wikibase -- which is the software underlying Wikidata -- is a powerful platform for knowledge graph creation and management. However, it has been developed with a crowd-sourced knowledge graph creation scenario in mind, which in particular…

Artificial Intelligence · Computer Science 2022-05-30 Cogan Shimizu , Andrew Eells , Seila Gonzalez , Lu Zhou , Pascal Hitzler , Alicia Sheill , Catherine Foley , Dean Rehberger

Wikibook-Bot - Automatic Generation of a Wikipedia Book

A Wikipedia book (known as Wikibook) is a collection of Wikipedia articles on a particular theme that is organized as a book. We propose Wikibook-Bot, a machine-learning based technique for automatically generating high quality Wikibooks…

Digital Libraries · Computer Science 2018-12-31 Shahar Admati , Lior Rokach , Bracha Shapira

Scholarly Knowledge Graph Construction from Published Software Packages

The value of structured scholarly knowledge for research and society at large is well understood, but producing scholarly knowledge (i.e., knowledge traditionally published in articles) in structured form remains a challenge. We propose an…

Digital Libraries · Computer Science 2023-12-06 Muhammad Haris , Sören Auer , Markus Stocker

WikiAutoGen: Towards Multi-Modal Wikipedia-Style Article Generation

Knowledge discovery and collection are intelligence-intensive tasks that traditionally require significant human effort to ensure high-quality outputs. Recent research has explored multi-agent frameworks for automating Wikipedia-style…

Computer Vision and Pattern Recognition · Computer Science 2025-09-08 Zhongyu Yang , Jun Chen , Dannong Xu , Junjie Fei , Xiaoqian Shen , Liangbing Zhao , Chun-Mei Feng , Mohamed Elhoseiny

Wikidata as a seed for Web Extraction

Wikidata has grown to a knowledge graph with an impressive size. To date, it contains more than 17 billion triples collecting information about people, places, films, stars, publications, proteins, and many more. On the other side, most of…

Computation and Language · Computer Science 2024-01-17 Kunpeng Guo , Dennis Diefenbach , Antoine Gourru , Christophe Gravier

InCoder: A Generative Model for Code Infilling and Synthesis

Code is seldom written in a single left-to-right pass and is instead repeatedly edited and refined. We introduce InCoder, a unified generative model that can perform program synthesis (via left-to-right generation) as well as editing (via…

Software Engineering · Computer Science 2023-04-11 Daniel Fried , Armen Aghajanyan , Jessy Lin , Sida Wang , Eric Wallace , Freda Shi , Ruiqi Zhong , Wen-tau Yih , Luke Zettlemoyer , Mike Lewis