English

CodexDB: Generating Code for Processing SQL Queries using GPT-3 Codex

Databases 2022-04-20 v1 Computation and Language Machine Learning

Abstract

CodexDB is an SQL processing engine whose internals can be customized via natural language instructions. CodexDB is based on OpenAI's GPT-3 Codex model which translates text into code. It is a framework on top of GPT-3 Codex that decomposes complex SQL queries into a series of simple processing steps, described in natural language. Processing steps are enriched with user-provided instructions and descriptions of database properties. Codex translates the resulting text into query processing code. An early prototype of CodexDB is able to generate correct code for a majority of queries of the WikiSQL benchmark and can be customized in various ways.

Keywords

Cite

@article{arxiv.2204.08941,
  title  = {CodexDB: Generating Code for Processing SQL Queries using GPT-3 Codex},
  author = {Immanuel Trummer},
  journal= {arXiv preprint arXiv:2204.08941},
  year   = {2022}
}