English
Related papers

Related papers: GAP-Gen: Guided Automatic Python Code Generation

200 papers

We consider the problem of parsing natural language descriptions into source code written in a general-purpose programming language like Python. Existing data-driven methods treat this problem as a language generation task without…

Computation and Language · Computer Science 2017-04-07 Pengcheng Yin , Graham Neubig

Automatic code generation is to generate the program code according to the given natural language description. The current mainstream approach uses neural networks to encode natural language descriptions, and output abstract syntax trees…

Software Engineering · Computer Science 2022-02-16 Maosheng Zhong , Gen Liu , Hongwei Li , Jiangling Kuang , Jinshan Zeng , Mingwen Wang

Recent advancements in natural language processing \cite{gpt2} \cite{BERT} have led to near-human performance in multiple natural language tasks. In this paper, we seek to understand whether similar techniques can be applied to a highly…

Computation and Language · Computer Science 2021-02-23 Luis Perez , Lizi Ottens , Sudharshan Viswanathan

In this article we show how the problem of neural text generation can be constructively reformulated in terms of transitions between the states of a finite-state machine. This framework leads to an efficient approach to guiding text…

Computation and Language · Computer Science 2023-08-22 Brandon T. Willard , Rémi Louf

In recent years, data has emerged as the new gold, serving as a powerful tool for creating intelligent systems. However, procuring high-quality data remains challenging, especially for code. To address this, we developed TinyPy Generator, a…

Programming Languages · Computer Science 2024-03-12 Kamel Yamani , Marwa Naïr , Riyadh Baghdadi

Due to the development of pre-trained language models, automated code generation techniques have shown great promise in recent years. However, the generated code is difficult to meet the syntactic constraints of the target language,…

Software Engineering · Computer Science 2023-08-01 Guang Yang , Yu Zhou , Xiang Chen , Xiangyu Zhang , Yiran Xu , Tingting Han , Taolue Chen

Language models have shown remarkable proficiency in code generation; nevertheless, ensuring type correctness remains a challenge. Although traditional methods, such as constrained decoding, alleviate this problem by externally rejecting…

Programming Languages · Computer Science 2026-02-09 Zhechong Huang , Zhao Zhang , Ruyi Ji , Tingxuan Xia , Qihao Zhu , Qinxiang Cao , Zeyu Sun , Wiggin Zhou , Yingfei Xiong

Syntactically controlled paraphrase generation requires language models to generate paraphrases for sentences according to specific syntactic structures. Existing fine-tuning methods for this task are costly as all the parameters of the…

Computation and Language · Computer Science 2023-05-29 Yixin Wan , Kuan-Hao Huang , Kai-Wei Chang

Automated documentation of programming source code and automated code generation from natural language are challenging tasks of both practical and scientific interest. Progress in these areas has been limited by the low availability of…

Computation and Language · Computer Science 2017-07-10 Antonio Valerio Miceli Barone , Rico Sennrich

Code generation from text requires understanding the user's intent from a natural language description and generating an executable code snippet that satisfies this intent. While recent pretrained language models demonstrate remarkable…

Computation and Language · Computer Science 2023-05-29 Haau-Sing Li , Mohsen Mesgar , André F. T. Martins , Iryna Gurevych

As Pre-trained Language Models (PLMs), a popular approach for code intelligence, continue to grow in size, the computational cost of their usage has become prohibitively expensive. Prompt learning, a recent development in the field of…

Software Engineering · Computer Science 2024-03-21 Chengzhe Feng , Yanan Sun , Ke Li , Pan Zhou , Jiancheng Lv , Aojun Lu

To harness the power of large language models in safety-critical domains, we need to ensure the explainability of their predictions. However, despite the significant attention to model interpretability, there remains an unexplored domain in…

Computation and Language · Computer Science 2024-06-04 Kenza Amara , Rita Sevastjanova , Mennatallah El-Assady

Mathematical programming is widely employed across various sectors - such as logistics, energy, and workforce planning - to model and solve industrial optimisation problems, but its use requires substantial domain expertise. Large language…

Programming Languages · Computer Science 2026-05-29 Roberto Rossi , Steven D. Prestwich

Most existing text generation models follow the sequence-to-sequence paradigm. Generative Grammar suggests that humans generate natural language texts by learning language grammar. We propose a syntax-guided generation schema, which…

Computation and Language · Computer Science 2023-06-27 Yafu Li , Leyang Cui , Jianhao Yan , Yongjing Yin , Wei Bi , Shuming Shi , Yue Zhang

Given a sentence (e.g., "I like mangoes") and a constraint (e.g., sentiment flip), the goal of controlled text generation is to produce a sentence that adapts the input sentence to meet the requirements of the constraint (e.g., "I hate…

Computation and Language · Computer Science 2020-05-19 Ashutosh Kumar , Kabir Ahuja , Raghuram Vadapalli , Partha Talukdar

We consider the task of text generation in language models with constraints specified in natural language. To this end, we first create a challenging benchmark Cognac that provides as input to the model a topic with example text, along with…

Computation and Language · Computer Science 2022-12-21 Howard Chen , Huihan Li , Danqi Chen , Karthik Narasimhan

As the need for large-scale data processing grows, distributed programming frameworks like PySpark have become increasingly popular. However, the task of converting traditional, sequential code to distributed code remains a significant…

Distributed, Parallel, and Cluster Computing · Computer Science 2024-10-11 Arun Sanjel , Bikram Khanal , Greg Speegle , Pablo Rivas

Paraphrase generation plays an essential role in natural language process (NLP), and it has many downstream applications. However, training supervised paraphrase models requires many annotated paraphrase pairs, which are usually costly to…

Computation and Language · Computer Science 2021-01-27 Kuan-Hao Huang , Kai-Wei Chang

Source code documentation is an important artifact for efficient software development. Code documentation could greatly benefit from automation since manual documentation is often labouring, resource and time-intensive. In this paper, we…

Software Engineering · Computer Science 2022-09-07 Junaed Younus Khan , Gias Uddin

Automated unit test generation is an established research field that has so far focused on statically-typed programming languages. The lack of type information in dynamically-typed programming languages, such as Python, inhibits test…

Software Engineering · Computer Science 2025-07-03 Lukas Krodinger , Stephan Lukasczyk , Gordon Fraser
‹ Prev 1 2 3 10 Next ›