Related papers: Deep Reinforcement Learning for Programming Langua…

Human Instruction-Following with Deep Reinforcement Learning via Transfer-Learning from Text

Recent work has described neural-network-based agents that are trained with reinforcement learning (RL) to execute language-like commands in simulated worlds, as a step towards an intelligent agent or robot that can be instructed by human…

Computation and Language · Computer Science 2020-05-20 Felix Hill , Sona Mokra , Nathaniel Wong , Tim Harley

Language-Conditioned Reinforcement Learning to Solve Misunderstandings with Action Corrections

Human-to-human conversation is not just talking and listening. It is an incremental process where participants continually establish a common understanding to rule out misunderstandings. Current language understanding methods for…

Machine Learning · Computer Science 2022-11-21 Frank Röder , Manfred Eppe

Leveraging Grammar and Reinforcement Learning for Neural Program Synthesis

Program synthesis is the task of automatically generating a program consistent with a specification. Recent years have seen proposal of a number of neural approaches for program synthesis, many of which adopt a sequence generation paradigm…

Machine Learning · Computer Science 2018-05-23 Rudy Bunel , Matthew Hausknecht , Jacob Devlin , Rishabh Singh , Pushmeet Kohli

Repair Is Nearly Generation: Multilingual Program Repair with LLMs

Most programmers make mistakes when writing code. Some of these mistakes are small and require few edits to the original program -- a class of errors recently termed last mile mistakes. These errors break the flow for experienced developers…

Software Engineering · Computer Science 2022-12-06 Harshit Joshi , José Cambronero , Sumit Gulwani , Vu Le , Ivan Radicek , Gust Verbruggen

SYNFIX: Automatically Fixing Syntax Errors using Compiler Diagnostics

Beginning programmers struggle with the complex grammar of modern programming languages like Java, and make lot of syntax errors. The diagnostic syntax error messages from compilers and IDEs are sometimes useful, but often the messages are…

Software Engineering · Computer Science 2022-10-14 Toufique Ahmed , Noah Rose Ledesma , Premkumar Devanbu

Co-Learning: Code Learning for Multi-Agent Reinforcement Collaborative Framework with Conversational Natural Language Interfaces

Online question-and-answer (Q\&A) systems based on the Large Language Model (LLM) have progressively diverged from recreational to professional use. This paper proposed a Multi-Agent framework with environmentally reinforcement learning…

Software Engineering · Computer Science 2024-09-05 Jiapeng Yu , Yuqian Wu , Yajing Zhan , Wenhao Guo , Zhou Xu , Raymond Lee

Learning to solve arithmetic problems with a virtual abacus

Acquiring mathematical skills is considered a key challenge for modern Artificial Intelligence systems. Inspired by the way humans discover numerical knowledge, here we introduce a deep reinforcement learning framework that allows to…

Machine Learning · Computer Science 2023-01-18 Flavio Petruzzellis , Ling Xuan Chen , Alberto Testolin

Automated Correction for Syntax Errors in Programming Assignments using Recurrent Neural Networks

We present a method for automatically generating repair feedback for syntax errors for introductory programming problems. Syntax errors constitute one of the largest classes of errors (34%) in our dataset of student submissions obtained…

Programming Languages · Computer Science 2016-03-22 Sahil Bhatia , Rishabh Singh

Unveiling the Decision-Making Process in Reinforcement Learning with Genetic Programming

Despite tremendous progress, machine learning and deep learning still suffer from incomprehensible predictions. Incomprehensibility, however, is not an option for the use of (deep) reinforcement learning in the real world, as unpredictable…

Artificial Intelligence · Computer Science 2024-07-23 Manuel Eberhardinger , Florian Rupp , Johannes Maucher , Setareh Maghsudi

Teaching LLMs Human-Like Editing of Inappropriate Argumentation via Reinforcement Learning

Editing human-written text has become a standard use case of large language models (LLMs), for example, to make one's arguments more appropriate for a discussion. Comparing human to LLM-generated edits, however, we observe a mismatch in…

Computation and Language · Computer Science 2026-04-15 Timon Ziegenbein , Maja Stahl , Henning Wachsmuth

Deep Reinforcement Learning, a textbook

Deep reinforcement learning has gathered much attention recently. Impressive results were achieved in activities as diverse as autonomous driving, game playing, molecular recombination, and robotics. In all these fields, computer programs…

Artificial Intelligence · Computer Science 2023-04-25 Aske Plaat

Guiding Policies with Language via Meta-Learning

Behavioral skills or policies for autonomous agents are conventionally learned from reward functions, via reinforcement learning, or from demonstrations, via imitation learning. However, both modes of task specification have their…

Machine Learning · Computer Science 2019-01-30 John D. Co-Reyes , Abhishek Gupta , Suvansh Sanjeev , Nick Altieri , Jacob Andreas , John DeNero , Pieter Abbeel , Sergey Levine

Reflexion: Language Agents with Verbal Reinforcement Learning

Large language models (LLMs) have been increasingly used to interact with external environments (e.g., games, compilers, APIs) as goal-driven agents. However, it remains challenging for these language agents to quickly and efficiently learn…

Artificial Intelligence · Computer Science 2023-10-11 Noah Shinn , Federico Cassano , Edward Berman , Ashwin Gopinath , Karthik Narasimhan , Shunyu Yao

Meta-Reinforcement Learning via Language Instructions

Although deep reinforcement learning has recently been very successful at learning complex behaviors, it requires a tremendous amount of data to learn a task. One of the fundamental reasons causing this limitation lies in the nature of the…

Robotics · Computer Science 2022-09-19 Zhenshan Bing , Alexander Koch , Xiangtong Yao , Kai Huang , Alois Knoll

Teaching LLMs for Step-Level Automatic Math Correction via Reinforcement Learning

Automatic math correction aims to check students' solutions to mathematical problems via artificial intelligence technologies. Most existing studies focus on judging the final answer at the problem level, while they ignore detailed feedback…

Computation and Language · Computer Science 2025-03-25 Junsong Li , Jie Zhou , Yutao Yang , Bihao Zhan , Qianjun Pan , Yuyang Ding , Qin Chen , Jiang Bo , Xin Lin , Liang He

Mapping Language to Programs using Multiple Reward Components with Inverse Reinforcement Learning

Mapping natural language instructions to programs that computers can process is a fundamental challenge. Existing approaches focus on likelihood-based training or using reinforcement learning to fine-tune models based on a single reward. In…

Computation and Language · Computer Science 2021-10-05 Sayan Ghosh , Shashank Srivastava

SLMFix: Leveraging Small Language Models for Error Fixing with Reinforcement Learning

Recent advancements in large language models (LLMs) have shown very impressive capabilities in code generation across many programming languages. However, even state-of-the-art LLMs generate programs that contains syntactic errors and fail…

Software Engineering · Computer Science 2025-11-25 David Jiahao Fu , Aryan Gupta , Aaron Councilman , David Grove , Yu-Xiong Wang , Vikram Adve

Optimizing Quantum Error Correction Codes with Reinforcement Learning

Quantum error correction is widely thought to be the key to fault-tolerant quantum computation. However, determining the most suited encoding for unknown error channels or specific laboratory setups is highly challenging. Here, we present a…

Quantum Physics · Physics 2020-04-10 Hendrik Poulsen Nautrup , Nicolas Delfosse , Vedran Dunjko , Hans J. Briegel , Nicolai Friis

RTLFixer: Automatically Fixing RTL Syntax Errors with Large Language Models

This paper presents RTLFixer, a novel framework enabling automatic syntax errors fixing for Verilog code with Large Language Models (LLMs). Despite LLM's promising capabilities, our analysis indicates that approximately 55% of errors in…

Hardware Architecture · Computer Science 2024-05-22 Yun-Da Tsai , Mingjie Liu , Haoxing Ren

Reinforcement Learning for Automated Cybersecurity Penetration Testing

This paper aims to provide an innovative machine learning-based solution to automate security testing tasks for web applications, ensuring the correct functioning of all components while reducing project maintenance costs. Reinforcement…

Cryptography and Security · Computer Science 2025-07-08 Daniel López-Montero , José L. Álvarez-Aldana , Alicia Morales-Martínez , Marta Gil-López , Juan M. Auñón García