Related papers: Generating Sequences by Learning to Self-Correct

Retrieve and Refine: Improved Sequence Generation Models For Dialogue

Sequence generation models for dialogue are known to have several problems: they tend to produce short, generic sentences that are uninformative and unengaging. Retrieval models on the other hand can surface interesting responses, but are…

Computation and Language · Computer Science 2018-09-07 Jason Weston , Emily Dinan , Alexander H. Miller

In Tree Structure Should Sentence Be Generated

Generative models reliant on sequential autoregression have been at the forefront of language generation for an extensive period, particularly following the introduction of widely acclaimed transformers. Despite its excellent performance,…

Computation and Language · Computer Science 2024-06-21 Yaguang Li , Xin Chen

The Unbearable Weight of Generating Artificial Errors for Grammatical Error Correction

In recent years, sequence-to-sequence models have been very effective for end-to-end grammatical error correction (GEC). As creating human-annotated parallel corpus for GEC is expensive and time-consuming, there has been work on artificial…

Computation and Language · Computer Science 2019-07-23 Phu Mon Htut , Joel Tetreault

In-sample Curriculum Learning by Sequence Completion for Natural Language Generation

Curriculum learning has shown promising improvements in multiple domains by training machine learning models from easy samples to hard ones. Previous works which either design rules or train models for scoring the difficulty highly rely on…

Computation and Language · Computer Science 2023-05-24 Qi Jia , Yizhu Liu , Haifeng Tang , Kenny Q. Zhu

Self-Correcting Large Language Models: Generation vs. Multiple Choice

Large language models have recently demonstrated remarkable abilities to self-correct their responses through iterative refinement, often referred to as self-consistency or self-reflection. However, the dynamics of this self-correction…

Computation and Language · Computer Science 2025-11-13 Hossein A. Rahmani , Satyapriya Krishna , Xi Wang , Mohammadmehdi Naghiaei , Emine Yilmaz

Sequence-to-Action: Grammatical Error Correction with Action Guided Sequence Generation

The task of Grammatical Error Correction (GEC) has received remarkable attention with wide applications in Natural Language Processing (NLP) in recent years. While one of the key principles of GEC is to keep the correct parts unchanged and…

Computation and Language · Computer Science 2022-05-24 Jiquan Li , Junliang Guo , Yongxin Zhu , Xin Sheng , Deqiang Jiang , Bo Ren , Linli Xu

Sequence-to-Sequence Generation for Spoken Dialogue via Deep Syntax Trees and Strings

We present a natural language generator based on the sequence-to-sequence approach that can be trained to produce natural language strings as well as deep syntax dependency trees from input dialogue acts, and we use it to directly compare…

Computation and Language · Computer Science 2017-09-18 Ondřej Dušek , Filip Jurčíček

Small Language Models Need Strong Verifiers to Self-Correct Reasoning

Self-correction has emerged as a promising solution to boost the reasoning performance of large language models (LLMs), where LLMs refine their solutions using self-generated critiques that pinpoint the errors. This work explores whether…

Computation and Language · Computer Science 2024-06-07 Yunxiang Zhang , Muhammad Khalifa , Lajanugen Logeswaran , Jaekyeom Kim , Moontae Lee , Honglak Lee , Lu Wang

Self-Correcting Code Generation Using Small Language Models

Self-correction has demonstrated potential in code generation by allowing language models to revise and improve their outputs through successive refinement. Recent studies have explored prompting-based strategies that incorporate…

Computation and Language · Computer Science 2025-08-26 Jeonghun Cho , Deokhyung Kang , Hyounghun Kim , Gary Geunbae Lee

Self-Correction Makes LLMs Better Parsers

Large language models (LLMs) have achieved remarkable success across various natural language processing (NLP) tasks. However, recent studies suggest that they still face challenges in performing fundamental NLP tasks essential for deep…

Computation and Language · Computer Science 2025-04-22 Ziyan Zhang , Yang Hou , Chen Gong , Zhenghua Li

Language Models can Self-Lengthen to Generate Long Texts

Recent advancements in Large Language Models (LLMs) have significantly enhanced their ability to process long contexts, yet a notable gap remains in generating long, aligned outputs. This limitation stems from a training gap where…

Computation and Language · Computer Science 2024-11-01 Shanghaoran Quan , Tianyi Tang , Bowen Yu , An Yang , Dayiheng Liu , Bofei Gao , Jianhong Tu , Yichang Zhang , Jingren Zhou , Junyang Lin

Synthetic Data Generation for Grammatical Error Correction with Tagged Corruption Models

Synthetic data generation is widely known to boost the accuracy of neural grammatical error correction (GEC) systems, but existing methods often lack diversity or are too simplistic to generate the broad range of grammatical errors made by…

Computation and Language · Computer Science 2021-05-28 Felix Stahlberg , Shankar Kumar

Large Language Models Cannot Self-Correct Reasoning Yet

Large Language Models (LLMs) have emerged as a groundbreaking technology with their unparalleled text generation capabilities across various applications. Nevertheless, concerns persist regarding the accuracy and appropriateness of their…

Computation and Language · Computer Science 2024-03-15 Jie Huang , Xinyun Chen , Swaroop Mishra , Huaixiu Steven Zheng , Adams Wei Yu , Xinying Song , Denny Zhou

Grammatical Error Correction as GAN-like Sequence Labeling

In Grammatical Error Correction (GEC), sequence labeling models enjoy fast inference compared to sequence-to-sequence models; however, inference in sequence labeling GEC models is an iterative process, as sentences are passed to the model…

Computation and Language · Computer Science 2021-06-01 Kevin Parnow , Zuchao Li , Hai Zhao

Training-Free Self-Correction for Multimodal Masked Diffusion Models

Masked diffusion models have emerged as a powerful framework for text and multimodal generation. However, their sampling procedure updates multiple tokens simultaneously and treats generated tokens as immutable, which may lead to error…

Machine Learning · Statistics 2026-02-04 Yidong Ouyang , Panwen Hu , Zhengyan Wan , Zhe Wang , Liyan Xie , Dmitriy Bespalov , Ying Nian Wu , Guang Cheng , Hongyuan Zha , Qiang Sun

Breaking Writer's Block: Low-cost Fine-tuning of Natural Language Generation Models

It is standard procedure these days to solve Information Extraction task by fine-tuning large pre-trained language models. This is not the case for generation task, which relies on a variety of techniques for controlled language generation.…

Computation and Language · Computer Science 2021-03-03 Alexandre Duval , Thomas Lamson , Gael de Leseleuc de Kerouara , Matthias Gallé

Leveraging Grammar and Reinforcement Learning for Neural Program Synthesis

Program synthesis is the task of automatically generating a program consistent with a specification. Recent years have seen proposal of a number of neural approaches for program synthesis, many of which adopt a sequence generation paradigm…

Machine Learning · Computer Science 2018-05-23 Rudy Bunel , Matthew Hausknecht , Jacob Devlin , Rishabh Singh , Pushmeet Kohli

Self-Evaluation Improves Selective Generation in Large Language Models

Safe deployment of large language models (LLMs) may benefit from a reliable method for assessing their generated content to determine when to abstain or to selectively generate. While likelihood-based metrics such as perplexity are widely…

Computation and Language · Computer Science 2023-12-18 Jie Ren , Yao Zhao , Tu Vu , Peter J. Liu , Balaji Lakshminarayanan

Grammatical Error Generation Based on Translated Fragments

We perform neural machine translation of sentence fragments in order to create large amounts of training data for English grammatical error correction. Our method aims at simulating mistakes made by second language learners, and produces a…

Computation and Language · Computer Science 2021-04-21 Eetu Sjöblom , Mathias Creutz , Teemu Vahtola

SERC: LDPC-Inspired Semantic Error Correction for Retrieval-Augmented Generation

While Large Language Models (LLMs) have demonstrated remarkable capabilities, their reliability is significantly compromised by hallucinations. Existing intrinsic self-correction methods attempt to address this, but often fail due to…

Computation and Language · Computer Science 2026-05-29 Gyumin Kim , Juhwan Park , Jaeha Kim , Seunggyun Han , Kyungrak Son , Ikbeom Jang