Related papers: Stronger Baselines for Grammatical Error Correctio…

Comparison of Grammatical Error Correction Using Back-Translation Models

Grammatical error correction (GEC) suffers from a lack of sufficient parallel data. Therefore, GEC studies have developed various methods to generate pseudo data, which comprise pairs of grammatical and artificially produced ungrammatical…

Computation and Language · Computer Science 2021-04-19 Aomi Koyama , Kengo Hotate , Masahiro Kaneko , Mamoru Komachi

Encoder-Decoder Models Can Benefit from Pre-trained Masked Language Models in Grammatical Error Correction

This paper investigates how to effectively incorporate a pre-trained masked language model (MLM), such as BERT, into an encoder-decoder (EncDec) model for grammatical error correction (GEC). The answer to this question is not as…

Computation and Language · Computer Science 2020-06-02 Masahiro Kaneko , Masato Mita , Shun Kiyono , Jun Suzuki , Kentaro Inui

Towards Minimal Supervision BERT-based Grammar Error Correction

Current grammatical error correction (GEC) models typically consider the task as sequence generation, which requires large amounts of annotated data and limit the applications in data-limited settings. We try to incorporate contextual…

Computation and Language · Computer Science 2020-01-13 Yiyuan Li , Antonios Anastasopoulos , Alan W Black

Type-Driven Multi-Turn Corrections for Grammatical Error Correction

Grammatical Error Correction (GEC) aims to automatically detect and correct grammatical errors. In this aspect, dominant models are trained by one-iteration learning while performing multiple iterations of corrections during inference.…

Computation and Language · Computer Science 2022-03-18 Shaopeng Lai , Qingyu Zhou , Jiali Zeng , Zhongli Li , Chao Li , Yunbo Cao , Jinsong Su

Utilizing Bidirectional Encoder Representations from Transformers for Answer Selection

Pre-training a transformer-based model for the language modeling task in a large dataset and then fine-tuning it for downstream tasks has been found very useful in recent years. One major advantage of such pre-trained language models is…

Computation and Language · Computer Science 2020-11-17 Md Tahmid Rahman Laskar , Enamul Hoque , Jimmy Xiangji Huang

Efficient Grammatical Error Correction Via Multi-Task Training and Optimized Training Schedule

Progress in neural grammatical error correction (GEC) is hindered by the lack of annotated training data. Sufficient amounts of high-quality manually annotated data are not available, so recent research has relied on generating synthetic…

Computation and Language · Computer Science 2023-11-21 Andrey Bout , Alexander Podolskiy , Sergey Nikolenko , Irina Piontkovskaya

BART: Denoising Sequence-to-Sequence Pre-training for Natural Language Generation, Translation, and Comprehension

We present BART, a denoising autoencoder for pretraining sequence-to-sequence models. BART is trained by (1) corrupting text with an arbitrary noising function, and (2) learning a model to reconstruct the original text. It uses a standard…

Computation and Language · Computer Science 2019-10-31 Mike Lewis , Yinhan Liu , Naman Goyal , Marjan Ghazvininejad , Abdelrahman Mohamed , Omer Levy , Ves Stoyanov , Luke Zettlemoyer

Revisiting Grammatical Error Correction Evaluation and Beyond

Pretraining-based (PT-based) automatic evaluation metrics (e.g., BERTScore and BARTScore) have been widely used in several sentence generation tasks (e.g., machine translation and text summarization) due to their better correlation with…

Computation and Language · Computer Science 2022-11-04 Peiyuan Gong , Xuebo Liu , Heyan Huang , Min Zhang

GECToR -- Grammatical Error Correction: Tag, Not Rewrite

In this paper, we present a simple and efficient GEC sequence tagger using a Transformer encoder. Our system is pre-trained on synthetic data and then fine-tuned in two stages: first on errorful corpora, and second on a combination of…

Computation and Language · Computer Science 2020-06-01 Kostiantyn Omelianchuk , Vitaliy Atrasevych , Artem Chernodub , Oleksandr Skurzhanskyi

Chinese Grammatical Correction Using BERT-based Pre-trained Model

In recent years, pre-trained models have been extensively studied, and several downstream tasks have benefited from their utilization. In this study, we verify the effectiveness of two methods that incorporate a BERT-based pre-trained model…

Computation and Language · Computer Science 2020-11-05 Hongfei Wang , Michiki Kurosawa , Satoru Katsumata , Mamoru Komachi

Data Augmentation using Pre-trained Transformer Models

Language model based pre-trained models such as BERT have provided significant gains across different NLP tasks. In this paper, we study different types of transformer based pre-trained models such as auto-regressive models (GPT-2),…

Computation and Language · Computer Science 2021-02-02 Varun Kumar , Ashutosh Choudhary , Eunah Cho

Advancements in Arabic Grammatical Error Detection and Correction: An Empirical Investigation

Grammatical error correction (GEC) is a well-explored problem in English with many existing models and datasets. However, research on GEC in morphologically rich languages has been limited due to challenges such as data scarcity and…

Computation and Language · Computer Science 2023-11-10 Bashar Alhafni , Go Inoue , Christian Khairallah , Nizar Habash

Do Grammatical Error Correction Models Realize Grammatical Generalization?

There has been an increased interest in data generation approaches to grammatical error correction (GEC) using pseudo data. However, these approaches suffer from several issues that make them inconvenient for real-world deployment including…

Computation and Language · Computer Science 2021-06-08 Masato Mita , Hitomi Yanaka

Multi-head Sequence Tagging Model for Grammatical Error Correction

To solve the Grammatical Error Correction (GEC) problem , a mapping between a source sequence and a target one is needed, where the two differ only on few spans. For this reason, the attention has been shifted to the non-autoregressive or…

Computation and Language · Computer Science 2024-10-23 Kamal Al-Sabahi , Kang Yang , Wangwang Liu , Guanyu Jiang , Xian Li , Ming Yang

The Unreasonable Effectiveness of Transformer Language Models in Grammatical Error Correction

Recent work on Grammatical Error Correction (GEC) has highlighted the importance of language modeling in that it is certainly possible to achieve good performance by comparing the probabilities of the proposed edits. At the same time,…

Computation and Language · Computer Science 2019-06-06 Dimitrios Alikaniotis , Vipul Raheja

Neural Grammatical Error Correction with Finite State Transducers

Grammatical error correction (GEC) is one of the areas in natural language processing in which purely neural models have not yet superseded more traditional symbolic models. Hybrid systems combining phrase-based statistical machine…

Computation and Language · Computer Science 2019-04-08 Felix Stahlberg , Christopher Bryant , Bill Byrne

Data Weighted Training Strategies for Grammatical Error Correction

Recent progress in the task of Grammatical Error Correction (GEC) has been driven by addressing data sparsity, both through new methods for generating large and noisy pretraining data and through the publication of small and higher-quality…

Computation and Language · Computer Science 2020-09-10 Jared Lichtarge , Chris Alberti , Shankar Kumar

Interpretability for Language Learners Using Example-Based Grammatical Error Correction

Grammatical Error Correction (GEC) should not focus only on high accuracy of corrections but also on interpretability for language learning. However, existing neural-based GEC models mainly aim at improving accuracy, and their…

Computation and Language · Computer Science 2022-03-15 Masahiro Kaneko , Sho Takase , Ayana Niwa , Naoaki Okazaki

Improving Grammatical Error Correction via Pre-Training a Copy-Augmented Architecture with Unlabeled Data

Neural machine translation systems have become state-of-the-art approaches for Grammatical Error Correction (GEC) task. In this paper, we propose a copy-augmented architecture for the GEC task by copying the unchanged words from the source…

Computation and Language · Computer Science 2019-06-12 Wei Zhao , Liang Wang , Kewei Shen , Ruoyu Jia , Jingming Liu

Exploiting Pre-trained Encoder-Decoder Transformers for Sequence-to-Sequence Constituent Parsing

To achieve deep natural language understanding, syntactic constituent parsing plays a crucial role and is widely required by many artificial intelligence systems for processing both text and speech. A recent approach involves using standard…

Computation and Language · Computer Science 2026-05-14 Daniel Fernández-González , Cristina Outeiriño Cid