English
Related papers

Related papers: Stronger Baselines for Grammatical Error Correctio…

200 papers

Grammatical error correction (GEC) suffers from a lack of sufficient parallel data. Therefore, GEC studies have developed various methods to generate pseudo data, which comprise pairs of grammatical and artificially produced ungrammatical…

Computation and Language · Computer Science 2021-04-19 Aomi Koyama , Kengo Hotate , Masahiro Kaneko , Mamoru Komachi

This paper investigates how to effectively incorporate a pre-trained masked language model (MLM), such as BERT, into an encoder-decoder (EncDec) model for grammatical error correction (GEC). The answer to this question is not as…

Computation and Language · Computer Science 2020-06-02 Masahiro Kaneko , Masato Mita , Shun Kiyono , Jun Suzuki , Kentaro Inui

Current grammatical error correction (GEC) models typically consider the task as sequence generation, which requires large amounts of annotated data and limit the applications in data-limited settings. We try to incorporate contextual…

Computation and Language · Computer Science 2020-01-13 Yiyuan Li , Antonios Anastasopoulos , Alan W Black

Grammatical Error Correction (GEC) aims to automatically detect and correct grammatical errors. In this aspect, dominant models are trained by one-iteration learning while performing multiple iterations of corrections during inference.…

Computation and Language · Computer Science 2022-03-18 Shaopeng Lai , Qingyu Zhou , Jiali Zeng , Zhongli Li , Chao Li , Yunbo Cao , Jinsong Su

Pre-training a transformer-based model for the language modeling task in a large dataset and then fine-tuning it for downstream tasks has been found very useful in recent years. One major advantage of such pre-trained language models is…

Computation and Language · Computer Science 2020-11-17 Md Tahmid Rahman Laskar , Enamul Hoque , Jimmy Xiangji Huang

Progress in neural grammatical error correction (GEC) is hindered by the lack of annotated training data. Sufficient amounts of high-quality manually annotated data are not available, so recent research has relied on generating synthetic…

Computation and Language · Computer Science 2023-11-21 Andrey Bout , Alexander Podolskiy , Sergey Nikolenko , Irina Piontkovskaya

We present BART, a denoising autoencoder for pretraining sequence-to-sequence models. BART is trained by (1) corrupting text with an arbitrary noising function, and (2) learning a model to reconstruct the original text. It uses a standard…

Computation and Language · Computer Science 2019-10-31 Mike Lewis , Yinhan Liu , Naman Goyal , Marjan Ghazvininejad , Abdelrahman Mohamed , Omer Levy , Ves Stoyanov , Luke Zettlemoyer

Pretraining-based (PT-based) automatic evaluation metrics (e.g., BERTScore and BARTScore) have been widely used in several sentence generation tasks (e.g., machine translation and text summarization) due to their better correlation with…

Computation and Language · Computer Science 2022-11-04 Peiyuan Gong , Xuebo Liu , Heyan Huang , Min Zhang

In this paper, we present a simple and efficient GEC sequence tagger using a Transformer encoder. Our system is pre-trained on synthetic data and then fine-tuned in two stages: first on errorful corpora, and second on a combination of…

Computation and Language · Computer Science 2020-06-01 Kostiantyn Omelianchuk , Vitaliy Atrasevych , Artem Chernodub , Oleksandr Skurzhanskyi

In recent years, pre-trained models have been extensively studied, and several downstream tasks have benefited from their utilization. In this study, we verify the effectiveness of two methods that incorporate a BERT-based pre-trained model…

Computation and Language · Computer Science 2020-11-05 Hongfei Wang , Michiki Kurosawa , Satoru Katsumata , Mamoru Komachi

Language model based pre-trained models such as BERT have provided significant gains across different NLP tasks. In this paper, we study different types of transformer based pre-trained models such as auto-regressive models (GPT-2),…

Computation and Language · Computer Science 2021-02-02 Varun Kumar , Ashutosh Choudhary , Eunah Cho

Grammatical error correction (GEC) is a well-explored problem in English with many existing models and datasets. However, research on GEC in morphologically rich languages has been limited due to challenges such as data scarcity and…

Computation and Language · Computer Science 2023-11-10 Bashar Alhafni , Go Inoue , Christian Khairallah , Nizar Habash

There has been an increased interest in data generation approaches to grammatical error correction (GEC) using pseudo data. However, these approaches suffer from several issues that make them inconvenient for real-world deployment including…

Computation and Language · Computer Science 2021-06-08 Masato Mita , Hitomi Yanaka

To solve the Grammatical Error Correction (GEC) problem , a mapping between a source sequence and a target one is needed, where the two differ only on few spans. For this reason, the attention has been shifted to the non-autoregressive or…

Computation and Language · Computer Science 2024-10-23 Kamal Al-Sabahi , Kang Yang , Wangwang Liu , Guanyu Jiang , Xian Li , Ming Yang

Recent work on Grammatical Error Correction (GEC) has highlighted the importance of language modeling in that it is certainly possible to achieve good performance by comparing the probabilities of the proposed edits. At the same time,…

Computation and Language · Computer Science 2019-06-06 Dimitrios Alikaniotis , Vipul Raheja

Grammatical error correction (GEC) is one of the areas in natural language processing in which purely neural models have not yet superseded more traditional symbolic models. Hybrid systems combining phrase-based statistical machine…

Computation and Language · Computer Science 2019-04-08 Felix Stahlberg , Christopher Bryant , Bill Byrne

Recent progress in the task of Grammatical Error Correction (GEC) has been driven by addressing data sparsity, both through new methods for generating large and noisy pretraining data and through the publication of small and higher-quality…

Computation and Language · Computer Science 2020-09-10 Jared Lichtarge , Chris Alberti , Shankar Kumar

Grammatical Error Correction (GEC) should not focus only on high accuracy of corrections but also on interpretability for language learning. However, existing neural-based GEC models mainly aim at improving accuracy, and their…

Computation and Language · Computer Science 2022-03-15 Masahiro Kaneko , Sho Takase , Ayana Niwa , Naoaki Okazaki

Neural machine translation systems have become state-of-the-art approaches for Grammatical Error Correction (GEC) task. In this paper, we propose a copy-augmented architecture for the GEC task by copying the unchanged words from the source…

Computation and Language · Computer Science 2019-06-12 Wei Zhao , Liang Wang , Kewei Shen , Ruoyu Jia , Jingming Liu

To achieve deep natural language understanding, syntactic constituent parsing plays a crucial role and is widely required by many artificial intelligence systems for processing both text and speech. A recent approach involves using standard…

Computation and Language · Computer Science 2026-05-14 Daniel Fernández-González , Cristina Outeiriño Cid
‹ Prev 1 2 3 10 Next ›