Related papers: Parallel Spell-Checking Algorithm Based on Yahoo! …

Context-sensitive Spelling Correction Using Google Web 1T 5-Gram Information

In computing, spell checking is the process of detecting and sometimes providing spelling suggestions for incorrectly spelled words in a text. Basically, a spell checker is a computer program that uses a dictionary of words to perform spell…

Computation and Language · Computer Science 2012-04-27 Youssef Bassil , Mohammad Alwani

OCR Post-Processing Error Correction Algorithm using Google Online Spelling Suggestion

With the advent of digital optical scanners, a lot of paper-based books, textbooks, magazines, articles, and documents are being transformed into an electronic version that can be manipulated by a computer. For this purpose, OCR, short for…

Computation and Language · Computer Science 2012-04-03 Youssef Bassil , Mohammad Alwani

A context sensitive real-time Spell Checker with language adaptability

We present a novel language adaptable spell checking system which detects spelling errors and suggests context sensitive corrections in real-time. We show that our system can be extended to new languages with minimal language-specific…

Computation and Language · Computer Science 2019-10-25 Prabhakar Gupta

Neural spell-checker: Beyond words with synthetic data generation

Spell-checkers are valuable tools that enhance communication by identifying misspelled words in written texts. Recent improvements in deep learning, and in particular in large language models, have opened new opportunities to improve…

Computation and Language · Computer Science 2024-11-01 Matej Klemen , Martin Božič , Špela Arhar Holdt , Marko Robnik-Šikonja

Towards a single proposal is spelling correction

The study presented here relies on the integrated use of different kinds of knowledge in order to improve first-guess accuracy in non-word context-sensitive correction for general unrestricted texts. State of the art spelling correction…

cmp-lg · Computer Science 2007-05-23 E. Agirre , K. Gojenola , K. Sarasola

ASR Context-Sensitive Error Correction Based on Microsoft N-Gram Dataset

At the present time, computers are employed to solve complex tasks and problems ranging from simple calculations to intensive digital image processing and intricate algorithmic optimization problems to computationally-demanding weather…

Computation and Language · Computer Science 2012-03-26 Youssef Bassil , Paul Semaan

Contextual Multilingual Spellchecker for User Queries

Spellchecking is one of the most fundamental and widely used search features. Correcting incorrectly spelled user queries not only enhances the user experience but is expected by the user. However, most widely available spellchecking…

Computation and Language · Computer Science 2024-04-16 Sanat Sharma , Josep Valls-Vargas , Tracy Holloway King , Francois Guerin , Chirag Arora

Correcting Real-Word Spelling Errors: A New Hybrid Approach

Spelling correction is one of the main tasks in the field of Natural Language Processing. Contrary to common spelling errors, real-word errors cannot be detected by conventional spelling correction methods. The real-word correction model…

Computation and Language · Computer Science 2023-02-14 Seyed MohammadSadegh Dashti , Amid Khatibi Bardsiri , Vahid Khatibi Bardsiri

Detecting Machine-Translated Paragraphs by Matching Similar Words

Machine-translated text plays an important role in modern life by smoothing communication from various communities using different languages. However, unnatural translation may lead to misunderstanding, a detector is thus needed to avoid…

Computation and Language · Computer Science 2019-04-25 Hoang-Quoc Nguyen-Son , Tran Phuong Thao , Seira Hidano , Shinsaku Kiyomoto

Detecting Cross-Lingual Plagiarism Using Simulated Word Embeddings

Cross-lingual plagiarism (CLP) occurs when texts written in one language are translated into a different language and used without acknowledging the original sources. One of the most common methods for detecting CLP requires online machine…

Computation and Language · Computer Science 2018-01-04 Victor Thompson

Detection is the central problem in real-word spelling correction

Real-word spelling correction differs from non-word spelling correction in its aims and its challenges. Here we show that the central problem in real-word spelling correction is detection. Methods from non-word spelling correction, which…

Computation and Language · Computer Science 2014-08-18 L. Amber Wilcox-O'Hearn

NeuSpell: A Neural Spelling Correction Toolkit

We introduce NeuSpell, an open-source toolkit for spelling correction in English. Our toolkit comprises ten different models, and benchmarks them on naturally occurring misspellings from multiple sources. We find that many systems do not…

Computation and Language · Computer Science 2020-10-22 Sai Muralidhar Jayanthi , Danish Pruthi , Graham Neubig

SpellMapper: A non-autoregressive neural spellchecker for ASR customization with candidate retrieval based on n-gram mappings

Contextual spelling correction models are an alternative to shallow fusion to improve automatic speech recognition (ASR) quality given user vocabulary. To deal with large user vocabularies, most of these models include candidate retrieval…

Computation and Language · Computer Science 2023-06-06 Alexandra Antonova , Evelina Bakhturina , Boris Ginsburg

A Methodology for Generative Spelling Correction via Natural Spelling Errors Emulation across Multiple Domains and Languages

Modern large language models demonstrate impressive capabilities in text generation and generalization. However, they often struggle with solving text editing tasks, particularly when it comes to correcting spelling errors and mistypings.…

Computation and Language · Computer Science 2023-09-14 Nikita Martynov , Mark Baushenko , Anastasia Kozlova , Katerina Kolomeytseva , Aleksandr Abramov , Alena Fenogenova

Methods for Detecting Paraphrase Plagiarism

Paraphrase plagiarism is one of the difficult challenges facing plagiarism detection systems. Paraphrasing occur when texts are lexically or syntactically altered to look different, but retain their original meaning. Most plagiarism…

Information Retrieval · Computer Science 2018-01-01 Victor Thompson

Cooperative Error Handling and Shallow Processing

This paper is concerned with the detection and correction of sub-sentential English text errors. Previous spelling programs, unless restricted to a very small set of words, have operated as post-processors. And to date, grammar checkers and…

cmp-lg · Computer Science 2016-08-31 Tanya Bowden

A Simple and Practical Approach to Improve Misspellings in OCR Text

The focus of our paper is the identification and correction of non-word errors in OCR text. Such errors may be the result of incorrect insertion, deletion, or substitution of a character, or the transposition of two adjacent characters…

Computation and Language · Computer Science 2021-06-24 Junxia Lin , Johannes Ledolter

Plagiarism Detection Using Machine Learning

Plagiarism is an act of using someone else's work without proper acknowledgment, and this sin is seen to cut across various arenas including the academy, publishing, and other similar arenas. The traditional methods of plagiarism detection…

Emerging Technologies · Computer Science 2024-12-10 Omraj Kamat , Tridib Ghosh , Kalaivani J , Angayarkanni V , Rama P

Comparing Neural- and N-Gram-Based Language Models for Word Segmentation

Word segmentation is the task of inserting or deleting word boundary characters in order to separate character sequences that correspond to words in some language. In this article we propose an approach based on a beam search algorithm and…

Computation and Language · Computer Science 2018-12-04 Yerai Doval , Carlos Gómez-Rodríguez

OCR Context-Sensitive Error Correction Based on Google Web 1T 5-Gram Data Set

Since the dawn of the computing era, information has been represented digitally so that it can be processed by electronic computers. Paper books and documents were abundant and widely being published at that time; and hence, there was a…

Computation and Language · Computer Science 2012-04-03 Youssef Bassil , Mohammad Alwani