Related papers: Automatic Error Type Annotation for Arabic

Enhancing Arabic Automated Essay Scoring with Synthetic Data and Error Injection

Automated Essay Scoring (AES) plays a crucial role in assessing language learners' writing quality, reducing grading workload, and providing real-time feedback. The lack of annotated essay datasets inhibits the development of Arabic AES…

Computation and Language · Computer Science 2025-06-11 Chatrine Qwaider , Bashar Alhafni , Kirill Chirkunov , Nizar Habash , Ted Briscoe

Arabic Spelling Correction using Supervised Learning

In this work, we address the problem of spelling correction in the Arabic language utilizing the new corpus provided by QALB (Qatar Arabic Language Bank) project which is an annotated corpus of sentences with errors and their corrections.…

Machine Learning · Computer Science 2014-10-01 Youssef Hassan , Mohamed Aly , Amir Atiya

Multi-Level Analysis and Annotation of Arabic Corpora for Text-to-Sign Language MT

In this paper, we present an ongoing effort in lexical semantic analysis and annotation of Modern Standard Arabic (MSA) text, a semi automatic annotation tool concerned with the morphologic, syntactic, and semantic levels of description.

Computation and Language · Computer Science 2016-05-25 Abdelaziz Lakhfif , Mohammed T. Laskri , Eric Atwell

Automated essay scoring in Arabic: a dataset and analysis of a BERT-based system

Automated Essay Scoring (AES) holds significant promise in the field of education, helping educators to mark larger volumes of essays and provide timely feedback. However, Arabic AES research has been limited by the lack of publicly…

Computation and Language · Computer Science 2024-07-17 Rayed Ghazawi , Edwin Simpson

Strategies for Arabic Readability Modeling

Automatic readability assessment is relevant to building NLP applications for education, content analysis, and accessibility. However, Arabic readability assessment is a challenging task due to Arabic's morphological richness and limited…

Computation and Language · Computer Science 2024-07-04 Juan Piñeros Liberato , Bashar Alhafni , Muhamed Al Khalil , Nizar Habash

Enhancing Text Editing for Grammatical Error Correction: Arabic as a Case Study

Text editing frames grammatical error correction (GEC) as a sequence tagging problem, where edit tags are assigned to input tokens, and applying these edits results in the corrected text. This approach has gained attention for its…

Computation and Language · Computer Science 2025-06-03 Bashar Alhafni , Nizar Habash

ArbESC+: Arabic Enhanced Edit Selection System Combination for Grammatical Error Correction Resolving conflict and improving system combination in Arabic GEC

Grammatical Error Correction (GEC) is an important aspect of natural language processing. Arabic has a complicated morphological and syntactic structure, posing a greater challenge than other languages. Even though modern neural models have…

Computation and Language · Computer Science 2025-11-19 Ahlam Alrehili , Areej Alhothali

Developing a New Approach for Arabic Morphological Analysis and Generation

Arabic morphological analysis is one of the essential stages in Arabic Natural Language Processing. In this paper we present an approach for Arabic morphological analysis. This approach is based on Arabic morphological automaton (AMAUT).…

Computation and Language · Computer Science 2011-01-31 Mourad Gridach , Noureddine Chenfour

LAILA: A Large Trait-Based Dataset for Arabic Automated Essay Scoring

Automated Essay Scoring (AES) has gained increasing attention in recent years, yet research on Arabic AES remains limited due to the lack of publicly available datasets. To address this, we introduce LAILA, the largest publicly available…

Computation and Language · Computer Science 2026-01-27 May Bashendy , Walid Massoud , Sohaila Eltanbouly , Salam Albatarni , Marwan Sayed , Abrar Abir , Houda Bouamor , Tamer Elsayed

ChatGPT for Arabic Grammatical Error Correction

Recently, large language models (LLMs) fine-tuned to follow human instruction have exhibited significant capabilities in various English NLP tasks. However, their performance in grammatical error correction (GEC) tasks, particularly in…

Artificial Intelligence · Computer Science 2023-08-10 Sang Yun Kwon , Gagan Bhatia , El Moatez Billah Nagoud , Muhammad Abdul-Mageed

AraT5: Text-to-Text Transformers for Arabic Language Generation

Transfer learning with a unified Transformer framework (T5) that converts all language problems into a text-to-text format was recently proposed as a simple and effective transfer learning approach. Although a multilingual version of the T5…

Computation and Language · Computer Science 2022-03-16 El Moatez Billah Nagoudi , AbdelRahim Elmadany , Muhammad Abdul-Mageed

Automatic Difficulty Classification of Arabic Sentences

In this paper, we present a Modern Standard Arabic (MSA) Sentence difficulty classifier, which predicts the difficulty of sentences for language learners using either the CEFR proficiency levels or the binary classification as simple or…

Computation and Language · Computer Science 2021-03-09 Nouran Khallaf , Serge Sharoff

AraSpell: A Deep Learning Approach for Arabic Spelling Correction

Spelling correction is the task of identifying spelling mistakes, typos, and grammatical mistakes in a given text and correcting them according to their context and grammatical structure. This work introduces "AraSpell," a framework for…

Computation and Language · Computer Science 2024-05-14 Mahmoud Salhab , Faisal Abu-Khzam

Question-Answering (QA) Model for a Personalized Learning Assistant for Arabic Language

This paper describes the creation, optimization, and assessment of a question-answering (QA) model for a personalized learning assistant that uses BERT transformers customized for the Arabic language. The model was particularly finetuned on…

Computation and Language · Computer Science 2024-06-14 Mohammad Sammoudi , Ahmad Habaybeh , Huthaifa I. Ashqar , Mohammed Elhenawy

Advancements in Arabic Grammatical Error Detection and Correction: An Empirical Investigation

Grammatical error correction (GEC) is a well-explored problem in English with many existing models and datasets. However, research on GEC in morphologically rich languages has been limited due to challenges such as data scarcity and…

Computation and Language · Computer Science 2023-11-10 Bashar Alhafni , Go Inoue , Christian Khairallah , Nizar Habash

ArTST: Arabic Text and Speech Transformer

We present ArTST, a pre-trained Arabic text and speech transformer for supporting open-source speech technologies for the Arabic language. The model architecture follows the unified-modal framework, SpeechT5, that was recently released for…

Computation and Language · Computer Science 2023-10-26 Hawau Olamide Toyin , Amirbek Djanibekov , Ajinkya Kulkarni , Hanan Aldarmaki

Beyond English: Evaluating LLMs for Arabic Grammatical Error Correction

Large language models (LLMs) finetuned to follow human instruction have recently exhibited significant capabilities in various English NLP tasks. However, their performance in grammatical error correction (GEC), especially on languages…

Computation and Language · Computer Science 2023-12-15 Sang Yun Kwon , Gagan Bhatia , El Moatez Billah Nagoudi , Muhammad Abdul-Mageed

BERT-Based Arabic Social Media Author Profiling

We report our models for detecting age, language variety, and gender from social media data in the context of the Arabic author profiling and deception detection shared task (APDA). We build simple models based on pre-trained bidirectional…

Computation and Language · Computer Science 2019-11-01 Chiyu Zhang , Muhammad Abdul-Mageed

Open Automatic Speech Recognition Models for Classical and Modern Standard Arabic

Despite Arabic being one of the most widely spoken languages, the development of Arabic Automatic Speech Recognition (ASR) systems faces significant challenges due to the language's complexity, and only a limited number of public Arabic ASR…

Computation and Language · Computer Science 2025-07-21 Lilit Grigoryan , Nikolay Karpov , Enas Albasiri , Vitaly Lavrukhin , Boris Ginsburg

MADARi: A Web Interface for Joint Arabic Morphological Annotation and Spelling Correction

In this paper, we introduce MADARi, a joint morphological annotation and spelling correction system for texts in Standard and Dialectal Arabic. The MADARi framework provides intuitive interfaces for annotating text and managing the…

Computation and Language · Computer Science 2018-08-28 Ossama Obeid , Salam Khalifa , Nizar Habash , Houda Bouamor , Wajdi Zaghouani , Kemal Oflazer