English
Related papers

Related papers: Arabic Language Text Classification Using Dependen…

200 papers

Unlike other languages, the Arabic language has a morphological complexity which makes the Arabic sentiment analysis is a challenging task. Moreover, the presence of the dialects in the Arabic texts have made the sentiment analysis task is…

Computation and Language · Computer Science 2019-02-19 Omar Al-Harbi

Matching texts in highly inflected languages such as Arabic by simple stemming strategy is unlikely to perform well. In this paper, we present a strategy for automatic text matching technique for for inflectional languages, using Arabic as…

Computation and Language · Computer Science 2014-03-25 Tarek El-Shishtawy , Fatma El-Ghannam

Text categorization is the process of grouping documents into categories based on their contents. This process is important to make information retrieval easier, and it became more important due to the huge textual information available…

Information Retrieval · Computer Science 2015-01-08 Ashraf Odeh , Aymen Abu-Errub , Qusai Shambour , Nidal Turab

We present new methods for pruning and enhancing item- sets for text classification via association rule mining. Pruning methods are based on dependency syntax and enhancing methods are based on replacing words by their hyperonyms of…

Information Retrieval · Computer Science 2014-07-29 Yannis Haralambous , Philippe Lenca

Representation of semantic information contained in the words is needed for any Arabic Text Mining applications. More precisely, the purpose is to better take into account the semantic dependencies between words expressed by the…

Computation and Language · Computer Science 2012-12-18 Hanane Froud , Abdelmonaim Lachkar , Said Alaoui Ouatik

In this paper, we address the problems of Arabic Text Classification and stemming using Transducers and Rational Kernels. We introduce a new stemming technique based on the use of Arabic patterns (Pattern Based Stemmer). Patterns are…

Computation and Language · Computer Science 2015-02-27 Attia Nehar , Djelloul Ziadi , Hadda Cherroun

Classifying text is a method for categorizing documents into pre-established groups. Text documents must be prepared and represented in a way that is appropriate for the algorithms used for data mining prior to classification. As a result,…

Computation and Language · Computer Science 2024-02-26 Esra'a Alhenawi , Ruba Abu Khurma , Pedro A. Castillo , Maribel G. Arenas

This paper presents an approach based on supervised machine learning methods to build a classifier that can identify text complexity in order to present Arabic language learners with texts suitable to their levels. The approach is based on…

Computation and Language · Computer Science 2021-09-20 Sadik Bessou , Ghozlane Chenni

Automatic readability assessment is relevant to building NLP applications for education, content analysis, and accessibility. However, Arabic readability assessment is a challenging task due to Arabic's morphological richness and limited…

Computation and Language · Computer Science 2024-07-04 Juan Piñeros Liberato , Bashar Alhafni , Muhamed Al Khalil , Nizar Habash

Arabic word segmentation is essential for a variety of NLP applications such as machine translation and information retrieval. Segmentation entails breaking words into their constituent stems, affixes and clitics. In this paper, we compare…

Computation and Language · Computer Science 2017-08-22 Mohamed Eldesouki , Younes Samih , Ahmed Abdelali , Mohammed Attia , Hamdy Mubarak , Kareem Darwish , Kallmeyer Laura

The complexities of Arabic language in morphology, orthography and dialects makes sentiment analysis for Arabic more challenging. Also, text feature extraction from short messages like tweets, in order to gauge the sentiment, makes this…

Computation and Language · Computer Science 2018-10-17 Abdulaziz M. Alayba , Vasile Palade , Matthew England , Rahat Iqbal

Research into statistical parsing for English has enjoyed over a decade of successful results. However, adapting these models to other languages has met with difficulties. Previous comparative work has shown that Modern Arabic is one of the…

Computation and Language · Computer Science 2015-10-27 Kais Dukes

Word segmentation plays a pivotal role in improving any Arabic NLP application. Therefore, a lot of research has been spent in improving its accuracy. Off-the-shelf tools, however, are: i) complicated to use and ii) domain/dialect…

Computation and Language · Computer Science 2017-09-05 Hassan Sajjad , Fahim Dalvi , Nadir Durrani , Ahmed Abdelali , Yonatan Belinkov , Stephan Vogel

Recently, string kernels have obtained state-of-the-art results in various text classification tasks such as Arabic dialect identification or native language identification. In this paper, we apply two simple yet effective transductive…

Computation and Language · Computer Science 2018-09-03 Radu Tudor Ionescu , Andrei M. Butnaru

The rapid growth of the internet has increased the number of online texts. This led to the rapid growth of the number of online texts in the Arabic language. The enormous amount of text must be organized into classes to make the analysis…

Information Retrieval · Computer Science 2022-11-08 Sumaia Mohammed AL-Ghuribi , Shahrul Azman Mohd Noah

We observe a recent behaviour on social media, in which users intentionally remove consonantal dots from Arabic letters, in order to bypass content-classification algorithms. Content classification is typically done by fine-tuning…

Computation and Language · Computer Science 2021-11-19 Aviad Rom , Kfir Bar

Large language models (LLMs) perform strongly on many NLP tasks, but their ability to produce explicit linguistic structure remains unclear. We evaluate instruction-tuned LLMs on two structured prediction tasks for Standard Arabic:…

Computation and Language · Computer Science 2026-03-18 Mohamed Adel , Bashar Alhafni , Nizar Habash

We investigate different approaches for dialect identification in Arabic broadcast speech, using phonetic, lexical features obtained from a speech recognition system, and acoustic features using the i-vector framework. We studied both…

Computation and Language · Computer Science 2016-08-12 Ahmed Ali , Najim Dehak , Patrick Cardinal , Sameer Khurana , Sree Harsha Yella , James Glass , Peter Bell , Steve Renals

Today text classification becomes critical task for concerned individuals for numerous purposes. Hence, several researches have been conducted to develop automatic text classification for national and international languages. However, the…

Computation and Language · Computer Science 2023-05-09 Jawid Ahmad Baktash , Mursal Dawodi , Mohammad Zarif Joya , Nematullah Hassanzada

Social media such as Twitter, Facebook, etc. has led to a generated growing number of comments that contains users opinions. Sentiment analysis research deals with these comments to extract opinions which are positive or negative. Arabic…

Computation and Language · Computer Science 2021-01-11 Anwar Alnawas , Nursal ARICI
‹ Prev 1 2 3 10 Next ›