Related papers: Early text classification: a Naive solution

Naive Bayes and Text Classification I - Introduction and Theory

Naive Bayes classifiers, a family of classifiers that are based on the popular Bayes' probability theorem, are known for creating simple yet well performing models, especially in the fields of document classification and disease prediction.…

Machine Learning · Computer Science 2017-02-16 Sebastian Raschka

Naive Bayes with Correlation Factor for Text Classification Problem

Naive Bayes estimator is widely used in text classification problems. However, it doesn't perform well with small-size training dataset. We propose a new method based on Naive Bayes estimator to solve this problem. A correlation factor is…

Information Retrieval · Computer Science 2019-05-16 Jiangning Chen , Zhibo Dai , Juntao Duan , Heinrich Matzinger , Ionel Popescu

A Survey of Na\"ive Bayes Machine Learning approach in Text Document Classification

Text Document classification aims in associating one or more predefined categories based on the likelihood suggested by the training set of labeled documents. Many machine learning algorithms play a vital role in training the system with…

Machine Learning · Computer Science 2010-03-10 Vidhya. K. A , G. Aghila

A hybrid learning algorithm for text classification

Text classification is the process of classifying documents into predefined categories based on their content. Existing supervised learning algorithms to automatically classify text need sufficient documents to learn accurately. This paper…

Neural and Evolutionary Computing · Computer Science 2010-09-27 S. M. Kamruzzaman , Farhana Haider

Text Classification using Artificial Intelligence

Text classification is the process of classifying documents into predefined categories based on their content. It is the automated assignment of natural language texts to predefined categories. Text classification is the primary requirement…

Information Retrieval · Computer Science 2010-09-28 S. M. Kamruzzaman

Text Classification using Association Rule with a Hybrid Concept of Naive Bayes Classifier and Genetic Algorithm

Text classification is the automated assignment of natural language texts to predefined categories based on their content. Text classification is the primary requirement of text retrieval systems, which retrieve texts in response to a user…

Information Retrieval · Computer Science 2010-09-28 S. M. Kamruzzaman , Farhana Haider , Ahmed Ryadh Hasan

Text Classification using Data Mining

Text classification is the process of classifying documents into predefined categories based on their content. It is the automated assignment of natural language texts to predefined categories. Text classification is the primary requirement…

Information Retrieval · Computer Science 2010-09-28 S. M. Kamruzzaman , Farhana Haider , Ahmed Ryadh Hasan

Text Categorization using Association Rule and Naive Bayes Classifier

As the amount of online text increases, the demand for text categorization to aid the analysis and management of text is increasing. Text is cheap, but information, in the form of knowing what classes a text belongs to, is expensive.…

Information Retrieval · Computer Science 2010-09-28 S M Kamruzzaman , Chowdhury Mofizur Rahman

Web Document Categorization Using Naive Bayes Classifier and Latent Semantic Analysis

A rapid growth of web documents due to heavy use of World Wide Web necessitates efficient techniques to efficiently classify the document on the web. It is thus produced High volumes of data per second with high diversity. Automatically…

Computation and Language · Computer Science 2020-06-03 Alireza Saleh Sedghpour , Mohammad Reza Saleh Sedghpour

Improved Naive Bayes with Mislabeled Data

Labeling mistakes are frequently encountered in real-world applications. If not treated well, the labeling mistakes can deteriorate the classification performances of a model seriously. To address this issue, we propose an improved Naive…

Machine Learning · Computer Science 2023-04-14 Qianhan Zeng , Yingqiu Zhu , Xuening Zhu , Feifei Wang , Weichen Zhao , Shuning Sun , Meng Su , Hansheng Wang

Classifying text using machine learning models and determining conversation drift

Text classification helps analyse texts for semantic meaning and relevance, by mapping the words against this hierarchy. An analysis of various types of texts is invaluable to understanding both their semantic meaning, as well as their…

Machine Learning · Computer Science 2022-11-16 Chaitanya Chadha , Vandit Gupta , Deepak Gupta , Ashish Khanna

The performance of multiple language models in identifying offensive language on social media

Text classification is an important topic in the field of natural language processing. It has been preliminarily applied in information retrieval, digital library, automatic abstracting, text filtering, word semantic discrimination and many…

Computation and Language · Computer Science 2023-12-20 Hao Li , Brandon Bennett

The Mysterious Optimality of Naive Bayes: Estimation of the Probability in the System of "Classifiers"

Bayes Classifiers are widely used currently for recognition, identification and knowledge discovery. The fields of application are, for example, image processing, medicine, chemistry (QSAR). But by mysterious way the Naive Bayes Classifier…

Computer Vision and Pattern Recognition · Computer Science 2013-12-30 Oleg Kupervasser , Alexsander Vardy

Text Classification using the Concept of Association Rule of Data Mining

As the amount of online text increases, the demand for text classification to aid the analysis and management of text is increasing. Text is cheap, but information, in the form of knowing what classes a text belongs to, is expensive.…

Machine Learning · Computer Science 2010-09-27 Chowdhury Mofizur Rahman , Ferdous Ahmed Sohel , Parvez Naushad , S. M. Kamruzzaman

Improved Text Language Identification for the South African Languages

Virtual assistants and text chatbots have recently been gaining popularity. Given the short message nature of text-based chat interactions, the language identification systems of these bots might only have 15 or 20 characters to make a…

Computation and Language · Computer Science 2017-11-02 Bernardt Duvenhage , Mfundo Ntini , Phala Ramonyai

Natural Language Processing Models for Robust Document Categorization

This article presents an evaluation of several machine learning methods applied to automated text classification, alongside the design of a demonstrative system for unbalanced document categorization and distribution. The study focuses on…

Computation and Language · Computer Science 2026-02-25 Radoslaw Roszczyk , Pawel Tecza , Maciej Stodolski , Krzysztof Siwek

Exact and efficient inference for Partial Bayes problems

Bayesian methods are useful for statistical inference. However, real-world problems can be challenging using Bayesian methods when the data analyst has only limited prior knowledge. In this paper we consider a class of problems, called…

Methodology · Statistics 2019-11-20 Yixuan Qiu , Lingsong Zhang , Chuanhai Liu

Differential Bias: On the Perceptibility of Stance Imbalance in Argumentation

Most research on natural language processing treats bias as an absolute concept: Based on a (probably complex) algorithmic analysis, a sentence, an article, or a text is classified as biased or not. Given the fact that for humans the…

Computation and Language · Computer Science 2022-10-14 Alonso Palomino , Martin Potthast , Khalid Al-Khatib , Benno Stein

Conical Classification For Computationally Efficient One-Class Topic Determination

As the Internet grows in size, so does the amount of text based information that exists. For many application spaces it is paramount to isolate and identify texts that relate to a particular topic. While one-class classification would be…

Artificial Intelligence · Computer Science 2021-11-02 Sameer Khanna

Text Classification of the Precursory Accelerating Seismicity Corpus: Inference on some Theoretical Trends in Earthquake Predictability Research from 1988 to 2018

Text analytics based on supervised machine learning classifiers has shown great promise in a multitude of domains, but has yet to be applied to Seismology. We test various standard models (Naive Bayes, k-Nearest Neighbors, Support Vector…

Computation and Language · Computer Science 2019-04-19 Arnaud Mignan