English
Related papers

Related papers: Early text classification: a Naive solution

200 papers

Naive Bayes classifiers, a family of classifiers that are based on the popular Bayes' probability theorem, are known for creating simple yet well performing models, especially in the fields of document classification and disease prediction.…

Machine Learning · Computer Science 2017-02-16 Sebastian Raschka

Naive Bayes estimator is widely used in text classification problems. However, it doesn't perform well with small-size training dataset. We propose a new method based on Naive Bayes estimator to solve this problem. A correlation factor is…

Information Retrieval · Computer Science 2019-05-16 Jiangning Chen , Zhibo Dai , Juntao Duan , Heinrich Matzinger , Ionel Popescu

Text Document classification aims in associating one or more predefined categories based on the likelihood suggested by the training set of labeled documents. Many machine learning algorithms play a vital role in training the system with…

Machine Learning · Computer Science 2010-03-10 Vidhya. K. A , G. Aghila

Text classification is the process of classifying documents into predefined categories based on their content. Existing supervised learning algorithms to automatically classify text need sufficient documents to learn accurately. This paper…

Neural and Evolutionary Computing · Computer Science 2010-09-27 S. M. Kamruzzaman , Farhana Haider

Text classification is the process of classifying documents into predefined categories based on their content. It is the automated assignment of natural language texts to predefined categories. Text classification is the primary requirement…

Information Retrieval · Computer Science 2010-09-28 S. M. Kamruzzaman

Text classification is the automated assignment of natural language texts to predefined categories based on their content. Text classification is the primary requirement of text retrieval systems, which retrieve texts in response to a user…

Information Retrieval · Computer Science 2010-09-28 S. M. Kamruzzaman , Farhana Haider , Ahmed Ryadh Hasan

Text classification is the process of classifying documents into predefined categories based on their content. It is the automated assignment of natural language texts to predefined categories. Text classification is the primary requirement…

Information Retrieval · Computer Science 2010-09-28 S. M. Kamruzzaman , Farhana Haider , Ahmed Ryadh Hasan

As the amount of online text increases, the demand for text categorization to aid the analysis and management of text is increasing. Text is cheap, but information, in the form of knowing what classes a text belongs to, is expensive.…

Information Retrieval · Computer Science 2010-09-28 S M Kamruzzaman , Chowdhury Mofizur Rahman

A rapid growth of web documents due to heavy use of World Wide Web necessitates efficient techniques to efficiently classify the document on the web. It is thus produced High volumes of data per second with high diversity. Automatically…

Computation and Language · Computer Science 2020-06-03 Alireza Saleh Sedghpour , Mohammad Reza Saleh Sedghpour

Labeling mistakes are frequently encountered in real-world applications. If not treated well, the labeling mistakes can deteriorate the classification performances of a model seriously. To address this issue, we propose an improved Naive…

Machine Learning · Computer Science 2023-04-14 Qianhan Zeng , Yingqiu Zhu , Xuening Zhu , Feifei Wang , Weichen Zhao , Shuning Sun , Meng Su , Hansheng Wang

Text classification helps analyse texts for semantic meaning and relevance, by mapping the words against this hierarchy. An analysis of various types of texts is invaluable to understanding both their semantic meaning, as well as their…

Machine Learning · Computer Science 2022-11-16 Chaitanya Chadha , Vandit Gupta , Deepak Gupta , Ashish Khanna

Text classification is an important topic in the field of natural language processing. It has been preliminarily applied in information retrieval, digital library, automatic abstracting, text filtering, word semantic discrimination and many…

Computation and Language · Computer Science 2023-12-20 Hao Li , Brandon Bennett

Bayes Classifiers are widely used currently for recognition, identification and knowledge discovery. The fields of application are, for example, image processing, medicine, chemistry (QSAR). But by mysterious way the Naive Bayes Classifier…

Computer Vision and Pattern Recognition · Computer Science 2013-12-30 Oleg Kupervasser , Alexsander Vardy

As the amount of online text increases, the demand for text classification to aid the analysis and management of text is increasing. Text is cheap, but information, in the form of knowing what classes a text belongs to, is expensive.…

Machine Learning · Computer Science 2010-09-27 Chowdhury Mofizur Rahman , Ferdous Ahmed Sohel , Parvez Naushad , S. M. Kamruzzaman

Virtual assistants and text chatbots have recently been gaining popularity. Given the short message nature of text-based chat interactions, the language identification systems of these bots might only have 15 or 20 characters to make a…

Computation and Language · Computer Science 2017-11-02 Bernardt Duvenhage , Mfundo Ntini , Phala Ramonyai

This article presents an evaluation of several machine learning methods applied to automated text classification, alongside the design of a demonstrative system for unbalanced document categorization and distribution. The study focuses on…

Computation and Language · Computer Science 2026-02-25 Radoslaw Roszczyk , Pawel Tecza , Maciej Stodolski , Krzysztof Siwek

Bayesian methods are useful for statistical inference. However, real-world problems can be challenging using Bayesian methods when the data analyst has only limited prior knowledge. In this paper we consider a class of problems, called…

Methodology · Statistics 2019-11-20 Yixuan Qiu , Lingsong Zhang , Chuanhai Liu

Most research on natural language processing treats bias as an absolute concept: Based on a (probably complex) algorithmic analysis, a sentence, an article, or a text is classified as biased or not. Given the fact that for humans the…

Computation and Language · Computer Science 2022-10-14 Alonso Palomino , Martin Potthast , Khalid Al-Khatib , Benno Stein

As the Internet grows in size, so does the amount of text based information that exists. For many application spaces it is paramount to isolate and identify texts that relate to a particular topic. While one-class classification would be…

Artificial Intelligence · Computer Science 2021-11-02 Sameer Khanna

Text analytics based on supervised machine learning classifiers has shown great promise in a multitude of domains, but has yet to be applied to Seismology. We test various standard models (Naive Bayes, k-Nearest Neighbors, Support Vector…

Computation and Language · Computer Science 2019-04-19 Arnaud Mignan
‹ Prev 1 2 3 10 Next ›