Related papers: Document Classification Using Distributed Machine …

Classification of Scientific Papers With Big Data Technologies

Data sizes that cannot be processed by conventional data storage and analysis systems are named as Big Data.It also refers to nex technologies developed to store, process and analyze large amounts of data. Automatic information retrieval…

Distributed, Parallel, and Cluster Computing · Computer Science 2018-02-15 Selen Gurbuz , Galip Aydin

A Survey of Na\"ive Bayes Machine Learning approach in Text Document Classification

Text Document classification aims in associating one or more predefined categories based on the likelihood suggested by the training set of labeled documents. Many machine learning algorithms play a vital role in training the system with…

Machine Learning · Computer Science 2010-03-10 Vidhya. K. A , G. Aghila

Categorical Classification of Book Summaries Using Word Embedding Techniques

In this study, book summaries and categories taken from book sites were classified using word embedding methods, natural language processing techniques and machine learning algorithms. In addition, one hot encoding, Word2Vec and Term…

Computation and Language · Computer Science 2025-07-30 Kerem Keskin , Mümine Kaya Keleş

Preparation of Improved Turkish DataSet for Sentiment Analysis in Social Media

A public dataset, with a variety of properties suitable for sentiment analysis [1], event prediction, trend detection and other text mining applications, is needed in order to be able to successfully perform analysis studies. The vast…

Computation and Language · Computer Science 2018-02-01 Semiha Makinist , Ibrahim Riza Hallac , Betul Ay Karakus , Galip Aydin

Distributed Readability Analysis Of Turkish Elementary School Textbooks

The readability assessment deals with estimating the level of difficulty in reading texts.Many readability tests, which do not indicate execution efficiency, have been applied on specific texts to measure the reading grade level in science…

Distributed, Parallel, and Cluster Computing · Computer Science 2018-02-13 Betul Karakus , Ibrahim Riza Hallac , Galip Aydin

Text Classification for Azerbaijani Language Using Machine Learning and Embedding

Text classification systems will help to solve the text clustering problem in the Azerbaijani language. There are some text-classification applications for foreign languages, but we tried to build a newly developed system to solve this…

Computation and Language · Computer Science 2020-01-01 Umid Suleymanov , Behnam Kiani Kalejahi , Elkhan Amrahov , Rashid Badirkhanli

Turkish Text Classification: From Lexicon Analysis to Bidirectional Transformer

Text classification has seen an increased use in both academic and industry settings. Though rule based methods have been fairly successful, supervised machine learning has been shown to be most successful for most languages, where most…

Computation and Language · Computer Science 2021-04-26 Deniz Kavi

Text Classification Using Hybrid Machine Learning Algorithms on Big Data

Recently, there are unprecedented data growth originating from different online platforms which contribute to big data in terms of volume, velocity, variety and veracity (4Vs). Given this nature of big data which is unstructured, performing…

Information Retrieval · Computer Science 2021-04-01 D. C. Asogwa , S. O. Anigbogu , I. E. Onyenwe , F. A. Sani

Text classification using machine learning methods

In this paper we present the results of an experiment aimed to use machine learning methods to obtain models that can be used for the automatic classification of products. In order to apply automatic classification methods, we transformed…

Computation and Language · Computer Science 2025-02-28 Bogdan Oancea

A Robust Hybrid Approach for Textual Document Classification

Text document classification is an important task for diverse natural language processing based applications. Traditional machine learning approaches mainly focused on reducing dimensionality of textual data to perform classification. This…

Computation and Language · Computer Science 2019-09-13 Muhammad Nabeel Asim , Muhammad Usman Ghani Khan , Muhammad Imran Malik , Andreas Dengel , Sheraz Ahmed

Performance Analysis of Supervised Machine Learning Algorithms for Text Classification

The demand for text classification is growing significantly in web searching, data mining, web ranking, recommendation systems, and so many other fields of information and technology. This paper illustrates the text classification process…

Computation and Language · Computer Science 2025-09-03 Sadia Zaman Mishu , S M Rafiuddin

Machine Learning Technique Based Fake News Detection

False news has received attention from both the general public and the scholarly world. Such false information has the ability to affect public perception, giving nefarious groups the chance to influence the results of public events like…

Computation and Language · Computer Science 2023-09-26 Biplob Kumar Sutradhar , Md. Zonaid , Nushrat Jahan Ria , Sheak Rashed Haider Noori

Natural Language Processing Models for Robust Document Categorization

This article presents an evaluation of several machine learning methods applied to automated text classification, alongside the design of a demonstrative system for unbalanced document categorization and distribution. The study focuses on…

Computation and Language · Computer Science 2026-02-25 Radoslaw Roszczyk , Pawel Tecza , Maciej Stodolski , Krzysztof Siwek

Development of Fake News Model using Machine Learning through Natural Language Processing

Fake news detection research is still in the early stage as this is a relatively new phenomenon in the interest raised by society. Machine learning helps to solve complex problems and to build AI systems nowadays and especially in those…

Computation and Language · Computer Science 2022-01-20 Sajjad Ahmed , Knut Hinkelmann , Flavio Corradini

Document Classification by Inversion of Distributed Language Representations

There have been many recent advances in the structure and measurement of distributed language models: those that map from words to a vector-space that is rich in information about word choice and composition. This vector-space is the…

Computation and Language · Computer Science 2015-07-27 Matt Taddy

A hybrid learning algorithm for text classification

Text classification is the process of classifying documents into predefined categories based on their content. Existing supervised learning algorithms to automatically classify text need sufficient documents to learn accurately. This paper…

Neural and Evolutionary Computing · Computer Science 2010-09-27 S. M. Kamruzzaman , Farhana Haider

A Comparative Study on Different Types of Approaches to Bengali document Categorization

Document categorization is a technique where the category of a document is determined. In this paper three well-known supervised learning techniques which are Support Vector Machine(SVM), Na\"ive Bayes(NB) and Stochastic Gradient…

Computation and Language · Computer Science 2017-01-31 Md. Saiful Islam , Fazla Elahi Md Jubayer , Syed Ikhtiar Ahmed

An Analysis of Hierarchical Text Classification Using Word Embeddings

Efficient distributed numerical word representation models (word embeddings) combined with modern machine learning algorithms have recently yielded considerable improvement on automatic document classification tasks. However, the…

Computation and Language · Computer Science 2018-09-07 Roger A. Stein , Patricia A. Jaques , Joao F. Valiati

Machine Learning Pipelines with Modern Big Data Tools for High Energy Physics

The effective utilization at scale of complex machine learning (ML) techniques for HEP use cases poses several technological challenges, most importantly on the actual implementation of dedicated end-to-end data pipelines. A solution to…

Distributed, Parallel, and Cluster Computing · Computer Science 2020-06-17 Matteo Migliorini , Riccardo Castellotti , Luca Canali , Marco Zanetti

Bug Classification: Feature Extraction and Comparison of Event Model using Na\"ive Bayes Approach

In software industries, individuals at different levels from customer to an engineer apply diverse mechanisms to detect to which class a particular bug should be allocated. Sometimes while a simple search in Internet might help, in many…

Software Engineering · Computer Science 2013-04-08 Sunil Joy Dommati , Ruchi Agrawal , Ram Mohana Reddy G. , S. Sowmya Kamath