Related papers: Supervised learning Methods for Bangla Web Documen…

A Comparative Study on Different Types of Approaches to Bengali document Categorization

Document categorization is a technique where the category of a document is determined. In this paper three well-known supervised learning techniques which are Support Vector Machine(SVM), Na\"ive Bayes(NB) and Stochastic Gradient…

Computation and Language · Computer Science 2017-01-31 Md. Saiful Islam , Fazla Elahi Md Jubayer , Syed Ikhtiar Ahmed

Machine and Deep Learning Methods with Manual and Automatic Labelling for News Classification in Bangla Language

Research in Natural Language Processing (NLP) has increasingly become important due to applications such as text classification, text mining, sentiment analysis, POS tagging, named entity recognition, textual entailment, and many others.…

Artificial Intelligence · Computer Science 2022-10-21 Istiak Ahmad , Fahad AlQurashi , Rashid Mehmood

Comparison of Classical Machine Learning Approaches on Bangla Textual Emotion Analysis

Detecting emotions from text is an extension of simple sentiment polarity detection. Instead of considering only positive or negative sentiments, emotions are conveyed using more tangible manner; thus, they can be expressed as many shades…

Computation and Language · Computer Science 2019-07-19 Md. Ataur Rahman , Md. Hanif Seddiqui

A Classical Approach to Handcrafted Feature Extraction Techniques for Bangla Handwritten Digit Recognition

Bangla Handwritten Digit recognition is a significant step forward in the development of Bangla OCR. However, intricate shape, structural likeness and distinctive composition style of Bangla digits makes it relatively challenging to…

Computer Vision and Pattern Recognition · Computer Science 2022-01-26 Md. Ferdous Wahid , Md. Fahim Shahriar , Md. Shohanur Islam Sobuj

Bangla Text Classification using Transformers

Text classification has been one of the earliest problems in NLP. Over time the scope of application areas has broadened and the difficulty of dealing with new areas (e.g., noisy social media content) has increased. The problem-solving…

Computation and Language · Computer Science 2020-11-10 Tanvirul Alam , Akib Khan , Firoj Alam

Feature Extraction Using Deep Generative Models for Bangla Text Classification on a New Comprehensive Dataset

The selection of features for text classification is a fundamental task in text mining and information retrieval. Despite being the sixth most widely spoken language in the world, Bangla has received little attention due to the scarcity of…

Information Retrieval · Computer Science 2023-08-29 Md. Rafi-Ur-Rashid , Sami Azam , Mirjam Jonkman

Robust and Consistent Estimation of Word Embedding for Bangla Language by fine-tuning Word2Vec Model

Word embedding or vector representation of word holds syntactical and semantic characteristics of a word which can be an informative feature for any machine learning-based models of natural language processing. There are several deep…

Computation and Language · Computer Science 2021-05-05 Rifat Rahman

Bangla Music Genre Classification Using Bidirectional LSTMS

Bangla music is enrich in its own music cultures. Now a days music genre classification is very significant because of the exponential increase in available music, both in digital and physical formats. It is necessary to index them…

Sound · Computer Science 2026-01-22 Muntakimur Rahaman , Md Mahmudul Hoque , Md Mehedi Hassain

Bangla Word Clustering Based on Tri-gram, 4-gram and 5-gram Language Model

In this paper, we describe a research method that generates Bangla word clusters on the basis of relating to meaning in language and contextual similarity. The importance of word clustering is in parts of speech (POS) tagging, word sense…

Computation and Language · Computer Science 2017-01-31 Dipaloke Saha , Md Saddam Hossain , MD. Saiful Islam , Sabir Ismail

A Survey of Na\"ive Bayes Machine Learning approach in Text Document Classification

Text Document classification aims in associating one or more predefined categories based on the likelihood suggested by the training set of labeled documents. Many machine learning algorithms play a vital role in training the system with…

Machine Learning · Computer Science 2010-03-10 Vidhya. K. A , G. Aghila

Sentiment Classification in Bangla Textual Content: A Comparative Study

Sentiment analysis has been widely used to understand our views on social and political agendas or user experiences over a product. It is one of the cores and well-researched areas in NLP. However, for low-resource languages, like Bangla,…

Computation and Language · Computer Science 2020-11-23 Md. Arid Hasan , Jannatul Tajrin , Shammur Absar Chowdhury , Firoj Alam

Performance Analysis of Supervised Machine Learning Algorithms for Text Classification

The demand for text classification is growing significantly in web searching, data mining, web ranking, recommendation systems, and so many other fields of information and technology. This paper illustrates the text classification process…

Computation and Language · Computer Science 2025-09-03 Sadia Zaman Mishu , S M Rafiuddin

Efficient Measuring of Readability to Improve Documents Accessibility for Arabic Language Learners

This paper presents an approach based on supervised machine learning methods to build a classifier that can identify text complexity in order to present Arabic language learners with texts suitable to their levels. The approach is based on…

Computation and Language · Computer Science 2021-09-20 Sadik Bessou , Ghozlane Chenni

Bengali Text Classification: An Evaluation of Large Language Model Approaches

Bengali text classification is a Significant task in natural language processing (NLP), where text is categorized into predefined labels. Unlike English, Bengali faces challenges due to the lack of extensive annotated datasets and…

Computation and Language · Computer Science 2026-01-21 Md Mahmudul Hoque , Md Mehedi Hassain , Md Hojaifa Tanvir , Rahul Nandy

Study and Observation of the Variation of Accuracies of KNN, SVM, LMNN, ENN Algorithms on Eleven Different Datasets from UCI Machine Learning Repository

Machine learning qualifies computers to assimilate with data, without being solely programmed [1, 2]. Machine learning can be classified as supervised and unsupervised learning. In supervised learning, computers learn an objective that…

Machine Learning · Computer Science 2019-02-06 Mohammad Mahmudur Rahman Khan , Rezoana Bente Arif , Md. Abu Bakr Siddique , Mahjabin Rahman Oishe

BN-DRISHTI: Bangla Document Recognition through Instance-level Segmentation of Handwritten Text Images

Handwriting recognition remains challenging for some of the most spoken languages, like Bangla, due to the complexity of line and word segmentation brought by the curvilinear nature of writing and lack of quality datasets. This paper solves…

Computer Vision and Pattern Recognition · Computer Science 2023-09-27 Sheikh Mohammad Jubaer , Nazifa Tabassum , Md. Ataur Rahman , Mohammad Khairul Islam

Abstractive Text Summarization for Bangla Language Using NLP and Machine Learning Approaches

Text summarization involves reducing extensive documents to short sentences that encapsulate the essential ideas. The goal is to create a summary that effectively conveys the main points of the original text. We spend a significant amount…

Computation and Language · Computer Science 2025-01-28 Asif Ahammad Miazee , Tonmoy Roy , Md Robiul Islam , Yeamin Safat

A comparison of SVM and RVM for Document Classification

Document classification is a task of assigning a new unclassified document to one of the predefined set of classes. The content based document classification uses the content of the document with some weighting criteria to assign it to one…

Information Retrieval · Computer Science 2013-01-15 Muhammad Rafi , Mohammad Shahid Shaikh

Tuning Traditional Language Processing Approaches for Pashto Text Classification

Today text classification becomes critical task for concerned individuals for numerous purposes. Hence, several researches have been conducted to develop automatic text classification for national and international languages. However, the…

Computation and Language · Computer Science 2023-05-09 Jawid Ahmad Baktash , Mursal Dawodi , Mohammad Zarif Joya , Nematullah Hassanzada

Empirical Study of Deep Learning for Text Classification in Legal Document Review

Predictive coding has been widely used in legal matters to find relevant or privileged documents in large sets of electronically stored information. It saves the time and cost significantly. Logistic Regression (LR) and Support Vector…

Information Retrieval · Computer Science 2019-04-04 Fusheng Wei , Han Qin , Shi Ye , Haozhen Zhao