Related papers: Natural Language Processing Models for Robust Docu…

Comparative Study of Long Document Classification

The amount of information stored in the form of documents on the internet has been increasing rapidly. Thus it has become a necessity to organize and maintain these documents in an optimum manner. Text classification algorithms study the…

Computation and Language · Computer Science 2022-02-22 Vedangi Wagh , Snehal Khandve , Isha Joshi , Apurva Wani , Geetanjali Kale , Raviraj Joshi

A Survey of Na\"ive Bayes Machine Learning approach in Text Document Classification

Text Document classification aims in associating one or more predefined categories based on the likelihood suggested by the training set of labeled documents. Many machine learning algorithms play a vital role in training the system with…

Machine Learning · Computer Science 2010-03-10 Vidhya. K. A , G. Aghila

LegaLMFiT: Efficient Short Legal Text Classification with LSTM Language Model Pre-Training

Large Transformer-based language models such as BERT have led to broad performance improvements on many NLP tasks. Domain-specific variants of these models have demonstrated excellent performance on a variety of specialised tasks. In legal…

Computation and Language · Computer Science 2021-09-16 Benjamin Clavié , Akshita Gheewala , Paul Briton , Marc Alphonsus , Rym Laabiyad , Francesco Piccoli

Do BERT-Like Bidirectional Models Still Perform Better on Text Classification in the Era of LLMs?

The rapid adoption of LLMs has overshadowed the potential advantages of traditional BERT-like models in text classification. This study challenges the prevailing "LLM-centric" trend by systematically comparing three category methods, i.e.,…

Computation and Language · Computer Science 2025-05-27 Junyan Zhang , Yiming Huang , Shuliang Liu , Yubo Gao , Xuming Hu

Text Classification: Neural Networks VS Machine Learning Models VS Pre-trained Models

Text classification is a very common task nowadays and there are many efficient methods and algorithms that we can employ to accomplish it. Transformers have revolutionized the field of deep learning, particularly in Natural Language…

Machine Learning · Computer Science 2024-12-31 Christos Petridis

Comparison of Machine Learning Models to Classify Documents on Digital Development

Automated document classification is a trending topic in Natural Language Processing (NLP) due to the extensive growth in digital databases. However, a model that fits well for a specific classification task might perform weakly for another…

Machine Learning · Computer Science 2025-10-03 Uvini Ranaweera , Bawun Mawitagama , Sanduni Liyanage , Sandupa Keshan , Tiloka de Silva , Supun Hewawalpita

Advancements in Natural Language Processing: Exploring Transformer-Based Architectures for Text Understanding

Natural Language Processing (NLP) has witnessed a transformative leap with the advent of transformer-based architectures, which have significantly enhanced the ability of machines to understand and generate human-like text. This paper…

Computation and Language · Computer Science 2025-03-27 Tianhao Wu , Yu Wang , Ngoc Quach

Detecting AI Generated Text Based on NLP and Machine Learning Approaches

Recent advances in natural language processing (NLP) may enable artificial intelligence (AI) models to generate writing that is identical to human written form in the future. This might have profound ethical, legal, and social…

Machine Learning · Computer Science 2024-04-17 Nuzhat Prova

Transfer Learning Robustness in Multi-Class Categorization by Fine-Tuning Pre-Trained Contextualized Language Models

This study compares the effectiveness and robustness of multi-class categorization of Amazon product data using transfer learning on pre-trained contextualized language models. Specifically, we fine-tuned BERT and XLNet, two bidirectional…

Machine Learning · Statistics 2019-09-24 Xinyi Liu , Artit Wangperawong

Web Document Categorization Using Naive Bayes Classifier and Latent Semantic Analysis

A rapid growth of web documents due to heavy use of World Wide Web necessitates efficient techniques to efficiently classify the document on the web. It is thus produced High volumes of data per second with high diversity. Automatically…

Computation and Language · Computer Science 2020-06-03 Alireza Saleh Sedghpour , Mohammad Reza Saleh Sedghpour

Profitable Trade-Off Between Memory and Performance In Multi-Domain Chatbot Architectures

Text classification problem is a very broad field of study in the field of natural language processing. In short, the text classification problem is to determine which of the previously determined classes the given text belongs to.…

Computation and Language · Computer Science 2021-12-28 D. Emre Taşar , Şükrü Ozan , M. Fatih Akca , Oğuzhan Ölmez , Semih Gülüm , Seçilay Kutal , Ceren Belhan

An Automated Knowledge Mining and Document Classification System with Multi-model Transfer Learning

Service manual documents are crucial to the engineering company as they provide guidelines and knowledge to service engineers. However, it has become inconvenient and inefficient for service engineers to retrieve specific knowledge from…

Computation and Language · Computer Science 2021-06-25 Jia Wei Chong , Zhiyuan Chen , Mei Shin Oh

BERT_SE: A Pre-trained Language Representation Model for Software Engineering

The application of Natural Language Processing (NLP) has achieved a high level of relevance in several areas. In the field of software engineering (SE), NLP applications are based on the classification of similar texts (e.g. software…

Software Engineering · Computer Science 2021-12-02 Eliane Maria De Bortoli Fávero , Dalcimar Casanova

The Unreasonable Effectiveness of the Baseline: Discussing SVMs in Legal Text Classification

We aim to highlight an interesting trend to contribute to the ongoing debate around advances within legal Natural Language Processing. Recently, the focus for most legal text classification tasks has shifted towards large pre-trained deep…

Computation and Language · Computer Science 2021-10-25 Benjamin Clavié , Marc Alphonsus

AI-Generated Text Detection and Classification Based on BERT Deep Learning Algorithm

AI-generated text detection plays an increasingly important role in various fields. In this study, we developed an efficient AI-generated text detection model based on the BERT algorithm, which provides new ideas and methods for solving…

Computation and Language · Computer Science 2024-10-15 Hao Wang , Jianwei Li , Zhengyu Li

Comparative Analysis of Libraries for the Sentimental Analysis

This study is main goal is to provide a comparative comparison of libraries using machine learning methods. Experts in natural language processing (NLP) are becoming more and more interested in sentiment analysis (SA) of text changes. The…

Computation and Language · Computer Science 2023-07-27 Wendy Ccoya , Edson Pinto

Low-Shot Classification: A Comparison of Classical and Deep Transfer Machine Learning Approaches

Despite the recent success of deep transfer learning approaches in NLP, there is a lack of quantitative studies demonstrating the gains these models offer in low-shot text classification tasks over existing paradigms. Deep transfer learning…

Machine Learning · Computer Science 2019-07-18 Peter Usherwood , Steven Smit

A Natural Language Processing and Deep Learning based Model for Automated Vehicle Diagnostics using Free-Text Customer Service Reports

Initial fault detection and diagnostics are imperative measures to improve the efficiency, safety, and stability of vehicle operation. In recent years, numerous studies have investigated data-driven approaches to improve the vehicle…

Systems and Control · Electrical Eng. & Systems 2021-12-01 Ali Khodadadi , Soroush Ghandiparsi , Chen-Nee Chuah

Automated Text Mining of Experimental Methodologies from Biomedical Literature

Biomedical literature is a rapidly expanding field of science and technology. Classification of biomedical texts is an essential part of biomedicine research, especially in the field of biology. This work proposes the fine-tuned DistilBERT,…

Computation and Language · Computer Science 2024-04-23 Ziqing Guo

A hybrid learning algorithm for text classification

Text classification is the process of classifying documents into predefined categories based on their content. Existing supervised learning algorithms to automatically classify text need sufficient documents to learn accurately. This paper…

Neural and Evolutionary Computing · Computer Science 2010-09-27 S. M. Kamruzzaman , Farhana Haider