Related papers: Classifying textual data: shallow, deep and ensemb…

Text Classification: A Perspective of Deep Learning Methods

In recent years, with the rapid development of information on the Internet, the number of complex texts and documents has increased exponentially, which requires a deeper understanding of deep learning methods in order to accurately…

Computation and Language · Computer Science 2023-09-26 Zhongwei Wan

Text Classification Algorithms: A Survey

In recent years, there has been an exponential growth in the number of complex documents and texts that require a deeper understanding of machine learning methods to be able to accurately classify texts in many applications. Many machine…

Machine Learning · Computer Science 2020-05-21 Kamran Kowsari , Kiana Jafari Meimandi , Mojtaba Heidarysafa , Sanjana Mendu , Laura E. Barnes , Donald E. Brown

A Survey on Text Classification: From Shallow to Deep Learning

Text classification is the most fundamental and essential task in natural language processing. The last decade has seen a surge of research in this area due to the unprecedented success of deep learning. Numerous methods, datasets, and…

Computation and Language · Computer Science 2021-12-23 Qian Li , Hao Peng , Jianxin Li , Congying Xia , Renyu Yang , Lichao Sun , Philip S. Yu , Lifang He

Ensemble deep learning: A review

Ensemble learning combines several individual models to obtain better generalization performance. Currently, deep learning architectures are showing better performance compared to the shallow or traditional models. Deep ensemble learning…

Machine Learning · Computer Science 2022-08-09 M. A. Ganaie , Minghui Hu , A. K. Malik , M. Tanveer , P. N. Suganthan

Comparison of Machine Learning Methods for Assigning Software Issues to Team Members

Software issues contain units of work to fix, improve, or create new threads during the development and facilitate communication among the team members. Assigning an issue to the most relevant team member and determining a category of an…

Computation and Language · Computer Science 2023-08-24 Büşra Tabak , Fatma Başak Aydemir

Deep Super Learner: A Deep Ensemble for Classification Problems

Deep learning has become very popular for tasks such as predictive modeling and pattern recognition in handling big data. Deep learning is a powerful machine learning method that extracts lower level features and feeds them forward for the…

Machine Learning · Computer Science 2018-03-07 Steven Young , Tamer Abdou , Ayse Bener

A Robust Hybrid Approach for Textual Document Classification

Text document classification is an important task for diverse natural language processing based applications. Traditional machine learning approaches mainly focused on reducing dimensionality of textual data to perform classification. This…

Computation and Language · Computer Science 2019-09-13 Muhammad Nabeel Asim , Muhammad Usman Ghani Khan , Muhammad Imran Malik , Andreas Dengel , Sheraz Ahmed

Deep Learning Based Text Classification: A Comprehensive Review

Deep learning based models have surpassed classical machine learning based approaches in various text classification tasks, including sentiment analysis, news categorization, question answering, and natural language inference. In this…

Computation and Language · Computer Science 2021-01-05 Shervin Minaee , Nal Kalchbrenner , Erik Cambria , Narjes Nikzad , Meysam Chenaghlu , Jianfeng Gao

A Unifying Review of Deep and Shallow Anomaly Detection

Deep learning approaches to anomaly detection have recently improved the state of the art in detection performance on complex datasets such as large collections of images or text. These results have sparked a renewed interest in the anomaly…

Machine Learning · Computer Science 2021-02-09 Lukas Ruff , Jacob R. Kauffmann , Robert A. Vandermeulen , Grégoire Montavon , Wojciech Samek , Marius Kloft , Thomas G. Dietterich , Klaus-Robert Müller

DOC: Deep Open Classification of Text Documents

Traditional supervised learning makes the closed-world assumption that the classes appeared in the test data must have appeared in training. This also applies to text learning or text classification. As learning is used increasingly in…

Computation and Language · Computer Science 2017-09-27 Lei Shu , Hu Xu , Bing Liu

A Comprehensive Survey of Text Classification Techniques and Their Research Applications: Observational and Experimental Insights

The exponential growth of textual data presents substantial challenges in management and analysis, notably due to high storage and processing costs. Text classification, a vital aspect of text mining, provides robust solutions by enabling…

Computation and Language · Computer Science 2025-01-22 Kamal Taha , Paul D. Yoo , Chan Yeun , Aya Taha

Automated Single-Label Patent Classification using Ensemble Classifiers

Many thousands of patent applications arrive at patent offices around the world every day. One important subtask when a patent application is submitted is to assign one or more classification codes from the complex and hierarchical patent…

Information Retrieval · Computer Science 2022-03-08 Eleni Kamateri , Vasileios Stamatis , Konstantinos Diamantaras , Michail Salampasis

On the Effectiveness of Neural Ensembles for Image Classification with Small Datasets

Deep neural networks represent the gold standard for image classification. However, they usually need large amounts of data to reach superior performance. In this work, we focus on image classification problems with a few labeled examples…

Computer Vision and Pattern Recognition · Computer Science 2021-11-30 Lorenzo Brigato , Luca Iocchi

Semantic classifier approach to document classification

In this paper we propose a new document classification method, bridging discrepancies (so-called semantic gap) between the training set and the application sets of textual data. We demonstrate its superiority over classical text…

Information Retrieval · Computer Science 2017-01-17 Piotr Borkowski , Krzysztof Ciesielski , Mieczysław A. Kłopotek

A Comprehensive Survey on Deep Clustering: Taxonomy, Challenges, and Future Directions

Clustering is a fundamental machine learning task which has been widely studied in the literature. Classic clustering methods follow the assumption that data are represented as features in a vectorized form through various representation…

Machine Learning · Computer Science 2022-06-16 Sheng Zhou , Hongjia Xu , Zhuonan Zheng , Jiawei Chen , Zhao li , Jiajun Bu , Jia Wu , Xin Wang , Wenwu Zhu , Martin Ester

Understanding Text Classification Data and Models Using Aggregated Input Salience

Realizing when a model is right for a wrong reason is not trivial and requires a significant effort by model developers. In some cases an input salience method, which highlights the most important parts of the input, may reveal problematic…

Computation and Language · Computer Science 2023-01-12 Sebastian Ebert , Alice Shoshana Jakobovits , Katja Filippova

Misclassification cost-sensitive ensemble learning: A unifying framework

Over the years, a plethora of cost-sensitive methods have been proposed for learning on data when different types of misclassification errors incur different costs. Our contribution is a unifying framework that provides a comprehensive and…

Machine Learning · Computer Science 2020-07-16 George Petrides , Wouter Verbeke

Efficient Classification of Long Documents Using Transformers

Several methods have been proposed for classifying long textual documents using Transformers. However, there is a lack of consensus on a benchmark to enable a fair comparison among different approaches. In this paper, we provide a…

Computation and Language · Computer Science 2022-03-23 Hyunji Hayley Park , Yogarshi Vyas , Kashif Shah

Investigating the Working of Text Classifiers

Text classification is one of the most widely studied tasks in natural language processing. Motivated by the principle of compositionality, large multilayer neural network models have been employed for this task in an attempt to effectively…

Computation and Language · Computer Science 2018-08-07 Devendra Singh Sachan , Manzil Zaheer , Ruslan Salakhutdinov

Combining Self-labeling with Selective Sampling

Since data is the fuel that drives machine learning models, and access to labeled data is generally expensive, semi-supervised methods are constantly popular. They enable the acquisition of large datasets without the need for too many…

Machine Learning · Computer Science 2023-01-12 Jędrzej Kozal , Michał Woźniak