Related papers: Text Classification Using Association Rules, Depen…

Text Classification using the Concept of Association Rule of Data Mining

As the amount of online text increases, the demand for text classification to aid the analysis and management of text is increasing. Text is cheap, but information, in the form of knowing what classes a text belongs to, is expensive.…

Machine Learning · Computer Science 2010-09-27 Chowdhury Mofizur Rahman , Ferdous Ahmed Sohel , Parvez Naushad , S. M. Kamruzzaman

Arabic Language Text Classification Using Dependency Syntax-Based Feature Selection

We study the performance of Arabic text classification combining various techniques: (a) tfidf vs. dependency syntax, for feature selection and weighting; (b) class association rules vs. support vector machines, for classification. The…

Computation and Language · Computer Science 2014-10-21 Yannis Haralambous , Yassir Elidrissi , Philippe Lenca

Mining Multi-Level Frequent Itemsets under Constraints

Mining association rules is a task of data mining, which extracts knowledge in the form of significant implication relation of useful items (objects) from a database. Mining multilevel association rules uses concept hierarchies, also called…

Databases · Computer Science 2010-12-30 Mohamed Salah Gouider , Amine Farhat

Text Classification using Data Mining

Text classification is the process of classifying documents into predefined categories based on their content. It is the automated assignment of natural language texts to predefined categories. Text classification is the primary requirement…

Information Retrieval · Computer Science 2010-09-28 S. M. Kamruzzaman , Farhana Haider , Ahmed Ryadh Hasan

Text Categorization using Association Rule and Naive Bayes Classifier

As the amount of online text increases, the demand for text categorization to aid the analysis and management of text is increasing. Text is cheap, but information, in the form of knowing what classes a text belongs to, is expensive.…

Information Retrieval · Computer Science 2010-09-28 S M Kamruzzaman , Chowdhury Mofizur Rahman

Text Classification using Association Rule with a Hybrid Concept of Naive Bayes Classifier and Genetic Algorithm

Text classification is the automated assignment of natural language texts to predefined categories based on their content. Text classification is the primary requirement of text retrieval systems, which retrieve texts in response to a user…

Information Retrieval · Computer Science 2010-09-28 S. M. Kamruzzaman , Farhana Haider , Ahmed Ryadh Hasan

Association rule mining and itemset-correlation based variants

Association rules express implication formed relations among attributes in databases of itemsets. The apriori algorithm is presented, the basis for most association rule mining algorithms. It works by pruning away rules that need not be…

Databases · Computer Science 2019-07-24 Niels Mündler

Text Classification using Artificial Intelligence

Text classification is the process of classifying documents into predefined categories based on their content. It is the automated assignment of natural language texts to predefined categories. Text classification is the primary requirement…

Information Retrieval · Computer Science 2010-09-28 S. M. Kamruzzaman

Adverb Is the Key: Simple Text Data Augmentation with Adverb Deletion

In the field of text data augmentation, rule-based methods are widely adopted for real-world applications owing to their cost-efficiency. However, conventional rule-based approaches suffer from the possibility of losing the original…

Computation and Language · Computer Science 2024-04-01 Juhwan Choi , YoungBin Kim

A Hybrid Web Recommendation System based on the Improved Association Rule Mining Algorithm

As the growing interest of web recommendation systems those are applied to deliver customized data for their users, we started working on this system. Generally the recommendation systems are divided into two major categories such as…

Information Retrieval · Computer Science 2013-12-02 Ujwala Wanaskar , Sheetal Vij , Debajyoti Mukhopadhyay

Leveraging Association Rules for Better Predictions and Better Explanations

We present a new approach to classification that combines data and knowledge. In this approach, data mining is used to derive association rules (possibly with negations) from data. Those rules are leveraged to increase the predictive…

Artificial Intelligence · Computer Science 2025-10-22 Gilles Audemard , Sylvie Coste-Marquis , Pierre Marquis , Mehdi Sabiri , Nicolas Szczepanski

Exploiting Class Labels to Boost Performance on Embedding-based Text Classification

Text classification is one of the most frequent tasks for processing textual data, facilitating among others research from large-scale datasets. Embeddings of different kinds have recently become the de facto standard as features used for…

Computation and Language · Computer Science 2020-09-03 Arkaitz Zubiaga

Selective Text Augmentation with Word Roles for Low-Resource Text Classification

Data augmentation techniques are widely used in text classification tasks to improve the performance of classifiers, especially in low-resource scenarios. Most previous methods conduct text augmentation without considering the different…

Computation and Language · Computer Science 2022-09-07 Biyang Guo , Songqiao Han , Hailiang Huang

A hybrid learning algorithm for text classification

Text classification is the process of classifying documents into predefined categories based on their content. Existing supervised learning algorithms to automatically classify text need sufficient documents to learn accurately. This paper…

Neural and Evolutionary Computing · Computer Science 2010-09-27 S. M. Kamruzzaman , Farhana Haider

A Comprehensive Survey of Text Classification Techniques and Their Research Applications: Observational and Experimental Insights

The exponential growth of textual data presents substantial challenges in management and analysis, notably due to high storage and processing costs. Text classification, a vital aspect of text mining, provides robust solutions by enabling…

Computation and Language · Computer Science 2025-01-22 Kamal Taha , Paul D. Yoo , Chan Yeun , Aya Taha

Efficient strategies for hierarchical text classification: External knowledge and auxiliary tasks

In hierarchical text classification, we perform a sequence of inference steps to predict the category of a document from top to bottom of a given class taxonomy. Most of the studies have focused on developing novels neural network…

Computation and Language · Computer Science 2020-05-25 Kervy Rivas Rojas , Gina Bustamante , Arturo Oncevay , Marco A. Sobrevilla Cabezudo

Enhancing Vision-Language Model Pre-training with Image-text Pair Pruning Based on Word Frequency

We propose Word-Frequency-based Image-Text Pair Pruning (WFPP), a novel data pruning method that improves the efficiency of VLMs. Unlike MetaCLIP, our method does not need metadata for pruning, but selects text-image pairs to prune based on…

Machine Learning · Computer Science 2024-12-11 Mingliang Liang , Martha Larson

Discovering Categorical Main and Interaction Effects Based on Association Rule Mining

With the growing size of data sets, feature selection becomes increasingly important. Taking interactions of original features into consideration will lead to extremely high dimension, especially when the features are categorical and…

Databases · Computer Science 2021-04-13 Qiuqiang Lin , Chuanhou Gao

Automatic Counterfactual Augmentation for Robust Text Classification Based on Word-Group Search

Despite large-scale pre-trained language models have achieved striking results for text classificaion, recent work has raised concerns about the challenge of shortcut learning. In general, a keyword is regarded as a shortcut if it creates a…

Computation and Language · Computer Science 2023-07-06 Rui Song , Fausto Giunchiglia , Yingji Li , Hao Xu

A Simple and Effective Approach for Fine Tuning Pre-trained Word Embeddings for Improved Text Classification

This work presents a new and simple approach for fine-tuning pretrained word embeddings for text classification tasks. In this approach, the class in which a term appears, acts as an additional contextual variable during the fine tuning…

Computation and Language · Computer Science 2019-12-17 Amr Al-Khatib , Samhaa R. El-Beltagy