Related papers: Exploiting Class Labels to Boost Performance on Em…

TF-CR: Weighting Embeddings for Text Classification

Text classification, as the task consisting in assigning categories to textual instances, is a very common task in information science. Methods learning distributed representations of words, such as word embeddings, have become popular in…

Computation and Language · Computer Science 2020-12-15 Arkaitz Zubiaga

An Analysis of Hierarchical Text Classification Using Word Embeddings

Efficient distributed numerical word representation models (word embeddings) combined with modern machine learning algorithms have recently yielded considerable improvement on automatic document classification tasks. However, the…

Computation and Language · Computer Science 2018-09-07 Roger A. Stein , Patricia A. Jaques , Joao F. Valiati

Joint Embedding of Words and Labels for Text Classification

Word embeddings are effective intermediate representations for capturing semantic regularities between words, when learning the representations of text sequences. We propose to view text classification as a label-word joint embedding…

Computation and Language · Computer Science 2018-05-14 Guoyin Wang , Chunyuan Li , Wenlin Wang , Yizhe Zhang , Dinghan Shen , Xinyuan Zhang , Ricardo Henao , Lawrence Carin

An empirical study on large scale text classification with skip-gram embeddings

We investigate the integration of word embeddings as classification features in the setting of large scale text classification. Such representations have been used in a plethora of tasks, however their application in classification…

Computation and Language · Computer Science 2016-06-22 Georgios Balikas , Massih-Reza Amini

Compositional Coding Capsule Network with K-Means Routing for Text Classification

Text classification is a challenging problem which aims to identify the category of texts. In the process of training, word embeddings occupy a large part of parameters. Under the limitation of limited computing resources, it indirectly…

Machine Learning · Computer Science 2022-06-03 Hao Ren , Hong Lu

Utility of General and Specific Word Embeddings for Classifying Translational Stages of Research

Conventional text classification models make a bag-of-words assumption reducing text into word occurrence counts per document. Recent algorithms such as word2vec are capable of learning semantic meaning and similarity between words in an…

Computation and Language · Computer Science 2018-07-11 Vincent Major , Alisa Surkis , Yindalon Aphinyanaphongs

A Comparative Study on TF-IDF feature Weighting Method and its Analysis using Unstructured Dataset

Text Classification is the process of categorizing text into the relevant categories and its algorithms are at the core of many Natural Language Processing (NLP). Term Frequency-Inverse Document Frequency (TF-IDF) and NLP are the most…

Computation and Language · Computer Science 2023-08-09 Mamata Das , Selvakumar K. , P. J. A. Alphonse

A Simple and Effective Approach for Fine Tuning Pre-trained Word Embeddings for Improved Text Classification

This work presents a new and simple approach for fine-tuning pretrained word embeddings for text classification tasks. In this approach, the class in which a term appears, acts as an additional contextual variable during the fine tuning…

Computation and Language · Computer Science 2019-12-17 Amr Al-Khatib , Samhaa R. El-Beltagy

Effects of term weighting approach with and without stop words removing on Arabic text classification

Classifying text is a method for categorizing documents into pre-established groups. Text documents must be prepared and represented in a way that is appropriate for the algorithms used for data mining prior to classification. As a result,…

Computation and Language · Computer Science 2024-02-26 Esra'a Alhenawi , Ruba Abu Khurma , Pedro A. Castillo , Maribel G. Arenas

Domain Lexical Knowledge-based Word Embedding Learning for Text Classification under Small Data

Pre-trained language models such as BERT have been proved to be powerful in many natural language processing tasks. But in some text classification applications such as emotion recognition and sentiment analysis, BERT may not lead to…

Computation and Language · Computer Science 2025-06-03 Zixiao Zhu , Kezhi Mao

Contextually Propagated Term Weights for Document Representation

Word embeddings predict a word from its neighbours by learning small, dense embedding vectors. In practice, this prediction corresponds to a semantic score given to the predicted word (or term weight). We present a novel model that, given a…

Information Retrieval · Computer Science 2019-06-04 Casper Hansen , Christian Hansen , Stephen Alstrup , Jakob Grue Simonsen , Christina Lioma

Improve Text Classification Accuracy with Intent Information

Text classification, a core component of task-oriented dialogue systems, attracts continuous research from both the research and industry community, and has resulted in tremendous progress. However, existing method does not consider the use…

Computation and Language · Computer Science 2022-12-16 Yifeng Xie

Text Classification based on Word Subspace with Term-Frequency

Text classification has become indispensable due to the rapid increase of text in digital form. Over the past three decades, efforts have been made to approach this task using various learning algorithms and statistical models based on…

Machine Learning · Statistics 2018-06-11 Erica K. Shimomoto , Lincon S. Souza , Bernardo B. Gatto , Kazuhiro Fukui

On the Learnability of Concepts: With Applications to Comparing Word Embedding Algorithms

Word Embeddings are used widely in multiple Natural Language Processing (NLP) applications. They are coordinates associated with each word in a dictionary, inferred from statistical properties of these words in a large corpus. In this paper…

Computation and Language · Computer Science 2020-06-18 Adam Sutton , Nello Cristianini

Word Embedding Techniques for Classification of Star Ratings

Telecom services are at the core of today's societies' everyday needs. The availability of numerous online forums and discussion platforms enables telecom providers to improve their services by exploring the views of their customers to…

Computation and Language · Computer Science 2025-04-21 Hesham Abdelmotaleb , Craig McNeile , Malgorzata Wojtys

Multi-Task Label Embedding for Text Classification

Multi-task learning in text classification leverages implicit correlations among related tasks to extract common features and yield performance gains. However, most previous works treat labels of each task as independent and meaningless…

Computation and Language · Computer Science 2017-10-20 Honglun Zhang , Liqiang Xiao , Wenqing Chen , Yongkun Wang , Yaohui Jin

Learning to Weight for Text Classification

In information retrieval (IR) and related tasks, term weighting approaches typically consider the frequency of the term in the document and in the collection in order to compute a score reflecting the importance of the term for the…

Machine Learning · Computer Science 2021-09-22 Alejandro Moreo Fernández , Andrea Esuli , Fabrizio Sebastiani

Incorporating Word Embeddings into Open Directory Project based Large-scale Classification

Recently, implicit representation models, such as embedding or deep learning, have been successfully adopted to text classification task due to their outstanding performance. However, these approaches are limited to small- or moderate-scale…

Computation and Language · Computer Science 2018-04-04 Kang-Min Kim , Aliyeva Dinara , Byung-Ju Choi , SangKeun Lee

A Curriculum Learning Approach for Multi-domain Text Classification Using Keyword weight Ranking

Text classification is a very classic NLP task, but it has two prominent shortcomings: On the one hand, text classification is deeply domain-dependent. That is, a classifier trained on the corpus of one domain may not perform so well in…

Computation and Language · Computer Science 2022-10-28 Zilin Yuan , Yinghui Li , Yangning Li , Rui Xie , Wei Wu , Hai-Tao Zheng

Multi-Label Clinical Text Eligibility Classification and Summarization System

Clinical trials are central to medical progress because they help improve understanding of human health and the healthcare system. They play a key role in discovering new ways to detect, prevent, or treat diseases, and it is essential that…

Computation and Language · Computer Science 2025-10-16 Surya Tejaswi Yerramsetty , Almas Fathimah