Related papers: Conical Classification For Computationally Efficie…

Classifying text using machine learning models and determining conversation drift

Text classification helps analyse texts for semantic meaning and relevance, by mapping the words against this hierarchy. An analysis of various types of texts is invaluable to understanding both their semantic meaning, as well as their…

Machine Learning · Computer Science 2022-11-16 Chaitanya Chadha , Vandit Gupta , Deepak Gupta , Ashish Khanna

Rationale-Augmented Convolutional Neural Networks for Text Classification

We present a new Convolutional Neural Network (CNN) model for text classification that jointly exploits labels on documents and their component sentences. Specifically, we consider scenarios in which annotators explicitly mark sentences (or…

Computation and Language · Computer Science 2016-09-27 Ye Zhang , Iain Marshall , Byron C. Wallace

Text Classification: A Perspective of Deep Learning Methods

In recent years, with the rapid development of information on the Internet, the number of complex texts and documents has increased exponentially, which requires a deeper understanding of deep learning methods in order to accurately…

Computation and Language · Computer Science 2023-09-26 Zhongwei Wan

Machine Identification of High Impact Research through Text and Image Analysis

The volume of academic paper submissions and publications is growing at an ever increasing rate. While this flood of research promises progress in various fields, the sheer volume of output inherently increases the amount of noise. We…

Information Retrieval · Computer Science 2020-05-22 Marko Stamenovic , Jeibo Luo

The Effectiveness of Classification on Information Retrieval System (Case Study)

Large amount of unstructured designed information is difficult to deal with. Obtaining specific information is a hard mission and takes a lot of time. Information Retrieval System (IR) is a way to solve this kind of problem. IR is a good…

Information Retrieval · Computer Science 2018-04-03 Maher Abdullah , Mohammed GH. I. Al Zamil

A Comprehensive Survey of Text Classification Techniques and Their Research Applications: Observational and Experimental Insights

The exponential growth of textual data presents substantial challenges in management and analysis, notably due to high storage and processing costs. Text classification, a vital aspect of text mining, provides robust solutions by enabling…

Computation and Language · Computer Science 2025-01-22 Kamal Taha , Paul D. Yoo , Chan Yeun , Aya Taha

Machine learning approach for text and document mining

Text Categorization (TC), also known as Text Classification, is the task of automatically classifying a set of text documents into different categories from a predefined set. If a document belongs to exactly one of the categories, it is a…

Information Retrieval · Computer Science 2014-06-09 Vishwanath Bijalwan , Pinki Kumari , Jordan Pascual , Vijay Bhaskar Semwal

Light-Weighted CNN for Text Classification

For management, documents are categorized into a specific category, and to do these, most of the organizations use manual labor. In today's automation era, manual efforts on such a task are not justified, and to avoid this, we have so many…

Machine Learning · Computer Science 2020-04-20 Ritu Yadav

Text Classification Algorithms: A Survey

In recent years, there has been an exponential growth in the number of complex documents and texts that require a deeper understanding of machine learning methods to be able to accurately classify texts in many applications. Many machine…

Machine Learning · Computer Science 2020-05-21 Kamran Kowsari , Kiana Jafari Meimandi , Mojtaba Heidarysafa , Sanjana Mendu , Laura E. Barnes , Donald E. Brown

Empirical Comparisons of CNN with Other Learning Algorithms for Text Classification in Legal Document Review

Research has shown that Convolutional Neural Networks (CNN) can be effectively applied to text classification as part of a predictive coding protocol. That said, most research to date has been conducted on data sets with short documents…

Information Retrieval · Computer Science 2019-12-23 Robert Keeling , Rishi Chhatwal , Nathaniel Huber-Fliflet , Jianping Zhang , Fusheng Wei , Haozhen Zhao , Shi Ye , Han Qin

Text Classification: A Sequential Reading Approach

We propose to model the text classification process as a sequential decision process. In this process, an agent learns to classify documents into topics while reading the document sentences sequentially and learns to stop as soon as enough…

Artificial Intelligence · Computer Science 2015-03-19 Gabriel Dulac-Arnold , Ludovic Denoyer , Patrick Gallinari

Computerized document classification already orders the news articles that Apple's "News" app or Google's "personalized search" feature groups together to match a reader's interests. The invisible and therefore illegible decisions that go…

Computation and Language · Computer Science 2018-12-17 Ashley Lee , Jo Guldi , Andras Zsom

Efficient Classification of Long Documents Using Transformers

Several methods have been proposed for classifying long textual documents using Transformers. However, there is a lack of consensus on a benchmark to enable a fair comparison among different approaches. In this paper, we provide a…

Computation and Language · Computer Science 2022-03-23 Hyunji Hayley Park , Yogarshi Vyas , Kashif Shah

A Framework for Explainable Text Classification in Legal Document Review

Companies regularly spend millions of dollars producing electronically-stored documents in legal matters. Recently, parties on both sides of the 'legal aisle' are accepting the use of machine learning techniques like text classification to…

Information Retrieval · Computer Science 2019-12-23 Christian J. Mahoney , Jianping Zhang , Nathaniel Huber-Fliflet , Peter Gronvall , Haozhen Zhao

Efficient Large-Scale Multi-Modal Classification

While the incipient internet was largely text-based, the modern digital world is becoming increasingly multi-modal. Here, we examine multi-modal classification where one modality is discrete, e.g. text, and the other is continuous, e.g.…

Computation and Language · Computer Science 2018-02-09 D. Kiela , E. Grave , A. Joulin , T. Mikolov

Text Classification using Artificial Intelligence

Text classification is the process of classifying documents into predefined categories based on their content. It is the automated assignment of natural language texts to predefined categories. Text classification is the primary requirement…

Information Retrieval · Computer Science 2010-09-28 S. M. Kamruzzaman

Explainable Text Classification Techniques in Legal Document Review: Locating Rationales without Using Human Annotated Training Text Snippets

US corporations regularly spend millions of dollars reviewing electronically-stored documents in legal matters. Recently, attorneys apply text classification to efficiently cull massive volumes of data to identify responsive documents for…

Information Retrieval · Computer Science 2023-11-16 Christian Mahoney , Peter Gronvall , Nathaniel Huber-Fliflet , Jianping Zhang

Unified Line and Paragraph Detection by Graph Convolutional Networks

We formulate the task of detecting lines and paragraphs in a document into a unified two-level clustering problem. Given a set of text detection boxes that roughly correspond to words, a text line is a cluster of boxes and a paragraph is a…

Computer Vision and Pattern Recognition · Computer Science 2022-03-21 Shuang Liu , Renshen Wang , Michalis Raptis , Yasuhisa Fujii

Convolutional Neural Networks for Toxic Comment Classification

Flood of information is produced in a daily basis through the global Internet usage arising from the on-line interactive communications among users. While this situation contributes significantly to the quality of human life, unfortunately…

Computation and Language · Computer Science 2024-06-04 Spiros V. Georgakopoulos , Sotiris K. Tasoulis , Aristidis G. Vrahatis , Vassilis P. Plagianakos

Efficient strategies for hierarchical text classification: External knowledge and auxiliary tasks

In hierarchical text classification, we perform a sequence of inference steps to predict the category of a document from top to bottom of a given class taxonomy. Most of the studies have focused on developing novels neural network…

Computation and Language · Computer Science 2020-05-25 Kervy Rivas Rojas , Gina Bustamante , Arturo Oncevay , Marco A. Sobrevilla Cabezudo