Related papers: hep-th

Identifying the Development and Application of Artificial Intelligence in Scientific Text

We describe a strategy for identifying the universe of research publications relevant to the application and development of artificial intelligence. The approach leverages the arXiv corpus of scientific preprints, in which authors choose…

Digital Libraries · Computer Science 2020-05-29 James Dunham , Jennifer Melot , Dewey Murdick

Hierarchical Attentional Hybrid Neural Networks for Document Classification

Document classification is a challenging task with important applications. The deep learning approaches to the problem have gained much attention recently. Despite the progress, the proposed models do not incorporate the knowledge of the…

Computation and Language · Computer Science 2019-10-15 Jader Abreu , Luis Fred , David Macêdo , Cleber Zanchettin

Hierarchical Classification of Research Fields in the "Web of Science" Using Deep Learning

This paper presents a hierarchical classification system that automatically categorizes a scholarly publication using its abstract into a three-tier hierarchical label set (discipline, field, subfield) in a multi-class setting. This system…

Digital Libraries · Computer Science 2024-07-26 Susie Xi Rao , Peter H. Egger , Ce Zhang

Scientific Paper Classification Based on Graph Neural Network with Hypergraph Self-attention Mechanism

The number of scientific papers has increased rapidly in recent years. How to make good use of scientific papers for research is very important. Through the high-quality classification of scientific papers, researchers can quickly find the…

Information Retrieval · Computer Science 2022-10-10 Jiashun Liu , Zhe Xue , Ang Li

Classification and Clustering of arXiv Documents, Sections, and Abstracts, Comparing Encodings of Natural and Mathematical Language

In this paper, we show how selecting and combining encodings of natural and mathematical language affect classification and clustering of documents with mathematical content. We demonstrate this by using sets of documents, sections, and…

Digital Libraries · Computer Science 2020-05-25 Philipp Scharpf , Moritz Schubotz , Abdou Youssef , Felix Hamborg , Norman Meuschke , Bela Gipp

Efficient strategies for hierarchical text classification: External knowledge and auxiliary tasks

In hierarchical text classification, we perform a sequence of inference steps to predict the category of a document from top to bottom of a given class taxonomy. Most of the studies have focused on developing novels neural network…

Computation and Language · Computer Science 2020-05-25 Kervy Rivas Rojas , Gina Bustamante , Arturo Oncevay , Marco A. Sobrevilla Cabezudo

Predicting Research Trends From Arxiv

We perform trend detection on two datasets of Arxiv papers, derived from its machine learning (cs.LG) and natural language processing (cs.CL) categories. Our approach is bottom-up: we first rank papers by their normalized citation counts,…

Computation and Language · Computer Science 2019-03-08 Steffen Eger , Chao Li , Florian Netzer , Iryna Gurevych

Automated Research Article Classification and Recommendation Using NLP and ML

In the digital era, the exponential growth of scientific publications has made it increasingly difficult for researchers to efficiently identify and access relevant work. This paper presents an automated framework for research article…

Information Retrieval · Computer Science 2025-10-08 Shadikur Rahman , Hasibul Karim Shanto , Umme Ayman Koana , Syed Muhammad Danish

Efficient Character-level Document Classification by Combining Convolution and Recurrent Layers

Document classification tasks were primarily tackled at word level. Recent research that works with character-level inputs shows several benefits over word-level approaches such as natural incorporation of morphemes and better handling of…

Computation and Language · Computer Science 2016-02-02 Yijun Xiao , Kyunghyun Cho

Improving Human Text Comprehension through Semi-Markov CRF-based Neural Section Title Generation

Titles of short sections within long documents support readers by guiding their focus towards relevant passages and by providing anchor-points that help to understand the progression of the document. The positive effects of section titles…

Computation and Language · Computer Science 2019-04-16 Sebastian Gehrmann , Steven Layne , Franck Dernoncourt

Text Classification using Graph Convolutional Networks: A Comprehensive Survey

Text classification is a quintessential and practical problem in natural language processing with applications in diverse domains such as sentiment analysis, fake news detection, medical diagnosis, and document classification. A sizable…

Computation and Language · Computer Science 2024-10-15 Syed Mustafa Haider Rizvi , Ramsha Imran , Arif Mahmood

Beyond original Research Articles Categorization via NLP

This work proposes a novel approach to text categorization -- for unknown categories -- in the context of scientific literature, using Natural Language Processing techniques. The study leverages the power of pre-trained language models,…

Computation and Language · Computer Science 2023-09-14 Rosanna Turrisi

On the Use of ArXiv as a Dataset

The arXiv has collected 1.5 million pre-print articles over 28 years, hosting literature from scientific fields including Physics, Mathematics, and Computer Science. Each pre-print features text, figures, authors, citations, categories, and…

Information Retrieval · Computer Science 2019-05-02 Colin B. Clement , Matthew Bierbaum , Kevin P. O'Keeffe , Alexander A. Alemi

SECTOR: A Neural Model for Coherent Topic Segmentation and Classification

When searching for information, a human reader first glances over a document, spots relevant sections and then focuses on a few sentences for resolving her intention. However, the high variance of document structure complicates to identify…

Computation and Language · Computer Science 2019-02-14 Sebastian Arnold , Rudolf Schneider , Philippe Cudré-Mauroux , Felix A. Gers , Alexander Löser

Understanding the Logical and Semantic Structure of Large Documents

Current language understanding approaches focus on small documents, such as newswire articles, blog posts, product reviews and discussion forum entries. Understanding and extracting information from large documents like legal briefs,…

Computation and Language · Computer Science 2017-09-05 Muhammad Mahbubur Rahman , Tim Finin

Multi-Perspective Relevance Matching with Hierarchical ConvNets for Social Media Search

Despite substantial interest in applications of neural networks to information retrieval, neural ranking models have only been applied to standard ad hoc retrieval tasks over web pages and newswire documents. This paper proposes MP-HCNN…

Information Retrieval · Computer Science 2019-06-25 Jinfeng Rao , Wei Yang , Yuhao Zhang , Ferhan Ture , Jimmy Lin

Semantic Hypergraphs

Approaches to Natural language processing (NLP) may be classified along a double dichotomy open/opaque - strict/adaptive. The former axis relates to the possibility of inspecting the underlying processing rules, the latter to the use of…

Information Retrieval · Computer Science 2021-02-19 Telmo Menezes , Camille Roth

Recurrent Neural Networks with Mixed Hierarchical Structures for Natural Language Processing

Hierarchical structures exist in both linguistics and Natural Language Processing (NLP) tasks. How to design RNNs to learn hierarchical representations of natural languages remains a long-standing challenge. In this paper, we define two…

Computation and Language · Computer Science 2021-06-07 Zhaoxin Luo , Michael Zhu

A semantic hierarchical graph neural network for text classification

The key to the text classification task is language representation and important information extraction, and there are many related studies. In recent years, the research on graph neural network (GNN) in text classification has gradually…

Computation and Language · Computer Science 2022-09-16 Shuai Hua , Xinxin Li , Yunpeng Jing , Qunfeng Liu

Hierarchical Bi-Directional Self-Attention Networks for Paper Review Rating Recommendation

Review rating prediction of text reviews is a rapidly growing technology with a wide range of applications in natural language processing. However, most existing methods either use hand-crafted features or learn features using deep learning…

Computation and Language · Computer Science 2020-11-03 Zhongfen Deng , Hao Peng , Congying Xia , Jianxin Li , Lifang He , Philip S. Yu