English
Related papers

Related papers: Multi-class Text Classification using BERT-based A…

200 papers

Text classification problem is a very broad field of study in the field of natural language processing. In short, the text classification problem is to determine which of the previously determined classes the given text belongs to.…

Computation and Language · Computer Science 2021-12-28 D. Emre Taşar , Şükrü Ozan , M. Fatih Akca , Oğuzhan Ölmez , Semih Gülüm , Seçilay Kutal , Ceren Belhan

We introduce a novel multi-agent collaboration framework designed to enhance the accuracy and robustness of text classification models. Leveraging BERT as the primary classifier, our framework dynamically escalates low-confidence…

Computation and Language · Computer Science 2025-02-27 Hediyeh Baban , Sai A Pidapar , Aashutosh Nema , Sichen Lu

This study compares the effectiveness and robustness of multi-class categorization of Amazon product data using transfer learning on pre-trained contextualized language models. Specifically, we fine-tuned BERT and XLNet, two bidirectional…

Machine Learning · Statistics 2019-09-24 Xinyi Liu , Artit Wangperawong

In pickup and delivery services, transaction classification based on customer provided free text is a challenging problem. It involves the association of a wide variety of customer inputs to a fixed set of categories while adapting to the…

Computation and Language · Computer Science 2021-02-16 Rohan Sukumaran

Machine learning-based classifiers have been used for text classification, such as sentiment analysis, news classification, and toxic comment classification. However, supervised machine learning models often require large amounts of labeled…

Computation and Language · Computer Science 2025-05-06 Yejian Zhang , Shingo Takada

Educational content labeled with proper knowledge components (KCs) are particularly useful to teachers or content organizers. However, manually labeling educational content is labor intensive and error-prone. To address this challenge,…

Computation and Language · Computer Science 2021-05-25 Jia Tracy Shen , Michiharu Yamashita , Ethan Prihar , Neil Heffernan , Xintao Wu , Sean McGrew , Dongwon Lee

Manual coding of text data from open-ended questions into different categories is time consuming and expensive. Automated coding uses statistical/machine learning to train on a small subset of manually coded text answers. Recently,…

Applications · Statistics 2023-10-25 Hyukjun Gweon , Matthias Schonlau

Service manual documents are crucial to the engineering company as they provide guidelines and knowledge to service engineers. However, it has become inconvenient and inefficient for service engineers to retrieve specific knowledge from…

Computation and Language · Computer Science 2021-06-25 Jia Wei Chong , Zhiyuan Chen , Mei Shin Oh

Manually labelling large collections of text data is a time-consuming, expensive, and laborious task, but one that is necessary to support machine learning based on text datasets. Active learning has been shown to be an effective way to…

Computation and Language · Computer Science 2019-10-11 Jinghui Lu , Maeve Henchion , Brian Mac Namee

Although BERT is widely used by the NLP community, little is known about its inner workings. Several attempts have been made to shed light on certain aspects of BERT, often with contradicting conclusions. A much raised concern focuses on…

Computation and Language · Computer Science 2020-10-13 Nikolaos Manginas , Ilias Chalkidis , Prodromos Malakasiotis

This paper studies the performances and behaviors of BERT in ranking tasks. We explore several different ways to leverage the pre-trained BERT and fine-tune it on two ranking tasks: MS MARCO passage reranking and TREC Web Track ad hoc…

Information Retrieval · Computer Science 2019-04-29 Yifan Qiao , Chenyan Xiong , Zhenghao Liu , Zhiyuan Liu

In this work, we propose the use of a fully managed machine learning service, which utilizes active learning to directly build models from unstructured data. With this tool, business users can quickly and easily build machine learning…

Machine Learning · Computer Science 2021-02-02 Teja Kanchinadam , Qian You , Keith Westpfahl , James Kim , Siva Gunda , Sebastian Seith , Glenn Fung

BERT, which stands for Bidirectional Encoder Representations from Transformers, is a recently introduced language representation model based upon the transfer learning paradigm. We extend its fine-tuning procedure to address one of its…

Computation and Language · Computer Science 2019-10-25 Raghavendra Pappagari , Piotr Żelasko , Jesús Villalba , Yishay Carmiel , Najim Dehak

Active learning strives to reduce annotation costs by choosing the most critical examples to label. Typically, the active learning strategy is contingent on the classification model. For instance, uncertainty sampling depends on poorly…

Computation and Language · Computer Science 2020-10-26 Michelle Yuan , Hsuan-Tien Lin , Jordan Boyd-Graber

Large-scale pre-trained language models such as BERT are popular solutions for text classification. Due to the superior performance of these advanced methods, nowadays, people often directly train them for a few epochs and deploy the…

Computation and Language · Computer Science 2023-06-13 Yu-Chen Lin , Si-An Chen , Jie-Jyun Liu , Chih-Jen Lin

Named entity recognition (NER) has been studied extensively and the earlier algorithms were based on sequence labeling like Hidden Markov Models (HMM) and conditional random fields (CRF). These were followed by neural network based deep…

Computation and Language · Computer Science 2021-05-10 Shalin Shah , Ryan Siskind

Existing e-commerce platforms heavily rely on manual annotation for product categorization, which is inefficient and inconsistent. These platforms often employ a hierarchical structure for categorizing products; however, few studies have…

Computation and Language · Computer Science 2025-08-26 Kun Liu , Tuozhen Liu , Feifei Wang , Rui Pan

Active learning has been shown to be an effective way to alleviate some of the effort required in utilising large collections of unlabelled data for machine learning tasks without needing to fully label them. The representation mechanism…

Information Retrieval · Computer Science 2020-04-29 Jinghui Lu , Brian MacNamee

Multi-task learning (MTL) has achieved remarkable success in natural language processing applications. In this work, we study a multi-task learning model with multiple decoders on varieties of biomedical and clinical natural language…

Computation and Language · Computer Science 2020-05-07 Yifan Peng , Qingyu Chen , Zhiyong Lu

Search query classification, as an effective way to understand user intents, is of great importance in real-world online ads systems. To ensure a lower latency, a shallow model (e.g. FastText) is widely used for efficient online inference.…

Information Retrieval · Computer Science 2023-11-21 Kun-Peng Ning , Ming Pang , Zheng Fang , Xue Jiang , Xi-Wei Zhao , Chang-Ping Peng , Zhan-Gang Lin , Jing-He Hu , Jing-Ping Shao
‹ Prev 1 2 3 10 Next ›