Related papers: Multi-class Text Classification using BERT-based A…

Profitable Trade-Off Between Memory and Performance In Multi-Domain Chatbot Architectures

Text classification problem is a very broad field of study in the field of natural language processing. In short, the text classification problem is to determine which of the previously determined classes the given text belongs to.…

Computation and Language · Computer Science 2021-12-28 D. Emre Taşar , Şükrü Ozan , M. Fatih Akca , Oğuzhan Ölmez , Semih Gülüm , Seçilay Kutal , Ceren Belhan

Enhancing Text Classification with a Novel Multi-Agent Collaboration Framework Leveraging BERT

We introduce a novel multi-agent collaboration framework designed to enhance the accuracy and robustness of text classification models. Leveraging BERT as the primary classifier, our framework dynamically escalates low-confidence…

Computation and Language · Computer Science 2025-02-27 Hediyeh Baban , Sai A Pidapar , Aashutosh Nema , Sichen Lu

Transfer Learning Robustness in Multi-Class Categorization by Fine-Tuning Pre-Trained Contextualized Language Models

This study compares the effectiveness and robustness of multi-class categorization of Amazon product data using transfer learning on pre-trained contextualized language models. Specifically, we fine-tuned BERT and XLNet, two bidirectional…

Machine Learning · Statistics 2019-09-24 Xinyi Liu , Artit Wangperawong

Improved Customer Transaction Classification using Semi-Supervised Knowledge Distillation

In pickup and delivery services, transaction classification based on customer provided free text is a challenging problem. It involves the association of a wide variety of customer inputs to a fixed set of categories while adapting to the…

Computation and Language · Computer Science 2021-02-16 Rohan Sukumaran

Applying LLMs to Active Learning: Towards Cost-Efficient Cross-Task Text Classification without Manually Labeled Data

Machine learning-based classifiers have been used for text classification, such as sentiment analysis, news classification, and toxic comment classification. However, supervised machine learning models often require large amounts of labeled…

Computation and Language · Computer Science 2025-05-06 Yejian Zhang , Shingo Takada

Classifying Math KCs via Task-Adaptive Pre-Trained BERT

Educational content labeled with proper knowledge components (KCs) are particularly useful to teachers or content organizers. However, manually labeling educational content is labor intensive and error-prone. To address this challenge,…

Computation and Language · Computer Science 2021-05-25 Jia Tracy Shen , Michiharu Yamashita , Ethan Prihar , Neil Heffernan , Xintao Wu , Sean McGrew , Dongwon Lee

Automated classification for open-ended questions with BERT

Manual coding of text data from open-ended questions into different categories is time consuming and expensive. Automated coding uses statistical/machine learning to train on a small subset of manually coded text answers. Recently,…

Applications · Statistics 2023-10-25 Hyukjun Gweon , Matthias Schonlau

An Automated Knowledge Mining and Document Classification System with Multi-model Transfer Learning

Service manual documents are crucial to the engineering company as they provide guidelines and knowledge to service engineers. However, it has become inconvenient and inefficient for service engineers to retrieve specific knowledge from…

Computation and Language · Computer Science 2021-06-25 Jia Wei Chong , Zhiyuan Chen , Mei Shin Oh

Investigating the Effectiveness of Representations Based on Word-Embeddings in Active Learning for Labelling Text Datasets

Manually labelling large collections of text data is a time-consuming, expensive, and laborious task, but one that is necessary to support machine learning based on text datasets. Active learning has been shown to be an effective way to…

Computation and Language · Computer Science 2019-10-11 Jinghui Lu , Maeve Henchion , Brian Mac Namee

Layer-wise Guided Training for BERT: Learning Incrementally Refined Document Representations

Although BERT is widely used by the NLP community, little is known about its inner workings. Several attempts have been made to shed light on certain aspects of BERT, often with contradicting conclusions. A much raised concern focuses on…

Computation and Language · Computer Science 2020-10-13 Nikolaos Manginas , Ilias Chalkidis , Prodromos Malakasiotis

Understanding the Behaviors of BERT in Ranking

This paper studies the performances and behaviors of BERT in ranking tasks. We explore several different ways to leverage the pre-trained BERT and fine-tune it on two ranking tasks: MS MARCO passage reranking and TREC Web Track ad hoc…

Information Retrieval · Computer Science 2019-04-29 Yifan Qiao , Chenyan Xiong , Zhenghao Liu , Zhiyuan Liu

A Simple yet Brisk and Efficient Active Learning Platform for Text Classification

In this work, we propose the use of a fully managed machine learning service, which utilizes active learning to directly build models from unstructured data. With this tool, business users can quickly and easily build machine learning…

Machine Learning · Computer Science 2021-02-02 Teja Kanchinadam , Qian You , Keith Westpfahl , James Kim , Siva Gunda , Sebastian Seith , Glenn Fung

Hierarchical Transformers for Long Document Classification

BERT, which stands for Bidirectional Encoder Representations from Transformers, is a recently introduced language representation model based upon the transfer learning paradigm. We extend its fine-tuning procedure to address one of its…

Computation and Language · Computer Science 2019-10-25 Raghavendra Pappagari , Piotr Żelasko , Jesús Villalba , Yishay Carmiel , Najim Dehak

Cold-start Active Learning through Self-supervised Language Modeling

Active learning strives to reduce annotation costs by choosing the most critical examples to label. Typically, the active learning strategy is contingent on the classification model. For instance, uncertainty sampling depends on poorly…

Computation and Language · Computer Science 2020-10-26 Michelle Yuan , Hsuan-Tien Lin , Jordan Boyd-Graber

Linear Classifier: An Often-Forgotten Baseline for Text Classification

Large-scale pre-trained language models such as BERT are popular solutions for text classification. Due to the superior performance of these advanced methods, nowadays, people often directly train them for a few epochs and deploy the…

Computation and Language · Computer Science 2023-06-13 Yu-Chen Lin , Si-An Chen , Jie-Jyun Liu , Chih-Jen Lin

Multi-Task Learning of Query Intent and Named Entities using Transfer Learning

Named entity recognition (NER) has been studied extensively and the earlier algorithms were based on sequence labeling like Hidden Markov Models (HMM) and conditional random fields (CRF). These were followed by neural network based deep…

Computation and Language · Computer Science 2021-05-10 Shalin Shah , Ryan Siskind

A BERT-based Hierarchical Classification Model with Applications in Chinese Commodity Classification

Existing e-commerce platforms heavily rely on manual annotation for product categorization, which is inefficient and inconsistent. These platforms often employ a hierarchical structure for categorizing products; however, few studies have…

Computation and Language · Computer Science 2025-08-26 Kun Liu , Tuozhen Liu , Feifei Wang , Rui Pan

Investigating the Effectiveness of Representations Based on Pretrained Transformer-based Language Models in Active Learning for Labelling Text Datasets

Active learning has been shown to be an effective way to alleviate some of the effort required in utilising large collections of unlabelled data for machine learning tasks without needing to fully label them. The representation mechanism…

Information Retrieval · Computer Science 2020-04-29 Jinghui Lu , Brian MacNamee

An Empirical Study of Multi-Task Learning on BERT for Biomedical Text Mining

Multi-task learning (MTL) has achieved remarkable success in natural language processing applications. In this work, we study a multi-task learning model with multiple decoders on varieties of biomedical and clinical natural language…

Computation and Language · Computer Science 2020-05-07 Yifan Peng , Qingyu Chen , Zhiyong Lu

Towards Better Query Classification with Multi-Expert Knowledge Condensation in JD Ads Search

Search query classification, as an effective way to understand user intents, is of great importance in real-world online ads systems. To ensure a lower latency, a shallow model (e.g. FastText) is widely used for efficient online inference.…

Information Retrieval · Computer Science 2023-11-21 Kun-Peng Ning , Ming Pang , Zheng Fang , Xue Jiang , Xi-Wei Zhao , Chang-Ping Peng , Zhan-Gang Lin , Jing-He Hu , Jing-Ping Shao