Related papers: Improved Customer Transaction Classification using…

Multi-class Text Classification using BERT-based Active Learning

Text Classification finds interesting applications in the pickup and delivery services industry where customers require one or more items to be picked up from a location and delivered to a certain destination. Classifying these customer…

Information Retrieval · Computer Science 2021-09-21 Sumanth Prabhu , Moosa Mohamed , Hemant Misra

Optimizing Multi-Class Text Classification: A Diverse Stacking Ensemble Framework Utilizing Transformers

Customer reviews play a crucial role in assessing customer satisfaction, gathering feedback, and driving improvements for businesses. Analyzing these reviews provides valuable insights into customer sentiments, including compliments,…

Computation and Language · Computer Science 2023-08-23 Anusuya Krishnan

Cross-Lingual NER for Financial Transaction Data in Low-Resource Languages

We propose an efficient modeling framework for cross-lingual named entity recognition in semi-structured text data. Our approach relies on both knowledge distillation and consistency training. The modeling framework leverages knowledge from…

Computation and Language · Computer Science 2023-07-19 Sunisth Kumar , Davide Liu , Alexandre Boulenger

A Semi-supervised Multi-task Learning Approach to Classify Customer Contact Intents

In the area of customer support, understanding customers' intents is a crucial step. Machine learning plays a vital role in this type of intent classification. In reality, it is typical to collect confirmation from customer support…

Information Retrieval · Computer Science 2021-07-30 Li Dong , Matthew C. Spencer , Amir Biagi

BERT2DNN: BERT Distillation with Massive Unlabeled Data for Online E-Commerce Search

Relevance has significant impact on user experience and business profit for e-commerce search platform. In this work, we propose a data-driven framework for search relevance prediction, by distilling knowledge from BERT and related…

Machine Learning · Computer Science 2020-10-21 Yunjiang Jiang , Yue Shang , Ziyang Liu , Hongwei Shen , Yun Xiao , Wei Xiong , Sulong Xu , Weipeng Yan , Di Jin

Simplified TinyBERT: Knowledge Distillation for Document Retrieval

Despite the effectiveness of utilizing the BERT model for document ranking, the high computational cost of such approaches limits their uses. To this end, this paper first empirically investigates the effectiveness of two knowledge…

Information Retrieval · Computer Science 2023-05-05 Xuanang Chen , Ben He , Kai Hui , Le Sun , Yingfei Sun

DiPair: Fast and Accurate Distillation for Trillion-Scale Text Matching and Pair Modeling

Pre-trained models like BERT (Devlin et al., 2018) have dominated NLP / IR applications such as single sentence classification, text pair classification, and question answering. However, deploying these models in real systems is highly…

Computation and Language · Computer Science 2021-05-06 Jiecao Chen , Liu Yang , Karthik Raman , Michael Bendersky , Jung-Jung Yeh , Yun Zhou , Marc Najork , Danyang Cai , Ehsan Emadzadeh

Distilling Dense Representations for Ranking using Tightly-Coupled Teachers

We present an approach to ranking with dense representations that applies knowledge distillation to improve the recently proposed late-interaction ColBERT model. Specifically, we distill the knowledge from ColBERT's expressive MaxSim…

Information Retrieval · Computer Science 2020-10-23 Sheng-Chieh Lin , Jheng-Hong Yang , Jimmy Lin

An Empirical Study of Uniform-Architecture Knowledge Distillation in Document Ranking

Although BERT-based ranking models have been commonly used in commercial search engines, they are usually time-consuming for online ranking tasks. Knowledge distillation, which aims at learning a smaller model with comparable performance to…

Information Retrieval · Computer Science 2023-02-09 Xubo Qin , Xiyuan Liu , Xiongfeng Zheng , Jie Liu , Yutao Zhu

Profitable Trade-Off Between Memory and Performance In Multi-Domain Chatbot Architectures

Text classification problem is a very broad field of study in the field of natural language processing. In short, the text classification problem is to determine which of the previously determined classes the given text belongs to.…

Computation and Language · Computer Science 2021-12-28 D. Emre Taşar , Şükrü Ozan , M. Fatih Akca , Oğuzhan Ölmez , Semih Gülüm , Seçilay Kutal , Ceren Belhan

On the Efficiency of Subclass Knowledge Distillation in Classification Tasks

This work introduces a novel knowledge distillation framework for classification tasks where information on existing subclasses is available and taken into consideration. In classification tasks with a small number of classes or binary…

Machine Learning · Computer Science 2022-07-06 Ahmad Sajedi , Konstantinos N. Plataniotis

Adversarial Self-Supervised Data-Free Distillation for Text Classification

Large pre-trained transformer-based language models have achieved impressive results on a wide range of NLP tasks. In the past few years, Knowledge Distillation(KD) has become a popular paradigm to compress a computationally expensive model…

Computation and Language · Computer Science 2020-10-13 Xinyin Ma , Yongliang Shen , Gongfan Fang , Chen Chen , Chenghao Jia , Weiming Lu

RecoBERT: A Catalog Language Model for Text-Based Recommendations

Language models that utilize extensive self-supervised pre-training from unlabeled text, have recently shown to significantly advance the state-of-the-art performance in a variety of language understanding tasks. However, it is yet unclear…

Information Retrieval · Computer Science 2020-09-29 Itzik Malkiel , Oren Barkan , Avi Caciularu , Noam Razin , Ori Katz , Noam Koenigstein

Larger models yield better results? Streamlined severity classification of ADHD-related concerns using BERT-based knowledge distillation

This work focuses on the efficiency of the knowledge distillation approach in generating a lightweight yet powerful BERT based model for natural language processing applications. After the model creation, we applied the resulting model,…

Computation and Language · Computer Science 2024-11-04 Ahmed Akib Jawad Karim , Kazi Hafiz Md. Asad , Md. Golam Rabiul Alam

Data-Efficient Ranking Distillation for Image Retrieval

Recent advances in deep learning has lead to rapid developments in the field of image retrieval. However, the best performing architectures incur significant computational cost. Recent approaches tackle this issue using knowledge…

Computer Vision and Pattern Recognition · Computer Science 2020-07-14 Zakaria Laskar , Juho Kannala

Text classification in shipping industry using unsupervised models and Transformer based supervised models

Obtaining labelled data in a particular context could be expensive and time consuming. Although different algorithms, including unsupervised learning, semi-supervised learning, self-learning have been adopted, the performance of text…

Computation and Language · Computer Science 2022-12-26 Ying Xie , Dongping Song

Towards Better Query Classification with Multi-Expert Knowledge Condensation in JD Ads Search

Search query classification, as an effective way to understand user intents, is of great importance in real-world online ads systems. To ensure a lower latency, a shallow model (e.g. FastText) is widely used for efficient online inference.…

Information Retrieval · Computer Science 2023-11-21 Kun-Peng Ning , Ming Pang , Zheng Fang , Xue Jiang , Xi-Wei Zhao , Chang-Ping Peng , Zhan-Gang Lin , Jing-He Hu , Jing-Ping Shao

Semi-Supervised Learning Approach to Discover Enterprise User Insights from Feedback and Support

With the evolution of the cloud and customer centric culture, we inherently accumulate huge repositories of textual reviews, feedback, and support data.This has driven enterprises to seek and research engagement patterns, user network…

Machine Learning · Computer Science 2020-07-23 Xin Deng , Ross Smith , Genevieve Quintin

A BERT-based Hierarchical Classification Model with Applications in Chinese Commodity Classification

Existing e-commerce platforms heavily rely on manual annotation for product categorization, which is inefficient and inconsistent. These platforms often employ a hierarchical structure for categorizing products; however, few studies have…

Computation and Language · Computer Science 2025-08-26 Kun Liu , Tuozhen Liu , Feifei Wang , Rui Pan

MLKD-BERT: Multi-level Knowledge Distillation for Pre-trained Language Models

Knowledge distillation is an effective technique for pre-trained language model compression. Although existing knowledge distillation methods perform well for the most typical model BERT, they could be further improved in two aspects: the…

Computation and Language · Computer Science 2024-07-04 Ying Zhang , Ziheng Yang , Shufan Ji