Related papers: Imbalanced multi-label classification using multi-…

Balancing Methods for Multi-label Text Classification with Long-Tailed Class Distribution

Multi-label text classification is a challenging task because it requires capturing label dependencies. It becomes even more challenging when class distribution is long-tailed. Resampling and re-weighting are common approaches used for…

Computation and Language · Computer Science 2021-10-19 Yi Huang , Buse Giledereli , Abdullatif Köksal , Arzucan Özgür , Elif Ozkirimli

In real-world applications, as data availability increases, obtaining labeled data for machine learning (ML) projects remains challenging due to the high costs and intensive efforts required for data annotation. Many ML projects,…

Machine Learning · Computer Science 2024-12-24 Ismail Hakki Karaman , Gulser Koksal , Levent Eriskin , Salih Salihoglu

Extrapolated Markov Chain Oversampling Method for Imbalanced Text Classification

Text classification is the task of automatically assigning text documents correct labels from a predefined set of categories. In real-life (text) classification tasks, observations and misclassification costs are often unevenly distributed…

Machine Learning · Computer Science 2025-09-03 Aleksi Avela , Pauliina Ilmonen

M2m: Imbalanced Classification via Major-to-minor Translation

In most real-world scenarios, labeled training datasets are highly class-imbalanced, where deep neural networks suffer from generalizing to a balanced testing criterion. In this paper, we explore a novel yet simple way to alleviate this…

Computer Vision and Pattern Recognition · Computer Science 2020-12-22 Jaehyung Kim , Jongheon Jeong , Jinwoo Shin

Noised Consistency Training for Text Summarization

Neural abstractive summarization methods often require large quantities of labeled training data. However, labeling large amounts of summarization data is often prohibitive due to time, financial, and expertise constraints, which has…

Computation and Language · Computer Science 2022-02-09 Junnan Liu , Qianren Mao , Bang Liu , Hao Peng , Hongdong Zhu , Jianxin Li

Retrieval-augmented Multi-label Text Classification

Multi-label text classification (MLC) is a challenging task in settings of large label sets, where label support follows a Zipfian distribution. In this paper, we address this problem through retrieval augmentation, aiming to improve the…

Computation and Language · Computer Science 2023-05-23 Ilias Chalkidis , Yova Kementchedjhieva

Multi-task Learning for Universal Sentence Embeddings: A Thorough Evaluation using Transfer and Auxiliary Tasks

Learning distributed sentence representations is one of the key challenges in natural language processing. Previous work demonstrated that a recurrent neural network (RNNs) based sentence encoder trained on a large collection of annotated…

Computation and Language · Computer Science 2018-08-20 Wasi Uddin Ahmad , Xueying Bai , Zhechao Huang , Chao Jiang , Nanyun Peng , Kai-Wei Chang

Exploring Multitask Learning for Low-Resource AbstractiveSummarization

This paper explores the effect of using multitask learning for abstractive summarization in the context of small training corpora. In particular, we incorporate four different tasks (extractive summarization, language modeling, concept…

Computation and Language · Computer Science 2021-09-20 Ahmed Magooda , Mohamed Elaraby , Diane Litman

Multitask Learning for Class-Imbalanced Discourse Classification

Small class-imbalanced datasets, common in many high-level semantic tasks like discourse analysis, present a particular challenge to current deep-learning architectures. In this work, we perform an extensive analysis on sentence-level…

Computation and Language · Computer Science 2021-01-05 Alexander Spangher , Jonathan May , Sz-rung Shiang , Lingjia Deng

Multi-Task Label Embedding for Text Classification

Multi-task learning in text classification leverages implicit correlations among related tasks to extract common features and yield performance gains. However, most previous works treat labels of each task as independent and meaningless…

Computation and Language · Computer Science 2017-10-20 Honglun Zhang , Liqiang Xiao , Wenqing Chen , Yongkun Wang , Yaohui Jin

Multi-Task Learning for Robot Perception with Imbalanced Data

Multi-task problem solving has been shown to improve the accuracy of the individual tasks, which is an important feature for robots, as they have a limited resource. However, when the number of labels for each task is not equal, namely…

Robotics · Computer Science 2026-02-03 Ozgur Erkent

Enhancing Coherence of Extractive Summarization with Multitask Learning

This study proposes a multitask learning architecture for extractive summarization with coherence boosting. The architecture contains an extractive summarizer and coherent discriminator module. The coherent discriminator is trained online…

Computation and Language · Computer Science 2023-07-24 Renlong Jie , Xiaojun Meng , Lifeng Shang , Xin Jiang , Qun Liu

Learning Multi-Tasks with Inconsistent Labels by using Auxiliary Big Task

Multi-task learning is to improve the performance of the model by transferring and exploiting common knowledge among tasks. Existing MTL works mainly focus on the scenario where label sets among multiple tasks (MTs) are usually the same,…

Machine Learning · Computer Science 2022-01-10 Quan Feng , Songcan Chen

Enhancement Encoding: A Novel Imbalanced Classification Approach via Encoding the Training Labels

Class imbalance, which is also called long-tailed distribution, is a common problem in classification tasks based on machine learning. If it happens, the minority data will be overwhelmed by the majority, which presents quite a challenge…

Machine Learning · Computer Science 2023-03-29 Jia-Chen Zhao

Learning to Learn and Predict: A Meta-Learning Approach for Multi-Label Classification

Many tasks in natural language processing can be viewed as multi-label classification problems. However, most of the existing models are trained with the standard cross-entropy loss function and use a fixed prediction policy (e.g., a…

Computation and Language · Computer Science 2019-09-11 Jiawei Wu , Wenhan Xiong , William Yang Wang

Towards Imbalanced Large Scale Multi-label Classification with Partially Annotated Labels

Multi-label classification is a widely encountered problem in daily life, where an instance can be associated with multiple classes. In theory, this is a supervised learning method that requires a large amount of labeling. However,…

Computer Vision and Pattern Recognition · Computer Science 2023-08-02 XIn Zhang , Yuqi Song , Fei Zuo , Xiaofeng Wang

Multiple Relations Classification using Imbalanced Predictions Adaptation

The relation classification task assigns the proper semantic relation to a pair of subject and object entities; the task plays a crucial role in various text mining applications, such as knowledge graph construction and entities interaction…

Computation and Language · Computer Science 2023-09-26 Sakher Khalil Alqaaidi , Elika Bozorgi , Krzysztof J. Kochut

The Multiplex Classification Framework: optimizing multi-label classifiers through problem transformation, ontology engineering, and model ensembling

Classification is a fundamental task in machine learning. While conventional methods-such as binary, multiclass, and multi-label classification-are effective for simpler problems, they may not adequately address the complexities of some…

Machine Learning · Computer Science 2024-12-20 Mauro Nievas Offidani , Facundo Roffet , Claudio Augusto Delrieux , Maria Carolina Gonzalez Galtier , Marcos Zarate

Multi-task Learning of Pairwise Sequence Classification Tasks Over Disparate Label Spaces

We combine multi-task learning and semi-supervised learning by inducing a joint embedding space between disparate label spaces and learning transfer functions between label embeddings, enabling us to jointly leverage unlabelled data and…

Computation and Language · Computer Science 2018-04-10 Isabelle Augenstein , Sebastian Ruder , Anders Søgaard

Enhancing Extractive Text Summarization with Topic-Aware Graph Neural Networks

Text summarization aims to compress a textual document to a short summary while keeping salient information. Extractive approaches are widely used in text summarization because of their fluency and efficiency. However, most of existing…

Computation and Language · Computer Science 2020-10-14 Peng Cui , Le Hu , Yuanchao Liu