Related papers: Multitask Learning for Class-Imbalanced Discourse …

A Survey of Methods for Addressing Class Imbalance in Deep-Learning Based Natural Language Processing

Many natural language processing (NLP) tasks are naturally imbalanced, as some target categories occur much more frequently than others in the real world. In such scenarios, current NLP models still tend to perform poorly on less frequent…

Computation and Language · Computer Science 2023-02-23 Sophie Henning , William Beluch , Alexander Fraser , Annemarie Friedrich

Imbalanced multi-label classification using multi-task learning with extractive summarization

Extractive summarization and imbalanced multi-label classification often require vast amounts of training data to avoid overfitting. In situations where training data is expensive to generate, leveraging information between tasks is an…

Computation and Language · Computer Science 2019-03-19 John Brandt

Fairness-aware Class Imbalanced Learning

Class imbalance is a common challenge in many NLP tasks, and has clear connections to bias, in that bias in training data often leads to higher accuracy for majority groups at the expense of minority groups. However there has traditionally…

Computation and Language · Computer Science 2021-09-23 Shivashankar Subramanian , Afshin Rahimi , Timothy Baldwin , Trevor Cohn , Lea Frermann

Striking a Balance: Alleviating Inconsistency in Pre-trained Models for Symmetric Classification Tasks

While fine-tuning pre-trained models for downstream classification is the conventional paradigm in NLP, often task-specific nuances may not get captured in the resultant models. Specifically, for tasks that take two inputs and require the…

Computation and Language · Computer Science 2022-03-28 Ashutosh Kumar , Aditya Joshi

Imbalanced Classification via Explicit Gradient Learning From Augmented Data

Learning from imbalanced data is one of the most significant challenges in real-world classification tasks. In such cases, neural networks performance is substantially impaired due to preference towards the majority class. Existing…

Machine Learning · Computer Science 2022-11-13 Bronislav Yasinnik , Moshe Salhov , Ofir Lindenbaum , Amir Averbuch

Multitask Learning for Low Resource Spoken Language Understanding

We explore the benefits that multitask learning offer to speech processing as we train models on dual objectives with automatic speech recognition and intent classification or sentiment classification. Our models, although being of modest…

Computation and Language · Computer Science 2022-11-28 Quentin Meeus , Marie-Francine Moens , Hugo Van hamme

Order Matters in the Presence of Dataset Imbalance for Multilingual Learning

In this paper, we empirically study the optimization dynamics of multi-task learning, particularly focusing on those that govern a collection of tasks with significant data imbalance. We present a simple yet effective method of pre-training…

Computation and Language · Computer Science 2023-12-12 Dami Choi , Derrick Xin , Hamid Dadkhahi , Justin Gilmer , Ankush Garg , Orhan Firat , Chih-Kuan Yeh , Andrew M. Dai , Behrooz Ghorbani

Handling Class Imbalance in Low-Resource Dialogue Systems by Combining Few-Shot Classification and Interpolation

Utterance classification performance in low-resource dialogue systems is constrained by an inevitably high degree of data imbalance in class labels. We present a new end-to-end pairwise learning framework that is designed specifically to…

Computation and Language · Computer Science 2020-10-29 Vishal Sunder , Eric Fosler-Lussier

Learning to Paraphrase Sentences to Different Complexity Levels

While sentence simplification is an active research topic in NLP, its adjacent tasks of sentence complexification and same-level paraphrasing are not. To train models on all three tasks, we present two new unsupervised datasets. We compare…

Computation and Language · Computer Science 2023-11-22 Alison Chi , Li-Kuang Chen , Yi-Chen Chang , Shu-Hui Lee , Jason S. Chang

Balancing Methods for Multi-label Text Classification with Long-Tailed Class Distribution

Multi-label text classification is a challenging task because it requires capturing label dependencies. It becomes even more challenging when class distribution is long-tailed. Resampling and re-weighting are common approaches used for…

Computation and Language · Computer Science 2021-10-19 Yi Huang , Buse Giledereli , Abdullatif Köksal , Arzucan Özgür , Elif Ozkirimli

Multiple Relations Classification using Imbalanced Predictions Adaptation

The relation classification task assigns the proper semantic relation to a pair of subject and object entities; the task plays a crucial role in various text mining applications, such as knowledge graph construction and entities interaction…

Computation and Language · Computer Science 2023-09-26 Sakher Khalil Alqaaidi , Elika Bozorgi , Krzysztof J. Kochut

M2m: Imbalanced Classification via Major-to-minor Translation

In most real-world scenarios, labeled training datasets are highly class-imbalanced, where deep neural networks suffer from generalizing to a balanced testing criterion. In this paper, we explore a novel yet simple way to alleviate this…

Computer Vision and Pattern Recognition · Computer Science 2020-12-22 Jaehyung Kim , Jongheon Jeong , Jinwoo Shin

Optimizing Deep Learning Models to Address Class Imbalance in Code Comment Classification

Developers rely on code comments to document their work, track issues, and understand the source code. As such, comments provide valuable insights into developers' understanding of their code and describe their various intentions in writing…

Software Engineering · Computer Science 2025-07-03 Moritz Mock , Thomas Borsani , Giuseppe Di Fatta , Barbara Russo

Grad2Task: Improved Few-shot Text Classification Using Gradients for Task Representation

Large pretrained language models (LMs) like BERT have improved performance in many disparate natural language processing (NLP) tasks. However, fine tuning such models requires a large number of training examples for each target task.…

Computation and Language · Computer Science 2022-01-28 Jixuan Wang , Kuan-Chieh Wang , Frank Rudzicz , Michael Brudno

Survey of resampling techniques for improving classification performance in unbalanced datasets

A number of classification problems need to deal with data imbalance between classes. Often it is desired to have a high recall on the minority class while maintaining a high precision on the majority class. In this paper, we review a…

Applications · Statistics 2016-08-23 Ajinkya More

Multi-Task Learning for Argumentation Mining in Low-Resource Settings

We investigate whether and where multi-task learning (MTL) can improve performance on NLP problems related to argumentation mining (AM), in particular argument component identification. Our results show that MTL performs particularly well…

Computation and Language · Computer Science 2018-05-07 Claudia Schulz , Steffen Eger , Johannes Daxenberger , Tobias Kahse , Iryna Gurevych

Multi-Task Learning for Robot Perception with Imbalanced Data

Multi-task problem solving has been shown to improve the accuracy of the individual tasks, which is an important feature for robots, as they have a limited resource. However, when the number of labels for each task is not equal, namely…

Robotics · Computer Science 2026-02-03 Ozgur Erkent

Enhancing Science Classroom Discourse Analysis through Joint Multi-Task Learning for Reasoning-Component Classification

Analyzing the reasoning patterns of students in science classrooms is critical for understanding knowledge construction mechanism and improving instructional practice to maximize cognitive engagement, yet manual coding of classroom…

Computation and Language · Computer Science 2026-05-08 Jiho Noh , Mukhesh Raghava Katragadda , Raymond Carl , Soon Lee

Multi-Task Learning for Conversational Question Answering over a Large-Scale Knowledge Base

We consider the problem of conversational question answering over a large-scale knowledge base. To handle huge entity vocabulary of a large-scale knowledge base, recent neural semantic parsing based approaches usually decompose the task…

Computation and Language · Computer Science 2019-10-14 Tao Shen , Xiubo Geng , Tao Qin , Daya Guo , Duyu Tang , Nan Duan , Guodong Long , Daxin Jiang

Improve Meta-learning for Few-Shot Text Classification with All You Can Acquire from the Tasks

Meta-learning has emerged as a prominent technology for few-shot text classification and has achieved promising performance. However, existing methods often encounter difficulties in drawing accurate class prototypes from support set…

Computer Vision and Pattern Recognition · Computer Science 2024-10-15 Xinyue Liu , Yunlong Gao , Linlin Zong , Bo Xu