Related papers: Learning Personalized Attribute Preference via Mul…

Prefer to Classify: Improving Text Classifiers via Auxiliary Preference Learning

The development of largely human-annotated benchmarks has driven the success of deep neural networks in various NLP tasks. To enhance the effectiveness of existing benchmarks, collecting new additional input-output pairs is often too costly…

Computation and Language · Computer Science 2023-06-09 Jaehyung Kim , Jinwoo Shin , Dongyeop Kang

Enhancing Personalized Ranking With Differentiable Group AUC Optimization

AUC is a common metric for evaluating the performance of a classifier. However, most classifiers are trained with cross entropy, and it does not optimize the AUC metric directly, which leaves a gap between the training and evaluation stage.…

Machine Learning · Computer Science 2023-04-20 Xiao Sun , Bo Zhang , Chenrui Zhang , Han Ren , Mingchen Cai

Evolutionary Multitasking AUC Optimization

Learning to optimize the area under the receiver operating characteristics curve (AUC) performance for imbalanced data has attracted much attention in recent years. Although there have been several methods of AUC optimization, scaling up…

Machine Learning · Computer Science 2024-10-28 Chao Wang , Kai Wu , Jing Liu

Capturing Perspectives of Crowdsourced Annotators in Subjective Learning Tasks

Supervised classification heavily depends on datasets annotated by humans. However, in subjective tasks such as toxicity classification, these annotations often exhibit low agreement among raters. Annotations have commonly been aggregated…

Computation and Language · Computer Science 2024-05-17 Negar Mokhberian , Myrl G. Marmarelis , Frederic R. Hopp , Valerio Basile , Fred Morstatter , Kristina Lerman

Noise Correction on Subjective Datasets

Incorporating every annotator's perspective is crucial for unbiased data modeling. Annotator fatigue and changing opinions over time can distort dataset annotations. To combat this, we propose to learn a more accurate representation of…

Machine Learning · Computer Science 2024-06-05 Uthman Jinadu , Yi Ding

Towards Model-Based Data Acquisition for Subjective Multi-Task NLP Problems

Data annotated by humans is a source of knowledge by describing the peculiarities of the problem and therefore fueling the decision process of the trained model. Unfortunately, the annotation process for subjective natural language…

Computation and Language · Computer Science 2023-12-14 Kamil Kanclerz , Julita Bielaniewicz , Marcin Gruza , Jan Kocon , Stanisław Woźniak , Przemysław Kazienko

Learning with Multiclass AUC: Theory and Algorithms

The Area under the ROC curve (AUC) is a well-known ranking metric for problems such as imbalanced learning and recommender systems. The vast majority of existing AUC-optimization-based machine learning methods only focus on binary-class…

Machine Learning · Computer Science 2021-07-29 Zhiyong Yang , Qianqian Xu , Shilong Bao , Xiaochun Cao , Qingming Huang

Post-Training Attribute Unlearning in Recommender Systems

With the growing privacy concerns in recommender systems, recommendation unlearning is getting increasing attention. Existing studies predominantly use training data, i.e., model inputs, as unlearning target. However, attackers can extract…

Information Retrieval · Computer Science 2024-10-25 Chaochao Chen , Yizhao Zhang , Yuyuan Li , Jun Wang , Lianyong Qi , Xiaolong Xu , Xiaolin Zheng , Jianwei Yin

Generalizing AUC Optimization to Multiclass Classification for Audio Segmentation With Limited Training Data

Area under the ROC curve (AUC) optimisation techniques developed for neural networks have recently demonstrated their capabilities in different audio and speech related tasks. However, due to its intrinsic nature, AUC optimisation has…

Sound · Computer Science 2021-10-28 Pablo Gimeno , Victoria Mingote , Alfonso Ortega , Antonio Miguel , Eduardo Lleida

Annotation-Efficient Language Model Alignment via Diverse and Representative Response Texts

Preference optimization is a standard approach to fine-tuning large language models to align with human preferences. The quantity, diversity, and representativeness of the preference dataset are critical to the effectiveness of preference…

Computation and Language · Computer Science 2025-09-18 Yuu Jinnai , Ukyo Honda

Dealing with Disagreements: Looking Beyond the Majority Vote in Subjective Annotations

Majority voting and averaging are common approaches employed to resolve annotator disagreements and derive single ground truth labels from multiple annotations. However, annotators may systematically disagree with one another, often…

Computation and Language · Computer Science 2021-10-13 Aida Mostafazadeh Davani , Mark Díaz , Vinodkumar Prabhakaran

UmBERTo-MTSA @ AcCompl-It: Improving Complexity and Acceptability Prediction with Multi-task Learning on Self-Supervised Annotations

This work describes a self-supervised data augmentation approach used to improve learning models' performances when only a moderate amount of labeled data is available. Multiple copies of the original model are initially trained on the…

Computation and Language · Computer Science 2020-12-18 Gabriele Sarti

AdaTask: A Task-aware Adaptive Learning Rate Approach to Multi-task Learning

Multi-task learning (MTL) models have demonstrated impressive results in computer vision, natural language processing, and recommender systems. Even though many approaches have been proposed, how well these approaches balance different…

Machine Learning · Computer Science 2024-05-06 Enneng Yang , Junwei Pan , Ximei Wang , Haibin Yu , Li Shen , Xihua Chen , Lei Xiao , Jie Jiang , Guibing Guo

From Common to Special: When Multi-Attribute Learning Meets Personalized Opinions

Visual attributes, which refer to human-labeled semantic annotations, have gained increasing popularity in a wide range of real world applications. Generally, the existing attribute learning methods fall into two categories: one focuses on…

Machine Learning · Computer Science 2018-08-07 Zhiyong Yang , Qianqian Xu , Xiaochun Cao , Qingming Huang

Improving Attributed Text Generation of Large Language Models via Preference Learning

Large language models have been widely adopted in natural language processing, yet they face the challenge of generating unreliable content. Recent works aim to reduce misinformation and hallucinations by resorting to attribution as a means…

Computation and Language · Computer Science 2024-03-28 Dongfang Li , Zetian Sun , Baotian Hu , Zhenyu Liu , Xinshuo Hu , Xuebo Liu , Min Zhang

Improving One-class Recommendation with Multi-tasking on Various Preference Intensities

In the one-class recommendation problem, it's required to make recommendations basing on users' implicit feedback, which is inferred from their action and inaction. Existing works obtain representations of users and items by encoding…

Information Retrieval · Computer Science 2024-01-22 Chu-Jen Shao , Hao-Ming Fu , Pu-Jen Cheng

Multi-utility Learning: Structured-output Learning with Multiple Annotation-specific Loss Functions

Structured-output learning is a challenging problem; particularly so because of the difficulty in obtaining large datasets of fully labelled instances for training. In this paper we try to overcome this difficulty by presenting a…

Computer Vision and Pattern Recognition · Computer Science 2014-06-24 Roman Shapovalov , Dmitry Vetrov , Anton Osokin , Pushmeet Kohli

Efficient AUC Optimization for Information Ranking Applications

Adequate evaluation of an information retrieval system to estimate future performance is a crucial task. Area under the ROC curve (AUC) is widely used to evaluate the generalization of a retrieval system. However, the objective function…

Information Retrieval · Computer Science 2016-04-26 Sean J. Welleck

Meta-learning Representations for Learning from Multiple Annotators

We propose a meta-learning method for learning from multiple noisy annotators. In many applications such as crowdsourcing services, labels for supervised learning are given by multiple annotators. Since the annotators have different skills…

Machine Learning · Computer Science 2025-06-13 Atsutoshi Kumagai , Tomoharu Iwata , Taishi Nishiyama , Yasutoshi Ida , Yasuhiro Fujiwara

Preference Consistency Matters: Enhancing Preference Learning in Language Models with Automated Self-Curation of Training Corpora

Inconsistent annotations in training corpora, particularly within preference learning datasets, pose challenges in developing advanced language models. These inconsistencies often arise from variability among annotators and inherent…

Computation and Language · Computer Science 2025-02-03 JoonHo Lee , JuYoun Son , Juree Seok , Wooseok Jang , Yeong-Dae Kwon