English
Related papers

Related papers: Avoiding spurious correlations via logit correctio…

200 papers

Spurious correlations pose a major challenge for robust machine learning. Models trained with empirical risk minimization (ERM) may learn to rely on correlations between class labels and spurious attributes, leading to poor performance on…

Machine Learning · Computer Science 2024-12-12 Michael Zhang , Nimit S. Sohoni , Hongyang R. Zhang , Chelsea Finn , Christopher Ré

Machine learning models are known to learn spurious correlations, i.e., features having strong relations with class labels but no causal relation. Relying on those correlations leads to poor performance in the data groups without these…

Machine Learning · Computer Science 2026-04-28 Phuong Quynh Le , Jörg Schlötterer , Christin Seifert

Standard empirical risk minimization (ERM) models may prioritize learning spurious correlations between spurious features and true labels, leading to poor accuracy on groups where these correlations do not hold. Mitigating this issue often…

Machine Learning · Computer Science 2024-06-05 Yujin Han , Difan Zou

Deep learning has seen widespread success in various domains such as science, industry, and society. However, it is acknowledged that certain approaches suffer from non-robustness, relying on spurious correlations for predictions.…

Machine Learning · Computer Science 2025-05-22 Xiaoling Zhou , Wei Ye , Rui Xie , Shikun Zhang

Deep learning models are known to often learn features that spuriously correlate with the class label during training but are irrelevant to the prediction task. Existing methods typically address this issue by annotating potential spurious…

Computer Vision and Pattern Recognition · Computer Science 2025-12-30 Weiwei Li , Junzhuo Liu , Yuanyuan Ren , Yuchen Zheng , Yahao Liu , Wen Li

Classifiers trained with Empirical Risk Minimization (ERM) tend to rely on attributes that have high spurious correlation with the target. This can degrade the performance on underrepresented (or 'minority') groups that lack these…

Language models (LMs) have achieved notable success in numerous NLP tasks, employing both fine-tuning and in-context learning (ICL) methods. While language models demonstrate exceptional performance, they face robustness challenges due to…

Computation and Language · Computer Science 2024-06-18 Yuhang Zhou , Paiheng Xu , Xiaoyu Liu , Bang An , Wei Ai , Furong Huang

Recent studies highlight that deep learning models often learn spurious features mistakenly linked to labels, compromising their reliability in real-world scenarios where such correlations do not hold. Despite the increasing research…

Computer Vision and Pattern Recognition · Computer Science 2024-07-09 Xiwei Xuan , Ziquan Deng , Hsuan-Tien Lin , Kwan-Liu Ma

Recent research has revealed that deep neural networks often take dataset biases as a shortcut to make decisions rather than understand tasks, leading to failures in real-world applications. In this study, we focus on the spurious…

Computation and Language · Computer Science 2023-06-23 Yanrui Du , Jing Yan , Yan Chen , Jing Liu , Sendong Zhao , Qiaoqiao She , Hua Wu , Haifeng Wang , Bing Qin

We present a general methodology for using unlabeled data to design semi supervised learning (SSL) variants of the Empirical Risk Minimization (ERM) learning process. Focusing on generalized linear regression, we analyze of the…

Machine Learning · Statistics 2022-03-08 Oren Yuval , Saharon Rosset

Spurious correlations that degrade model generalization or lead the model to be right for the wrong reasons are one of the main robustness concerns for real-world deployments. However, mitigating these correlations during pre-training for…

Machine Learning · Computer Science 2023-06-01 Yu Yang , Besmira Nushi , Hamid Palangi , Baharan Mirzasoleiman

Models trained with empirical risk minimization (ERM) are prone to be biased towards spurious correlations between target labels and bias attributes, which leads to poor performance on data groups lacking spurious correlations. It is…

Machine Learning · Computer Science 2024-12-23 Hyeonggeun Han , Sehwan Kim , Hyungjun Joo , Sangwoo Hong , Jungwoo Lee

The multi-label classification problem has generated significant interest in recent years. However, existing approaches do not adequately address two key challenges: (a) the ability to tackle problems with a large number (say millions) of…

Machine Learning · Computer Science 2013-11-26 Hsiang-Fu Yu , Prateek Jain , Purushottam Kar , Inderjit S. Dhillon

Predictive performance of machine learning models trained with empirical risk minimization (ERM) can degrade considerably under distribution shifts. The presence of spurious correlations in training datasets leads ERM-trained models to…

Machine Learning · Computer Science 2023-02-08 Simon Roburin , Charles Corbière , Gilles Puy , Nicolas Thome , Matthieu Aubry , Renaud Marlet , Patrick Pérez

We propose self-adaptive training---a new training algorithm that dynamically corrects problematic training labels by model predictions without incurring extra computational cost---to improve generalization of deep learning for potentially…

Machine Learning · Computer Science 2020-10-01 Lang Huang , Chao Zhang , Hongyang Zhang

To address the problem of NLP classifiers learning spurious correlations between training features and target labels, a common approach is to make the model's predictions invariant to these features. However, this can be counter-productive…

Machine Learning · Computer Science 2023-06-22 Parikshit Bansal , Amit Sharma

Deep neural classifiers tend to rely on spurious correlations between spurious attributes of inputs and targets to make predictions, which could jeopardize their generalization capability. Training classifiers robust to spurious…

Machine Learning · Computer Science 2024-05-07 Guangtao Zheng , Wenqian Ye , Aidong Zhang

Real-world classification problems typically exhibit an imbalanced or long-tailed label distribution, wherein many labels are associated with only a few samples. This poses a challenge for generalisation on such labels, and also makes…

Machine Learning · Computer Science 2021-07-13 Aditya Krishna Menon , Sadeep Jayasumana , Ankit Singh Rawat , Himanshu Jain , Andreas Veit , Sanjiv Kumar

In text classification tasks, models often rely on spurious correlations for predictions, incorrectly associating irrelevant features with the target labels. This issue limits the robustness and generalization of models, especially when…

Machine Learning · Computer Science 2025-02-04 Yuqing Zhou , Ziwei Zhu

While existing social bot detectors perform well on benchmarks, their robustness across diverse real-world scenarios remains limited due to unclear ground truth and varied misleading cues. In particular, the impact of shortcut learning,…

Computation and Language · Computer Science 2026-03-24 Shiyan Zheng , Herun Wan , Minnan Luo , Junhang Huang
‹ Prev 1 2 3 10 Next ›