Related papers: Identifying Spurious Correlations for Robust Text …

Understanding and Mitigating Spurious Correlations in Text Classification with Neighborhood Analysis

Recent research has revealed that machine learning models have a tendency to leverage spurious correlations that exist in the training set but may not hold true in general circumstances. For instance, a sentiment classifier may erroneously…

Computation and Language · Computer Science 2024-02-06 Oscar Chew , Hsuan-Tien Lin , Kai-Wei Chang , Kuan-Hao Huang

Robustness to Spurious Correlations in Text Classification via Automatically Generated Counterfactuals

Spurious correlations threaten the validity of statistical classifiers. While model accuracy may appear high when the test data is from the same distribution as the training data, it can quickly degrade when the test distribution changes.…

Machine Learning · Computer Science 2020-12-21 Zhao Wang , Aron Culotta

Fighting Spurious Correlations in Text Classification via a Causal Learning Perspective

In text classification tasks, models often rely on spurious correlations for predictions, incorrectly associating irrelevant features with the target labels. This issue limits the robustness and generalization of models, especially when…

Machine Learning · Computer Science 2025-02-04 Yuqing Zhou , Ziwei Zhu

Informativeness and Invariance: Two Perspectives on Spurious Correlations in Natural Language

Spurious correlations are a threat to the trustworthiness of natural language processing systems, motivating research into methods for identifying and eliminating them. However, addressing the problem of spurious correlations requires more…

Computation and Language · Computer Science 2022-05-05 Jacob Eisenstein

Are All Spurious Features in Natural Language Alike? An Analysis through a Causal Lens

The term `spurious correlations' has been used in NLP to informally denote any undesirable feature-label correlations. However, a correlation can be undesirable because (i) the feature is irrelevant to the label (e.g. punctuation in a…

Computation and Language · Computer Science 2022-10-26 Nitish Joshi , Xiang Pan , He He

Detecting Spurious Correlations via Robust Visual Concepts in Real and AI-Generated Image Classification

Often machine learning models tend to automatically learn associations present in the training data without questioning their validity or appropriateness. This undesirable property is the root cause of the manifestation of spurious…

Machine Learning · Computer Science 2023-11-17 Preetam Prabhu Srikar Dammu , Chirag Shah

Learning Robust Classifiers with Self-Guided Spurious Correlation Mitigation

Deep neural classifiers tend to rely on spurious correlations between spurious attributes of inputs and targets to make predictions, which could jeopardize their generalization capability. Training classifiers robust to spurious…

Machine Learning · Computer Science 2024-05-07 Guangtao Zheng , Wenqian Ye , Aidong Zhang

Controlling Learned Effects to Reduce Spurious Correlations in Text Classifiers

To address the problem of NLP classifiers learning spurious correlations between training features and target labels, a common approach is to make the model's predictions invariant to these features. However, this can be counter-productive…

Machine Learning · Computer Science 2023-06-22 Parikshit Bansal , Amit Sharma

Spuriousness-Aware Meta-Learning for Learning Robust Classifiers

Spurious correlations are brittle associations between certain attributes of inputs and target variables, such as the correlation between an image background and an object class. Deep image classifiers often leverage them for predictions,…

Computer Vision and Pattern Recognition · Computer Science 2024-06-18 Guangtao Zheng , Wenqian Ye , Aidong Zhang

Identifying and Mitigating Spurious Correlations for Improving Robustness in NLP Models

Recently, NLP models have achieved remarkable progress across a variety of tasks; however, they have also been criticized for being not robust. Many robustness problems can be attributed to models exploiting spurious correlations, or…

Computation and Language · Computer Science 2022-05-26 Tianlu Wang , Rohit Sridhar , Diyi Yang , Xuezhi Wang

Explore Spurious Correlations at the Concept Level in Language Models for Text Classification

Language models (LMs) have achieved notable success in numerous NLP tasks, employing both fine-tuning and in-context learning (ICL) methods. While language models demonstrate exceptional performance, they face robustness challenges due to…

Computation and Language · Computer Science 2024-06-18 Yuhang Zhou , Paiheng Xu , Xiaoyu Liu , Bang An , Wei Ai , Furong Huang

Identifying Spurious Correlations and Correcting them with an Explanation-based Learning

Identifying spurious correlations learned by a trained model is at the core of refining a trained model and building a trustworthy model. We present a simple method to identify spurious correlations that have been learned by a model trained…

Computer Vision and Pattern Recognition · Computer Science 2022-12-07 Misgina Tsighe Hagos , Kathleen M. Curran , Brian Mac Namee

Towards Robust Classification Model by Counterfactual and Invariant Data Generation

Despite the success of machine learning applications in science, industry, and society in general, many approaches are known to be non-robust, often relying on spurious correlations to make predictions. Spuriousness occurs when some…

Computer Vision and Pattern Recognition · Computer Science 2021-06-04 Chun-Hao Chang , George Alexandru Adam , Anna Goldenberg

Measuring Spurious Correlation in Classification: 'Clever Hans' in Translationese

Recent work has shown evidence of 'Clever Hans' behavior in high-performance neural translationese classifiers, where BERT-based classifiers capitalize on spurious correlations, in particular topic information, between data and target…

Computation and Language · Computer Science 2024-06-13 Angana Borah , Daria Pylypenko , Cristina Espana-Bonet , Josef van Genabith

Causal Effect Regularization: Automated Detection and Removal of Spurious Attributes

In many classification datasets, the task labels are spuriously correlated with some input attributes. Classifiers trained on such datasets often rely on these attributes for prediction, especially when the spurious correlation is high, and…

Machine Learning · Computer Science 2023-12-11 Abhinav Kumar , Amit Deshpande , Amit Sharma

Let Samples Speak: Mitigating Spurious Correlation by Exploiting the Clusterness of Samples

Deep learning models are known to often learn features that spuriously correlate with the class label during training but are irrelevant to the prediction task. Existing methods typically address this issue by annotating potential spurious…

Computer Vision and Pattern Recognition · Computer Science 2025-12-30 Weiwei Li , Junzhuo Liu , Yuanyuan Ren , Yuchen Zheng , Yahao Liu , Wen Li

Counterfactual Invariance to Spurious Correlations: Why and How to Pass Stress Tests

Informally, a 'spurious correlation' is the dependence of a model on some aspect of the input data that an analyst thinks shouldn't matter. In machine learning, these have a know-it-when-you-see-it character; e.g., changing the gender of a…

Machine Learning · Computer Science 2021-11-04 Victor Veitch , Alexander D'Amour , Steve Yadlowsky , Jacob Eisenstein

Correlation-Based Method for Sentiment Classification

The classic supervised classification algorithms are efficient, but time-consuming, complicated and not interpretable, which makes it difficult to analyze their results that limits the possibility to improve them based on real observations.…

Computation and Language · Computer Science 2018-03-05 Hussam Hamdan

A Toolkit for Detecting Spurious Correlations in Speech Datasets

We introduce a toolkit for uncovering spurious correlations between recording characteristics and target class in speech datasets. Spurious correlations may arise due to heterogeneous recording conditions, a common scenario for…

Sound · Computer Science 2026-04-30 Lara Gauder , Pablo Riera , Andrea Slachevsky , Gonzalo Forno , Adolfo M. García , Luciana Ferrer

Out of Spuriousity: Improving Robustness to Spurious Correlations without Group Annotations

Machine learning models are known to learn spurious correlations, i.e., features having strong relations with class labels but no causal relation. Relying on those correlations leads to poor performance in the data groups without these…

Machine Learning · Computer Science 2026-04-28 Phuong Quynh Le , Jörg Schlötterer , Christin Seifert