Related papers: How Does Counterfactually Augmented Data Impact Mo…

Counterfactually Augmented Data and Unintended Bias: The Case of Sexism and Hate Speech Detection

Counterfactually Augmented Data (CAD) aims to improve out-of-domain generalizability, an indicator of model robustness. The improvement is credited with promoting core features of the construct over spurious artifacts that happen to…

Computation and Language · Computer Science 2022-05-10 Indira Sen , Mattia Samory , Claudia Wagner , Isabelle Augenstein

People Make Better Edits: Measuring the Efficacy of LLM-Generated Counterfactually Augmented Data for Harmful Language Detection

NLP models are used in a variety of critical social computing tasks, such as detecting sexist, racist, or otherwise hateful content. Therefore, it is imperative that these models are robust to spurious features. Past work has attempted to…

Computation and Language · Computer Science 2024-02-27 Indira Sen , Dennis Assenmacher , Mattia Samory , Isabelle Augenstein , Wil van der Aalst , Claudia Wagner

PairCFR: Enhancing Model Training on Paired Counterfactually Augmented Data through Contrastive Learning

Counterfactually Augmented Data (CAD) involves creating new data samples by applying minimal yet sufficient modifications to flip the label of existing data samples to other classes. Training with CAD enhances model robustness against…

Machine Learning · Computer Science 2024-06-12 Xiaoqi Qiu , Yongjie Wang , Xu Guo , Zhiwei Zeng , Yue Yu , Yuhong Feng , Chunyan Miao

An Investigation of the (In)effectiveness of Counterfactually Augmented Data

While pretrained language models achieve excellent performance on natural language understanding benchmarks, they tend to rely on spurious correlations and generalize poorly to out-of-distribution (OOD) data. Recent work has explored using…

Computation and Language · Computer Science 2022-03-17 Nitish Joshi , He He

Explaining The Efficacy of Counterfactually Augmented Data

In attempts to produce ML models less reliant on spurious patterns in NLP datasets, researchers have recently proposed curating counterfactually augmented data (CAD) via a human-in-the-loop process in which given some documents and their…

Computation and Language · Computer Science 2021-03-25 Divyansh Kaushik , Amrith Setlur , Eduard Hovy , Zachary C. Lipton

Exploring the Efficacy of Automatically Generated Counterfactuals for Sentiment Analysis

While state-of-the-art NLP models have been achieving the excellent performance of a wide range of tasks in recent years, important questions are being raised about their robustness and their underlying sensitivity to systematic biases that…

Computation and Language · Computer Science 2022-03-25 Linyi Yang , Jiazheng Li , Pádraig Cunningham , Yue Zhang , Barry Smyth , Ruihai Dong

Does Data Augmentation Improve Generalization in NLP?

Neural models often exploit superficial features to achieve good performance, rather than deriving more general features. Overcoming this tendency is a central challenge in areas such as representation learning and ML fairness. Recent work…

Computation and Language · Computer Science 2020-10-12 Rohan Jha , Charles Lovering , Ellie Pavlick

Towards Robust Aspect-based Sentiment Analysis through Non-counterfactual Augmentations

While state-of-the-art NLP models have demonstrated excellent performance for aspect based sentiment analysis (ABSA), substantial evidence has been presented on their lack of robustness. This is especially manifested as significant…

Computation and Language · Computer Science 2023-07-24 Xinyu Liu , Yan Ding , Kaikai An , Chunyang Xiao , Pranava Madhyastha , Tong Xiao , Jingbo Zhu

SALAD: Improving Robustness and Generalization through Contrastive Learning with Structure-Aware and LLM-Driven Augmented Data

In various natural language processing (NLP) tasks, fine-tuning Pre-trained Language Models (PLMs) often leads to the issue of spurious correlations, which negatively impacts performance, particularly when dealing with out-of-distribution…

Computation and Language · Computer Science 2025-04-17 Suyoung Bae , Hyojun Kim , YunSeok Choi , Jee-Hyong Lee

Addressing Both Statistical and Causal Gender Fairness in NLP Models

Statistical fairness stipulates equivalent outcomes for every protected group, whereas causal fairness prescribes that a model makes the same prediction for an individual regardless of their protected characteristics. Counterfactual data…

Computation and Language · Computer Science 2024-04-02 Hannah Chen , Yangfeng Ji , David Evans

Context-Aware Counterfactual Data Augmentation for Gender Bias Mitigation in Language Models

A challenge in mitigating social bias in fine-tuned language models (LMs) is the potential reduction in language modeling capability, which can harm downstream performance. Counterfactual data augmentation (CDA), a widely used method for…

Computation and Language · Computer Science 2026-02-11 Shweta Parihar , Liu Guangliang , Natalie Parde , Lu Cheng

Topic-aware Causal Intervention for Counterfactual Detection

Counterfactual statements, which describe events that did not or cannot take place, are beneficial to numerous NLP applications. Hence, we consider the problem of counterfactual detection (CFD) and seek to enhance the CFD models. Previous…

Computation and Language · Computer Science 2024-10-01 Thong Nguyen , Truc-My Nguyen

AutoCAD: Automatically Generating Counterfactuals for Mitigating Shortcut Learning

Recent studies have shown the impressive efficacy of counterfactually augmented data (CAD) for reducing NLU models' reliance on spurious features and improving their generalizability. However, current methods still heavily rely on human…

Artificial Intelligence · Computer Science 2022-11-30 Jiaxin Wen , Yeshuang Zhu , Jinchao Zhang , Jie Zhou , Minlie Huang

Deep Learning on a Healthy Data Diet: Finding Important Examples for Fairness

Data-driven predictive solutions predominant in commercial applications tend to suffer from biases and stereotypes, which raises equity concerns. Prediction models may discover, use, or amplify spurious correlations based on gender or other…

Computation and Language · Computer Science 2022-11-28 Abdelrahman Zayed , Prasanna Parthasarathi , Goncalo Mordido , Hamid Palangi , Samira Shabanian , Sarath Chandar

Bias Challenges in Counterfactual Data Augmentation

Deep learning models tend not to be out-of-distribution robust primarily due to their reliance on spurious features to solve the task. Counterfactual data augmentations provide a general way of (approximately) achieving representations that…

Machine Learning · Computer Science 2022-09-15 S Chandra Mouli , Yangze Zhou , Bruno Ribeiro

Robust Models are less Over-Confident

Despite the success of convolutional neural networks (CNNs) in many academic benchmarks for computer vision tasks, their application in the real-world is still facing fundamental challenges. One of these open problems is the inherent lack…

Computer Vision and Pattern Recognition · Computer Science 2022-12-07 Julia Grabinski , Paul Gavrikov , Janis Keuper , Margret Keuper

Measuring and Improving Attentiveness to Partial Inputs with Counterfactuals

The inevitable appearance of spurious correlations in training datasets hurts the generalization of NLP models on unseen data. Previous work has found that datasets with paired inputs are prone to correlations between a specific part of the…

Computation and Language · Computer Science 2024-10-10 Yanai Elazar , Bhargavi Paranjape , Hao Peng , Sarah Wiegreffe , Khyathi Raghavi , Vivek Srikumar , Sameer Singh , Noah A. Smith

FairFlow: An Automated Approach to Model-based Counterfactual Data Augmentation For NLP

Despite the evolution of language models, they continue to portray harmful societal biases and stereotypes inadvertently learned from training data. These inherent biases often result in detrimental effects in various applications.…

Computation and Language · Computer Science 2024-07-24 Ewoenam Kwaku Tokpo , Toon Calders

Robust Hate Speech Detection in Social Media: A Cross-Dataset Empirical Evaluation

The automatic detection of hate speech online is an active research area in NLP. Most of the studies to date are based on social media datasets that contribute to the creation of hate speech detection models trained on them. However, data…

Computation and Language · Computer Science 2023-07-06 Dimosthenis Antypas , Jose Camacho-Collados

Model-based Counterfactual Generator for Gender Bias Mitigation

Counterfactual Data Augmentation (CDA) has been one of the preferred techniques for mitigating gender bias in natural language models. CDA techniques have mostly employed word substitution based on dictionaries. Although such…

Computation and Language · Computer Science 2023-11-07 Ewoenam Kwaku Tokpo , Toon Calders