Related papers: Data Augmentations for Improved (Large) Language M…

Learning the Difference that Makes a Difference with Counterfactually-Augmented Data

Despite alarm over the reliance of machine learning systems on so-called spurious patterns, the term lacks coherent meaning in standard statistical frameworks. However, the language of causality offers clarity: spurious associations are due…

Computation and Language · Computer Science 2020-02-18 Divyansh Kaushik , Eduard Hovy , Zachary C. Lipton

Bias Challenges in Counterfactual Data Augmentation

Deep learning models tend not to be out-of-distribution robust primarily due to their reliance on spurious features to solve the task. Counterfactual data augmentations provide a general way of (approximately) achieving representations that…

Machine Learning · Computer Science 2022-09-15 S Chandra Mouli , Yangze Zhou , Bruno Ribeiro

Learning What Makes a Difference from Counterfactual Examples and Gradient Supervision

One of the primary challenges limiting the applicability of deep learning is its susceptibility to learning spurious correlations rather than the underlying mechanisms of the task of interest. The resulting failure to generalise cannot be…

Computer Vision and Pattern Recognition · Computer Science 2020-04-21 Damien Teney , Ehsan Abbasnedjad , Anton van den Hengel

Counterfactually-Augmented SNLI Training Data Does Not Yield Better Generalization Than Unaugmented Data

A growing body of work shows that models exploit annotation artifacts to achieve state-of-the-art performance on standard crowdsourced benchmarks---datasets collected from crowdworkers to create an evaluation task---while still failing on…

Computation and Language · Computer Science 2020-10-13 William Huang , Haokun Liu , Samuel R. Bowman

Large Language Models for Statistical Inference: Context Augmentation with Applications to the Two-Sample Problem and Regression

We introduce context augmentation, a data-augmentation approach that uses large language models (LLMs) to generate contexts around observed strings as a means of facilitating valid frequentist inference. These generated contexts serve to…

Methodology · Statistics 2025-07-01 Marc Ratkovic

Does Using Counterfactual Help LLMs Explain Textual Importance in Classification?

Large language models (LLMs) are becoming useful in many domains due to their impressive abilities that arise from large training datasets and large model sizes. More recently, they have been shown to be very effective in textual…

Computation and Language · Computer Science 2025-10-07 Nelvin Tan , James Asikin Cheung , Yu-Ching Shih , Dong Yang , Amol Salunkhe

Robustness to Spurious Correlations in Text Classification via Automatically Generated Counterfactuals

Spurious correlations threaten the validity of statistical classifiers. While model accuracy may appear high when the test data is from the same distribution as the training data, it can quickly degrade when the test distribution changes.…

Machine Learning · Computer Science 2020-12-21 Zhao Wang , Aron Culotta

An Investigation of the (In)effectiveness of Counterfactually Augmented Data

While pretrained language models achieve excellent performance on natural language understanding benchmarks, they tend to rely on spurious correlations and generalize poorly to out-of-distribution (OOD) data. Recent work has explored using…

Computation and Language · Computer Science 2022-03-17 Nitish Joshi , He He

Contextual Augmentation: Data Augmentation by Words with Paradigmatic Relations

We propose a novel data augmentation for labeled sentences called contextual augmentation. We assume an invariance that sentences are natural even if the words in the sentences are replaced with other words with paradigmatic relations. We…

Computation and Language · Computer Science 2018-05-17 Sosuke Kobayashi

On Counterfactual Data Augmentation Under Confounding

Counterfactual data augmentation has recently emerged as a method to mitigate confounding biases in the training data. These biases, such as spurious correlations, arise due to various observed and unobserved confounding variables in the…

Machine Learning · Computer Science 2023-11-22 Abbavaram Gowtham Reddy , Saketh Bachu , Saloni Dash , Charchit Sharma , Amit Sharma , Vineeth N Balasubramanian

Counterfactual Augmentation for Multimodal Learning Under Presentation Bias

In real-world machine learning systems, labels are often derived from user behaviors that the system wishes to encourage. Over time, new models must be trained as new training examples and features become available. However, feedback loops…

Machine Learning · Computer Science 2023-11-01 Victoria Lin , Louis-Philippe Morency , Dimitrios Dimitriadis , Srinagesh Sharma

Augmentation by Counterfactual Explanation -- Fixing an Overconfident Classifier

A highly accurate but overconfident model is ill-suited for deployment in critical applications such as healthcare and autonomous driving. The classification outcome should reflect a high uncertainty on ambiguous in-distribution samples…

Machine Learning · Computer Science 2022-10-25 Sumedha Singla , Nihal Murali , Forough Arabshahi , Sofia Triantafyllou , Kayhan Batmanghelich

Counterfactual Data Augmentation improves Factuality of Abstractive Summarization

Abstractive summarization systems based on pretrained language models often generate coherent but factually inconsistent sentences. In this paper, we present a counterfactual data augmentation approach where we augment data with perturbed…

Computation and Language · Computer Science 2022-05-26 Dheeraj Rajagopal , Siamak Shakeri , Cicero Nogueira dos Santos , Eduard Hovy , Chung-Ching Chang

Selecting Data Augmentation for Simulating Interventions

Machine learning models trained with purely observational data and the principle of empirical risk minimization \citep{vapnik_principles_1992} can fail to generalize to unseen domains. In this paper, we focus on the case where the problem…

Machine Learning · Statistics 2020-10-27 Maximilian Ilse , Jakub M. Tomczak , Patrick Forré

Making a (Counterfactual) Difference One Rationale at a Time

Rationales, snippets of extracted text that explain an inference, have emerged as a popular framework for interpretable natural language processing (NLP). Rationale models typically consist of two cooperating modules: a selector and a…

Computation and Language · Computer Science 2022-01-17 Mitchell Plyler , Michael Green , Min Chi

Diversity-oriented Data Augmentation with Large Language Models

Data augmentation is an essential technique in natural language processing (NLP) for enriching training datasets by generating diverse samples. This process is crucial for improving the robustness and generalization capabilities of NLP…

Computation and Language · Computer Science 2025-10-16 Zaitian Wang , Jinghan Zhang , Xinhao Zhang , Kunpeng Liu , Pengfei Wang , Yuanchun Zhou

Improving Classifier Robustness through Active Generation of Pairwise Counterfactuals

Counterfactual Data Augmentation (CDA) is a commonly used technique for improving robustness in natural language classifiers. However, one fundamental challenge is how to discover meaningful counterfactuals and efficiently label them, with…

Computation and Language · Computer Science 2023-05-24 Ananth Balashankar , Xuezhi Wang , Yao Qin , Ben Packer , Nithum Thain , Jilin Chen , Ed H. Chi , Alex Beutel

Relation-based Counterfactual Data Augmentation and Contrastive Learning for Robustifying Natural Language Inference Models

Although pre-trained language models show good performance on various natural language processing tasks, they often rely on non-causal features and patterns to determine the outcome. For natural language inference tasks, previous results…

Computation and Language · Computer Science 2024-10-29 Heerin Yang , Sseung-won Hwang , Jungmin So

Aligning Language Models with Observational Data: Opportunities and Risks from a Causal Perspective

Large language models are being widely used across industries to generate content that contributes directly to key performance metrics, such as conversion rates. Pretrained models, however, often fall short when it comes to aligning with…

Machine Learning · Computer Science 2025-06-03 Erfan Loghmani

Deep Learning on a Healthy Data Diet: Finding Important Examples for Fairness

Data-driven predictive solutions predominant in commercial applications tend to suffer from biases and stereotypes, which raises equity concerns. Prediction models may discover, use, or amplify spurious correlations based on gender or other…

Computation and Language · Computer Science 2022-11-28 Abdelrahman Zayed , Prasanna Parthasarathi , Goncalo Mordido , Hamid Palangi , Samira Shabanian , Sarath Chandar