Related papers: Are Sample-Efficient NLP Models More Robust?

The Evolution of Out-of-Distribution Robustness Throughout Fine-Tuning

Although machine learning models typically experience a drop in performance on out-of-distribution data, accuracies on in- versus out-of-distribution data are widely observed to follow a single linear trend when evaluated across a testbed…

Machine Learning · Computer Science 2021-07-01 Anders Andreassen , Yasaman Bahri , Behnam Neyshabur , Rebecca Roelofs

Pretrained Transformers Improve Out-of-Distribution Robustness

Although pretrained Transformers such as BERT achieve high accuracy on in-distribution examples, do they generalize to new distributions? We systematically measure out-of-distribution (OOD) generalization for seven NLP datasets by…

Computation and Language · Computer Science 2020-04-17 Dan Hendrycks , Xiaoyuan Liu , Eric Wallace , Adam Dziedzic , Rishabh Krishnan , Dawn Song

Effective Robustness against Natural Distribution Shifts for Models with Different Training Data

"Effective robustness" measures the extra out-of-distribution (OOD) robustness beyond what can be predicted from the in-distribution (ID) performance. Existing effective robustness evaluations typically use a single test set such as…

Machine Learning · Computer Science 2023-10-31 Zhouxing Shi , Nicholas Carlini , Ananth Balashankar , Ludwig Schmidt , Cho-Jui Hsieh , Alex Beutel , Yao Qin

Revisiting Out-of-distribution Robustness in NLP: Benchmark, Analysis, and LLMs Evaluations

This paper reexamines the research on out-of-distribution (OOD) robustness in the field of NLP. We find that the distribution shift settings in previous studies commonly lack adequate challenges, hindering the accurate evaluation of OOD…

Computation and Language · Computer Science 2023-10-27 Lifan Yuan , Yangyi Chen , Ganqu Cui , Hongcheng Gao , Fangyuan Zou , Xingyi Cheng , Heng Ji , Zhiyuan Liu , Maosong Sun

Robustness Challenges in Model Distillation and Pruning for Natural Language Understanding

Recent work has focused on compressing pre-trained language models (PLMs) like BERT where the major focus has been to improve the in-distribution performance for downstream tasks. However, very few of these studies have analyzed the impact…

Computation and Language · Computer Science 2023-02-28 Mengnan Du , Subhabrata Mukherjee , Yu Cheng , Milad Shokouhi , Xia Hu , Ahmed Hassan Awadallah

Models Out of Line: A Fourier Lens on Distribution Shift Robustness

Improving the accuracy of deep neural networks (DNNs) on out-of-distribution (OOD) data is critical to an acceptance of deep learning (DL) in real world applications. It has been observed that accuracies on in-distribution (ID) versus OOD…

Machine Learning · Computer Science 2022-07-12 Sara Fridovich-Keil , Brian R. Bartoldson , James Diffenderfer , Bhavya Kailkhura , Peer-Timo Bremer

An Empirical Study of Pre-trained Model Selection for Out-of-Distribution Generalization and Calibration

In the field of computer vision, fine-tuning pre-trained models has become a prevalent strategy for out-of-distribution (OOD) generalization tasks. Different from most prior work that has focused on advancing learning algorithms, we…

Machine Learning · Computer Science 2025-04-29 Hiroki Naganuma , Ryuichiro Hataya , Kotaro Yoshida , Ioannis Mitliagkas

A Meta-Analysis of Distributionally-Robust Models

State-of-the-art image classifiers trained on massive datasets (such as ImageNet) have been shown to be vulnerable to a range of both intentional and incidental distribution shifts. On the other hand, several recent classifiers with…

Computer Vision and Pattern Recognition · Computer Science 2022-06-16 Benjamin Feuer , Ameya Joshi , Chinmay Hegde

Understanding Out-of-distribution: A Perspective of Data Dynamics

Despite machine learning models' success in Natural Language Processing (NLP) tasks, predictions from these models frequently fail on out-of-distribution (OOD) samples. Prior works have focused on developing state-of-the-art methods for…

Computation and Language · Computer Science 2021-11-30 Dyah Adila , Dongyeop Kang

Robustness May be More Brittle than We Think under Different Degrees of Distribution Shifts

Out-of-distribution (OOD) generalization is a complicated problem due to the idiosyncrasies of possible distribution shifts between training and test domains. Most benchmarks employ diverse datasets to address this issue; however, the…

Machine Learning · Computer Science 2023-12-18 Kaican Li , Yifan Zhang , Lanqing Hong , Zhenguo Li , Nevin L. Zhang

Large Pre-Training Datasets Don't Always Guarantee Robustness after Fine-Tuning

Large-scale pretrained models are widely leveraged as foundations for learning new specialized tasks via fine-tuning, with the goal of maintaining the general performance of the model while allowing it to gain new skills. A valuable goal…

Computer Vision and Pattern Recognition · Computer Science 2025-09-29 Jaedong Hwang , Brian Cheung , Zhang-Wei Hong , Akhilan Boopathy , Pulkit Agrawal , Ila Fiete

GLUE-X: Evaluating Natural Language Understanding Models from an Out-of-distribution Generalization Perspective

Pre-trained language models (PLMs) are known to improve the generalization performance of natural language understanding models by leveraging large amounts of data during the pre-training phase. However, the out-of-distribution (OOD)…

Computation and Language · Computer Science 2023-05-23 Linyi Yang , Shuibai Zhang , Libo Qin , Yafu Li , Yidong Wang , Hanmeng Liu , Jindong Wang , Xing Xie , Yue Zhang

Energy-bounded Learning for Robust Models of Code

In programming, learning code representations has a variety of applications, including code classification, code search, comment generation, bug prediction, and so on. Various representations of code in terms of tokens, syntax trees,…

Machine Learning · Computer Science 2022-05-10 Nghi D. Q. Bui , Yijun Yu

A Less Biased Evaluation of Out-of-distribution Sample Detectors

In the real world, a learning system could receive an input that is unlike anything it has seen during training. Unfortunately, out-of-distribution samples can lead to unpredictable behaviour. We need to know whether any given input belongs…

Machine Learning · Computer Science 2019-08-21 Alireza Shafaei , Mark Schmidt , James J. Little

On the Out of Distribution Robustness of Foundation Models in Medical Image Segmentation

Constructing a robust model that can effectively generalize to test samples under distribution shifts remains a significant challenge in the field of medical imaging. The foundational models for vision and language, pre-trained on extensive…

Image and Video Processing · Electrical Eng. & Systems 2023-11-21 Duy Minh Ho Nguyen , Tan Ngoc Pham , Nghiem Tuong Diep , Nghi Quoc Phan , Quang Pham , Vinh Tong , Binh T. Nguyen , Ngan Hoang Le , Nhat Ho , Pengtao Xie , Daniel Sonntag , Mathias Niepert

Robust Meta Learning for Image based tasks

A machine learning model that generalizes well should obtain low errors on unseen test examples. Thus, if we learn an optimal model in training data, it could have better generalization performance in testing tasks. However, learning such a…

Computer Vision and Pattern Recognition · Computer Science 2023-02-22 Penghao Jiang , Xin Ke , ZiFeng Wang , Chunxi Li

Exploring The Landscape of Distributional Robustness for Question Answering Models

We conduct a large empirical evaluation to investigate the landscape of distributional robustness in question answering. Our investigation spans over 350 models and 16 question answering datasets, including a diverse set of architectures,…

Computation and Language · Computer Science 2022-10-25 Anas Awadalla , Mitchell Wortsman , Gabriel Ilharco , Sewon Min , Ian Magnusson , Hannaneh Hajishirzi , Ludwig Schmidt

In-N-Out: Pre-Training and Self-Training using Auxiliary Information for Out-of-Distribution Robustness

Consider a prediction setting with few in-distribution labeled examples and many unlabeled examples both in- and out-of-distribution (OOD). The goal is to learn a model which performs well both in-distribution and OOD. In these settings,…

Machine Learning · Computer Science 2021-04-08 Sang Michael Xie , Ananya Kumar , Robbie Jones , Fereshte Khani , Tengyu Ma , Percy Liang

How Does Diffusion Influence Pretrained Language Models on Out-of-Distribution Data?

Transformer-based pretrained language models (PLMs) have achieved great success in modern NLP. An important advantage of PLMs is good out-of-distribution (OOD) robustness. Recently, diffusion models have attracted a lot of work to apply…

Computation and Language · Computer Science 2023-07-27 Huazheng Wang , Daixuan Cheng , Haifeng Sun , Jingyu Wang , Qi Qi , Jianxin Liao , Jing Wang , Cong Liu

Improving Out-of-Distribution Generalization by Adversarial Training with Structured Priors

Deep models often fail to generalize well in test domains when the data distribution differs from that in the training domain. Among numerous approaches to address this Out-of-Distribution (OOD) generalization problem, there has been a…

Machine Learning · Computer Science 2022-10-14 Qixun Wang , Yifei Wang , Hong Zhu , Yisen Wang