English
Related papers

Related papers: Are Sample-Efficient NLP Models More Robust?

200 papers

Although machine learning models typically experience a drop in performance on out-of-distribution data, accuracies on in- versus out-of-distribution data are widely observed to follow a single linear trend when evaluated across a testbed…

Machine Learning · Computer Science 2021-07-01 Anders Andreassen , Yasaman Bahri , Behnam Neyshabur , Rebecca Roelofs

Although pretrained Transformers such as BERT achieve high accuracy on in-distribution examples, do they generalize to new distributions? We systematically measure out-of-distribution (OOD) generalization for seven NLP datasets by…

Computation and Language · Computer Science 2020-04-17 Dan Hendrycks , Xiaoyuan Liu , Eric Wallace , Adam Dziedzic , Rishabh Krishnan , Dawn Song

"Effective robustness" measures the extra out-of-distribution (OOD) robustness beyond what can be predicted from the in-distribution (ID) performance. Existing effective robustness evaluations typically use a single test set such as…

Machine Learning · Computer Science 2023-10-31 Zhouxing Shi , Nicholas Carlini , Ananth Balashankar , Ludwig Schmidt , Cho-Jui Hsieh , Alex Beutel , Yao Qin

This paper reexamines the research on out-of-distribution (OOD) robustness in the field of NLP. We find that the distribution shift settings in previous studies commonly lack adequate challenges, hindering the accurate evaluation of OOD…

Computation and Language · Computer Science 2023-10-27 Lifan Yuan , Yangyi Chen , Ganqu Cui , Hongcheng Gao , Fangyuan Zou , Xingyi Cheng , Heng Ji , Zhiyuan Liu , Maosong Sun

Recent work has focused on compressing pre-trained language models (PLMs) like BERT where the major focus has been to improve the in-distribution performance for downstream tasks. However, very few of these studies have analyzed the impact…

Computation and Language · Computer Science 2023-02-28 Mengnan Du , Subhabrata Mukherjee , Yu Cheng , Milad Shokouhi , Xia Hu , Ahmed Hassan Awadallah

Improving the accuracy of deep neural networks (DNNs) on out-of-distribution (OOD) data is critical to an acceptance of deep learning (DL) in real world applications. It has been observed that accuracies on in-distribution (ID) versus OOD…

Machine Learning · Computer Science 2022-07-12 Sara Fridovich-Keil , Brian R. Bartoldson , James Diffenderfer , Bhavya Kailkhura , Peer-Timo Bremer

In the field of computer vision, fine-tuning pre-trained models has become a prevalent strategy for out-of-distribution (OOD) generalization tasks. Different from most prior work that has focused on advancing learning algorithms, we…

Machine Learning · Computer Science 2025-04-29 Hiroki Naganuma , Ryuichiro Hataya , Kotaro Yoshida , Ioannis Mitliagkas

State-of-the-art image classifiers trained on massive datasets (such as ImageNet) have been shown to be vulnerable to a range of both intentional and incidental distribution shifts. On the other hand, several recent classifiers with…

Computer Vision and Pattern Recognition · Computer Science 2022-06-16 Benjamin Feuer , Ameya Joshi , Chinmay Hegde

Despite machine learning models' success in Natural Language Processing (NLP) tasks, predictions from these models frequently fail on out-of-distribution (OOD) samples. Prior works have focused on developing state-of-the-art methods for…

Computation and Language · Computer Science 2021-11-30 Dyah Adila , Dongyeop Kang

Out-of-distribution (OOD) generalization is a complicated problem due to the idiosyncrasies of possible distribution shifts between training and test domains. Most benchmarks employ diverse datasets to address this issue; however, the…

Machine Learning · Computer Science 2023-12-18 Kaican Li , Yifan Zhang , Lanqing Hong , Zhenguo Li , Nevin L. Zhang

Large-scale pretrained models are widely leveraged as foundations for learning new specialized tasks via fine-tuning, with the goal of maintaining the general performance of the model while allowing it to gain new skills. A valuable goal…

Computer Vision and Pattern Recognition · Computer Science 2025-09-29 Jaedong Hwang , Brian Cheung , Zhang-Wei Hong , Akhilan Boopathy , Pulkit Agrawal , Ila Fiete

Pre-trained language models (PLMs) are known to improve the generalization performance of natural language understanding models by leveraging large amounts of data during the pre-training phase. However, the out-of-distribution (OOD)…

Computation and Language · Computer Science 2023-05-23 Linyi Yang , Shuibai Zhang , Libo Qin , Yafu Li , Yidong Wang , Hanmeng Liu , Jindong Wang , Xing Xie , Yue Zhang

In programming, learning code representations has a variety of applications, including code classification, code search, comment generation, bug prediction, and so on. Various representations of code in terms of tokens, syntax trees,…

Machine Learning · Computer Science 2022-05-10 Nghi D. Q. Bui , Yijun Yu

In the real world, a learning system could receive an input that is unlike anything it has seen during training. Unfortunately, out-of-distribution samples can lead to unpredictable behaviour. We need to know whether any given input belongs…

Machine Learning · Computer Science 2019-08-21 Alireza Shafaei , Mark Schmidt , James J. Little

Constructing a robust model that can effectively generalize to test samples under distribution shifts remains a significant challenge in the field of medical imaging. The foundational models for vision and language, pre-trained on extensive…

A machine learning model that generalizes well should obtain low errors on unseen test examples. Thus, if we learn an optimal model in training data, it could have better generalization performance in testing tasks. However, learning such a…

Computer Vision and Pattern Recognition · Computer Science 2023-02-22 Penghao Jiang , Xin Ke , ZiFeng Wang , Chunxi Li

We conduct a large empirical evaluation to investigate the landscape of distributional robustness in question answering. Our investigation spans over 350 models and 16 question answering datasets, including a diverse set of architectures,…

Computation and Language · Computer Science 2022-10-25 Anas Awadalla , Mitchell Wortsman , Gabriel Ilharco , Sewon Min , Ian Magnusson , Hannaneh Hajishirzi , Ludwig Schmidt

Consider a prediction setting with few in-distribution labeled examples and many unlabeled examples both in- and out-of-distribution (OOD). The goal is to learn a model which performs well both in-distribution and OOD. In these settings,…

Machine Learning · Computer Science 2021-04-08 Sang Michael Xie , Ananya Kumar , Robbie Jones , Fereshte Khani , Tengyu Ma , Percy Liang

Transformer-based pretrained language models (PLMs) have achieved great success in modern NLP. An important advantage of PLMs is good out-of-distribution (OOD) robustness. Recently, diffusion models have attracted a lot of work to apply…

Computation and Language · Computer Science 2023-07-27 Huazheng Wang , Daixuan Cheng , Haifeng Sun , Jingyu Wang , Qi Qi , Jianxin Liao , Jing Wang , Cong Liu

Deep models often fail to generalize well in test domains when the data distribution differs from that in the training domain. Among numerous approaches to address this Out-of-Distribution (OOD) generalization problem, there has been a…

Machine Learning · Computer Science 2022-10-14 Qixun Wang , Yifei Wang , Hong Zhu , Yisen Wang
‹ Prev 1 2 3 10 Next ›