English
Related papers

Related papers: Predicting Performance for Natural Language Proces…

200 papers

Performance prediction, the task of estimating a system's performance without performing experiments, allows us to reduce the experimental burden caused by the combinatorial explosion of different datasets, languages, tasks, and models. In…

Computation and Language · Computer Science 2021-02-11 Zihuiwen Ye , Pengfei Liu , Jinlan Fu , Graham Neubig

Recent advancements in NLP have given us models like mBERT and XLMR that can serve over 100 languages. The languages that these models are evaluated on, however, are very few in number, and it is unlikely that evaluation datasets will cover…

Computation and Language · Computer Science 2021-10-19 Anirudh Srinivasan , Sunayana Sitaram , Tanuja Ganu , Sandipan Dandapat , Kalika Bali , Monojit Choudhury

This article emphasizes that NLP as a science seeks to make inferences about the performance effects that result from applying one method (compared to another method) in the processing of natural language. Yet NLP research in practice…

Computation and Language · Computer Science 2022-09-15 Sandra Wankmüller

Scaling existing applications and solutions to multiple human languages has traditionally proven to be difficult, mainly due to the language-dependent nature of preprocessing and feature engineering techniques employed in traditional…

Computation and Language · Computer Science 2020-01-01 Xiaotong Liu , Yingbei Tong , Anbang Xu , Rama Akkiraju

Many promising-looking ideas in AI research fail to deliver, but their validation takes substantial human labor and compute. Predicting an idea's chance of success is thus crucial for accelerating empirical AI research, a skill that even…

Artificial Intelligence · Computer Science 2025-06-03 Jiaxin Wen , Chenglei Si , Yueh-han Chen , He He , Shi Feng

As the applications of Natural Language Processing (NLP) in sensitive areas like Political Profiling, Review of Essays in Education, etc. proliferate, there is a great need for increasing transparency in NLP models to build trust with…

Computation and Language · Computer Science 2022-11-29 Adel Rahimi , Shaurya Jain

Although recent Massively Multilingual Language Models (MMLMs) like mBERT and XLMR support around 100 languages, most existing multilingual NLP benchmarks provide evaluation data in only a handful of these languages with little linguistic…

Computation and Language · Computer Science 2022-11-15 Kabir Ahuja , Sandipan Dandapat , Sunayana Sitaram , Monojit Choudhury

Natural language inference (NLI) is among the most challenging tasks in natural language understanding. Recent work on unsupervised pretraining that leverages unsupervised signals such as language-model and sentence prediction objectives…

Computation and Language · Computer Science 2019-04-30 Tianda Li , Xiaodan Zhu , Quan Liu , Qian Chen , Zhigang Chen , Si Wei

We investigate the predictability of large language model (LLM) capabilities: given records of past experiments using different model families, numbers of parameters, tasks, and numbers of in-context examples, can we accurately predict LLM…

Computation and Language · Computer Science 2023-11-01 Qinyuan Ye , Harvey Yiyun Fu , Xiang Ren , Robin Jia

Machine learning practitioners often face significant challenges in formally integrating their prior knowledge and beliefs into predictive models, limiting the potential for nuanced and context-aware analyses. Moreover, the expertise needed…

Machine Learning · Statistics 2024-12-23 James Requeima , John Bronskill , Dami Choi , Richard E. Turner , David Duvenaud

Although neural models have achieved impressive results on several NLP benchmarks, little is understood about the mechanisms they use to perform language tasks. Thus, much recent attention has been devoted to analyzing the sentence…

Computation and Language · Computer Science 2021-03-09 Abhilasha Ravichander , Yonatan Belinkov , Eduard Hovy

Large NLP models have recently shown impressive performance in language understanding tasks, typically evaluated by their fine-tuned performance. Alternatively, probing has received increasing attention as being a lightweight method for…

Computation and Language · Computer Science 2022-10-17 Zining Zhu , Soroosh Shahtalebi , Frank Rudzicz

Assessing instruction quality is a fundamental component of any improvement efforts in the education system. However, traditional manual assessments are expensive, subjective, and heavily dependent on observers' expertise and idiosyncratic…

Computation and Language · Computer Science 2025-01-03 Paiheng Xu , Jing Liu , Nathan Jones , Julie Cohen , Wei Ai

Reliable uncertainty quantification is a first step towards building explainable, transparent, and accountable artificial intelligent systems. Recent progress in Bayesian deep learning has made such quantification realizable. In this paper,…

Computation and Language · Computer Science 2018-11-20 Yijun Xiao , William Yang Wang

Research in natural language processing proceeds, in part, by demonstrating that new models achieve superior performance (e.g., accuracy) on held-out test data, compared to previous results. In this paper, we demonstrate that test-set…

Machine Learning · Computer Science 2019-09-09 Jesse Dodge , Suchin Gururangan , Dallas Card , Roy Schwartz , Noah A. Smith

Large language models (LLMs) have demonstrated unprecedented emergent capabilities, including content generation, translation, and simulation of human behavior. Field experiments, on the other hand, are widely employed in social studies to…

Computers and Society · Computer Science 2025-05-22 Yaoyu Chen , Yuheng Hu , Yingda Lu

The dominating NLP paradigm of training a strong neural predictor to perform one task on a specific dataset has led to state-of-the-art performance in a variety of applications (eg. sentiment classification, span-prediction based question…

Computation and Language · Computer Science 2021-09-06 Paul Michel

Behavioral testing in NLP allows fine-grained evaluation of systems by examining their linguistic capabilities through the analysis of input-output behavior. Unfortunately, existing work on behavioral testing in Machine Translation (MT) is…

Computation and Language · Computer Science 2023-11-06 Javier Ferrando , Matthias Sperber , Hendra Setiawan , Dominic Telaar , Saša Hasan

While NLP models often seek to capture cognitive states via language, the validity of predicted states is determined by comparing them to annotations created without access the cognitive states of the authors. In behavioral sciences,…

Computation and Language · Computer Science 2025-02-20 Vasudha Varadarajan , Syeda Mahwish , Xiaoran Liu , Julia Buffolino , Christian C. Luhmann , Ryan L. Boyd , H. Andrew Schwartz

Despite widespread success in language understanding and generation, large language models (LLMs) exhibit unclear and often inconsistent behavior when faced with tasks that require probabilistic reasoning. In this work, we present the first…

Computation and Language · Computer Science 2025-09-29 Mobina Pournemat , Keivan Rezaei , Gaurang Sriramanan , Arman Zarei , Jiaxiang Fu , Yang Wang , Hamid Eghbalzadeh , Soheil Feizi
‹ Prev 1 2 3 10 Next ›