English
Related papers

Related papers: On measuring linguistic intelligence

200 papers

In this paper I propose a new way of measuring linguistic productivity that objectively assesses the ability of an affix to be used to coin new complex words and, unlike other popular measures, is not directly dependent upon token…

Computation and Language · Computer Science 2023-08-25 Sergei Monakhov

The development of Large Language Models (LLMs) relies on extensive text corpora, which are often unevenly distributed across languages. This imbalance results in LLMs performing significantly better on high-resource languages like English,…

Computation and Language · Computer Science 2024-12-12 Zihao Li , Yucheng Shi , Zirui Liu , Fan Yang , Ali Payani , Ninghao Liu , Mengnan Du

Results reported in large-scale multilingual evaluations are often fragmented and confounded by factors such as target languages, differences in experimental setups, and model choices. We propose a framework that disentangles these…

Computation and Language · Computer Science 2025-08-26 Songbo Hu , Ivan Vulić , Anna Korhonen

Large Language Models (LLMs), often show strong performance on English tasks, while exhibiting limitations on other languages. What is an LLM's multilingual capability when it is trained only on certain languages? The underlying mechanism…

Computation and Language · Computer Science 2024-06-04 Fei Yuan , Shuai Yuan , Zhiyong Wu , Lei Li

We analyse preference inference, through consistency, for general preference languages based on lexicographic models. We identify a property, which we call strong compositionality, that applies for many natural kinds of preference…

Logic in Computer Science · Computer Science 2024-11-01 Nic Wilson , Anne-Marie George

In this paper, we introduce PolyMath, a multilingual mathematical reasoning benchmark covering 18 languages and 4 easy-to-hard difficulty levels. Our benchmark ensures difficulty comprehensiveness, language diversity, and high-quality…

Language models (LMs) estimate a probability distribution over strings in a natural language; these distributions are crucial for computing perplexity and surprisal in linguistics research. While we are usually concerned with measuring…

Computation and Language · Computer Science 2024-10-15 Tiago Pimentel , Clara Meister

Multilingual Large Language Models (LLMs) develop cross-lingual abilities despite being trained on limited parallel data. However, they often struggle to generate responses in the intended language, favoring high-resource languages such as…

Computation and Language · Computer Science 2025-06-02 Elnaz Rahmati , Alireza S. Ziabari , Morteza Dehghani

Large language models (LLMs) provide detailed and impressive responses to queries in English. However, are they really consistent at responding to the same query in other languages? The popular way of evaluating for multilingual performance…

Computation and Language · Computer Science 2025-05-29 Ashim Gupta , Maitrey Mehta , Zhichao Xu , Vivek Srikumar

To ensure equitable access to the benefits of large language models (LLMs), it is essential to evaluate their capabilities across the world's languages. We introduce the AI Language Proficiency Monitor, a comprehensive multilingual…

Computation and Language · Computer Science 2025-07-14 David Pomerenke , Jonas Nothnagel , Simon Ostermann

People vary in their ability to make accurate predictions about the future. Prior studies have shown that some individuals can predict the outcome of future events with consistently better accuracy. This leads to a natural question: what…

Computation and Language · Computer Science 2020-06-17 Shi Zong , Alan Ritter , Eduard Hovy

The linguistic abilities of Large Language Models are a matter of ongoing debate. This study contributes to this discussion by investigating model performance in a morphological generalization task that involves novel words. Using a…

Computation and Language · Computer Science 2026-04-02 Nikoleta Pantelidou , Evelina Leivada , Raquel Montero , Paolo Morosi

This study introduces an innovative multilingual bias evaluation framework for assessing bias in Large Language Models, combining explicit bias assessment through the BBQ benchmark with implicit bias measurement using a prompt-based…

Computers and Society · Computer Science 2025-12-19 Yuxuan Liang , Marwa Mahmoud

Various measures of dispersion have been proposed to paint a fuller picture of a word's distribution in a corpus, but only little has been done to validate them externally. We evaluate a wide range of dispersion measures as predictors of…

Computation and Language · Computer Science 2025-01-14 Adam Nohejl , Taro Watanabe

Even in highly-developed countries, as many as 15-30\% of the population can only understand texts written using a basic vocabulary. Their understanding of everyday texts is limited, which prevents them from taking an active role in society…

Computation and Language · Computer Science 2022-09-13 Sanja Stajner , Daniel Ferres , Matthew Shardlow , Kai North , Marcos Zampieri , Horacio Saggion

Invitation to the statistical study of language: The topic of this presentation is the interdisciplinary nexus between linguistics and statistics. It targets linguists, for whom it may have a theoretical interest, or professionals that work…

Applications · Statistics 2018-04-23 Rogelio Nazar

Large Language Models (LLMs) are increasingly deployed across multilingual applications that handle sensitive data, yet their scale and linguistic variability introduce major privacy risks. Mostly evaluated for English, this paper…

Computation and Language · Computer Science 2025-10-13 Abhishek K. Mishra , Antoine Boutet , Lucas Magnana

Portfolio diversification, traditionally measured through asset correlations and volatilitybased metrics, is fundamental to managing financial risk. However, existing diversification metrics often overlook non-numerical relationships…

Portfolio Management · Quantitative Finance 2024-11-12 Sayyed Faraz Mohseni , Hamid R. Arian , Jean-François Bégin

Large Language Models (LLMs) have been profusely evaluated on their ability to answer questions on many topics and their performance on different natural language understanding tasks. Those tests are usually conducted in English, but most…

Computation and Language · Computer Science 2024-09-25 Marina Mayor-Rocher , Nina Melero , Elena Merino-Gómez , María Grandury , Javier Conde , Pedro Reviriego

Corpora and web texts can become a rich language learning resource if we have a means of assessing whether they are linguistically appropriate for learners at a given proficiency level. In this paper, we aim at addressing this issue by…

Computation and Language · Computer Science 2016-03-30 Ildikó Pilán , Sowmya Vajjala , Elena Volodina
‹ Prev 1 2 3 10 Next ›