Related papers: On measuring linguistic intelligence

Probabilistic Method of Measuring Linguistic Productivity

In this paper I propose a new way of measuring linguistic productivity that objectively assesses the ability of an affix to be used to coin new complex words and, unlike other popular measures, is not directly dependent upon token…

Computation and Language · Computer Science 2023-08-25 Sergei Monakhov

Language Ranker: A Metric for Quantifying LLM Performance Across High and Low-Resource Languages

The development of Large Language Models (LLMs) relies on extensive text corpora, which are often unevenly distributed across languages. This imbalance results in LLMs performing significantly better on high-resource languages like English,…

Computation and Language · Computer Science 2024-12-12 Zihao Li , Yucheng Shi , Zirui Liu , Fan Yang , Ali Payani , Ninghao Liu , Mengnan Du

Quantifying Language Disparities in Multilingual Large Language Models

Results reported in large-scale multilingual evaluations are often fragmented and confounded by factors such as target languages, differences in experimental setups, and model choices. We propose a framework that disentangles these…

Computation and Language · Computer Science 2025-08-26 Songbo Hu , Ivan Vulić , Anna Korhonen

How Vocabulary Sharing Facilitates Multilingualism in LLaMA?

Large Language Models (LLMs), often show strong performance on English tasks, while exhibiting limitations on other languages. What is an LLM's multilingual capability when it is trained only on certain languages? The underlying mechanism…

Computation and Language · Computer Science 2024-06-04 Fei Yuan , Shuai Yuan , Zhiyong Wu , Lei Li

Efficient Inference and Computation of Optimal Alternatives for Preference Languages Based On Lexicographic Models

We analyse preference inference, through consistency, for general preference languages based on lexicographic models. We identify a property, which we call strong compositionality, that applies for many natural kinds of preference…

Logic in Computer Science · Computer Science 2024-11-01 Nic Wilson , Anne-Marie George

PolyMath: Evaluating Mathematical Reasoning in Multilingual Contexts

In this paper, we introduce PolyMath, a multilingual mathematical reasoning benchmark covering 18 languages and 4 easy-to-hard difficulty levels. Our benchmark ensures difficulty comprehensiveness, language diversity, and high-quality…

Computation and Language · Computer Science 2025-11-04 Yiming Wang , Pei Zhang , Jialong Tang , Haoran Wei , Baosong Yang , Rui Wang , Chenshu Sun , Feitong Sun , Jiran Zhang , Junxuan Wu , Qiqian Cang , Yichang Zhang , Fei Huang , Junyang Lin , Fei Huang , Jingren Zhou

How to Compute the Probability of a Word

Language models (LMs) estimate a probability distribution over strings in a natural language; these distributions are crucial for computing perplexity and surprisal in linguistics research. While we are usually concerned with measuring…

Computation and Language · Computer Science 2024-10-15 Tiago Pimentel , Clara Meister

CoCo-CoLa: Evaluating and Improving Language Adherence in Multilingual LLMs

Multilingual Large Language Models (LLMs) develop cross-lingual abilities despite being trained on limited parallel data. However, they often struggle to generate responses in the intended language, favoring high-resource languages such as…

Computation and Language · Computer Science 2025-06-02 Elnaz Rahmati , Alireza S. Ziabari , Morteza Dehghani

Found in Translation: Measuring Multilingual LLM Consistency as Simple as Translate then Evaluate

Large language models (LLMs) provide detailed and impressive responses to queries in English. However, are they really consistent at responding to the same query in other languages? The popular way of evaluating for multilingual performance…

Computation and Language · Computer Science 2025-05-29 Ashim Gupta , Maitrey Mehta , Zhichao Xu , Vivek Srikumar

The AI Language Proficiency Monitor -- Tracking the Progress of LLMs on Multilingual Benchmarks

To ensure equitable access to the benefits of large language models (LLMs), it is essential to evaluate their capabilities across the world's languages. We introduce the AI Language Proficiency Monitor, a comprehensive multilingual…

Computation and Language · Computer Science 2025-07-14 David Pomerenke , Jonas Nothnagel , Simon Ostermann

Measuring Forecasting Skill from Text

People vary in their ability to make accurate predictions about the future. Prior studies have shown that some individuals can predict the outcome of future events with consistently better accuracy. This leads to a natural question: what…

Computation and Language · Computer Science 2020-06-17 Shi Zong , Alan Ritter , Eduard Hovy

Community size rather than grammatical complexity better predicts Large Language Model accuracy in a novel Wug Test

The linguistic abilities of Large Language Models are a matter of ongoing debate. This study contributes to this discussion by investigating model performance in a morphological generalization task that involves novel words. Using a…

Computation and Language · Computer Science 2026-04-02 Nikoleta Pantelidou , Evelina Leivada , Raquel Montero , Paolo Morosi

Cross-Language Bias Examination in Large Language Models

This study introduces an innovative multilingual bias evaluation framework for assessing bias in Large Language Models, combining explicit bias assessment through the BBQ benchmark with implicit bias measurement using a prompt-based…

Computers and Society · Computer Science 2025-12-19 Yuxuan Liang , Marwa Mahmoud

Dispersion Measures as Predictors of Lexical Decision Time, Word Familiarity, and Lexical Complexity

Various measures of dispersion have been proposed to paint a fuller picture of a word's distribution in a corpus, but only little has been done to validate them externally. We evaluate a wide range of dispersion measures as predictors of…

Computation and Language · Computer Science 2025-01-14 Adam Nohejl , Taro Watanabe

Lexical Simplification Benchmarks for English, Portuguese, and Spanish

Even in highly-developed countries, as many as 15-30\% of the population can only understand texts written using a basic vocabulary. Their understanding of everyday texts is limited, which prevents them from taking an active role in society…

Computation and Language · Computer Science 2022-09-13 Sanja Stajner , Daniel Ferres , Matthew Shardlow , Kai North , Marcos Zampieri , Horacio Saggion

Invitaci\'on al estudio estad\'istico del lenguaje

Invitation to the statistical study of language: The topic of this presentation is the interdisciplinary nexus between linguistics and statistics. It targets linguists, for whom it may have a theoretical interest, or professionals that work…

Applications · Statistics 2018-04-23 Rogelio Nazar

The Model's Language Matters: A Comparative Privacy Analysis of LLMs

Large Language Models (LLMs) are increasingly deployed across multilingual applications that handle sensitive data, yet their scale and linguistic variability introduce major privacy risks. Mostly evaluated for English, this paper…

Computation and Language · Computer Science 2025-10-13 Abhishek K. Mishra , Antoine Boutet , Lucas Magnana

The lexical ratio: A new perspective on portfolio diversification

Portfolio diversification, traditionally measured through asset correlations and volatilitybased metrics, is fundamental to managing financial risk. However, existing diversification metrics often overlook non-numerical relationships…

Portfolio Management · Quantitative Finance 2024-11-12 Sayyed Faraz Mohseni , Hamid R. Arian , Jean-François Bégin

Evaluating Large Language Models with Tests of Spanish as a Foreign Language: Pass or Fail?

Large Language Models (LLMs) have been profusely evaluated on their ability to answer questions on many topics and their performance on different natural language understanding tasks. Those tests are usually conducted in English, but most…

Computation and Language · Computer Science 2024-09-25 Marina Mayor-Rocher , Nina Melero , Elena Merino-Gómez , María Grandury , Javier Conde , Pedro Reviriego

A Readable Read: Automatic Assessment of Language Learning Materials based on Linguistic Complexity

Corpora and web texts can become a rich language learning resource if we have a means of assessing whether they are linguistically appropriate for learners at a given proficiency level. In this paper, we aim at addressing this issue by…

Computation and Language · Computer Science 2016-03-30 Ildikó Pilán , Sowmya Vajjala , Elena Volodina