English
Related papers

Related papers: Cognitive Limits Shape Language Statistics

200 papers

Zipf's law has been found in many human-related fields, including language, where the frequency of a word is persistently found as a power law function of its frequency rank, known as Zipf's law. However, there is much dispute whether it is…

Computation and Language · Computer Science 2018-07-06 Shuiyuan Yu , Chunshan Xu , Haitao Liu

Human language, the most powerful communication system in history, is closely associated with cognition. Written text is one of the fundamental manifestations of language, and the study of its universal regularities can give clues about how…

Computation and Language · Computer Science 2009-02-05 M. Angeles Serrano , Alessandro Flammini , Filippo Menczer

The frequencies at which individual words occur across languages follow power law distributions, a pattern of findings known as Zipf's law. A vast literature argues over whether this serves to optimize the efficiency of human communication,…

Computation and Language · Computer Science 2020-01-16 Michael Ramscar

Zipf's law of abbreviation, namely the tendency of more frequent words to be shorter, has been viewed as a manifestation of compression, i.e. the minimization of the length of forms -- a universal principle of natural communication.…

Computation and Language · Computer Science 2026-03-31 Sonia Petrini , Antoni Casas-i-Muñoz , Jordi Cluet-i-Martinell , Mengxue Wang , Christian Bentz , Ramon Ferrer-i-Cancho

Human language, as a typical complex system, its organization and evolution is an attractive topic for both physical and cultural researchers. In this paper, we present the first exhaustive analysis of the text organization of human speech.…

Computation and Language · Computer Science 2015-01-08 Ruokuang Lin , Qianli D. Y. Ma , Chunhua Bian

Here we present a new class of optimality for coding systems. Members of that class are displaced linearly from optimal coding and thus exhibit Zipf's law, namely a power-law distribution of frequency ranks. Within that class, Zipf's law,…

Computation and Language · Computer Science 2025-10-31 Ramon Ferrer-i-Cancho

Here we sketch a new derivation of Zipf's law for word frequencies based on optimal coding. The structure of the derivation is reminiscent of Mandelbrot's random typing model but it has multiple advantages over random typing: (1) it starts…

Computation and Language · Computer Science 2020-09-24 Ramon Ferrer-i-Cancho

The Zipf's law establishes that if the words of a (large) text are ordered by decreasing frequency, the frequency versus the rank decreases as a power law with exponent close to $-1$. Previous work has stressed that this pattern arises from…

Physics and Society · Physics 2019-04-03 Felipe Urbina , Javier Vera

A family of information theoretic models of communication was introduced more than a decade ago to explain the origins of Zipf's law for word frequencies. The family is a based on a combination of two information theoretic principles:…

Physics and Society · Physics 2020-09-24 Ramon Ferrer-i-Cancho

In this study, we investigate whether speech symbols, learned through deep learning, follow Zipf's law, akin to natural language symbols. Zipf's law is an empirical law that delineates the frequency distribution of words, forming…

Computation and Language · Computer Science 2023-09-19 Shinnosuke Takamichi , Hiroki Maeda , Joonyong Park , Daisuke Saito , Hiroshi Saruwatari

Zipf's law of abbreviation, the tendency of more frequent words to be shorter, is one of the most solid candidates for a linguistic universal, in the sense that it has the potential for being exceptionless or with a number of exceptions…

Computation and Language · Computer Science 2023-10-13 Sonia Petrini , Antoni Casas-i-Muñoz , Jordi Cluet-i-Martinell , Mengxue Wang , Chris Bentz , Ramon Ferrer-i-Cancho

We study a deliberately simple, fully non-linguistic model of text: a sequence of independent draws from a finite alphabet of letters plus a single space symbol. A word is defined as a maximal block of non-space symbols. Within this…

Computation and Language · Computer Science 2025-11-25 Vladimir Berman

Zipf's law states that if words of language are ranked in the order of decreasing frequency in texts, the frequency of a word is inversely proportional to its rank. It is very robust as an experimental observation, but to date it escaped…

Computation and Language · Computer Science 2009-01-22 Dmitrii Manin

Zipf's law is a fundamental paradigm in the statistics of written and spoken natural language as well as in other communication systems. We raise the question of the elementary units for which Zipf's law should hold in the most natural way,…

Physics and Society · Physics 2015-07-14 Alvaro Corral , Gemma Boleda , Ramon Ferrer-i-Cancho

We investigate the origin of Zipf's law for words in written texts by means of a stochastic dynamical model for text generation. The model incorporates both features related to the general structure of languages and memory effects inherent…

Statistical Mechanics · Physics 2007-05-23 Damián H. Zanette , Marcelo A. Montemurro

Human language has a distinct systematic structure, where utterances break into individually meaningful words which are combined to form phrases. We show that natural-language-like systematicity arises in codes that are constrained by a…

Computation and Language · Computer Science 2025-11-19 Richard Futrell , Michael Hahn

Natural languages are full of rules and exceptions. One of the most famous quantitative rules is Zipf's law which states that the frequency of occurrence of a word is approximately inversely proportional to its rank. Though this `law' of…

Computation and Language · Computer Science 2015-05-27 Jake Ryland Williams , James P. Bagrow , Christopher M. Danforth , Peter Sheridan Dodds

Quantitative linguistics has provided us with a number of empirical laws that characterise the evolution of languages and competition amongst them. In terms of language usage, one of the most influential results is Zipf's law of word…

Physics and Society · Physics 2009-01-21 Alvaro Corral , Ramon Ferrer-i-Cancho , Gemma Boleda , Albert Diaz-Guilera , .

The problem of compression in standard information theory consists of assigning codes as short as possible to numbers. Here we consider the problem of optimal coding -- under an arbitrary coding scheme -- and show that it predicts Zipf's…

Computation and Language · Computer Science 2020-09-24 Ramon Ferrer-i-Cancho , Christian Bentz , Caio Seguin

This paper studies the limits of language models' statistical learning in the context of Zipf's law. First, we demonstrate that Zipf-law token distribution emerges irrespective of the chosen tokenization. Second, we show that Zipf…

Computation and Language · Computer Science 2022-11-22 Elizaveta Zhemchuzhina , Nikolai Filippov , Ivan P. Yamshchikov
‹ Prev 1 2 3 10 Next ›