English
Related papers

Related papers: Microbial genome as a fluctuating system: Distribu…

200 papers

We show that textual analysis of microbial genomes reveal telling footprints of the early evolution of the genomes. The frequencies of word occurrence of random DNA sequences considered as texts in their four nucleotides are expected to…

Biological Physics · Physics 2007-05-23 Li-Ching Hsieh , Liaofu Luo , HC Lee

The mean length and the variability of coding sequences for 48 genomes of bacteria and archaea were analyzed. It was found that the plotted data can be described by an angular area. This suggests the followings: a) The variability of a…

Genomics · Quantitative Biology 2008-05-12 Vasile V. Morariu

This paper considers three kinds of length sequences of the complete genome. Detrended fluctuation analysis, spectral analysis, and the mean distance spanned within time $L$ are used to discuss the correlation property of these sequences.…

Biological Physics · Physics 2009-11-06 Zu-Guo Yu , V. V. Anh , Bin Wang

Natural languages are full of rules and exceptions. One of the most famous quantitative rules is Zipf's law which states that the frequency of occurrence of a word is approximately inversely proportional to its rank. Though this `law' of…

Computation and Language · Computer Science 2015-05-27 Jake Ryland Williams , James P. Bagrow , Christopher M. Danforth , Peter Sheridan Dodds

Statistical analysis of distributions of occurrence frequencies of short words in 108 microbial complete genomes reveals the existence of a set of universal "root-sequence lengths" shared by all microbial genomes. These lengths and their…

Biological Physics · Physics 2007-05-23 Li-Ching Hsieh , Chang-Heng Chang , Liaofu Luo , Fengmin Ji , Hoong-Chien Lee

Zipf's law has been found in many human-related fields, including language, where the frequency of a word is persistently found as a power law function of its frequency rank, known as Zipf's law. However, there is much dispute whether it is…

Computation and Language · Computer Science 2018-07-06 Shuiyuan Yu , Chunshan Xu , Haitao Liu

In this paper we combine statistical analysis of large text databases and simple stochastic models to explain the appearance of scaling laws in the statistics of word frequencies. Besides the sublinear scaling of the vocabulary size with…

Physics and Society · Physics 2014-11-05 Martin Gerlach , Eduardo G. Altmann

Bacterial genomes and large-scale computer software projects both consist of a large number of components (genes or software packages) connected via a network of mutual dependencies. Components can be easily added or removed from individual…

Molecular Networks · Quantitative Biology 2013-08-12 Tin Yau Pang , Sergei Maslov

Long-range correlations are found in symbolic sequences from human language, music and DNA. Determining the span of correlations in dolphin whistle sequences is crucial for shedding light on their communicative complexity. Dolphin whistles…

Neurons and Cognition · Quantitative Biology 2014-12-03 Ramon Ferrer-i-Cancho , Brenda McCowan

The dependence with text length of the statistical properties of word occurrences has long been considered a severe limitation quantitative linguistics. We propose a simple scaling form for the distribution of absolute word frequencies…

Physics and Society · Physics 2015-06-15 Francesc Font-Clos , Gemma Boleda , Álvaro Corral

Statistical analysis of bacteria genomes texts has been performed on the basis of 20 complete genomes origin from Genebank. It has been revealed that the word ranked distributions are quite well approximated by logarithmic law. Results…

Condensed Matter · Physics 2009-10-31 Olga V. Kirillova

We investigate the propagation of random fluctuations through biochemical networks in which the concentrations of species are large enough so that the unperturbed problem is well-described by ordinary differential equation. We characterize…

Probability · Mathematics 2007-05-23 David Anderson , Jonathan Mattingly , H. Frederik Nijhout , Michael Reed

Languages across the world exhibit Zipf's law of abbreviation, namely more frequent words tend to be shorter. The generalized version of the law - an inverse relationship between the frequency of a unit and its magnitude - holds also for…

Information Theory · Computer Science 2016-05-05 R. Ferrer-i-Cancho , C. Bentz , C. Seguin

The frequency distributions of DNA k-mers are shaped by fundamental biological processes and offer a window into genome structure and evolution. Inspired by analogies to natural language, prior studies have attempted to model genomic k-mer…

Much evolutionary information is stored in the fluctuations of protein length distributions. The genome size and non-coding DNA content can be calculated based only on the protein length distributions. So there is intrinsic relationship…

Genomics · Quantitative Biology 2008-06-03 Dirson Jian Li , Shengli Zhang

We study statistical properties of DNA chains of thirteen microbial complete genomes. We find that the power spectrum of several of the sequences studied flattens off in the low frequency limit. This implies that the correlation length in…

Statistical Mechanics · Physics 2009-10-31 Maria de Sousa Vieira

The binary many-step Markov chain with the step-like memory function is considered as a model for the analysis of rank distributions of words in stochastic symbolic dynamical systems. We prove that the envelope curve for this distribution…

History and Philosophy of Physics · Physics 2007-05-23 K. E. Kechedzhy O. V. Usatenko , V. A. Yampol'skii

In our paper selected linguistic features of genomes to study the statistics of the gene codes are considered. We present the information theory from which it follows that if the system is described by distributions of hyperbolic type it…

Genomics · Quantitative Biology 2014-07-10 Krystyna Lukierska-Walasek , Krzysztof Topolski , Krzysztof Trojanowski

The nature of the quantitative distribution of the 64 DNA codons in the human genome has been an issue of debate for over a decade. Some groups have proposed that the quantitative distribution of the DNA codons ordered as a rank-frequency…

Genomics · Quantitative Biology 2015-01-06 Bohdan B. Khomtchouk

The importance of statistical patterns of language has been debated over decades. Although Zipf's law is perhaps the most popular case, recently, Menzerath's law has begun to be involved. Menzerath's law manifests in language, music and…

‹ Prev 1 2 3 10 Next ›