English
Related papers

Related papers: Coverage statistics for sequence census methods

200 papers

The estimation of coverage probabilities, and in particular of the missing mass, is a classical statistical problem with applications in numerous scientific fields. In this paper, we study this problem in relation to randomized data…

Methodology · Statistics 2022-09-07 Stefano Favaro , Matteo Sesia

At the core of high throughput DNA sequencing platforms lies a bio-physical surface process that results in a random geometry of clusters of homogenous short DNA fragments typically hundreds of base pairs long - bridge amplification. The…

Genomics · Quantitative Biology 2015-08-13 Eliza O'Reilly , Francois Baccelli , Gustavo de Veciana , Haris Vikalo

Motivated by the fundamental problem of measuring species diversity, this paper introduces the concept of a cluster structure to define an exchangeable cluster probability function that governs the joint distribution of a random count and…

Methodology · Statistics 2014-10-14 Mingyuan Zhou , Stephen G Walker

Sequencing by synthesis is used in many next-generation DNA sequencing technologies. Some of the technologies, especially those exploring the principle of single-molecule sequencing, allow incomplete nucleotide incorporation in each cycle.…

Genomics · Quantitative Biology 2024-05-28 Yong Kong

This work focuses on clustering populations with a hierarchical dependency structure that can be described by a tree. A particular example that is the focus of our work is the phylogenetic tree, with nodes often representing biological…

Methodology · Statistics 2023-02-28 Hanxi Sun , Heejung Shim , Vinayak Rao

Motivated by the fundamental problem of modeling the frequency of frequencies (FoF) distribution, this paper introduces the concept of a cluster structure to define a probability function that governs the joint distribution of a random…

Methodology · Statistics 2016-08-02 Mingyuan Zhou , Stefano Favaro , Stephen G Walker

The spectrum and coherency are useful quantities for characterizing the temporal correlations and functional relations within and between point processes. This paper begins with a review of these quantities, their interpretation and how…

Biological Physics · Physics 2007-05-23 M. R. Jarvis , P. P. Mitra

We study spanning trees on Sierpinski graphs (i.e., finite approximations to the Sierpinski gasket) that are chosen uniformly at random. We construct a joint probability space for uniform spanning trees on every finite Sierpinski graph and…

Probability · Mathematics 2015-01-14 Masato Shinoda , Elmar Teufl , Stephan Wagner

We study some stochastic models of physical mapping of genomic sequences. Our starting point is a global construction of the process of the clones and of the process of the anchors which are used to map the sequence. This yields explicit…

Probability · Mathematics 2007-05-23 Didier Piau

This paper describes a compound Poisson-based random effects structure for modeling zero-inflated data. Data with large proportion of zeros are found in many fields of applied statistics, for example in ecology when trying to model and…

Applications · Statistics 2009-07-29 Marie-Pierre Etienne , Eric Parent , Benoit Hugues , Bernier Jacques

We wish to estimate the total number of classes in a population based on sample counts, especially in the presence of high latent diversity. Drawing on probability theory that characterizes distributions on the integers by ratios of…

Methodology · Statistics 2014-12-10 A. Willis , J. Bunge

The main statistical distributions applicable to the analysis of genome architecture and genome tracks are briefly discussed and critically assessed. Although the observed features in distributions of element lengths can be equally well…

Other Quantitative Biology · Quantitative Biology 2015-06-17 V. R. Chechetkin

We discuss a model of protein conformations where the conformations are combinations of short fragments from some small set. For these fragments we consider a distribution of frequencies of occurrence of pairs (sequence of amino acids,…

Biomolecules · Quantitative Biology 2016-07-05 S. V. Kozyrev

We show that textual analysis of microbial genomes reveal telling footprints of the early evolution of the genomes. The frequencies of word occurrence of random DNA sequences considered as texts in their four nucleotides are expected to…

Biological Physics · Physics 2007-05-23 Li-Ching Hsieh , Liaofu Luo , HC Lee

In this paper, we address the question of comparison between populations of trees. We study an statistical test based on the distance between empirical mean trees, as an analog of the two sample z statistic for comparing two means. Despite…

Statistics Theory · Mathematics 2007-08-14 Ana Georgina Flesia , Ricardo Fraiman

We consider invasion percolation on Galton-Watson trees. On almost every Galton-Watson tree, the invasion cluster almost surely contains only one infinite path. This means that for almost every Galton-Watson tree, invasion percolation…

Probability · Mathematics 2018-02-06 Marcus Michelen , Robin Pemantle , Josh Rosenberg

A new family of compound Poisson distribution functions from statistical linguistic is used to study the n-tuples and nucleotide composition features of DNA sequences. The relative frequency distribution of the 6-tuples and 7- tuples…

Statistical Mechanics · Physics 2007-05-23 K. L. Ng , S. P. Li

Large-scale statistical analysis of data sets associated with genome sequences plays an important role in modern biology. A key component of such statistical analyses is the computation of $p$-values and confidence bounds for statistics…

Applications · Statistics 2011-01-06 Peter J. Bickel , Nathan Boley , James B. Brown , Haiyan Huang , Nancy R. Zhang

There is currently a gap in theory for point patterns that lie on the surface of objects, with researchers focusing on patterns that lie in a Euclidean space, typically planar and spatial data. Methodology for planar and spatial data thus…

Statistics Theory · Mathematics 2020-02-11 Scott Ward , Edward A. K. Cohen , Niall Adams

We establish a complete picture of condensation in the inclusion process in the thermodynamic limit with vanishing diffusion, covering all scaling regimes of the diffusion parameter and including large deviation results for the maximum…

Probability · Mathematics 2021-07-21 Watthanan Jatuviriyapornchai , Paul Chleboun , Stefan Grosskinsky
‹ Prev 1 2 3 10 Next ›