数据分析、统计与概率
Life and language are discrete combinatorial systems (DCSs) in which the basic building blocks are finite sets of elementary units: nucleotides or codons in a DNA sequence and letters or words in a language. Different combinations of these…
We introduce the EMC algorithm for reconstructing a particle's 3D diffraction intensity from very many photon shot-noise limited 2D measurements, when the particle orientation in each measurement is unknown. The algorithm combines a…
The RooStatsCms (RSC) software framework allows analysis modelling and combination, statistical studies together with the access to sophisticated graphics routines for results visualisation. The goal of the project is to complement the…
The non-scientific event of a soccer match is analysed on a strictly scientific level. The analysis is based on the recently introduced concept of a team fitness (Eur. Phys. J. B 67, 445, 2009) and requires the use of finite-size scaling. A…
Weighted histograms in Monte Carlo simulations are often used for the estimation of probability density functions. They are obtained as a result of random experiments with random events that have weights. In this paper, the bin contents of…
The visibility algorithm has been recently introduced as a mapping between time series and complex networks. This procedure allows to apply methods of complex network theory for characterizing time series. In this work we present the…
A given set of data-points in some feature space may be associated with a Schrodinger equation whose potential is determined by the data. This is known to lead to good clustering solutions. Here we extend this approach into a full-fledged…
The researchers have drawn much attention about the birth weight of newborn babies in the last three decades. The birth weight is one of the vital roles in the babys health. So many researchers such as (2),(1) and (4) analyzed the birth…
We study the relationship between dynamical properties and interaction patterns in complex oscillator networks in the presence of noise. A striking finding is that noise leads to a general, one-to-one correspondence between the dynamical…
The D0 experiment at Fermilab's Tevatron will record several petabytes of data over the next five years in pursuing the goals of understanding nature and searching for the origin of mass. Computing resources required to analyze these data…
In the last 30 years it was found that many combinatorial systems undergo phase transitions. One of the most important examples of these can be found among the random k-satisfiability problems (often referred to as k-SAT), asking whether…
The effect of solar magnetic activity on the yearly mean average temperature is extracted from the historical record for much of North America. The level of solar activity is derived from the international sunspot number by the renormalized…
The emerging system at the European level can be conceptualized as a pattern of relations among member states that tends to be reproduced despite disturbances in individual trajectories. The Markov property is used as an indicator of…
The well established procedure of constructing phenomenological ensemble from a single long time series is investigated. It is determined that a time series generated by a simple Uhlenbeck-Ornstein Langevin equation is mean ergodic. However…
We study the shrinking Pearson random walk in two dimensions and greater, in which the direction of the Nth is random and its length equals lambda^{N-1}, with lambda<1. As lambda increases past a critical value lambda_c, the endpoint…
An approach to land surface temperature (LST) estimation that relies upon Bayesian inference has been tested against multiband infrared radiometric imagery from the Terra MODIS instrument. Bayesian LST estimators are shown to reproduce…
A C++ class was written for the calculation of frequentist confidence intervals using the profile likelihood method. Seven combinations of Binomial, Gaussian, Poissonian and Binomial uncertainties are implemented. The package provides…
Spectral clustering uses the global information embedded in eigenvectors of an inter-item similarity matrix to correctly identify clusters of irregular shape, an ability lacking in commonly used approaches such as k-means and agglomerative…
We comment on the recent manuscript by Raines et al. [arXiv:0905.0269v2] (now published in Nature, vol. 463, p. 214-217, 2010), which suggests that in certain conditions a single diffraction measurement may be sufficient to reconstruct the…
We present an analytic method for calculating spectral densities of empirical covariance matrices for correlated data. In this approach the data is represented as a rectangular random matrix whose columns correspond to sampled states of the…