数据分析、统计与概率
We report a statistical analysis over more than eight thousand songs. Specifically, we investigate the probability distribution of the normalized sound amplitudes. Our findings seems to suggest a universal form of distribution which…
This article surveys the procedures used for deriving detector transfer functions and normalizing probability densities for the statistical analysis technique known as the "matrix element method" in the context of high energy physics (HEP)…
A detailed presentation of hypothesis testing is given. The "look elsewhere" effect is illustrated, and a treatment of the trials factor is proposed with the introduction of hypothesis hypertests. An example of such a hypertest is…
We have devised a simple numerical technique to treat rugged data points that arise due to the insufficient gain setting error (or quantization error) of a digital instrument. This is a very wide spread problem that all experimentalists…
Many investigations of scientific collaboration are based on statistical analyses of large networks constructed from bibliographic repositories. These investigations often rely on a wealth of bibliographic data, but very little or no other…
Background: Zipf's law and Heaps' law are observed in disparate complex systems. Of particular interests, these two laws often appear together. Many theoretical models and analyses are performed to understand their co-occurrence in real…
These lectures concern two topics that are becoming increasingly important in the analysis of High Energy Physics (HEP) data: Bayesian statistics and multivariate methods. In the Bayesian approach we extend the interpretation of probability…
CIPM published the Supplement I for GUM in 2008 as not only an alternative approach to estimate the uncertainty for a given calibration measurement but also as a proper uncertainty estimation one, whenever any of the conditions imposed in…
In this presentation the experiences of the LHC experiments using grid computing were presented with a focus on experience with distributed analysis. After many years of development, preparation, exercises, and validation the LHC (Large…
In this paper, we develop a novel approach to measuring urban sprawl based on street nodes and naturally defined urban boundaries, both extracted from massive volunteered geographic information OpenStreetMap databases through some…
For Poisson distribution $Pois(n, \lambda)$ with $\lambda \gg 1$, $n \gg 1$ we propose to determine significance as $S = \frac{n_{obs}-\lambda}{\sqrt{\lambda}}$. The significance $S$ coincides up to sign with often used significance. For…
Modern network-like systems are usually coupled in such a way that failures in one network can affect the entire system. In infrastructures, biology, sociology, and economy, systems are interconnected and events taking place in one system…
Two utmost cases of super-extreme event's influence on the velocity autocorrelation function (VAF) were considered. The VAF itself was derived within the hierarchical Weierstrass-Mandelbrot Continuous-Time Random Walk (WM-CTRW) formalism,…
A painting consists of objects which are arranged in specific ways. The art of painting is drawing the objects, which can be considered as known trends, in an expressive manner. Detrended methods are suitable for characterizing the artistic…
We report remarkable similarities in the output signal of two distinct out-of- equilibrium physical systems - earthquakes and the intermittent acoustic noise emitted by crum- pled plastic sheets - Biaxially Oriented Polypropylene (BOPP)…
We propose a new method to test the effectiveness of a spatial point process forecast based on a log-likelihood score for predicted point density and the information gain for events that actually occurred in the test period. The method…
The continuous wavelet transform is adapted to account for signal truncation through renormalization and by modifying the shape of the analyzing window. Comparison is made of the instant and integrated wavelet power with previous…
The HBT-Analyzer is an universal tool for particle correlations analysis under the ROOT environment. It provides an efficient mixing mechanism, a wide range of correlation and monitoring functions, and a set of cuts that are applicable on…
This paper develops an analytical and rigorous formulation of the maximum entropy generation principle. The result is suggested as the Fourth Law of Thermodynamics.
We study the maximum entropy (MaxEnt) approach for analytical continuation of spectral data from imaginary times to real frequencies. The total error is divided in a statistical error, due to the noise in the input data, and a systematic…