数据分析、统计与概率
In this letter, we study some evolution networks that grow with linear preferential attachment. Based upon some recent results on the quotient Gamma function, we give a rigorous proof of the asymptotic Mandelbrot law for the degree…
The Posterior distribution of the Likelihood Ratio (PLR) is proposed by Dempster in 1974 for significance testing in the simple vs composite hypotheses case. In this hypotheses test case, classical frequentist and Bayesian hypotheses tests…
We investigate the longest-path attacks on complex networks. Specifically, we remove approximately the longest simple path from a network iteratively until there are no paths left in the network. We propose two algorithms, the random…
The histogram is an analysis tool in widespread use within many sciences, with high energy physics as a prime example. However, there exists an inherent bias in the choice of binning for the histogram, with different choices potentially…
Lectures presented at the 1st CERN Asia-Europe-Pacific School of High-Energy Physics, Fukuoka, Japan, 14-27 October 2012. A pedagogical selection of topics in probability and statistics is presented. Choice and emphasis are driven by the…
In this paper, we propose to mix the approach underlying Bandt-Pompe permutation entropy with Lempel-Ziv complexity, to design what we call Lempel-Ziv permutation complexity. The principle consists of two steps: (i) transformation of a…
Oscillation probability calculations are becoming increasingly CPU intensive in modern neutrino oscillation analyses. The independency of reweighting individual events in a Monte Carlo sample lends itself to parallel implementation on a…
Suppose that in a multiple choice examination the leading digit of the correct options follows Benford's Law, while the the leading digit of the distractors are uniform. Consider a strategy for guessing at answers that selects the option…
Self-exciting point processes describe the manner in which every event facilitates the occurrence of succeeding events. By increasing excitability, the event occurrences start to exhibit bursts even in the absence of external stimuli. We…
The systematic biases seen in people's probability judgments are typically taken as evidence that people do not reason about probability using the rules of probability theory, but instead use heuristics which sometimes yield reasonable…
Data assimilation method consists in combining all available pieces of information about a system to obtain optimal estimates of initial states. The different sources of information are weighted according to their accuracy by the means of…
We describe an iterative unfolding method for experimental data, making use of a regularization function. The use of this function allows one to build an improved normalization procedure for Monte Carlo spectra, unbiased by the presence of…
The use of correlation matrices to evaluate the number of uncorrelated stirrer positions of reverberation chamber has widespread applications in electromagnetic compatibility. We present a comparative study of recent techniques based on…
We have suggested a complexity measure based method for studying the dependence of measured 222Rn concentration time series on indoor air temperature and humidity. This method is based on the Kolmogorov complexity (KL). We have introduced…
The analysis of networks characterized by links with heterogeneous intensity or weight suffers from two long-standing problems of arbitrariness. On one hand, the definitions of topological properties introduced for binary graphs can be…
Spectral unmixing is a crucial processing step when analyzing hyperspectral data. In such analysis, most of the work in the literature relies on the widely acknowledged linear mixing model to describe the observed pixels. Unfortunately,…
This study investigates flood avalanches in a dense reservoir network in the semiarid north-eastern Brazil. The population living in this area strongly depends on the availability of the water from this network. Water is stored during…
Many man-made and natural phenomena, including the intensity of earthquakes, population of cities and size of international wars, are believed to follow power-law distributions. The accurate identification of power-law patterns has…
We review ideas on temporal dependences and recurrences in discrete time series from several areas of natural and social sciences. We revisit existing studies and redefine the relevant observables in the language of copulas (joint laws of…
We discuss the effect of large positive correlations in the combinations of several measurements of a single physical quantity using the Best Linear Unbiased Estimate (BLUE) method. We suggest a new approach for comparing the relative…