数据分析、统计与概率
We describe here the general mathematical approach to constructing likelihoods for fitting observed spectra in one or more dimensions with multiple sources, including the effects of systematic uncertainties represented as nuisance…
Connections created from a node-edge matrix have been traditionally difficult to visualize and analyze because of the number of flows to be rendered in a limited feature or cartographic space. Because analyzing connectivity patterns is…
Application of root density estimator to problems of statistical data analysis is demonstrated. Four sets of basis functions based on Chebyshev-Hermite, Laguerre, Kravchuk and Charlier polynomials are considered. The sets may be used for…
We use multifractal detrended fluctuation analysis (MF-DFA), to See query 1 study sunspot number fluctuations. The result of the MF-DFA shows that there are three crossover timescales in the fluctuation function. We discuss how the…
Concerning systematic effects, the recommendation given in the GUM is to correct for them, but unfortunately no detailed information is available, how to do this. This publication will show, how systematic measurement deviations can be…
We introduce a general method to infer the directional information flow between populations whose elements are described by n-dimensional vectors of symbolic attributes. The method is based on the Jensen-Shannon divergence and on the…
Characterisations of the long-term behaviour of heart rate variability in humans have emerged in the last few years as promising candidates to became clinically significant tools. We present two different statistical analyses of long time…
A tetra of sets which elements are time series of interbeats has been obtained from the databank Physionet-MIT-BIH, corresponding to the following failures at the humans' heart: Obstructive Sleep Apnea, Congestive Heart Failure, and Atrial…
Transfer entropy provides a general tool for analyzing the magnitudes and directions---but not the \emph{kinds}---of information transfer in a system. We extend transfer entropy in two complementary ways. First, we distinguish…
This note discusses an interpolation technique for univariate distributions. In other words, the question is how to obtain a good approximation for f(x|a) if a0 < a < a1 is a control variable and f(x|a0) and f(x|a1) are known. The technique…
RooStats is a project to create advanced statistical tools required for the analysis of LHC data, with emphasis on discoveries, confidence intervals, and combined measurements. The idea is to provide the major statistical techniques as a…
We apply a data assimilation techniques, inspired from meteorological applications, to perform an optimal reconstruction of the neutronic activity field in a nuclear core. Both measurements, and information coming from a numerical model,…
This study theoretically considers the motion of N identical inelastic particles between two oscillating walls. The particles' average energy increases abruptly at certain critical filling fractions, wherein the system changes into a…
Many methods have been developed for data clustering, such as k-means, expectation maximization and algorithms based on graph theory. In this latter case, graphs are generally constructed by taking into account the Euclidian distance as a…
This paper compares the performance of artificial neural networks and boosted decision trees, with and without cascade training, for tagging b-jets in a collider experiment. It is shown, using a Monte Carlo simulation of $WH \to l\nu…
An algorithm for the detection of overlapping natural communities in networks was proposed by Lancichinetti, Fortunato, and Kertesz (LFK) last year. The LFK algorithm constructs natural communities of (in principle) all nodes of a graph by…
Today, the human brain can be studied as a whole. Electroencephalography, magnetoencephalography, or functional magnetic resonance imaging techniques provide functional connectivity patterns between different brain areas, and during…
We analyze the connectivity structure of weighted brain networks extracted from spontaneous magnetoencephalographic (MEG) signals of healthy subjects and epileptic patients (suffering from absence seizures) recorded at rest. We find that,…
We propose new probabilistic models for the spatial distribution of supply and demand and use the models to determine how the trip length distribution is affected by the relative shortage or excess of supply, the spatial clustering of…
This paper presents a hierarchical Bayesian model to reconstruct sparse images when the observations are obtained from linear transformations and corrupted by an additive white Gaussian noise. Our hierarchical Bayes model is well suited to…