数据分析、统计与概率
The paper elucidates, with an analytic example, a subtle mistake in the application of the extended likelihood method to the problem of determining the fractions of pure samples in a mixed sample from the shape of the distribution of a…
With modern data acquisition devices that work fast and very precise, scientists often face the task of dealing with huge amounts of data. These need to be rapidly processed and stored onto a hard disk. We present a LabVIEW program which…
Most real-world complex systems can be modelled by coupled networks with multiple layers. How and to what extent the pattern of couplings between network layers may influence the interlaced structure and function of coupled networks are not…
Two new surrogate methods, the Small Shuffle Surrogate (SSS) and the Truncated Fourier Transform Surrogate (TFTS), have been proposed to study whether there are some kind of dynamics in irregular fluctuations and if so whether these…
In this paper, we develop the idea to partition the edges of a weighted graph in order to uncover overlapping communities of its nodes. Our approach is based on the construction of different types of weighted line graphs, i.e. graphs whose…
A method enabling automatic detection and tracking of large amounts of individual dust particles in plasmas is presented. Individual trajectories can be found with a good spatiotemporal resolution, even without applying any external light…
From the integration of non-symmetrical hyperboles, a one-parameter generalization of the logarithmic function is obtained. Inverting this function, one obtains the generalized exponential function. We show that functions characterizing…
Often the result of a scientific experiment is given by the difference of measurements in two configurations, denoted by A and B. Since the measurements are not obtained simultaneously, drift of the zero-point can bias the result. In…
In this note I go through the `proof' of frequentistic confidence intervals and show what it logically implies concerning the value of a physical quantity given an experimental observation (nothing).
This paper reviews the basic ideas behind a Bayesian unfolding published some years ago and improves their implementation. In particular, uncertainties are now treated at all levels by probability density functions and their propagation is…
We propose a similarity-based method, using the similarity between nodes, to address the problem of classification in partially labeled networks. The basic assumption is that two nodes are more likely to be categorized into the same class…
From a time series whose data are embedded in heavy noise, we construct an Hilbert space operator (J-operator) whose discrete spectrum represents the signal while the essential spectrum located on the unit circle, is associated with the…
There is substantial interest in the effect of human mobility patterns on opportunistic communications. Inspired by recent work revisiting some of the early evidence for a L\'evy flight foraging strategy in animals, we analyse datasets on…
Determining if two histograms are consistent, whether they have been drawn from the same underlying distribution or not, is a common problem in physics. Existing approaches are not only limited in power but also inapplicable to histograms…
This paper studies a fully Bayesian algorithm for endmember extraction and abundance estimation for hyperspectral imagery. Each pixel of the hyperspectral image is decomposed as a linear combination of pure endmember spectra following the…
We consider the evolution of a network of neurons, focusing on the asymptotic behavior of spikes dynamics instead of membrane potential dynamics. The spike response is not sought as a deterministic response in this context, but as a…
Huang's Empirical Mode Decomposition (EMD) is an algorithm for analyzing nonstationary data that provides a localized time-frequency representation by decomposing the data into adaptively defined modes. EMD can be used to estimate a…
We propose an interpolation expression using the difference moment (Kolmogorov transient structural function) of the second order as the average characteristic of displacements for identifying the anomalous diffusion in complex processes…
We investigate the community structure of physics subfields in the citation network of all Physical Review publications between 1893 and August 2007. We focus on well-cited publications (those receiving more than 100 citations), and apply…
We present a method which tests whether or not two datasets (one of which could be Monte Carlo generated) might come from the same distribution. Our method works in arbitrarily high dimensions.