数据分析、统计与概率
Speckle based imaging consists of forming a super-resolved reconstruction of an unknown sample from low-resolution images obtained under random inhomogeneous illuminations (speckles). In a blind context where the illuminations are unknown,…
Statistical distances quantifies the difference between two statistical constructs. In this article, we describe reference values for a distance between samples derived from the Kolmogorov-Smirnov statistic $D_{F,F'}$. Each measure of the…
The CERN IT provides a set of Hadoop clusters featuring more than 5 PBytes of raw storage with different open-source, user-level tools available for analytical purposes. The CMS experiment started collecting a large set of computing…
A novel single-lead f-wave extraction algorithm based on the modern diffusion geometry data analysis framework is proposed. The algorithm is essentially an averaged beat subtraction algorithm, where the ventricular activity template is…
We develop an algorithm for model selection which allows for the consideration of a combinatorially large number of candidate models governing a dynamical system. The innovation circumvents a disadvantage of standard model selection which…
Motivation: Structural heterogeneity in single-particle cryo-electron microscopy (cryo-EM) data represents a major challenge for high-resolution structure determination. Unsupervised classification may serve as the first step in the…
Currently, we are overwhelmed by a deluge of experimental data, and network physics has the potential to become an invaluable method to increase our understanding of large interacting datasets. However, this potential is often unrealized…
The identification of abnormal behaviour in mechanical systems is key to anticipate and avoid their potential failure. Thus wind turbine health is commonly assessed monitoring series of $10$-minute SCADA and high frequency data from…
Complex networks are usually characterized in terms of their topological, spatial, or information-theoretic properties and combinations of the associated metrics are used to discriminate networks into different classes or categories.…
The Liquids Reflectometer at Oak Ridge National Laboratory provides neutron reflectivity capability for an average of about 30 experiments each year. In recent years, there has been a large effort to streamline the data processing and…
We study two different forms of fluctuation-dissipation processes generating anomalous relaxations to equilibrium of an initial out of equilibrium condition, the former being based on a stationary although very slow correlation function and…
We provide a unified description of efficiency correction and error estimation for moments of conserved quantifies in heavy-ion collisions. Moments and cumulants are expressed in terms of the factorial moments, which can be easily corrected…
We explore a combinatorial framework which efficiently quantifies the asymmetries between minima and maxima in local fluctuations of time series. We firstly showcase its performance by applying it to a battery of synthetic cases. We find…
A hierarchical scheme for clustering data is presented which applies to spaces with a high number of dimension ($N_{_{D}}>3$). The data set is first reduced to a smaller set of partitions (multi-dimensional bins). Multiple clustering…
The estimation of phase errors from digital-holography data is critical for applications such as imaging or wave-front sensing. Conventional techniques require multiple i.i.d. data and perform poorly in the presence of high noise or large…
Discovery of causal relations is fundamental for understanding the dynamics of complex systems. While causal interactions are well defined for acyclic systems that can be separated into causally effective subsystems, a mathematical…
Data unfolding is a common analysis technique used in HEP data analysis. Inspired by the deconvolution technique in the digital signal processing, a new unfolding technique based on the SVD technique and the well-known Wiener filter is…
The measurement of a reaction cross section from a pulse height spectrum is a ubiquitous problem in experimental nuclear physics. In $\gamma$-ray spectroscopy, this is accomplished frequently by measuring the intensity of full-energy…
Multiscale fuzzy entropy (MFE) has been a prevalent tool to quantify the complexity of time series. However, it is extremely sensitive to the predetermined parameters and length of time series and it may yield an inaccurate estimation of…
Complex network formalism allows to explain the behavior of systems composed by interacting units. Several prototypical network models have been proposed thus far. The small-world model has been introduced to mimic two important features…