数据分析、统计与概率
Many measurements in the physical sciences can be cast as counting experiments, where the number of occurrences of a physical phenomenon informs the prevalence of the phenomenon's source. Often, detection of the physical phenomenon (termed…
Periodogram methods are widely used for the estimation of power- and cross-spectra, of which Welch's method is the most popular. Previous studies have analyzed the variance of the power spectra estimates and developed analytical probability…
The use of machine learning algorithms is an attractive way to produce very fast detector simulations for scattering reactions that can otherwise be computationally expensive. Here we develop a factorised approach where we deal with each…
Modeling non-Markovian time series is a recent topic of research in many fields such as climate modeling, biophysics, molecular dynamics, or finance. The generalized Langevin equation (GLE), given naturally by the Mori-Zwanzig projection…
Data-driven modeling of non-Markovian dynamics is a recent topic of research with applications in many fields such as climate research, molecular dynamics, biophysics, or wind power modeling. In the frequently used standard Langevin…
Multifractal Detrended Fluctuation Analysis stands out as one of the most reliable methods for unveiling multifractal properties, specially when real-world time series are under analysis. However, little is known about how several aspects,…
The development of an LHC physics analysis involves numerous investigations that require the repeated processing of terabytes of data. Thus, a rapid completion of each of these analysis cycles is central to mastering the science project. We…
When performing Monte-Carlo simulations, distributions are sometimes determined only for sub-intervals of the desired total range. In such cases, a frequent problem is to connect, or glue, individual distributions to obtain the final…
We investigate the performance of entropy estimation methods, based either on block entropies or compression approaches, in the case of bidimensional sequences. We introduce a validation dataset made of images produced by a large number of…
This article addresses the measurement of the power spectrum of red noise processes at the lowest frequencies, where the minimum acquisition time is so long that it is impossible to average on a sequence of data record. Therefore, averaging…
In this paper, a novel surrogate model based on the Grassmannian diffusion maps (GDMaps) and utilizing geometric harmonics is developed for predicting the response of engineering systems and complex physical phenomena. The method utilizes…
Anomalous diffusion or, more generally, anomalous transport, with nonlinear dependence of the mean-squared displacement on the measurement time, is ubiquitous in nature. It has been observed in processes ranging from microscopic movement of…
Boosted decision trees are a very powerful machine learning technique. After introducing specific concepts of machine learning in the high-energy physics context and describing ways to quantify the performance and training quality of…
We propose an extended genetic algorithm (GA) with different local environmental conditions. Genetic entities, or configurations, are put on nodes in a ring structure, and location-dependent environmental conditions are applied for each…
In this paper, we employ the persistent homology (PH) technique to examine the topological properties of fractional Gaussian noise (fGn). We develop the weighted natural visibility graph algorithm, and the associated simplicial complexes…
Monte Carlo (MC) simulations are extensively used for various purposes in modern high-energy physics (HEP) experiments. Precision measurements of established Standard Model processes or searches for new physics often require the collection…
In this article we describe the implementation of Artificial Intelligence models in track reconstruction software for the CLAS12 detector at Jefferson Lab. The Artificial Intelligence based approach resulted in improved track reconstruction…
In Bayesian estimation theory, the estimator ${\hat \theta} = E[\theta|l]$ attains the minimum mean squared error (MMSE) for estimating a scalar parameter of interest $\theta$ from the observation of $l$ through a noisy channel…
Thomson scattering measurements in High Energy Density experiments are often recorded using optical streak cameras. In the low-signal regime, noise introduced by the streak camera can become an important and sometimes the dominate source of…
In 2002, in a seminal article, Christoph Bandt and Bernd Pompe proposed a new methodology for the analysis of complex time series, now known as Ordinal Analysis. The ordinal methodology is based on the computation of symbols (known as…