数据分析、统计与概率
There have been a number of recent proposals to enhance the performance of machine learning strategies for collider physics by combining many distinct events into a single ensemble feature. To evaluate the efficacy of these proposals, we…
To model reliably behavioral systems with complex bio-social interactions, accounting for uncertainty quantification, is critical for many application areas. However, in terms of the mathematical formulation of the corresponding problems,…
Chronometric dating is becoming increasingly important in areas such as the Origin and evolution of Life on Earth and other planets, Origin and evolution of the Earth and the Solar System... Electron Spin Resonance (ESR) dating is based on…
Diffusion processes are important in several physical, chemical, biological and human phenomena. Examples include molecular encounters in reactions, cellular signalling, the foraging of animals, the spread of diseases, as well as trends in…
We propose a time-varying optimal window width (TVOWW) selection scheme to optimize the performance of several nonlinear-type time-frequency analyses, including the reassignment method, and the synchrosqueezing transform (SST) and its…
Time-frequency (TF) analysis is a powerful tool for exploring ultrafast dynamics in atoms and molecules. While some TF methods have demonstrated their usefulness and potential in several of quantum systems, a systematic comparison among…
This study introduces a new adaptive time-frequency (TF) analysis technique, synchrosqueezing transform (SST), to explore the dynamics of a laser-driven hydrogen atom at an {\it ab initio} level, upon which we have demonstrated its…
Amplitude analysis is a powerful technique to study hadron decays. A significant complication in these analyses is the treatment of instrumental effects, such as background and selection efficiency variations, in the multidimensional…
Signal estimation in the presence of background noise is a common problem in several scientific disciplines. An 'On/Off' measurement is performed when the background itself is not known, being estimated from a background control sample. The…
The unsupervised search for overdense regions in high-dimensional feature spaces, where locally high population densities may be associated with anomalous contaminations to an otherwise more uniform population, is of relevance to…
Since Bandt and Pompe's seminal work, permutation entropy has been used in several applications and is now an essential tool for time series analysis. Beyond becoming a popular and successful technique, permutation entropy inspired a…
In general-purpose particle detectors, the particle-flow algorithm may be used to reconstruct a comprehensive particle-level view of the event by combining information from the calorimeters and the trackers, significantly improving the…
Physicists are starting to work in areas where noisy signal analysis is required. In these fields, such as Economics, Neuroscience, and Physics, the notion of causality should be interpreted as a statistical measure. We introduce to the lay…
Bayesian modeling techniques enable sensitivity analyses that incorporate detailed expectations regarding future experiments. A model-based approach also allows one to evaluate inferences and predicted outcomes, by calibrating (or…
Th\^eo1 is a frequency stability statistic which is similar to the Allan variance but can provide stability estimates at longer averaging factors and with higher confidence. However, the calculation of Th\^eo1 is significantly slower than…
Single-shot wide-angle diffraction imaging is a widely used method to investigate the structure of non-crystallizing objects such as nanoclusters, large proteins or even viruses. Its main advantage is that information about the…
Technical analysis is considered the oldest, currently omnipresent, method for financial markets analysis, which uses past prices aiming at the possible short-term forecast of future prices. In the frame of complex systems, methods used to…
We propose a new method for Unsupervised clustering in particle physics named UCluster, where information in the embedding space created by a neural network is used to categorise collision events into different clusters that share similar…
Attaining reliable profile gradients is of utmost relevance for many physical systems. In most situations, the estimation of gradient can be inaccurate due to noise. It is common practice to first estimate the underlying system and then…
The creativity and emergence of biological and psychological behavior are nonlinear. However, that does not necessarily mean only that the measurements of the behaviors are curvilinear. Furthermore, the linear model might fail to reduce…