数据分析、统计与概率
Predicting extreme events in chaotic systems, characterized by rare but intensely fluctuating properties, is of great importance due to their impact on the performance and reliability of a wide range of systems. Some examples include…
The PandaX-4T experiment is designed for multiple purposes, including searches for solar neutrinos, weakly interacting massive particles, and rare double beta decays of xenon isotopes. The experiment produces a huge amount of raw data that…
In High Energy Physics simulations play a crucial role in unraveling the complexities of particle collision experiments within CERN's Large Hadron Collider. Machine learning simulation methods have garnered attention as promising…
Scan line levelling, a ubiquitous and often necessary step in AFM data processing, can cause a severe bias on measured roughness parameters such as mean square roughness or correlation length. Although bias estimates have been formulated,…
High-energy physics experiments rely heavily on precise measurements of energy and momentum, yet face significant challenges due to detector limitations, calibration errors, and the intrinsic nature of particle interactions. Traditional…
We present Link Density (LD) computed from the Recurrence Network (RN) of a time series data as an effective measure that can detect dynamical transitions in a system. We illustrate its use using time series from the standard Rossler system…
Metal additive manufacturing is gaining broad interest and increased use in the industrial and academic fields. However, the quantification and commercialization of standard parts usually require extensive experiments and expensive…
In this article, we review the interdisciplinary techniques (borrowed from physics, mathematics, statistics, machine-learning, etc.) and methodological framework that we have used to understand climate systems, which serve as examples of…
Pebble bed reactor (PBR) relying on TRISO-fueled pebbles is one of the most promising Gen-IV reactor designs because of intrinsic safety and thermal efficiency. Fuel pebbles flow through PBR's core and the identification of individual…
This study focuses on the novel application of a normalizing flow as a method of domain adaptation. Normalizing flows offer a way to transform data points between two different distributions. The present study investigates a method of…
In particle physics, workflow management systems are primarily used as tailored solutions in dedicated areas such as Monte Carlo production. However, physicists performing data analyses are usually required to steer their individual,…
Forman-Ricci curvature (FRC) is a potent and powerful tool for analysing empirical networks, as the distribution of the curvature values can identify structural information that is not readily detected by other geometrical methods.…
Distinguishing power-law distributions from other heavy-tailed distributions is challenging, and this task is often further complicated by subsampling effects. In this work, we evaluate the performance of two commonly used methods for…
Any continuous curve in a higher dimensional space can be considered a trajectory that can be parameterized by a single variable, usually taken as time. It is well known that a continuous curve can have a fractional dimensionality, which…
The reconstruction of photon conversions is importantin order to improve the reconstruction efficiency of the physics measurements involving photons. However, there are significant number of conversions in which only one of the two tracks…
Memory effects emerge as a fundamental consequence of dimensionality reduction when low-dimensional observables are used to describe the dynamics of complex many-body systems. In the context of molecular dynamics (MD) data analysis,…
This is a writeup, with some elaboration, of the talks by the two authors (a physicist and a statistician) at the first PHYSTAT Informal review on January 24, 2024. We discuss Bayesian and frequentist approaches to dealing with nuisance…
A new method recovers phase difference of interfering wavefronts from a pattern of interference fringes, avoiding discontinuity problem. The continuous phase is a solution of the first order differential equation of the interferogram…
We propose an algorithm to detect mini-jet clusters in high-energy nuclear collisions, by selecting a high-transverse-momentum ($p_T$) particle as a seed and assigning a clustering radius ($R$) in the pseudorapidity and azimuthal-angle…
Quantifying relationships between components of a complex system is critical to understanding the rich network of interactions that characterize the behavior of the system. Traditional methods for detecting pairwise dependence of time…