数据分析、统计与概率
The functions of complex networks are usually determined by a small set of vital nodes. Finding the best set of vital nodes (eigenshield nodes) is critical to the network's robustness against rumor spreading and cascading failures, which…
This document describes the conceptual design for the Offline Software and Computing for the Deep Underground Neutrino Experiment (DUNE). The goals of the experiment include 1) studying neutrino oscillations using a beam of neutrinos sent…
Equation-of-state (EOS) models underpin numerical simulations at the core of research in high energy density physics, inertial confinement fusion, laboratory astrophysics, and elsewhere. In these applications EOS models are needed that span…
Couplings in complex real-world systems are often nonlinear and scale-dependent. In many cases, it is crucial to consider a multitude of interlinked variables and the strengths of their correlations to adequately fathom the dynamics of a…
Deep learning methods have gained popularity in high energy physics for fast modeling of particle showers in detectors. Detailed simulation frameworks such as the gold standard Geant4 are computationally intensive, and current deep…
In this paper, we present a method that combines information-theoretical and statistical approaches to infer connectivity in complex networks using time-series data. The method is based on estimations of the Mutual Information Rate for…
The dynamics of the Earth's magnetosphere exhibits strongly fluctuating patterns as well as non-stationary and non-linear interactions, more pronounced during magnetospheric substorms and magnetic storms. This complex dynamics comprises…
Using a graph-based approach, we propose a multiscale permutation entropy to explore the complexity of multivariate time series over multiple time scales. This multivariate multiscale permutation entropy (MPEG) incorporates the interaction…
The identification of jets and their constituents is one of the key problems and challenging task in heavy ion experiments such as experiments at RHIC and LHC. The presence of huge background of soft particles pose a curse for jet finding…
Many simple natural phenomena are characterized by complex motion that appears random at first glance, but that often displays underlying patterns and behavior that can be clustered in groups. The movement of small pieces of paper falling…
The histogram is a key method for visualizing data and estimating the underlying probability distribution. Incorrect conclusions about the data result from over or under-binning. A new method based on the Shannon entropy of the histogram…
We propose a curvelet-based model for the generation of Anisotropic Fractional Brownian Fields, that are suited to model systems with orientation-dependent self-similar properties. The synthesis procedure consists of generating coefficients…
LHAASO KM2A consists of 5915 scintillation detectors and 1188 muon detectors, and the muon detectors cover 4% area of the whole array with 30 m spacing. The muon number of air shower events, with very high energy, is investigated with the…
In many hypothesis testing applications, we have mixed priors, with well-motivated informative priors for some parameters but not for others. The Bayesian methodology uses the Bayes factor and is helpful for the informative priors, as it…
We propose two different approaches for introducing the information temperature of the binary N-th order Markov chains. The first approach is based on comparing the Markov sequences with the equilibrium Ising chains at given temperatures.…
Ergodicity breaking is a challenge for biological and psychological sciences. Ergodicity is a necessary condition for linear causal modeling. Long-range correlations and non-Gaussianity characterizing various biological and psychological…
Searches for new physics often face unknown backgrounds, causing false detections or weakened upper limits. This paper introduces the deficit hawk technique, which mitigates unknown backgrounds by testing multiple options for data cuts,…
The estimation of parameters from data is a common problem in many areas of the physical sciences, and frequently used algorithms rely on sets of simulated data which are fit to data. In this article, an analytic solution for…
The mutual incompatibility of distinct spectroscopic systems is among the most limiting factors in Laser-Induced Breakdown Spectroscopy (LIBS). The cost related to setting up a new LIBS system is increased, as its extensive calibration is…
Transition path theory provides a statistical description of the dynamics of a reaction in terms of local spatial quantities. In its original formulation, it is limited to reactions that consist of trajectories flowing from a reactant set A…