数据分析、统计与概率
HMCF "Hamiltonian Monte Carlo for Fields" is a software add-on for the NIFTy "Numerical Information Field Theory" framework implementing Hamiltonian Monte Carlo (HMC) sampling in Python. HMCF as well as NIFTy are designed to address…
In the strong-constraint formulation of the history-matching problem, we assume that all the model errors relate to a selection of uncertain model input parameters. One does not account for additional model errors that could result from,…
An agent choosing between various actions tends to take the one with the lowest cost. But this choice is arguably too rigid (not adaptive) to be useful in complex situations, e.g., where exploration-exploitation trade-off is relevant in…
The method of maximum likelihood estimation (MLE) is a widely used statistical approach for estimating the values of one or more unknown parameters of a probabilistic model based on observed data. In this tutorial, I briefly review the…
We present a simulation-based study using deep convolutional neural networks (DCNNs) to identify neutrino interaction vertices in the MINERvA passive targets region, and illustrate the application of domain adversarial neural networks…
The three-cornered hat method and the Groslambert Covariance are very often used to estimate the frequency stability of each individual oscillator in a set of three oscillators by comparing them in pairs. However, no rigorous method to…
The online data reduction service reductus transforms measurements in experimental science from laboratory coordinates into physically meaningful quantities with accurate estimation of uncertainties based on instrumental settings and…
We propose a novel method of detecting directed interactions of a general dynamic network from measured data. By repeating random state variable resetting of a target node and appropriately averaging over the measurable data, the pairwise…
A new data cleaning procedure for electron cyclotron emission imaging (ECEI) of EAST tokamak is constructed. Machine learning techniques, including SVM and Decision tree, are applied to identifying saturated, zero, and weak signals of ECEI…
Two popular boosted decsion tree (BDT) methods, Adaptive BDT (AdaBDT) and Gradient BDT (GradBDT) are studied in the classification problem of separating signal from background assuming all trees are weak learners. The following results are…
A method to perform unfolding with Gaussian processes (GPs) is presented. Using Bayesian regression, we define an estimator for the underlying truth distribution as the mode of the posterior. We show that in the case where the bin contents…
We formulate, using the discrete nonlinear Schroedinger equation (DNLS), a general approach to encode and process information based on reservoir computing. Reservoir computing is a promising avenue for realizing neuromorphic computing…
We present a new statistical test that examines the consistency of the tails of two empirical distributions at multiple thresholds. Such distributions are often encountered in counting experiments, in physics and elsewhere, where the…
This paper deals with modelling and reconstruction of strain fields, relying upon data generated from neutron Bragg-edge measurements. We propose a probabilistic approach in which the strain field is modelled as a Gaussian process, assigned…
B-decay data from the Belle experiment at the KEKB collider have a substantial background from $e^{+}e^{-}\to q \bar{q}$ events. To suppress this we employ deep neural network algorithms. These provide improved signal from background…
A fundamental problem in geophysical modeling is related to the identification and approximation of causal structures among physical processes. However, resolving the bidirectional mappings between physical parameters and model state…
The impedance/admittance measurements of a piezoelectric transducer bonded to or embedded in a host structure can be used as damage indicator. When a credible model of the healthy structure, such as the finite element model, is available,…
In this note, we investigate possible relationships between the bivariate Hurst exponent $H_{xy}$ and an average of the separate Hurst exponents $\frac{1}{2}(H_x+H_y)$. We show that two cases are well theoretically founded. These are the…
Recent statistical evaluations for High-Energy Physics measurements, in particular those at the Large Hadron Collider, require careful evaluation of many sources of systematic uncertainties at the same time. While the fundamental aspects of…
How might a smooth probability distribution be estimated, with accurately quantified uncertainty, from a limited amount of sampled data? Here we describe a field-theoretic approach that addresses this problem remarkably well in one…