数据分析、统计与概率
RooFit is a toolkit for statistical modelling and fitting, and together with RooStats it is used for measurements and statistical tests by most experiments in particle physics. Since one year, RooFit is being modernised. In this talk,…
The HIBEAM and NNBAR experiments are a proposed fundamental science experiments at the European Spallation Source, performing high precision searches for neutron conversions in several baryon number violating (BNV) channels. For simulations…
Deep learning is a standard tool in the field of high-energy physics, facilitating considerable sensitivity enhancements for numerous analysis strategies. In particular, in identification of physics objects, such as jet flavor tagging,…
Currently, the engineering of miniature spectrometers mainly faces three problems: the mismatch between the number of filters at the front end of the detector and the spectral reconstruction accuracy; the lack of a stable spectral…
We constructed a frequently updated, near-real-time global power generation dataset: Carbon Monitor-Power since January, 2016 at national levels with near-global coverage and hourly-to-daily time resolution. The data presented here are…
A heuristic formula for 5-point approximation of the first derivative of an unknown function whose values are measured with an error at unequally spaced points is proposed. The derivative at a given point is calculated using the effective…
We review the alternative proposals introduced recently in the literature to update the standard formula to estimate the uncertainty on the mean of repeated measurements, and we compare their performances on synthetic examples with normal…
There is a growing use of neural network classifiers as unbinned, high-dimensional (and variable-dimensional) reweighting functions. To date, the focus has been on marginal reweighting, where a subset of features are used for reweighting…
Real-time forecasting is important to the society. It uses continuous data streams to update forecasts for sustained accuracy. But the data source is vulnerable to attacks or accidents and the dynamics of forecasting failure and recovery…
Background: In neurophysiological data, latency refers to a global shift of spikes from one spike train to the next, either caused by response onset fluctuations or by finite propagation speed. Such systematic shifts in spike timing lead to…
We investigate the asymptotic distribution of the profile likelihood ratio (PLR) when constraining effective field theories (EFTs) and show that Wilks' theorem is often violated, meaning that we should not assume the PLR to follow a…
A critical step in data analysis for many different types of experiments is the identification of features with theoretically defined shapes in N-dimensional datasets; examples of this process include finding peaks in multi-dimensional…
Particle symbols and Feynman diagrams often used in particle physics can be generated by a word document processor. Standard office packages have their built in symbols such as solid-line, dashed-line, arrows, etc. which are also used in…
Extracting governing physics from data is a key challenge in many areas of science and technology. The existing techniques for equations discovery are dependent on both input and state measurements; however, in practice, we only have access…
Mentoring is a key component of scientific achievements, contributing to overall measures of career success for mentees and mentors. A common success metric in the scientific enterprise is acquiring a large research group, which is believed…
Using the Fisher information (FI), the design of neutron reflectometry experiments can be optimised, leading to greater confidence in parameters of interest and better use of experimental time [Durant, Wilkins, Butler, & Cooper (2021). J.…
Getting the most from power-law-type data can be challenging. James Sethna points out some of the pitfalls in studying power laws arising from emergent scale invariance, as well as important opportunities.
End-to-end analyses of data from high-energy physics experiments using machine and deep learning techniques have emerged in recent years. These analyses use deep learning algorithms to go directly from low-level detector information…
We propose a new statistical test to identify non-stationary frequency-modulated stochastic processes from time series data. Our method uses the instantaneous phase as a discriminatory statistics with reliable critical values derived from…
Experiments aiming at high sensitivities usually demand for a very high statistics in order to reach more precise measurements. However, for those exploiting Low Temperature Detectors (LTDs), a high source activity may represent a drawback,…