数据分析、统计与概率
This manuscript describes the software package SCOUT, which analyzes, characterizes, and corrects one-dimensional signals. Specifically, it allows to check and correct for stationarity, detect spurious samples, check for normality, check…
Timepix and Timepix3 are hybrid pixel detectors ($256\times 256$ pixels), capable of tracking ionizing particles as isolated clusters of pixels. To efficiently analyze such clusters at potentially high rates, we introduce multiple…
In contrast to the symmetries of translation in space, rotation in space, and translation in time, the known laws of physics are not universally invariant under transformation of scale. However, the action can be invariant under change of…
For a better understanding of granular flow problems such as silo blockage, avalanche triggering, mixing and segregation, it is essential to have a `microscopic' view of individual particles. In order to cope with the difficulty arising…
In this Chapter we show that by considering eye movements, and in particular, the resulting sequence of gaze shifts, a stochastic process, a wide variety of tools become available for analyses and modelling beyond conventional statistical…
Scatterometry is a fast, indirect and nondestructive optical method for the quality control in the production of lithography masks. Geometry parameters of line gratings are obtained from diffracted light intensities by solving an inverse…
In the majority of molecular optimization tasks, predictive machine learning (ML) models are limited due to the unavailability and cost of generating big experimental datasets on the specific task. To circumvent this limitation, ML models…
I describe a five-dimensional, polarised, Bethe-Heitler event generator of $\gamma$-ray conversions to $\mu^+\mu^-$, based on a generator for conversion to $e^+e^-$ developed in the past. Verifications are performed from close-to-threshold…
Bayesian Gaussian Process Optimization can be considered as a method of the determination of the model parameters, based on the experimental data. In the range of soft QCD physics, the processes of hadron and nuclear interactions require…
This paper provides a detailed theoretical analysis of methods to approximate the solutions of high-dimensional (>10^6) linear Bayesian problems. An optimal low-rank projection that maximizes the information content of the Bayesian…
The critical heat flux (CHF) corresponding to the departure from nucleate boiling (DNB) crisis is essential to the design and safety of a two-phase flow boiling system. Despite the abundance of predictive tools available to the thermal…
RECAST is an analysis reinterpretation framework; since analyses are often sensitive to a range of models, RECAST can be used to constrain the plethora of theoretical models without the significant investment required for a new analysis.…
Removing noise from piecewise constant (PWC) signals, is a challenging signal processing problem arising in many practical contexts. For example, in exploration geosciences, noisy drill hole records need separating into stratigraphic zones,…
At high energy physics experiments, processing billions of records of structured numerical data from collider events to a few statistical summaries is a common task. The data processing is typically more complex than standard query…
This paper describes the methodology we have developed to define a sampling strategy adapted to operational constraints in order to characterize the dihydrogen flow rate of 2714 nuclear waste drums produced by radiolysis reaction of organic…
During oscillations of cosmology inflation around the minimum of a cuspy potential after inflation, the existence of extra high frequency gravitational waves (HFGWs) (GHz) has been proven effectively recently. Based on the electromagnetic…
New heterogeneous computing paradigms on dedicated hardware with increased parallelization, such as Field Programmable Gate Arrays (FPGAs), offer exciting solutions with large potential gains. The growing applications of machine learning…
Approaches for mapping time series to networks have become essential tools for dealing with the increasing challenges of characterizing data from complex systems. Among the different algorithms, the recently proposed ordinal networks stand…
For a broad class of input-output maps, arguments based on the coding theorem from algorithmic information theory (AIT) predict that simple (low Kolmogorov complexity) outputs are exponentially more likely to occur upon uniform random…
In these proceedings, we present a library allowing for straightforward calls in C++ to jet grooming algorithms trained with deep reinforcement learning. The RL agent is trained with a reward function constructed to optimize the groomed jet…