数据分析、统计与概率
Resampling is an operation costly in calculation time and accuracy. It regularizes irregular sampling, replacing N data by N periodic estimations. This stage can be suppressed, using formulas built with incoming data and completed by…
Complex systems are described with high-dimensional data that is hard to visualise. Inselberg's parallel coordinates are one representation technique for visualising high-dimensional data. Here we generalise Inselberg's approach, and use it…
Most of the real world complex networks such as the Internet, World Wide Web and collaboration networks are huge; and to infer their structure and dynamics one requires handling large connectivity (adjacency) matrices. Also, to find out the…
A new physically based method to estimate hemispheric-directional reflectance factor (HDRF) from lightweight multispectral cameras that have a downwelling irradiance sensor is presented. It combines radiometry with photogrammetric computer…
Analysis of daily streamflow variability in space and time is important for water resources planning, development, and management. The natural variability of streamflow is being complicated by anthropogenic influences and climate change,…
The level structure of nuclei offers a large amount and variety of information to improve our knowledge of the strong interaction and of mesoscopic quantum systems. Gamma spectroscopy is a powerful tool to perform such studies: modern gamma…
The availability of large datasets requires an improved view on statistical laws in complex systems, such as Zipf's law of word frequencies, the Gutenberg-Richter law of earthquake magnitudes, or scale-free degree distribution in networks.…
We propose a simplified procedure for the experimental application of the efficiency correction on higher order cumulants in heavy-ion collisions. By using the track-by-track efficiency, we can eliminate possible bias arising from the…
X-ray diffraction (XRD) data acquisition and analysis is among the most time-consuming steps in the development cycle of novel thin-film materials. We propose a machine-learning-enabled approach to predict crystallographic dimensionality…
Bayesian statistical inference has become increasingly important for the analysis of observations from the Advanced LIGO and Advanced Virgo gravitational-wave detectors. To this end, iterative simulation techniques, in particular nested…
In this communication I reconcile the kinematic method illustrated by some authors~\cite{yang2018,vitali2018} in studying the impact of system and suspension parameters on acoustophoretic separations with the statistical method formerly…
The explosion of activity in finding interactions in complex systems is driven by availability of copious observations of complex natural systems. However, such systems, e.g. the human brain, are rarely completely observable. Interaction…
We introduce a wide family of stochastic processes that are obtained as sums of self-similar localized "waveforms" with multiplicative intensity in the spirit of the Richardson cascade picture of turbulence. We establish the convergence and…
We derive a Bayesian framework for incorporating selection effects into population analyses. We allow for both measurement uncertainty in individual measurements and, crucially, for selection biases on the population of measurements, and…
A new boosting decision tree (BDT) method, QBDT, is proposed for the classification problem in the field of high energy physics (HEP). In many HEP researches, great efforts are made to increase the signal significance with the presence of…
Intuitively, a scientist might assume that a more complex regression model will necessarily yield a better predictive model of experimental data. Herein, we disprove this notion in the context of extracting the proton charge radius from…
Timing systems based on Analog-to-Digital Converters are widely used in the design of previous high energy physics detectors. In this paper, we propose a new method based on deep learning to extract the time information from a finite set of…
The author considers a hypothesis of neutron lifetime splitting in beta-decay and shows that the beta-decay of neutrons could be described by the triad of lifetimes tau_{Left}, tau_{Mean}, tau_{Right}. The lifetime tau_{Left} is the…
The 1-bit compressed sensing framework enables the recovery of a sparse vector x from the sign information of each entry of its linear transformation. Discarding the amplitude information can significantly reduce the amount of data, which…
Materials science is undergoing profound changes due to advances in characterization instrumentation that have resulted in an explosion of data in terms of volume, velocity, variety and complexity. Harnessing these data for scientific…