应用统计
Age-specific probabilities of death provide a snapshot of population mortality at the country level at a given point in time. Due to the high dimensionality of the data, summarising mortality information is essential for various analyses,…
We propose modeling absorption spectrum measurements as mixtures of Gaussian process experts. This enables us to construct a flexible statistical model for interpolating and extrapolating measurements, facilitating statistical integration…
We propose a semi-structured discrete-time multi-state model to analyse mortgage delinquency transitions. This model combines an easy-to-understand structured additive predictor, which includes linear effects and smooth functions of time…
This study aimed to adapt the Dual-Dimensional Scale of Instrumental and Relational Dependencies on Large Language Models (LLM-D12) into Turkish and evaluate its psychometric properties among regular LLM users. A sample of 387 participants…
The Global Database of Events, Language and Tone (GDELT) provides geolocated event records that can be aggregated into weekly spatiotemporal panels of event counts across regions, actors, and event types. These panels are typically sparse,…
Timely and accurate conflict event data are essential for real-time monitoring, forecasting, and policy response. Yet near-real-time conflict datasets such as the Armed Conflict Location \& Event Data Project (ACLED) are subject to…
The validity of assessments, from large-scale AI benchmarks to human classrooms, depends on the quality of individual items, yet modern evaluation instruments often contain thousands of items with minimal psychometric vetting. We introduce…
This study presents an Initial Data Analysis (IDA) of the German Transplantation Registry (TxReg) data for a better data understanding and to inform future data analyses. The IDA is focusing on data on first-time kidney-only…
While conducting probabilistic surveys is the gold standard for assessing vaccine coverage, implementing these surveys poses challenges for global health. There is a need for more convenient option that is more affordable and practical.…
This article focuses on the use of Geographically Weighted Regression (GWR) method to correct air quality low-cost sensors measurements. Those sensors are of major interest in the current era of high-resolution air quality monitoring at…
Urban air quality is a major concern today. Concentrations of pollutants, such as nitrogen dioxide, must be monitored to ensure that they do not exceed hazardous thresholds. For this reason, scarse reference stations, which are generally…
Single-cell trajectory analysis aims to reconstruct the biological developmental processes of cells as they evolve over time, leveraging temporal correlations in gene expression. During cellular development, gene expression patterns…
In a dataset of 423 patients who had had radical prostatectomy for localised prostate cancer we estimated the apparent Shannon information (ASI) about time to biochemical recurrence in various subsets of the available pre-op variables using…
Households represent a key unit of interest in infectious disease epidemiology, in both empirical studies and mathematical modelling. The within-household transmission potential of a disease is often summarised by a secondary attack ratio…
Accurate inference on population dynamics, such as migration and changes in population size, is essential for policymaking, resource allocation and demographic research. Traditional censuses are expensive, infrequent and not timely, leading…
Detection of occult hemorrhage (i.e., internal bleeding) in patients in intensive care units (ICUs) can pose significant challenges for critical care workers. Because blood loss may not always be clinically apparent, clinicians rely on…
Forecasting infectious disease outbreaks is hard. Forecasting emerging infectious diseases with limited historical data is even harder. In this paper, we investigate ways to improve emerging infectious disease forecasting under operational…
Understanding how animals move through heterogeneous landscapes is central to ecology and conservation. In this context, step selection functions (SSFs) have emerged as the main statistical framework to analyze how biotic and abiotic…
In the aftermath of the COVID-19 pandemic, empirical data have revealed that large-scale health crises not only cause immediate disruptions in mortality dynamics but also have persistent effects that may last for several years. Existing…
Understanding wafer-level spatial variations from in-situ process signals is essential for advanced plasma etching process monitoring. While most data-driven approaches focus on scalar indicators such as average etch rate, actual process…