应用统计
Incomplete reporting of diagnostic accuracy data remains a persistent problem in medical research. In many studies, only part of the 2x2 diagnostic table is reported, leaving denominators for diseased and non-diseased groups unknown and…
Extreme weather events during peak winter periods drive resource adequacy risk in Great Britain (GB), with weather sensitivity of the supply-demand balance increasing through additional electric heating and wind generation. This work…
This report describes the SHARELIFE-MI project, which aims to generate multiple imputations for missing values in the life-course data collected in SHARELIFE Waves 3 and 7. The SHARELIFE study reconstructs individual life histories through…
Football fans frequently exhibit pronounced emotional and physiological reactions during high-stakes matches. However, the temporal dynamics of this football fever are rarely modeled as a latent process. Using intensive longitudinal data…
Infectious disease dynamics operate across multiple biological scales, with within-host viral dynamics being a key driver of between-host transmission. However, while models that explicitly link these scales exist, none have been developed…
Accurate forecasting of electric vehicle (EV) charging demand is critical for grid management and infrastructure planning. Yet the field continues to rely on legacy benchmarks; such as the Palo Alto (2020) dataset; that fail to reflect the…
Clustering mixed-type data remains a major challenge in biomedical research to uncover clinically meaningful subgroups within heterogeneous patient populations. Most existing clustering methods impose restrictive assumptions like local…
Modeling precipitation and its accumulation over time and space is essential for flood risk assessment. In this paper, we analyze rainfall data collected over several years through a micro-scale precipitation sensor network in Montpellier,…
Wave impact loads on maritime structures can cause casualties, damage, pollution and operational delays. Consequently, their extreme values should be accounted for in the design of these structures. However, this is challenging, as wave…
More than a billion people around the world experience intermittence in their water supply, posing challenges for urban households in Global South cities. An intermittent water supply (IWS) system prompts water users to adapt to service…
The impact of a management intervention on the soil organic carbon (SOC) stored in a given volume of soil is moderated by features that determine that soil's sequestration potential under that intervention. To maximize total SOC…
We seek to identify genes involved in Parkinson's Disease (PD) by combining information across different experiment types. Each experiment, taken individually, may contain too little information to distinguish some important genes from…
Early identification of at risk students in higher education depends on predictive models that maintain accuracy across successive cohorts -- a requirement that single-cohort modeling approaches fail to meet. This study evaluates Bayesian…
The Nobel Prize in literature 1965 was awarded Mikhail Sholokhov (1905-1984), for the epic novel Tikhij Don about Cossack life and the birth of a new Soviet society (And Quiet Flows the Don, or The Quiet Don, in different translations).…
Air pollution is a worldwide public health threat that can cause or exacerbate many illnesses, including respiratory disease, cardiovascular disease, and some cancers. However, epidemiological studies and public health decision-making are…
Understanding and mapping extreme heat is critical for risk management and public health planning, particularly in regions with complex terrain and heterogeneous climate. We present a case study of extreme heat in the Four Corners region of…
Air quality monitoring in Italy relies on sparse, irregular, ground-based stations that provide high-quality but incomplete measurements of pollution. Chemical transport models (CTMs) offer full spatial and temporal coverage but smooth over…
Spatially resolved transcriptomics is a fast-developing set of technologies that enables the measurement of localized gene expression across spatial locations in a sample. Detecting spatially varying genes is critical for analyzing such…
The transition of end-of-life care to palliative care (PC) sparks intense debate: does it provide economic relief or shift unremunerated labor costs onto families? Evaluating this is hindered by causal inference challenges and skewed…
Neurophysiologists are nowadays able to record from a large number of extracellular electrodes and to extract, from the raw data, the sequences of action potentials or spikes generated by many neurons. Unfortunately these ''many neurons''…