应用统计
Microbiome interventions provide valuable data about microbial ecosystem structure and dynamics. Despite their ubiquity in microbiome research, few rigorous data analysis approaches are available. In this study, we extend transfer…
Humans have recorded the arrival dates of migratory birds for millennia, searching for trends and patterns. As the first arrival among individuals in a species is the realized tail of the probability distribution of arrivals, the…
In fluid team sports such as soccer and basketball, analyzing team formation is one of the most intuitive ways to understand tactics from domain participants' point of view. However, existing approaches either assume that team formation is…
Our understanding of the structure of the brain and its relationships with human traits is largely determined by how we represent the structural connectome. Standard practice divides the brain into regions of interest (ROIs) and represents…
Epidemic disease can spread during mass gatherings. We assessed the impact on the local-area trajectory of the COVID-19 epidemic of a type of mass gathering about which comprehensive data were available. Here, we examined five types of…
Improving road safety is hugely important with the number of deaths on the world's roads remaining unacceptably high; an estimated 1.35 million people die each year as a result of road traffic collisions (WHO, 2020). Current practice for…
The refurbishment of an escalator is usually linked with its design life as recommended by the manufacturer. However, the actual useful life of an escalator should be determined by its operating condition which is affected by the runtime,…
Aggregated curves are common structures in economics and finance, and the most prominent examples are supply and demand curves. In this study, we exploit the fact that all aggregated curves have an intrinsic hierarchical structure, and thus…
The Health Index is a value (in terms of color or score) which describes the technical condition of an asset. Using the Health Index of various assets, the so called aggregated Health Index of a system can be calculated. For electric…
Binary spatio-temporal data are common in many application areas. Such data can be considered from many perspectives, including via deterministic or stochastic cellular automata, where local rules govern the transition probabilities that…
Bivariate count models having one marginal and the other conditionals being of the Poissons form are called pseudo-Poisson distributions. Such models have simple exible dependence structures, possess fast computation algorithms and generate…
Agricultural workers are essential to the supply chain for our daily food and yet, many face harmful work conditions, including garnished wages, and other labor violations. Workers on H-2A visas are particularly vulnerable due to the…
A new method for clustering functional data is proposed via information maximization. The proposed method learns a probabilistic classifier in an unsupervised manner so that mutual information (or squared loss mutual information) between…
We propose a Bayesian stochastic cellular automata modeling approach to model the spread of wildfires with uncertainty quantification. The model considers a dynamic neighborhood structure that allows neighbor states to inform transition…
Financial institutions manage operational risk (OpRisk) by carrying out activities required by regulation, such as collecting loss data, calculating capital requirements, and reporting. For this purpose, for each OpRisk event, loss amounts,…
Partisan gerrymandering, i.e., manipulation of electoral district boundaries for political advantage, is one of the major challenges to election integrity in modern day democracies. Yet most of the existing methods for detecting partisan…
California experienced an increase in violent criminality during the last decade, largely driven by a surge in aggravated assaults. To address this challenge, accurate and timely forecasts of criminal activity may help state authorities…
Exposure to fine particulate matter ($PM_{2.5}$) poses significant health risks and accurately determining the shape of the relationship between $PM_{2.5}$ and health outcomes has crucial policy ramifications. While various statistical…
Objective: Breathing pattern variability (BPV), as a universal physiological feature, encodes rich health information. We aim to show that, a high-quality automatic sleep stage scoring based on a proper quantification of BPV extracting from…
This paper introduces a comprehensive and original database on West Nile virus (WNV) outbreaks that have occurred in Italy from September 2012 to November 2022. We have digitized bulletins published by the Italian National Institute of…