应用统计
The U.S. Census Longitudinal Business Database (LBD) product contains employment and payroll information of all U.S. establishments and firms dating back to 1976 and is an invaluable resource for economic research. However, the sensitive…
Prognostic Health Management aims to predict the Remaining Useful Life (RUL) of degrading components/systems utilizing monitoring data. These RUL predictions form the basis for optimizing maintenance planning in a Predictive Maintenance…
Purpose: This study aimed to investigate the correlation between air pollution and astigmatism, considering the detrimental effects of air pollution on respiratory, cardiovascular, and eye health. Methods: A longitudinal study was conducted…
With increasing interest in adaptive clinical trial designs, challenges are present to drug supply chain management which may offset the benefit of adaptive designs. Thus, it is necessary to develop an optimization tool to facilitate the…
Marine mammals are increasingly vulnerable to human disturbance and climate change. Their diving behavior leads to limited visual access during data collection, making studying the abundance and distribution of marine mammals challenging.…
The recent shift to remote learning and work has aggravated long-standing problems, such as the problem of monitoring the mental health of individuals and the progress of students towards learning targets. We introduce a novel latent…
Automated vehicle (AV) shuttles are emerging mobility technologies that have been widely piloted and deployed. Public attitude is critical to the deployment progress and the overall social benefits of automated vehicle (AV) technologies.…
The COVID-19 pandemic has had a substantial impact on hospital services, as many institutions have observed a surge in healthcare-associated infections (HAIs) despite heightened adherence to isolation protocols and hand hygiene. According…
Due to severe societal and environmental impacts, wildfire prediction using multi-modal sensing data has become a highly sought-after data-analytical tool by various stakeholders (such as state governments and power utility companies) to…
Dengue is a vector-borne disease transmitted by Aedes mosquitoes. The worldwide spread of these mosquitoes and the increasing disease burden have emphasized the need for a spatio-temporal risk map capable of assessing dengue outbreak…
Sociodemographic inequalities in student achievement are a persistent concern for education systems and are increasingly recognized to be intersectional. Intersectionality considers the multidimensional nature of disadvantage, appreciating…
Sample size calculations play a central role in study design because sample size affects study interpretability, costs, hospital resources, and staff time. For most veterinary orthopaedic risk-factor studies, either the sample size…
Background: Bayesian Networks (BNs) are probabilistic graphical models that leverage Bayes' theorem to portray dependencies and cause-and-effect relationships between variables. These networks have gained prominence in the field of health…
Despite a large body of literature on trip inference using call detail record (CDR) data, a fundamental understanding of their limitations is lacking. In particular, because of the sparse nature of CDR data, users may travel to a location…
Wheelchair basketball, regulated by the International Wheelchair Basketball Federation, is a sport designed for individuals with physical disabilities. This paper presents a data-driven tool that effectively determines optimal team line-ups…
Combining strengths from deep learning and extreme value theory can help describe complex relationships between variables where extreme events have significant impacts (e.g., environmental or financial applications). Neural networks learn…
An election audit is risk-limiting if the audit limits (to a pre-specified threshold) the chance that an erroneous electoral outcome will be certified. Extant methods for auditing instant-runoff voting (IRV) elections are either not…
Vanderweele and Knol define biological interaction as an instance wherein "two exposures physically interact to bring about the outcome." A hallmark of biological interaction is that the total effect, produced when factors act together,…
Laparoscopy is an operation carried out in the abdomen or pelvis through small incisions with external visual control by a camera. This technique needs the abdomen to be insufflated with carbon dioxide to obtain a working space for surgical…
When a predictive model is in production, it must be monitored in real-time to ensure that its performance does not suffer due to drift or abrupt changes to data. Ideally, this is done long before learning that the performance of the model…