应用统计
The digital twin approach has gained recognition as a promising solution to the challenges faced by the Architecture, Engineering, Construction, Operations, and Management (AECOM) industries. However, its broader application across AECOM…
The Housing and Land Survey (HLS) of Japan provides municipality-level grouped data on household incomes. Although these data can be used for effective local policymaking, their analyses are hindered by several challenges, such as limited…
Recently, Delhi has become a chamber of bad air quality. This study explores the trends of probable contributors to Delhi's deteriorating air quality by analyzing data from 2014 to 2024 -- a period that has not been the central focus of…
Here, we outline how Rothman diagrams provide a geometric perspective that can help epidemiologists understand the relationships between effect measure modification (which we call association measure modification), collapsibility, and…
The complexity of experimental setups in the field of cyber-physical energy systems has motivated the development of the Holistic Test Description (HTD), a well-adopted approach for documenting and communicating test designs. Uncertainty,…
Frailty assessment is crucial for stratifying populations and addressing healthcare challenges associated with ageing. This study proposes a Frailty Index based on administrative health data, with the aim of facilitating informed…
This study explores the potential of large language models (LLMs) to enhance expert forecasting through ensemble learning. Leveraging the European Central Bank's Survey of Professional Forecasters (SPF) dataset, we propose a comprehensive…
The global energy landscape is experiencing a transformative shift, with an increasing emphasis on sustainable and clean energy sources. Hydrogen remains a promising candidate for decarbonization, energy storage, and as an alternative fuel.…
The spatial transcriptomics (ST) data produced by recent biotechnologies, such as CosMx and Xenium, contain huge amount of information about cancer tissue samples, which has great potential for cancer research via detection of community: a…
Pressing is a fundamental defensive strategy in football, characterized by applying pressure on the ball owning team to regain possession. Despite its significance, existing metrics for measuring pressing often lack precision or…
We address the problem of identifying functional interactions among stochastic neurons with variable-length memory from their spiking activity. The neuronal network is modeled by a stochastic system of interacting point processes with…
In the dynamic realm of One Day International (ODI) cricket, the sport has undergone significant transformations over the past four decades. This study digs into the intricate evolution of ODI cricket from 1987 to 2023, analyzing about 4000…
We propose an approach utilizing gamma-distributed random variables, coupled with log-Gaussian modeling, to generate synthetic datasets suitable for training neural networks. This addresses the challenge of limited real observations in…
We propose a statistical model for narrowing line shapes in spectroscopy that are well approximated as linear combinations of Lorentzian or Voigt functions. We introduce a log-Gaussian Cox process to represent the peak locations thereby…
In Major League Baseball, every ballpark is different, with different dimensions and climates. These differences make some ballparks more conducive to hitting home runs than others. Several factors conspire to make estimation of these…
Dengue is an infectious disease which poses significant socioeconomic and disease burden in many tropical and subtropical regions of the world. This work aims to provide additional insight into the association between dengue and climate in…
Drought is a significant natural phenomenon with profound environmental, economic, and societal impacts. Effective monitoring of drought characteristics -- such as intensity, magnitude, and duration -- is crucial for resilience and…
Despite a global decline in motor vehicle crash fatalities due to improved research and road safety policies, road traffic injuries remain a significant public health concern. The World Health Organization 2023 report highlights that road…
Anomaly detection is the task of identifying rarely occurring (i.e. anormal or anomalous) samples that differ from almost all other samples in a dataset. As the patterns of anormal samples are usually not known a priori, this task is highly…
We revisit a foundational question in golf analytics: how important are the core components of performance--driving, approach play, and putting--in explaining success on the PGA Tour? Building on Mark Broadie's strokes gained analyses, we…