应用统计
Carpooling has the potential to transform itself into a mass transportation mode by abandoning its adherence to deterministic passenger-driver matching for door-to-door journeys, and by adopting instead stochastic matching on a network of…
Carpooling is an integral component in smart carbon-neutral cities, in particular to facilitate homework commuting. We study an innovative carpooling service developed by the start-up Ecov which specialises in homework commutes in…
P-values are widely used in both the social and natural sciences to quantify the statistical significance of observed results. The recent surge of big data research has made the p-value an even more popular tool to test the significance of…
This paper describes the use of survival analysis and simulation to model the lifetime of high voltage instrument transformers in the Dutch transmission sys-tem. To represent asset aging, the non-parametric Kaplan-Meier method is used to…
Vehicular air pollution has created an ongoing air quality and public health crisis. Despite growing knowledge of racial injustice in exposure levels, less is known about the relationship between the production of and exposure to such…
The technical capacity to monitor patients with a mobile device has drastically expanded, but data produced from this approach are often difficult to interpret. We present a solution to produce a meaningful representation of patient status…
Principal component analysis is a long-standing go-to method for exploring multivariate data. The principal components are linear combinations of the original variables, ordered by descending variance. The first few components typically…
Although most models for rainfall extremes focus on point-wise values, it is aggregated precipitation over areas up to river catchment scale that is of the most interest. To capture the joint behaviour of precipitation aggregates evaluated…
Studies of vaccine efficacy often record both the incidence of vaccine-targeted virus strains (primary outcome) and the incidence of non-targeted strains (secondary outcome). However, standard estimates of vaccine efficacy on targeted…
It is said that we live in the age of data, and that data is ubiquitous and readily available if one has the tools to harness it. That may well be true, but so is the opposite. It is ever more common to try to start a data science project…
Income inequality distribution between social groups has been a global challenge. The focus of this study is to investigate the potential impact of female income on family size and purchasing power. Using statistical methods such as simple…
In an era where external data and computational capabilities far exceed statistical agencies' own resources and capabilities, they face the renewed challenge of protecting the confidentiality of underlying microdata when publishing…
This paper addresses a common problem with hierarchical time series. Time series analysis demands the series for a model to be the sum of multiple series at corresponding sub-levels. Hierarchical Time Series presents a two-fold problem.…
The uniformly optimal search plan is a cornerstone of the optimal search theory. It is well-known that when the target distribution is circular normal and the detection function is exponential, the uniformly search plan has several…
Sex difference in allele frequency is an emerging topic that is critical to our understanding of ascertainment bias, as well as data quality particularly of the largely overlooked X chromosome. To detect sex difference in allele frequency…
We discuss some philosophical, methodological and practical problems concerning the use of the test-negative design for COVID-19 vaccines. These problems limit the use of this design considerably.
Predictions of hydrological models should be probabilistic in nature. Our aim is to introduce a method that estimates directly the uncertainty of hydrological simulations using expectiles, thus complementing previous quantile-based direct…
Characterizing the sleep-wake cycle in adolescents is an important prerequisite to better understand the association of abnormal sleep patterns with subsequent clinical and behavioral outcomes. The aim of this research was to develop hidden…
Huge amounts of money are invested every year by football clubs on transfers. For both growth and survival, it is crucial for recruiting departments to make smart choices when targeting players. Therefore, it is very important to identify…
The aim of this study was to improve previous zonal approaches to expected possession value (EPV) models in low data availability sports by introducing a Bayesian Mixture Model approach to an EPV model in rugby league. 99,966 observations…