Related papers: Preferential Sampling for Bivariate Spatial Data

A Look into the Problem of Preferential Sampling from the Lens of Survey Statistics

An evolving problem in the field of spatial and ecological statistics is that of preferential sampling, where biases may be present due to a relationship between sample data locations and a response of interest. This field of research bears…

Methodology · Statistics 2022-03-11 Daniel Vedensky , Paul A. Parker , Scott H. Holan

Optimal Design in Geostatistics under Preferential Sampling

This paper analyses the effect of preferential sampling in Geostatistics when the choice of new sampling locations is the main interest of the researcher. A Bayesian criterion based on maximizing utility functions is used. Simulated studies…

Statistics Theory · Mathematics 2015-09-23 Gustavo da Silva Ferreira , Dani Gamerman

Preferential sampling for presence/absence data and for fusion of presence/absence data with presence-only data

Presence/absence data and presence-only data are the two customary sources for learning about species distributions over a region. We illuminate the fundamental modeling differences between the two types of data. Most simply, locations are…

Methodology · Statistics 2019-04-04 Alan. E. Gelfand , Shinichiro Shirota

Reducing estimation bias in adaptively changing monitoring networks with preferential site selection

This paper explores the topic of preferential sampling, specifically situations where monitoring sites in environmental networks are preferentially located by the designers. This means the data arising from such networks may not accurately…

Applications · Statistics 2014-12-04 James V. Zidek , Gavin Shaddick , Carolyn G. Taylor

Exact Bayesian Inference for Geostatistical Models under Preferential Sampling

Preferential sampling is a common feature in geostatistics and occurs when the locations to be sampled are chosen based on information about the phenomena under study. In this case, point pattern models are commonly used as the probability…

Methodology · Statistics 2022-10-27 Douglas Mateus da Silva , Dani Gamerman

Spatial causal inference in the presence of preferential sampling to study the impacts of marine protected areas

Marine Protected Areas (MPAs) have been established globally to conserve marine resources. Given their maintenance costs and impact on commercial fishing, it is critical to evaluate their effectiveness to support future conservation. In…

Methodology · Statistics 2026-05-11 Dongjae Son , Brian J. Reich , Erin M. Schliep , Shu Yang , David A. Gill

On Ignorability of Preferential Sampling in Geostatistics

Preferential sampling has attracted considerable attention in geostatistics since the pioneering work of Diggle et al. (2010). A variety of likelihood-based approaches have been developed to correct estimation bias by explicitly modelling…

Methodology · Statistics 2025-11-06 Changqing Lu , Ganggang Xu , Junho Yang , Yongtao Guan

How to perform modeling with independent and preferential data jointly?

Continuous space species distribution models (SDMs) have a long-standing history as a valuable tool in ecological statistical analysis. Geostatistical and preferential models are both common models in ecology. Geostatistical models are…

Applications · Statistics 2023-07-17 Mario Figueira , David Conesa , Antonio López-Quílez , Iosu Paradinas

Goal-oriented adaptive sampling under random field modelling of response probability distributions

In the study of natural and artificial complex systems, responses that are not completely determined by the considered decision variables are commonly modelled probabilistically, resulting in response distributions varying across decision…

Methodology · Statistics 2021-10-07 Athénaïs Gautier , David Ginsbourger , Guillaume Pirot

Taking advantage of sampling deisgns in Bayesian spatial small ares survey studies

Spatial small area estimation models have become very popular in some contexts, such as disease mapping. Data in disease mapping studies are exhaustive, that is, the available data are supposed to be a complete register of all the…

Methodology · Statistics 2021-12-13 Carlos Vergara-Hernández , Marc Marí-DellOlmo , Laura Oliveras , Miguel A. Martinez-Beneito

Model-assisted survey sampling with Bayesian optimization

Survey sampling plays an important role in the efficient allocation and management of resources. The essence of survey sampling lies in acquiring a sample of data points from a population and subsequently using this sample to estimate the…

Methodology · Statistics 2024-01-29 Jonne Pohjankukka , Sakari Tuominen , Jukka Heikkonen

Quantifying and mitigating the effect of preferential sampling on phylodynamic inference

Phylodynamics seeks to estimate effective population size fluctuations from molecular sequences of individuals sampled from a population of interest. One way to accomplish this task formulates an observed sequence data likelihood exploiting…

Methodology · Statistics 2016-04-27 Michael D. Karcher , Julia A. Palacios , Trevor Bedford , Marc A. Suchard , Vladimir N. Minin

On the Origins of Sampling Bias: Implications on Fairness Measurement and Mitigation

Accurately measuring discrimination is crucial to faithfully assessing fairness of trained machine learning (ML) models. Any bias in measuring discrimination leads to either amplification or underestimation of the existing disparity.…

Machine Learning · Computer Science 2025-03-25 Sami Zhioua , Ruta Binkyte , Ayoub Ouni , Farah Barika Ktata

A general theory for preferential sampling in environmental networks

This paper presents a general model framework for detecting the preferential sampling of environmental monitors recording an environmental process across space and/or time. This is achieved by considering the joint distribution of an…

Methodology · Statistics 2019-04-09 Joe Watson , James V. Zidek , Gavin Shaddick

High-dimensional Multivariate Geostatistics: A Bayesian Matrix-Normal Approach

Joint modeling of spatially-oriented dependent variables is commonplace in the environmental sciences, where scientists seek to estimate the relationships among a set of environmental outcomes accounting for dependence among these outcomes…

Methodology · Statistics 2021-03-22 Lu Zhang , Sudipto Banerjee , Andrew O. Finley

Model-specific Data Subsampling with Influence Functions

Model selection requires repeatedly evaluating models on a given dataset and measuring their relative performances. In modern applications of machine learning, the models being considered are increasingly more expensive to evaluate and the…

Machine Learning · Computer Science 2020-10-21 Anant Raj , Cameron Musco , Lester Mackey , Nicolo Fusi

Demystifying Spatial Confounding

Spatial confounding is a fundamental issue in spatial regression models which arises because spatial random effects, included to approximate unmeasured spatial variation, are typically not independent of covariates in the model. This can…

Methodology · Statistics 2025-07-15 Emiko Dupont , Isa Marques , Thomas Kneib

Modelling ocean temperatures from bio-probes under preferential sampling

In the last 25 years there has been an important increase in the amount of data collected from animal-mounted sensors (bio-probes), which are often used to study the animals' behaviour or environment. We focus here on an example of the…

Applications · Statistics 2019-06-20 Daniel Dinsdale , Matias Salibian-Barrera

Generalized Linear Models for Longitudinal Data with Biased Sampling Designs: A Sequential Offsetted Regressions Approach

Biased sampling designs can be highly efficient when studying rare (binary) or low variability (continuous) endpoints. We consider longitudinal data settings in which the probability of being sampled depends on a repeatedly measured…

Methodology · Statistics 2020-01-14 Lee S. McDaniel , Jonathan S. Schildcrout , Enrique F. Schisterman , Paul J. Rathouz

A Primer on Domain Adaptation

Standard supervised machine learning assumes that the distribution of the source samples used to train an algorithm is the same as the one of the target samples on which it is supposed to make predictions. However, as any data scientist…

Machine Learning · Computer Science 2020-02-12 Pirmin Lemberger , Ivan Panico