Related papers: Endogenous post-stratification in surveys: classif…

Automated Selection of Post-Strata using a Model-Assisted Regression Tree Estimator

Auxiliary information can increase the efficiency of survey estimators through an assisting model when the model captures some of the relationship between the auxiliary data and the study variables. Despite their superior properties,…

Methodology · Statistics 2017-12-18 Kelly S. McConville , Daniell Toth

Stratified Sampling for Model-Assisted Estimation with Surrogate Outcomes

In many randomized trials, outcomes such as essays or open-ended responses must be manually scored as a preliminary step to impact analysis, a process that is costly and limiting. Model-assisted estimation offers a way to combine surrogate…

Methodology · Statistics 2026-02-16 Reagan Mozer , Nicole E. Pashley , Luke Miratrix

Bridging Stratification and Regression Adjustment: Batch-Adaptive Stratification with Post-Design Adjustment in Randomized Experiments

To increase statistical efficiency in a randomized experiment, researchers often use stratification (i.e., blocking) in the design stage. However, conventional practices of stratification fail to exploit valuable information about the…

Methodology · Statistics 2025-10-28 Zikai Li

Prevalence and trend estimation from observational data with highly variable post-stratification weights

In observational surveys, post-stratification is used to reduce bias resulting from differences between the survey population and the population under investigation. However, this can lead to inflated post-stratification weights and,…

Applications · Statistics 2016-06-24 Yannick Vandendijck , Christel Faes , Niel Hens

Improving optimal subsampling through stratification

Recent works have proposed optimal subsampling algorithms to improve computational efficiency in large datasets and to design validation studies in the presence of measurement error. Existing approaches generally fall into two categories:…

Methodology · Statistics 2025-12-25 Jasper B. Yang , Thomas Lumley , Bryan E. Shepherd , Pamela A. Shaw

Principal Stratification with Continuous Post-Treatment Variables: Nonparametric Identification and Semiparametric Estimation

Post-treatment variables often complicate causal inference. They appear in many scientific problems, including noncompliance, truncation by death, mediation, and surrogate endpoint evaluation. Principal stratification is a strategy to…

Methodology · Statistics 2024-04-04 Sizhu Lu , Zhichao Jiang , Peng Ding

Estimation of entropy measures for categorical variables with spatial correlation

Entropy is a measure of heterogeneity widely used in applied sciences, often when data are collected over space. Recently, a number of approaches has been proposed to include spatial information in entropy. The aim of entropy is to…

Statistics Theory · Mathematics 2019-11-12 Linda Altieri , Daniela Cocchi , Giulia Roli

Efficient Treatment Effect Estimation with Out-of-bag Post-stratification

Post-stratification is often used to estimate treatment effects with higher efficiency. However, the majority of existing post-stratification frameworks depend on prior knowledge of the distributions of covariates and assume that the units…

Methodology · Statistics 2023-09-13 Taebin Kim , Lili Wang , Randy Lai , Sangho Yoon

Improving multilevel regression and poststratification with structured priors

A central theme in the field of survey statistics is estimating population-level quantities through data coming from potentially non-representative samples of the population. Multilevel Regression and Poststratification (MRP), a model-based…

Methodology · Statistics 2020-07-17 Yuxiang Gao , Lauren Kennedy , Daniel Simpson , Andrew Gelman

Stratified Random Sampling for Dependent Inputs

A new approach of obtaining stratified random samples from statistically dependent random variables is described. The proposed method can be used to obtain samples from the input space of a computer forward model in estimating expectations…

Methodology · Statistics 2019-11-25 Anirban Mondal , Abhijit Mandal

Cost Issue in Estimation of Proportion in a Finite Population Divided Among Two Strata

The problem of estimation of the proportion of units with a given attribute in a~finite population is considered. From the population a sample is drawn due to the simple random sampling without replacement. There are limited funds for…

Statistics Theory · Mathematics 2019-03-26 Dominik Sieradzki , Wojciech Zieliński

Near Optimal Stratified Sampling

The performance of a machine learning system is usually evaluated by using i.i.d.\ observations with true labels. However, acquiring ground truth labels is expensive, while obtaining unlabeled samples may be cheaper. Stratified sampling can…

Machine Learning · Computer Science 2019-07-29 Tiancheng Yu , Xiyu Zhai , Suvrit Sra

Analysis of Ordinal Populations from Judgment Post-Stratification

In surveys requiring cost efficiency, such as medical research, measuring the variable of interest (e.g., disease status) is expensive and/or time-consuming; However, we often have access to easily attainable characteristics about sampling…

Methodology · Statistics 2023-02-23 Amirhossein Alvandi , Armin Hatefi

Post-sampling crowdsourced data to allow reliable statistical inference: the case of food price indices in Nigeria

Sound policy and decision making in developing countries is often limited by the lack of timely and reliable data. Crowdsourced data may provide a valuable alternative for data collection and analysis, e. g. in remote and insecure areas or…

Methodology · Statistics 2020-03-30 Giuseppe Arbia , Gloria Solano-Hermosilla , Fabio Micale , Vincenzo Nardelli , Giampiero Genovese

Improving instrumental variable estimators with post-stratification

Experiments studying get-out-the-vote (GOTV) efforts estimate the causal effect of various mobilization efforts on voter turnout. However, there is often substantial noncompliance in these studies. A usual approach is to use an instrumental…

Methodology · Statistics 2024-07-02 Nicole E. Pashley , Luke Keele , Luke W. Miratrix

A New Design-Based Variance Estimator for Finely Stratified Experiments

This paper considers the problem of design-based inference for the average treatment effect in finely stratified experiments. Here, by "design-based'' we mean that the only source of uncertainty stems from the randomness in treatment…

Econometrics · Economics 2025-05-08 Yuehao Bai , Xun Huang , Joseph P. Romano , Azeem M. Shaikh , Max Tabord-Meehan

From Samples to Persistent Stratified Homotopy Types

The natural occurrence of singular spaces in applications has led to recent investigations on performing topological data analysis (TDA) in a stratified framework. In many applications, there is no a priori information on what points should…

Algebraic Topology · Mathematics 2023-12-12 Tim Mäder , Lukas Waas

Small Area Shrinkage Estimation

The need for small area estimates is increasingly felt in both the public and private sectors in order to formulate their strategic plans. It is now widely recognized that direct small area survey estimates are highly unreliable owing to…

Methodology · Statistics 2012-03-26 G. Datta , M. Ghosh

Post-Estimation Smoothing: A Simple Baseline for Learning with Side Information

Observational data are often accompanied by natural structural indices, such as time stamps or geographic locations, which are meaningful to prediction tasks but are often discarded. We leverage semantically meaningful indexing data while…

Machine Learning · Computer Science 2020-03-16 Esther Rolf , Michael I. Jordan , Benjamin Recht

stratamatch: Prognostic ScoreStratification using a Pilot Design

Optimal propensity score matching has emerged as one of the most ubiquitous approaches for causal inference studies on observational data; However, outstanding critiques of the statistical properties of propensity score matching have cast…

Computation · Statistics 2021-03-01 Rachael C. Aikens , Joseph Rigdon , Justin Lee , Michael Baiocchi , Andrew B. Goldstone , Peter Chiu , Y. Joseph Woo , Jonathan H. Chen