English
Related papers

Related papers: Endogenous post-stratification in surveys: classif…

200 papers

Auxiliary information can increase the efficiency of survey estimators through an assisting model when the model captures some of the relationship between the auxiliary data and the study variables. Despite their superior properties,…

Methodology · Statistics 2017-12-18 Kelly S. McConville , Daniell Toth

In many randomized trials, outcomes such as essays or open-ended responses must be manually scored as a preliminary step to impact analysis, a process that is costly and limiting. Model-assisted estimation offers a way to combine surrogate…

Methodology · Statistics 2026-02-16 Reagan Mozer , Nicole E. Pashley , Luke Miratrix

To increase statistical efficiency in a randomized experiment, researchers often use stratification (i.e., blocking) in the design stage. However, conventional practices of stratification fail to exploit valuable information about the…

Methodology · Statistics 2025-10-28 Zikai Li

In observational surveys, post-stratification is used to reduce bias resulting from differences between the survey population and the population under investigation. However, this can lead to inflated post-stratification weights and,…

Applications · Statistics 2016-06-24 Yannick Vandendijck , Christel Faes , Niel Hens

Recent works have proposed optimal subsampling algorithms to improve computational efficiency in large datasets and to design validation studies in the presence of measurement error. Existing approaches generally fall into two categories:…

Methodology · Statistics 2025-12-25 Jasper B. Yang , Thomas Lumley , Bryan E. Shepherd , Pamela A. Shaw

Post-treatment variables often complicate causal inference. They appear in many scientific problems, including noncompliance, truncation by death, mediation, and surrogate endpoint evaluation. Principal stratification is a strategy to…

Methodology · Statistics 2024-04-04 Sizhu Lu , Zhichao Jiang , Peng Ding

Entropy is a measure of heterogeneity widely used in applied sciences, often when data are collected over space. Recently, a number of approaches has been proposed to include spatial information in entropy. The aim of entropy is to…

Statistics Theory · Mathematics 2019-11-12 Linda Altieri , Daniela Cocchi , Giulia Roli

Post-stratification is often used to estimate treatment effects with higher efficiency. However, the majority of existing post-stratification frameworks depend on prior knowledge of the distributions of covariates and assume that the units…

Methodology · Statistics 2023-09-13 Taebin Kim , Lili Wang , Randy Lai , Sangho Yoon

A central theme in the field of survey statistics is estimating population-level quantities through data coming from potentially non-representative samples of the population. Multilevel Regression and Poststratification (MRP), a model-based…

Methodology · Statistics 2020-07-17 Yuxiang Gao , Lauren Kennedy , Daniel Simpson , Andrew Gelman

A new approach of obtaining stratified random samples from statistically dependent random variables is described. The proposed method can be used to obtain samples from the input space of a computer forward model in estimating expectations…

Methodology · Statistics 2019-11-25 Anirban Mondal , Abhijit Mandal

The problem of estimation of the proportion of units with a given attribute in a~finite population is considered. From the population a sample is drawn due to the simple random sampling without replacement. There are limited funds for…

Statistics Theory · Mathematics 2019-03-26 Dominik Sieradzki , Wojciech Zieliński

The performance of a machine learning system is usually evaluated by using i.i.d.\ observations with true labels. However, acquiring ground truth labels is expensive, while obtaining unlabeled samples may be cheaper. Stratified sampling can…

Machine Learning · Computer Science 2019-07-29 Tiancheng Yu , Xiyu Zhai , Suvrit Sra

In surveys requiring cost efficiency, such as medical research, measuring the variable of interest (e.g., disease status) is expensive and/or time-consuming; However, we often have access to easily attainable characteristics about sampling…

Methodology · Statistics 2023-02-23 Amirhossein Alvandi , Armin Hatefi

Sound policy and decision making in developing countries is often limited by the lack of timely and reliable data. Crowdsourced data may provide a valuable alternative for data collection and analysis, e. g. in remote and insecure areas or…

Experiments studying get-out-the-vote (GOTV) efforts estimate the causal effect of various mobilization efforts on voter turnout. However, there is often substantial noncompliance in these studies. A usual approach is to use an instrumental…

Methodology · Statistics 2024-07-02 Nicole E. Pashley , Luke Keele , Luke W. Miratrix

This paper considers the problem of design-based inference for the average treatment effect in finely stratified experiments. Here, by "design-based'' we mean that the only source of uncertainty stems from the randomness in treatment…

Econometrics · Economics 2025-05-08 Yuehao Bai , Xun Huang , Joseph P. Romano , Azeem M. Shaikh , Max Tabord-Meehan

The natural occurrence of singular spaces in applications has led to recent investigations on performing topological data analysis (TDA) in a stratified framework. In many applications, there is no a priori information on what points should…

Algebraic Topology · Mathematics 2023-12-12 Tim Mäder , Lukas Waas

The need for small area estimates is increasingly felt in both the public and private sectors in order to formulate their strategic plans. It is now widely recognized that direct small area survey estimates are highly unreliable owing to…

Methodology · Statistics 2012-03-26 G. Datta , M. Ghosh

Observational data are often accompanied by natural structural indices, such as time stamps or geographic locations, which are meaningful to prediction tasks but are often discarded. We leverage semantically meaningful indexing data while…

Machine Learning · Computer Science 2020-03-16 Esther Rolf , Michael I. Jordan , Benjamin Recht

Optimal propensity score matching has emerged as one of the most ubiquitous approaches for causal inference studies on observational data; However, outstanding critiques of the statistical properties of propensity score matching have cast…

‹ Prev 1 2 3 10 Next ›