Related papers: Evaluating Binary Outcome Classifiers Estimated fr…

Moving toward best practice when using propensity score weighting in survey observational studies

Propensity score weighting is a common method for estimating treatment effects with survey data. The method is applied to minimize confounding using measured covariates that are often different between individuals in treatment and control.…

Methodology · Statistics 2026-02-06 Yukang Zeng , Fan Li , Guangyu Tong

Estimation of logistic regression parameters for complex survey data: a real data based simulation study

In complex survey data, each sampled observation has assigned a sampling weight, indicating the number of units that it represents in the population. Whether sampling weights should or not be considered in the estimation process of model…

Methodology · Statistics 2024-09-20 Amaia Iparragirre , Irantzu Barrio , Jorge Aramendi , Inmaculada Arostegui

Generalizing Randomized Trial Findings to a Target Population using Complex Survey Population Data

Randomized trials are considered the gold standard for estimating causal effects. Trial findings are often used to inform policy and programming efforts, yet their results may not generalize well to a relevant target population due to…

Methodology · Statistics 2020-04-06 Benjamin Ackerman , Catherine R. Lesko , Juned Siddique , Ryoko Susukida , Elizabeth A. Stuart

Worth Weighting? How to Think About and Use Weights in Survey Experiments

The popularity of online surveys has increased the prominence of using weights that capture units' probabilities of inclusion for claims of representativeness. Yet, much uncertainty remains regarding how these weights should be employed in…

Methodology · Statistics 2017-08-16 Luke W. Miratrix , Jasjeet S. Sekhon , Alexander G. Theodoridis , Luis F. Campos

Sensitivity Analysis for Survey Weights

Survey weighting allows researchers to account for bias in survey samples, due to unit nonresponse or convenience sampling, using measured demographic covariates. Unfortunately, in practice, it is impossible to know whether the estimated…

Methodology · Statistics 2023-03-07 Erin Hartman , Melody Huang

Robust Estimation of Loss-Based Measures of Model Performance under Covariate Shift

We present methods for estimating loss-based measures of the performance of a prediction model in a target population that differs from the source population in which the model was developed, in settings where outcome and covariate data are…

Methodology · Statistics 2022-10-06 Samantha Morrison , Constantine Gatsonis , Issa J. Dahabreh , Bing Li , Jon A. Steingrimsson

Improving prediction models by incorporating external data with weights based on similarity

In clinical settings, we often face the challenge of building prediction models based on small observational data sets. For example, such a data set might be from a medical center in a multi-center study. Differences between centers might…

Methodology · Statistics 2024-05-29 Max Behrens , Maryam Farhadizadeh , Angelika Rohde , Alexander Rühle , Nils H. Nicolay , Harald Binder , Daniela Zöller

Weighted Mean Difference Statistics for Paired Data in Presence of Missing Values

Missing data is a common issue in many biomedical studies. Under a paired design, some subjects may have missing values in either one or both of the conditions due to loss of follow-up, insufficient biological samples, etc. Such partially…

Methodology · Statistics 2021-10-26 Yuntong Li , Brent J. Shelton , William St Clair , Heidi L. Weiss , John L. Villano , Arnold J. Stromberg , Chi Wang , Li Chen

Correcting Performance Estimation Bias in Imbalanced Classification with Minority Subconcepts

Class-level evaluation can conceal substantial performance disparities across subconcepts within the same class, causing models that perform well on average to fail on specific subpopulations. Prior work has shown that common evaluation…

Machine Learning · Computer Science 2026-04-30 Taylor Maxson , Roberto Corizzo , Yaning Wu , Nathalie Japkowicz , Colin Bellinger

Using Model-Assisted Calibration Methods to Improve Efficiency of Regression Analyses with Two-Phase Samples under Complex Survey Designs

Two-phase sampling designs are frequently employed in epidemiological studies and large-scale health surveys. In such designs, certain variables are exclusively collected within a second-phase random subsample of the initial first-phase…

Methodology · Statistics 2024-03-25 Lingxiao Wang

On the estimation of complex statistics combining different surveys

The importance of exploring a potential integration among surveys has been acknowledged in order to enhance effectiveness and minimize expenses. In this work, we employ the alignment method to combine information from two different surveys…

Methodology · Statistics 2024-04-09 Vasilis Chasiotis , Dimitris Karlis

Using Balancing Weights to Target the Treatment Effect on the Treated when Overlap is Poor

Inverse probability weights are commonly used in epidemiology to estimate causal effects in observational studies. Researchers can typically focus on either the average treatment effect or the average treatment effect on the treated with…

Methodology · Statistics 2022-10-05 Eli Ben-Michael , Luke Keele

Evolved Sample Weights for Bias Mitigation: Effectiveness Depends on the Fairness Objective

Machine learning models trained on real-world data may inadvertently make biased predictions that negatively impact marginalized communities. Reweighting, which assigns a weight to each data point used during model training, can mitigate…

Machine Learning · Computer Science 2026-03-20 Anil K. Saini , Jose Guadalupe Hernandez , Emily F. Wong , Debanshi Misra , Tiffani J. Bright , Jason H. Moore

Optimization of Survey Weights under a Large Number of Conflicting Constraints

In the analysis of survey data, sampling weights are needed for consistent estimation of the population. However, the original inverse probability weights from the survey sample design are typically modified to account for non-response, to…

Computation · Statistics 2025-08-19 Matthew R. Williams , Terrance D. Savitsky

The Power of Prognosis: Improving Covariate Balance Tests with Outcome Information

Scholars frequently use covariate balance tests to test the validity of natural experiments and related designs. Unfortunately, when measured covariates are unrelated to potential outcomes, balance is uninformative about key identification…

Methodology · Statistics 2025-10-15 Clara Bicalho , Adam Bouyamourn , Thad Dunning

Investigating an Alternative for Estimation from a Nonprobability Sample: Matching plus Calibration

Matching a nonprobability sample to a probability sample is one strategy both for selecting the nonprobability units and for weighting them. This approach has been employed in the past to select subsamples of persons from a large panel of…

Methodology · Statistics 2021-12-03 Zhan Liu , Richard Valliant

Choosing good subsamples for regression modelling

A common problem in health research is that we have a large database with many variables measured on a large number of individuals. We are interested in measuring additional variables on a subsample; these measurements may be newly…

Methodology · Statistics 2022-03-22 Thomas Lumley , Tong Chen

Robust Estimation of Propensity Score Weights via Subclassification

Weighting estimators based on propensity scores are widely used for causal estimation in a variety of contexts, such as observational studies, marginal structural models and interference. They enjoy appealing theoretical properties such as…

Methodology · Statistics 2021-10-06 Linbo Wang , Yuexia Zhang , Thomas S. Richardson , Xiao-Hua Zhou

Improving the estimation of the odds-ratio using auxiliary information

The odds ratio measure is used in health and social surveys where the odds of a certain event is to be compared between two populations. It is defined using logistic regression, and requires that data from surveys are accompanied by their…

Methodology · Statistics 2014-07-01 C. Goga , A Ruiz-Gazen

Model-based metrics: Sample-efficient estimates of predictive model subpopulation performance

Machine learning models $-$ now commonly developed to screen, diagnose, or predict health conditions $-$ are evaluated with a variety of performance metrics. An important first step in assessing the practical utility of a model is to…

Machine Learning · Statistics 2021-04-27 Andrew C. Miller , Leon A. Gatys , Joseph Futoma , Emily B. Fox