English
Related papers

Related papers: Semiparametric Efficient Data Integration Using th…

200 papers

This paper considers an empirical likelihood inference for parameters defined by general estimating equations, when data are missing at random. The efficiency of existing estimators depends critically on correctly specifying the conditional…

Methodology · Statistics 2016-12-06 Tianqing Liu , Xiaohui Yuan , Zhaohai Li , Aiyi Liu

Nonresponse after probability sampling is a universal challenge in survey sampling, often necessitating adjustments to mitigate sampling and selection bias simultaneously. This study explored the removal of bias and effective utilization of…

Methodology · Statistics 2025-11-13 Kosuke Morikawa , Kenji Beppu , Wataru Aida

Many statistical estimands of interest (e.g., in regression or causality) are functions of the joint distribution of multiple random variables. But in some applications, data is not available that measures all random variables on each…

Methodology · Statistics 2025-02-11 Yicong Jiang , Lucas Janson

During the past few decades, missing-data problems have been studied extensively, with a focus on the ignorable missing case, where the missing probability depends only on observable quantities. By contrast, research into non-ignorable…

Methodology · Statistics 2019-08-06 Yukun Liu , Pengfei Li , Jing Qin

We study semiparametric efficiency bounds and efficient estimation of parameters defined through general moment restrictions with missing data. Identification relies on auxiliary data containing information about the distribution of the…

Statistics Theory · Mathematics 2008-04-04 Xiaohong Chen , Han Hong , Alessandro Tarozzi

Sample selection is pervasive in applied economic studies. This paper develops semiparametric selection models that achieve point identification without relying on exclusion restrictions, an assumption long believed necessary for…

Econometrics · Economics 2025-02-11 Dongwoo Kim , Young Jun Lee

The aim of survey statistics is to produce estimates with a minimal bias and a corresponding acceptable variance given a specific budget, preferable with a minor response burden for the participants. In recent years, considerable efforts…

Methodology · Statistics 2026-04-02 Martin Hyllienmark , Gustaf Strandell

In this paper we study predictive mean matching mass imputation estimators to integrate data from probability and non-probability samples. We consider two approaches: matching predicted to predicted ($\hat{y}-\hat{y}$~matching; PMM A) and…

Methodology · Statistics 2024-06-18 Piotr Chlebicki , Łukasz Chrostowski , Maciej Beręsewicz

We consider statistical inference for a finite-dimensional parameter in a regular semiparametric model under a distributed setting with blockwise missingness, where entire blocks of variables are unavailable at certain sites and sharing…

Methodology · Statistics 2025-08-26 Jingyue Huang , Huiyuan Wang , Yuqing Lei , Yong Chen

In this review we cover the basics of efficient nonparametric parameter estimation (also called functional estimation), with a focus on parameters that arise in causal inference problems. We review both efficiency bounds (i.e., what is the…

Methodology · Statistics 2023-01-27 Edward H. Kennedy

We consider the efficient estimation of the semiparametric additive transformation model with current status data. A wide range of survival models and econometric models can be incorporated into this general transformation framework. We…

Statistics Theory · Mathematics 2011-05-09 Guang Cheng , Xiao Wang

Valid statistical inference is challenging when the sample is subject to unknown selection bias. Data integration can be used to correct for selection bias when we have a parallel probability sample from the same population with some common…

Methodology · Statistics 2023-07-24 Zhonglei Wang , Shu Yang , Jae Kwang Kim

We consider statistical inference under a semi-supervised setting where we have access to both a labeled dataset consisting of pairs $\{X_i, Y_i \}_{i=1}^n$ and an unlabeled dataset $\{ X_i \}_{i=n+1}^{n+N}$. We ask the question: under what…

Statistics Theory · Mathematics 2025-03-20 Zichun Xu , Daniela Witten , Ali Shojaie

Data analysis based on information from several sources is common in economic and biomedical studies. This setting is often referred to as the data fusion problem, which differs from traditional missing data problems since no complete data…

Methodology · Statistics 2022-04-07 Wei Li , Shanshan Luo , Wangli Xu

We study the identification and estimation of statistical functionals of multivariate data missing non-monotonically and not-at-random, taking a semiparametric approach. Specifically, we assume that the missingness mechanism satisfies what…

Methodology · Statistics 2022-12-26 Daniel Malinsky , Ilya Shpitser , Eric J Tchetgen Tchetgen

Suppose we have individual data from an internal study and various summary statistics from relevant external studies. External summary statistics have the potential to improve statistical inference for the internal population; however, it…

Methodology · Statistics 2026-02-06 Wenjie Hu , Ruoyu Wang , Wei Li , Wang Miao

Combining information from multiple samples is often needed in biomedical and economic studies, but the differences between these samples must be appropriately taken into account in the analysis of the combined data. We study estimation for…

Methodology · Statistics 2018-08-14 Heng Shu , Zhiqiang Tan

We propose a semiparametric data fusion framework for efficient inference on survival probabilities by integrating right-censored and current status data. Existing data fusion methods focus largely on fusing right-censored data only, while…

Methodology · Statistics 2025-09-15 Xiudi Li , Sijia Li

Integrating non-probability samples into finite-population inference typically requires modeling unknown selection probabilities under a missing-at-random (MAR) assumption that is difficult to verify. We propose a design-based alternative…

Methodology · Statistics 2026-05-08 Andrius Čiginas , Ieva Burakauskaitė , Jae Kwang Kim

We propose nonparametric identification and semiparametric estimation of joint potential outcome distributions in the presence of confounding. First, in settings with observed confounding, we derive tighter, covariate-informed bounds on the…

Methodology · Statistics 2026-02-19 Jianle Sun , Kun Zhang
‹ Prev 1 2 3 10 Next ›