Related papers: Single-Index Model-Assisted Estimation In Survey S…

Nonparametric Additive Model-assisted Estimation for Survey Data

An additive model-assisted nonparametric method is investigated to estimate the finite population totals of massive survey data with the aid of auxiliary information. A class of estimators is proposed to improve the precision of the well…

Methodology · Statistics 2019-03-19 Li Wang , Suojin Wang

Semiparametric adaptive estimation under informative sampling

In survey sampling, survey data do not necessarily represent the target population, and the samples are often biased. However, information on the survey weights aids in the elimination of selection bias. The Horvitz-Thompson estimator is a…

Methodology · Statistics 2024-04-05 Kosuke Morikawa , Yoshikazu Terada , Jae Kwang Kim

Semi-supervised Inference: General Theory and Estimation of Means

We propose a general semi-supervised inference framework focused on the estimation of the population mean. As usual in semi-supervised settings, there exists an unlabeled sample of covariate vectors and a labeled sample consisting of…

Methodology · Statistics 2018-08-15 Anru Zhang , Lawrence D. Brown , T. Tony Cai

A step towards the integration of machine learning and classic model-based survey methods

The usage of machine learning methods in traditional surveys including official statistics, is still very limited. Therefore, we propose a predictor supported by these algorithms, which can be used to predict any population or subpopulation…

Methodology · Statistics 2025-07-14 Tomasz Żądło , Adam Chwila

Semiparametric Penalized Spline Regression

In this paper, we propose a new semiparametric regression estimator by using a hybrid technique of a parametric approach and a nonparametric penalized spline method. The overall shape of the true regression function is captured by the…

Statistics Theory · Mathematics 2012-02-17 Takuma Yoshida , Kanta Naito

Spline Single-Index Prediction Model

For the past two decades, single-index model, a special case of projection pursuit regression, has proven to be an efficient way of coping with the high dimensional problem in nonparametric regression. In this paper, based on weakly…

Statistics Theory · Mathematics 2007-05-23 Li Wang , Lijian Yang

A semiparametric single-index estimator for a class of estimating equation models

We propose a two-step pseudo-maximum likelihood procedure for semiparametric single-index regression models where the conditional variance is a known function of the regression and an additional parameter. The Poisson single-index…

Statistics Theory · Mathematics 2017-04-27 Marian Hristache , Weiyu Li , Valentin Patilea

Small Area Quantile Estimation

Sample surveys are widely used to obtain information about totals, means, medians, and other parameters of finite populations. In many applications, similar information is desired for subpopulations such as individuals in specific…

Methodology · Statistics 2017-05-30 Jiahua Chen , Yukun Liu

Optimal Sub-sampling with Influence Functions

Sub-sampling is a common and often effective method to deal with the computational challenges of large datasets. However, for most statistical models, there is no well-motivated approach for drawing a non-uniform subsample. We show that the…

Machine Learning · Statistics 2017-09-07 Daniel Ting , Eric Brochu

Efficient Estimation of Nonlinear Finite Population Parameters Using Nonparametrics

Currently, the high-precision estimation of nonlinear parameters such as Gini indices, low-income proportions or other measures of inequality is particularly crucial. In the present paper, we propose a general class of estimators for such…

Methodology · Statistics 2014-07-01 Camelia Goga , Anne Ruiz-Gazen

Semiparametric Mixtures of Regressions with Single-index for Model Based Clustering

In this article, we propose two classes of semiparametric mixture regression models with single-index for model based clustering. Unlike many semiparametric/nonparametric mixture regression models that can only be applied to low dimensional…

Methodology · Statistics 2017-08-15 Sijia Xiang , Weixin Yao

Survey Design and Estimating Equations when Combining Big Data with Probability Samples

The use of big data in official statistics and the applied sciences is accelerating, but statistics computed using only big data often suffer from substantial selection bias. This leads to inaccurate estimation and invalid statistical…

Methodology · Statistics 2023-08-11 Ryan Covey , Lucca Buonamano

A spline-assisted semiparametric approach to non-parametric measurement error models

It is well known that the minimax rates of convergence of nonparametric density and regression function estimation of a random variable measured with error is much slower than the rate in the error free case. Surprisingly, we show that if…

Statistics Theory · Mathematics 2019-08-21 Fei Jiang , Yanyuan Ma , Raymond J. Carroll

Active sampling: A machine-learning-assisted framework for finite population inference with optimal subsamples

Data subsampling has become widely recognized as a tool to overcome computational and economic bottlenecks in analyzing massive datasets. We contribute to the development of adaptive design for estimation of finite population…

Methodology · Statistics 2024-07-08 Henrik Imberg , Xiaomi Yang , Carol Flannagan , Jonas Bärgman

Optimal subsampling algorithm for the marginal model with large longitudinal data

Big data is ubiquitous in practices, and it has also led to heavy computation burden. To reduce the calculation cost and ensure the effectiveness of parameter estimators, an optimal subset sampling method is proposed to estimate the…

Methodology · Statistics 2023-11-16 Haohui Han , Liya Fu

Integration of survey data and big observational data for finite population inference using mass imputation

Multiple data sources are becoming increasingly available for statistical analyses in the era of big data. As an important example in finite-population inference, we consider an imputation approach to combining a probability sample with big…

Methodology · Statistics 2018-07-10 Shu Yang , Jae Kwang Kim

Estimation for a Partial-Linear Single-Index Model

In this paper, we study the estimation for a partial-linear single-index model. A two-stage estimation procedure is proposed to estimate the link function for the single index and the parameters in the single index, as well as the…

Methodology · Statistics 2009-05-14 Jane-Ling Wang , Liugen Xue , Lixing Zhu , Yun Sam Chong

Mixture of Regression Models with Single-Index

In this article, we propose a class of semiparametric mixture regression models with single-index. We argue that many recently proposed semiparametric/nonparametric mixture regression models can be considered special cases of the proposed…

Methodology · Statistics 2016-10-04 Sijia Xiang , Weixin Yao

Boosting Test Performance with Importance Sampling--a Subpopulation Perspective

Despite empirical risk minimization (ERM) is widely applied in the machine learning community, its performance is limited on data with spurious correlation or subpopulation that is introduced by hidden attributes. Existing literature…

Machine Learning · Computer Science 2024-12-18 Hongyu Shen , Zhizhen Zhao

Efficient semiparametric estimation in generalized partially linear additive models for longitudinal/clustered data

We consider efficient estimation of the Euclidean parameters in a generalized partially linear additive models for longitudinal/clustered data when multiple covariates need to be modeled nonparametrically, and propose an estimation…

Statistics Theory · Mathematics 2014-02-05 Guang Cheng , Lan Zhou , Jianhua Z. Huang