Related papers: Poisson Regression with Survey Data

An Investigation of Methods for Handling Missing Data with Penalized Regression

We investigate methods for penalized regression in the presence of missing observations. This paper introduces a method for estimating the parameters which compensates for the missing observations. We first, derive an unbiased estimator of…

Applications · Statistics 2013-10-09 Yunjin Choi , Robert Tibshirani

Measurement Errors in Semiparametric Generalized Regression Models

Regression models that ignore measurement error in predictors may produce highly biased estimates leading to erroneous inferences. It is well known that it is extremely difficult to take measurement error into account in Gaussian…

Methodology · Statistics 2023-02-03 Mohammad W. Hattab , David Ruppert

Regression adjustment in completely randomized experiments with a diverging number of covariates

Randomized experiments have become important tools in empirical research. In a completely randomized treatment-control experiment, the simple difference in means of the outcome is unbiased for the average treatment effect, and covariate…

Statistics Theory · Mathematics 2021-01-01 Lihua Lei , Peng Ding

Linear Regression with Sparsely Permuted Data

In regression analysis of multivariate data, it is tacitly assumed that response and predictor variables in each observed response-predictor pair correspond to the same entity or unit. In this paper, we consider the situation of "permuted…

Statistics Theory · Mathematics 2017-11-17 Martin Slawski , Emanuel Ben-David

Marginal Analysis of Count Time Series in the Presence of Missing Observations

Time series in real-world applications often have missing observations, making typical analytical methods unsuitable. One method for dealing with missing data is the concept of amplitude modulation. While this principle works with any data,…

Methodology · Statistics 2024-04-19 Simon Nik

Regression-based imputation of explanatory discrete missing data

Imputation of missing values is a strategy for handling non-responses in surveys or data loss in measurement processes, which may be more effective than ignoring them. When the variable represents a count, the literature dealing with this…

Applications · Statistics 2020-07-31 Gilma Hernández-Herrera , Albert Navarro , David Moriña

Nonparametric modal regression with missing response observations

Modal regression has emerged as a flexible alternative to classical regression models when the conditional mean or median are unable to adequately capture the underlying relation between a response and a predictor variable. This approach is…

Methodology · Statistics 2025-04-08 Ana Pérez-González , Tomás R. Cotos-Yáñez , Rosa M. Crujeiras

Poisson Regression in one Covariate on Massive Data

The goal of subsampling is to select an informative subset of all observations, when using the full data for statistical analysis is not viable. We construct locally $ D $-optimal subsampling designs under a Poisson regression model with a…

Statistics Theory · Mathematics 2024-03-28 Torsten Reuter , Rainer Schwabe

On regression adjustments in experiments with several treatments

Regression adjustments are often made to experimental data. Since randomization does not justify the models, bias is likely; nor are the usual variance calculations to be trusted. Here, we evaluate regression adjustments using Neyman's…

Applications · Statistics 2008-12-18 David A. Freedman

Inference for biased models: a quasi-instrumental variable approach

For linear regression models who are not exactly sparse in the sense that the coefficients of the insignificant variables are not exactly zero, the working models obtained by a variable selection are often biased. Even in sparse cases,…

Methodology · Statistics 2014-07-17 Lu Lin , Lixing Zhu , Yujie Gai

Coping with Selection Effects: A Primer on Regression with Truncated Data

The finite sensitivity of instruments or detection methods means that data sets in many areas of astronomy, for example cosmological or exoplanet surveys, are necessarily systematically incomplete. Such data sets, where the population being…

Instrumentation and Methods for Astrophysics · Physics 2020-10-14 Adam B. Mantz

Estimation of the shift parameter in regression models with unknown distribution of the observations

This paper is devoted to the estimation of the shift parameter in a semiparametric regression model when the distribution of the observation times is unknown. Hence, we propose to use a stochastic algorithm which takes into account the…

Statistics Theory · Mathematics 2013-12-23 Philippe Fraysse

Dealing with missing data using attention and latent space regularization

Most practical data science problems encounter missing data. A wide variety of solutions exist, each with strengths and weaknesses that depend upon the missingness-generating process. Here we develop a theoretical framework for training and…

Machine Learning · Computer Science 2022-11-15 Jahan C. Penny-Dimri , Christoph Bergmeir , Julian Smith

Evidence synthesis for count distributions based on heterogeneous and incomplete aggregated data

The analysis of count data is commonly done using Poisson models. Negative binomial models are a straightforward and readily motivated generalization for the case of overdispersed data, i.e., when the observed variance is greater than…

Methodology · Statistics 2016-01-06 Christian Röver , Stefan Andreas , Tim Friede

Regression with Sensor Data Containing Incomplete Observations

This paper addresses a regression problem in which output label values are the results of sensing the magnitude of a phenomenon. A low value of such labels can mean either that the actual magnitude of the phenomenon was low or that the…

Machine Learning · Computer Science 2023-06-01 Takayuki Katsuki , Takayuki Osogami

Targeted Principal Components Regression

We propose a principal components regression method based on maximizing a joint pseudo-likelihood for responses and predictors. Our method uses both responses and predictors to select linear combinations of the predictors relevant for the…

Methodology · Statistics 2021-08-10 Karl Oskar Ekvall

Regression adjustment in covariate-adaptive randomized experiments with missing covariates

Covariate-adaptive randomization is widely used in clinical trials to balance prognostic factors, and regression adjustments are often adopted to further enhance the estimation and inference efficiency. In practice, the covariates may…

Methodology · Statistics 2025-08-15 Wanjia Fu , Yingying Ma , Hanzhong Liu

Reparametrization of COM-Poisson Regression Models with Applications in the Analysis of Experimental Data

In the analysis of count data often the equidispersion assumption is not suitable, hence the Poisson regression model is inappropriate. As a generalization of the Poisson distribution, the COM-Poisson distribution can deal with under-,…

Applications · Statistics 2018-01-31 Eduardo E. Ribeiro , Walmes M. Zeviani , Wagner H. Bonat , Clarice G. B. Demétrio , John Hinde

Handling Sparse Non-negative Data in Finance

We show that Poisson regression, though often recommended over log-linear regression for modeling count and other non-negative variables in finance and economics, can be far from optimal when heteroskedasticity and sparsity -- two common…

Econometrics · Economics 2025-09-03 Agostino Capponi , Zhaonan Qu

Exact balanced random imputation for sample survey data

Surveys usually suffer from non-response, which decreases the effective sample size. Item non-response is typically handled by means of some form of random imputation if we wish to preserve the distribution of the imputed variable. This…

Methodology · Statistics 2017-08-04 Guillaume Chauvet , Wilfried Do Paco