English
Related papers

Related papers: Survey Data Integration for Distribution Function …

200 papers

Finite population inference is a central goal in survey sampling. Probability sampling is the main statistical approach to finite population inference. Challenges arise due to high cost and increasing non-response rates. Data integration…

Methodology · Statistics 2020-01-13 Shu Yang , Jae Kwang Kim

In the age of big data, nonprobability surveys are becoming increasingly abundant. Data integration techniques involving both probability and nonprobability surveys are being extensively used for providing improved estimates for finite…

Applications · Statistics 2025-10-17 Aditi Sen , Partha Lahiri

In survey analysis, the estimation of the cumulative distribution function (cdf) is of great interest: it allows for instance to derive quantiles estimators or other non linear parameters derived from the cdf. We consider the case where the…

Methodology · Statistics 2014-04-14 Sandrine Casanova , Eve Leconte

A reduced-bias nonparametric estimator of the cumulative distribution function (CDF) and the survival function is proposed using infinite-order kernels. Fourier transform theory on generalized functions is utilized to obtain the improved…

Methodology · Statistics 2009-03-18 Arthur Berg , Dimitris N. Politis

The cumulative distribution function (CDF) is fundamental for characterizing random variables, making it essential in applications that require privacy-preserving data analysis. This paper introduces a novel framework for constructing…

Cryptography and Security · Computer Science 2026-03-13 Ye Tao , Anand D. Sarwate

Multiple data sources are becoming increasingly available for statistical analyses in the era of big data. As an important example in finite-population inference, we consider an imputation approach to combining a probability sample with big…

Methodology · Statistics 2018-07-10 Shu Yang , Jae Kwang Kim

Forward simulation-based uncertainty quantification that studies the distribution of quantities of interest (QoI) is a crucial component for computationally robust engineering design and prediction. There is a large body of literature…

Computation · Statistics 2023-07-07 Ruijian Han , Boris Kramer , Dongjin Lee , Akil Narayan , Yiming Xu

The estimation of cumulative distribution functions (CDF) is an important learning task with a great variety of downstream applications, such as risk assessments in predictions and decision making. In this paper, we study functional…

Machine Learning · Computer Science 2024-03-11 Qian Zhang , Anuran Makur , Kamyar Azizzadenesheli

We study nonparametric estimation of univariate cumulative distribution functions (CDFs) pertaining to data missing at random. The proposed estimators smooth the inverse probability weighted (IPW) empirical CDF with the Bernstein operator,…

Statistics Theory · Mathematics 2026-03-30 Rihab Gharbi , Wissem Jedidi , Salah Khardani , Frédéric Ouimet

Normalizing flows model a complex target distribution in terms of a bijective transform operating on a simple base distribution. As such, they enable tractable computation of a number of important statistical quantities, particularly…

Machine Learning · Computer Science 2022-09-01 Chandramouli Shama Sastry , Andreas Lehrmann , Marcus Brubaker , Alexander Radovic

The use of big data in official statistics and the applied sciences is accelerating, but statistics computed using only big data often suffer from substantial selection bias. This leads to inaccurate estimation and invalid statistical…

Methodology · Statistics 2023-08-11 Ryan Covey , Lucca Buonamano

Learning the multivariate distribution of data is a core challenge in statistics and machine learning. Traditional methods aim for the probability density function (PDF) and are limited by the curse of dimensionality. Modern neural methods…

Machine Learning · Statistics 2022-10-14 Magda Amiridi , Nicholas D. Sidiropoulos

This paper considers the problem of estimating the cumulative distribution function and probability density function of a random variable using data quantized by uniform and non-uniform quantizers. A simple estimator is proposed based on…

Signal Processing · Electrical Eng. & Systems 2018-05-03 Paolo Carbone , Johan Schoukens , István Kollár , Antonio Moschitta

In this paper we study predictive mean matching mass imputation estimators to integrate data from probability and non-probability samples. We consider two approaches: matching predicted to predicted ($\hat{y}-\hat{y}$~matching; PMM A) and…

Methodology · Statistics 2024-06-18 Piotr Chlebicki , Łukasz Chrostowski , Maciej Beręsewicz

A quantile is defined as a value below which random draws from a given distribution falls with a given probability. In a centralized setting where the cumulative distribution function (CDF) is unknown, the empirical CDF (ECDF) can be used…

Systems and Control · Computer Science 2018-05-02 Jongmin Lee , Cihan Tepedelenlioglu , Andreas Spanias

The statistical challenges in using big data for making valid statistical inference in the finite population have been well documented in literature. These challenges are due primarily to statistical bias arising from under-coverage in the…

Methodology · Statistics 2020-06-19 Jae-kwang Kim , Siu-Ming Tam

We propose a method for finding a cumulative distribution function (cdf) that minimizes the distance to a given cdf, while belonging to an ambiguity set constructed relative to another cdf and, possibly, incorporating soft information. Our…

Optimization and Control · Mathematics 2024-08-23 Julio Deride , Johannes O. Royset , Fernanda Urrea

We consider the problem of evaluating the cumulative distribution function (CDF) of the sum of order statistics, which serves to compute outage probability (OP) values at the output of generalized selection combining receivers. Generally,…

Computation · Statistics 2017-11-15 Nadhir Ben Rached , Zdravko Botev , Abla Kammoun , Mohamed-Slim Alouini , Raul Tempone

Doubly robust estimators combine an inverse probability weighting estimator and a mass imputation estimator. Several doubly robust estimators for estimating the population mean (or prevalence) of an outcome have been proposed for…

Methodology · Statistics 2025-08-11 Shaun R Seaman , Tommy Nyberg , Anne M Presanis

The aim of survey statistics is to produce estimates with a minimal bias and a corresponding acceptable variance given a specific budget, preferable with a minor response burden for the participants. In recent years, considerable efforts…

Methodology · Statistics 2026-04-02 Martin Hyllienmark , Gustaf Strandell
‹ Prev 1 2 3 10 Next ›