Related papers: Sequential Unequal Probability Sampling For Stream…

Stream Sampling with Immediate Decision

The manuscript introduces a method to select a random sample from a stream by deciding on each sampling unit immediately after observing it. The process could be applied to unequal as well as equal probability sampling. The implementation…

Data Structures and Algorithms · Computer Science 2021-11-19 Bardia Panahbehagh , Raphaël Jauslin , Yves Tillé

Sequential Spatially Balanced Sampling

Sequential sampling occurs when the entire population is not known in advance and data are obtained one at a time or in groups of units. This manuscript proposes a new algorithm to sequentially select a balanced sample. The algorithm…

Methodology · Statistics 2023-01-04 Raphaël Jauslin , Bardia Panahbehagh , Yves Tillé

Estimating Coverage in Streams via a Modified CVM Method

When individuals in a population can be classified in classes or categories, the coverage of a sample, $C$, is defined as the probability that a randomly selected individual from the population belongs to a class represented in the sample.…

Computation · Statistics 2025-04-08 Carlos Hernandez-Suarez

Parallel Streaming Random Sampling

This paper investigates parallel random sampling from a potentially-unending data stream whose elements are revealed in a series of element sequences (minibatches). While sampling from a stream was extensively studied sequentially, not much…

Data Structures and Algorithms · Computer Science 2019-06-11 Kanat Tangwongsan , Srikanta Tirthapura

Weighted Reservoir Sampling With Replacement from Data Streams

In this work, we present a new random sampling method for data streams where the probability of an element's inclusion in the sample is proportional to a weight associated with that element. Our method is based on sampling with replacement,…

Data Structures and Algorithms · Computer Science 2026-03-18 Adriano Meligrana , Adriano Fazzone

Quasi-Systematic Sampling From a Continuous Population

A specific family of point processes are introduced that allow to select samples for the purpose of estimating the mean or the integral of a function of a real variable. These processes, called quasi-systematic processes, depend on a tuning…

Methodology · Statistics 2016-07-19 Matthieu Wilhelm , Yves Tillé , Lionel Qualité

Sampling Designs on Finite Populations with Spreading Control Parameters

We present new sampling methods in finite population that allow to control the joint inclusion probabilities of units and especially the spreading of sampled units in the population. They are based on the use of renewal chains and…

Methodology · Statistics 2017-04-12 Yves Tillé , Lionel Qualité , Matthieu Wilhelm

Stratified Random Sampling for Dependent Inputs

A new approach of obtaining stratified random samples from statistically dependent random variables is described. The proposed method can be used to obtain samples from the input space of a computer forward model in estimating expectations…

Methodology · Statistics 2019-11-25 Anirban Mondal , Abhijit Mandal

A method to find an efficient and robust sampling strategy under model uncertainty

We consider the problem of deciding on sampling strategy, in particular sampling design. We propose a risk measure, whose minimizing value guides the choice. The method makes use of a superpopulation model and takes into account uncertainty…

Methodology · Statistics 2020-07-06 Edgar Bueno , Dan Hedlin

Consistent Sampling with Replacement

We describe a very simple method for `consistent sampling' that allows for sampling with replacement. The method extends previous approaches to consistent sampling, which assign a pseudorandom real number to each element, and sample those…

Data Structures and Algorithms · Computer Science 2018-08-31 Ronald L. Rivest

Compound Sequential Change-point Detection in Parallel Data Streams

We consider sequential change-point detection in parallel data streams, where each stream has its own change point. Once a change is detected in a data stream, this stream is deactivated permanently. The goal is to maximize the normal…

Statistics Theory · Mathematics 2021-07-15 Yunxiao Chen , Xiaoou Li

A Decision Method for Elementary Stream Calculus

The main result is a doubly exponential decision procedure for the first-order equality theory of streams with both arithmetic and control-oriented stream operations. This stream logic is expressive for elementary problems of stream…

Logic in Computer Science · Computer Science 2024-01-05 Harald Ruess

Nested Sampling for Uncertainty Quantification and Rare Event Estimation

Nested Sampling is a method for computing the Bayesian evidence, also called the marginal likelihood, which is the integral of the likelihood with respect to the prior. More generally, it is a numerical probabilistic quadrature rule. The…

Computation · Statistics 2023-10-09 Jonas Latz , Doris Schneider , Philipp Wacker

A New Sampling Method Base on Sequential Tests with Fixed Sample Size Upper Limit

Sequential inspection is a technique employed to monitor product quality during the production process. For smaller batch sizes, the Acceptable Quality Limit(AQL) inspection theory is typically applied, whereas for larger batch sizes, the…

Methodology · Statistics 2024-12-17 Dihong Huang

Distinct Sampling on Streaming Data with Near-Duplicates

In this paper we study how to perform distinct sampling in the streaming model where data contain near-duplicates. The goal of distinct sampling is to return a distinct element uniformly at random from the universe of elements, given that…

Data Structures and Algorithms · Computer Science 2018-10-31 Jiecao Chen , Qin Zhang

Active Sampling of Multiple Sources for Sequential Estimation

Consider $K$ processes, each generating a sequence of identical and independent random variables. The probability measures of these processes have random parameters that must be estimated. Specifically, they share a parameter $\theta$…

Machine Learning · Computer Science 2022-10-12 Arpan Mukherjee , Ali Tajer , Pin-Yu Chen , Payel Das

Resampling: an improvement of Importance Sampling in varying population size models

Sequential importance sampling algorithms have been defined to estimate likelihoods in models of ancestral population processes. However, these algorithms are based on features of the models with constant population size, and become…

Statistics Theory · Mathematics 2016-03-24 Coralie Merle , Raphaël Leblois , François Rousset , Pierre Pudlo

A Case Study on Sampling Strategies for Evaluating Neural Sequential Item Recommendation Models

At the present time, sequential item recommendation models are compared by calculating metrics on a small item subset (target set) to speed up computation. The target set contains the relevant item and a set of negative items that are…

Information Retrieval · Computer Science 2021-07-29 Alexander Dallmann , Daniel Zoller , Andreas Hotho

A Bayesian Approach to the Partitioning of Workflows

When partitioning workflows in realistic scenarios, the knowledge of the processing units is often vague or unknown. A naive approach to addressing this issue is to perform many controlled experiments for different workloads, each…

Distributed, Parallel, and Cluster Computing · Computer Science 2015-11-03 Freddy C. Chua , Bernardo A. Huberman

A subsampling approach for large data sets when the Generalised Linear Model is potentially misspecified

Subsampling is a computationally efficient and scalable method to draw inference in large data settings based on a subset of the data rather than needing to consider the whole dataset. When employing subsampling techniques, a crucial…

Methodology · Statistics 2025-10-08 Amalan Mahendran , Helen Thompson , James M. McGree