Related papers: Incremental Sampling Without Replacement for Seque…

Intelligent sampling for surrogate modeling, hyperparameter optimization, and data analysis

Sampling techniques are used in many fields, including design of experiments, image processing, and graphics. The techniques in each field are designed to meet the constraints specific to that field such as uniform coverage of the range of…

Machine Learning · Computer Science 2023-06-08 Chandrika Kamath

Accelerating Machine Learning Algorithms with Adaptive Sampling

The era of huge data necessitates highly efficient machine learning algorithms. Many common machine learning algorithms, however, rely on computationally intensive subroutines that are prohibitively expensive on large datasets. Oftentimes,…

Machine Learning · Computer Science 2023-09-26 Mo Tiwari

Incremental Without Replacement Sampling in Nonconvex Optimization

Minibatch decomposition methods for empirical risk minimization are commonly analysed in a stochastic approximation setting, also known as sampling with replacement. On the other hands modern implementations of such techniques are…

Machine Learning · Computer Science 2023-01-09 Edouard Pauwels

Consistent Sampling with Replacement

We describe a very simple method for `consistent sampling' that allows for sampling with replacement. The method extends previous approaches to consistent sampling, which assign a pseudorandom real number to each element, and sample those…

Data Structures and Algorithms · Computer Science 2018-08-31 Ronald L. Rivest

Simple, Optimal Algorithms for Random Sampling Without Replacement

Consider the fundamental problem of drawing a simple random sample of size k without replacement from [n] := {1, . . . , n}. Although a number of classical algorithms exist for this problem, we construct algorithms that are even simpler,…

Data Structures and Algorithms · Computer Science 2021-04-13 Daniel Ting

Particle Guidance: non-I.I.D. Diverse Sampling with Diffusion Models

In light of the widespread success of generative models, a significant amount of research has gone into speeding up their sampling time. However, generative models are often sampled multiple times to obtain a diverse set incurring a cost…

Machine Learning · Computer Science 2023-11-27 Gabriele Corso , Yilun Xu , Valentin de Bortoli , Regina Barzilay , Tommi Jaakkola

Stochastic Beams and Where to Find Them: The Gumbel-Top-k Trick for Sampling Sequences Without Replacement

The well-known Gumbel-Max trick for sampling from a categorical distribution can be extended to sample $k$ elements without replacement. We show how to implicitly apply this 'Gumbel-Top-$k$' trick on a factorized distribution over…

Machine Learning · Computer Science 2019-05-31 Wouter Kool , Herke van Hoof , Max Welling

Automatic Recall Machines: Internal Replay, Continual Learning and the Brain

Replay in neural networks involves training on sequential data with memorized samples, which counteracts forgetting of previous behavior caused by non-stationarity. We present a method where these auxiliary samples are generated on the fly,…

Machine Learning · Computer Science 2020-12-15 Xu Ji , Joao Henriques , Tinne Tuytelaars , Andrea Vedaldi

A Scalable and Efficient Iterative Method for Copying Machine Learning Classifiers

Differential replication through copying refers to the process of replicating the decision behavior of a machine learning model using another model that possesses enhanced features and attributes. This process is relevant when external…

Machine Learning · Computer Science 2023-02-08 Nahuel Statuto , Irene Unceta , Jordi Nin , Oriol Pujol

Speculative Sampling for Parametric Temporal Point Processes

Temporal point processes are powerful generative models for event sequences that capture complex dependencies in time-series data. They are commonly specified using autoregressive models that learn the distribution of the next event from…

Machine Learning · Computer Science 2025-10-24 Marin Biloš , Anderson Schneider , Yuriy Nevmyvaka

On the Batch Size Selection in Stochastic Gradient Methods Using No-Replacement Sampling

Recent stochastic gradient methods that have appeared in the literature base their efficiency and global convergence properties on a suitable control of the variance of the gradient batch estimate. This control is typically achieved by…

Optimization and Control · Mathematics 2025-06-11 Marco Boresta , Alberto De Santis , Stefano Lucidi

Priority Sampling of Large Language Models for Compilers

Large language models show great potential in generating and optimizing code. Widely used sampling methods such as Nucleus Sampling increase the diversity of generation but often produce repeated samples for low temperatures and incoherent…

Machine Learning · Computer Science 2024-03-01 Dejan Grubisic , Chris Cummins , Volker Seeker , Hugh Leather

Automatic Reparameterisation of Probabilistic Programs

Probabilistic programming has emerged as a powerful paradigm in statistics, applied science, and machine learning: by decoupling modelling from inference, it promises to allow modellers to directly reason about the processes generating…

Machine Learning · Statistics 2019-06-10 Maria I. Gorinova , Dave Moore , Matthew D. Hoffman

CCDP: Composition of Conditional Diffusion Policies with Guided Sampling

Imitation Learning offers a promising approach to learn directly from data without requiring explicit models, simulations, or detailed task definitions. During inference, actions are sampled from the learned distribution and executed on the…

Robotics · Computer Science 2025-10-28 Amirreza Razmjoo , Sylvain Calinon , Michael Gienger , Fan Zhang

WOR and $p$'s: Sketches for $\ell_p$-Sampling Without Replacement

Weighted sampling is a fundamental tool in data analysis and machine learning pipelines. Samples are used for efficient estimation of statistics or as sparse representations of the data. When weight distributions are skewed, as is often the…

Machine Learning · Computer Science 2020-08-18 Edith Cohen , Rasmus Pagh , David P. Woodruff

Sequential Spatially Balanced Sampling

Sequential sampling occurs when the entire population is not known in advance and data are obtained one at a time or in groups of units. This manuscript proposes a new algorithm to sequentially select a balanced sample. The algorithm…

Methodology · Statistics 2023-01-04 Raphaël Jauslin , Bardia Panahbehagh , Yves Tillé

You May Not Need Order in Time Series Forecasting

Time series forecasting with limited data is a challenging yet critical task. While transformers have achieved outstanding performances in time series forecasting, they often require many training samples due to the large number of…

Machine Learning · Computer Science 2019-10-23 Yunkai Zhang , Qiao Jiang , Shurui Li , Xiaoyong Jin , Xueying Ma , Xifeng Yan

Optimal Sub-sampling with Influence Functions

Sub-sampling is a common and often effective method to deal with the computational challenges of large datasets. However, for most statistical models, there is no well-motivated approach for drawing a non-uniform subsample. We show that the…

Machine Learning · Statistics 2017-09-07 Daniel Ting , Eric Brochu

DiverseFlow: Sample-Efficient Diverse Mode Coverage in Flows

Many real-world applications of flow-based generative models desire a diverse set of samples that cover multiple modes of the target distribution. However, the predominant approach for obtaining diverse sets is not sample-efficient, as it…

Machine Learning · Computer Science 2025-04-11 Mashrur M. Morshed , Vishnu Boddeti

Importance Sampling via Score-based Generative Models

Importance sampling, which involves sampling from a probability density function (PDF) proportional to the product of an importance weight function and a base PDF, is a powerful technique with applications in variance reduction, biased or…

Machine Learning · Computer Science 2025-02-10 Heasung Kim , Taekyun Lee , Hyeji Kim , Gustavo de Veciana