English
Related papers

Related papers: Optimal Approximate Sampling from Discrete Probabi…

200 papers

We study the problem of generating a random variate $X$ from a finite discrete probability distribution $P$ using an entropy source of independent fair coin flips. A classic result from Knuth and Yao shows that the optimal expected number…

Data Structures and Algorithms · Computer Science 2026-04-28 Thomas L. Draper , Feras A. Saad

An algorithm for sampling exactly from the normal distribution is given. The algorithm reads some number of uniformly distributed random digits in a given base and generates an initial portion of the representation of a normal deviate in…

Computational Physics · Physics 2016-02-01 Charles F. F. Karney

In this paper we study the problem of computing max-entropy distributions over a discrete set of objects subject to observed marginals. Interest in such distributions arises due to their applicability in areas such as statistical physics,…

Data Structures and Algorithms · Computer Science 2013-05-02 Mohit Singh , Nisheeth K. Vishnoi

We consider the problem of hypothesis testing for discrete distributions. In the standard model, where we have sample access to an underlying distribution $p$, extensive research has established optimal bounds for uniformity testing,…

Machine Learning · Computer Science 2024-12-03 Maryam Aliakbarpour , Piotr Indyk , Ronitt Rubinfeld , Sandeep Silwal

A technique introduced by Indyk and Woodruff [STOC 2005] has inspired several recent advances in data-stream algorithms. We show that a number of these results follow easily from the application of a single probabilistic method called…

Data Structures and Algorithms · Computer Science 2011-04-26 Alexandr Andoni , Robert Krauthgamer , Krzysztof Onak

We consider the problem of learning a discrete distribution in the presence of an $\epsilon$ fraction of malicious data sources. Specifically, we consider the setting where there is some underlying distribution, $p$, and each data source…

Machine Learning · Computer Science 2017-11-23 Mingda Qiao , Gregory Valiant

Herding is a deterministic algorithm used to generate data points that can be regarded as random samples satisfying input moment conditions. The algorithm is based on the complex behavior of a high-dimensional dynamical system and is…

Machine Learning · Statistics 2023-05-10 Hiroshi Yamashita , Hideyuki Suzuki , Kazuyuki Aihara

This article introduces an algorithm to draw random discrete uniform variables within a given range of size n from a source of random bits. The algorithm aims to be simple to implement and optimal both with regards to the amount of random…

Data Structures and Algorithms · Computer Science 2013-04-09 Jérémie Lumbroso

We initiate an investigation of private sampling from distributions. Given a dataset with $n$ independent observations from an unknown distribution $P$, a sampling algorithm must output a single observation from a distribution that is close…

Machine Learning · Computer Science 2022-11-16 Sofya Raskhodnikova , Satchit Sivakumar , Adam Smith , Marika Swanberg

This article studies the fundamental problem of using i.i.d. coin tosses from an entropy source to efficiently generate random variables $X_i \sim P_i$ $(i \ge 1)$, where $(P_1, P_2, \dots)$ is a random sequence of rational discrete…

Data Structures and Algorithms · Computer Science 2026-05-08 Thomas L. Draper , Feras A. Saad

The maximum entropy principle is a powerful tool for solving underdetermined inverse problems. This paper considers the problem of discretizing a continuous distribution, which arises in various applied fields. We obtain the approximating…

Numerical Analysis · Mathematics 2020-08-05 Ken'ichiro Tanaka , Alexis Akira Toda

In 2016, Karney proposed an exact sampling algorithm for the standard normal distribution. In this paper, we study the computational complexity of this algorithm under the random deviate model. Specifically, Karney's algorithm requires the…

Data Structures and Algorithms · Computer Science 2020-08-11 Yusong Du , Baoying Fan , Baodian Wei

Given a dataset of $n$ i.i.d. samples from an unknown distribution $P$, we consider the problem of generating a sample from a distribution that is close to $P$ in total variation distance, under the constraint of differential privacy (DP).…

Data Structures and Algorithms · Computer Science 2023-06-23 Badih Ghazi , Xiao Hu , Ravi Kumar , Pasin Manurangsi

Fast distributed algorithms that output a feasible solution for constraint satisfaction problems, such as maximal independent sets, have been heavily studied. There has been much less research on distributed sampling problems, where one…

Distributed, Parallel, and Cluster Computing · Computer Science 2023-03-07 Sriram V. Pemmaraju , Joshua Z. Sobel

Given a collection of probability distributions $p_{1},\ldots,p_{m}$, the minimum entropy coupling is the coupling $X_{1},\ldots,X_{m}$ ($X_{i}\sim p_{i}$) with the smallest entropy $H(X_{1},\ldots,X_{m})$. While this problem is known to be…

Information Theory · Computer Science 2021-09-21 Cheuk Ting Li

This paper introduces a new algorithm for the fundamental problem of generating a random integer from a discrete probability distribution using a source of independent and unbiased random coin flips. We prove that this algorithm, which we…

Computation · Statistics 2020-07-03 Feras A. Saad , Cameron E. Freer , Martin C. Rinard , Vikash K. Mansinghka

The approximation of a discrete probability distribution $\mathbf{t}$ by an $M$-type distribution $\mathbf{p}$ is considered. The approximation error is measured by the informational divergence $\mathbb{D}(\mathbf{t}\Vert\mathbf{p})$, which…

Information Theory · Computer Science 2016-07-28 Bernhard C. Geiger , Georg Böcherer

We study the fundamental problem of sampling independent events, called subset sampling. Specifically, consider a set of $n$ events $S=\{x_1, \ldots, x_n\}$, where each event $x_i$ has an associated probability $p(x_i)$. The subset sampling…

Data Structures and Algorithms · Computer Science 2023-09-22 Lu Yi , Hanzhi Wang , Zhewei Wei

We study the problem of testing discrete distributions with a focus on the high probability regime. Specifically, given samples from one or more discrete distributions, a property $\mathcal{P}$, and parameters $0< \epsilon, \delta <1$, we…

Data Structures and Algorithms · Computer Science 2020-09-15 Ilias Diakonikolas , Themis Gouleakis , Daniel M. Kane , John Peebles , Eric Price

Scientific modeling applications often require estimating a distribution of parameters consistent with a dataset of observations - an inference task also known as source distribution estimation. This problem can be ill-posed, however, since…

Machine Learning · Computer Science 2024-12-02 Julius Vetter , Guy Moss , Cornelius Schröder , Richard Gao , Jakob H. Macke
‹ Prev 1 2 3 10 Next ›