English
Related papers

Related papers: Learning Poisson Binomial Distributions

200 papers

We give an algorithm for properly learning Poisson binomial distributions. A Poisson binomial distribution (PBD) of order $n$ is the discrete probability distribution of the sum of $n$ mutually independent Bernoulli random variables. Given…

Data Structures and Algorithms · Computer Science 2015-11-13 Ilias Diakonikolas , Daniel M. Kane , Alistair Stewart

We introduce the problem of simultaneously learning all powers of a Poisson Binomial Distribution (PBD). A PBD of order $n$ is the distribution of a sum of $n$ mutually independent Bernoulli random variables $X_i$, where $\mathbb{E}[X_i] =…

Data Structures and Algorithms · Computer Science 2017-07-19 Dimitris Fotakis , Vasilis Kontonis , Piotr Krysta , Paul Spirakis

A Poisson Binomial distribution over $n$ variables is the distribution of the sum of $n$ independent Bernoullis. We provide a sample near-optimal algorithm for testing whether a distribution $P$ supported on $\{0,...,n\}$ to which we have…

Data Structures and Algorithms · Computer Science 2014-10-15 Jayadev Acharya , Constantinos Daskalakis

We consider the problem of learning an unknown product distribution $X$ over $\{0,1\}^n$ using samples $f(X)$ where $f$ is a \emph{known} transformation function. Each choice of a transformation function $f$ specifies a learning problem in…

Machine Learning · Computer Science 2011-03-04 Constantinos Daskalakis , Ilias Diakonikolas , Rocco A. Servedio

The Poisson-binomial distribution is useful in many applied problems in engineering, actuarial science, and data mining. The Poisson-binomial distribution models the distribution of the sum of independent but not identically distributed…

Computation · Statistics 2017-02-07 Man Zhang , Yili Hong , Narayanaswamy Balakrishnan

An $(n, k)$-Poisson Multinomial Distribution (PMD) is a random variable of the form $X = \sum_{i=1}^n X_i$, where the $X_i$'s are independent random vectors supported on the set of standard basis vectors in $\mathbb{R}^k.$ In this paper, we…

Data Structures and Algorithms · Computer Science 2016-06-23 Ilias Diakonikolas , Daniel M. Kane , Alistair Stewart

The Poisson multinomial distribution (PMD) describes the distribution of the sum of $n$ independent but non-identically distributed random vectors, in which each random vector is of length $m$ with 0/1 valued elements and only one of its…

Computation · Statistics 2022-01-13 Zhengzhi Lin , Yueyao Wang , Yili Hong

A $k$-modal probability distribution over the discrete domain $\{1,...,n\}$ is one whose histogram has at most $k$ "peaks" and "valleys." Such distributions are natural generalizations of monotone ($k=0$) and unimodal ($k=1$) probability…

Data Structures and Algorithms · Computer Science 2014-09-16 Constantinos Daskalakis , Ilias Diakonikolas , Rocco A. Servedio

Given i.i.d.~samples from an unknown distribution $P$, the goal of distribution learning is to recover the parameters of a distribution that is close to $P$. When $P$ belongs to the class of product distributions on the Boolean hypercube…

Machine Learning · Computer Science 2025-11-14 Arnab Bhattacharyya , Davin Choo , Philips George John , Themis Gouleakis

We study the problem, introduced by Qiao and Valiant, of learning from untrusted batches. Here, we assume $m$ users, all of whom have samples from some underlying distribution $p$ over $1, \ldots, n$. Each user sends a batch of $k$ i.i.d.…

Data Structures and Algorithms · Computer Science 2019-11-07 Sitan Chen , Jerry Li , Ankur Moitra

We revisit the problem of distribution learning within the framework of learning-augmented algorithms. In this setting, we explore the scenario where a probability distribution is provided as potentially inaccurate advice on the true,…

Machine Learning · Computer Science 2025-02-03 Arnab Bhattacharyya , Davin Choo , Philips George John , Themis Gouleakis

An $(n,k)$-Poisson Multinomial Distribution (PMD) is the distribution of the sum of $n$ independent random vectors supported on the set ${\cal B}_k=\{e_1,\ldots,e_k\}$ of standard basis vectors in $\mathbb{R}^k$. We prove a structural…

Data Structures and Algorithms · Computer Science 2015-11-24 Constantinos Daskalakis , Gautam Kamath , Christos Tzamos

The question of polynomial learnability of probability distributions, particularly Gaussian mixture distributions, has recently received significant attention in theoretical computer science and machine learning. However, despite major…

Machine Learning · Computer Science 2010-05-13 Mikhail Belkin , Kaushik Sinha

We consider the problem of PAC-learning decision trees, i.e., learning a decision tree over the n-dimensional hypercube from independent random labeled examples. Despite significant effort, no polynomial-time algorithm is known for learning…

Machine Learning · Computer Science 2008-12-05 Adam Tauman Kalai , Shang-Hua Teng

We study the problem of learning from unlabeled samples very general statistical mixture models on large finite sets. Specifically, the model to be learned, $\vartheta$, is a probability distribution over probability distributions $p$,…

Machine Learning · Computer Science 2015-04-13 Jian Li , Yuval Rabani , Leonard J. Schulman , Chaitanya Swamy

We consider the following basic learning task: given independent draws from an unknown distribution over a discrete support, output an approximation of the distribution that is as accurate as possible in $\ell_1$ distance (i.e. total…

Machine Learning · Computer Science 2015-11-12 Gregory Valiant , Paul Valiant

We study the general problem of testing whether an unknown distribution belongs to a specified family of distributions. More specifically, given a distribution family $\mathcal{P}$ and sample access to an unknown discrete distribution…

Data Structures and Algorithms · Computer Science 2017-08-09 Clément L. Canonne , Ilias Diakonikolas , Alistair Stewart

The Wasserstein distance has emerged as a key metric to quantify distances between probability distributions, with applications in various fields, including machine learning, control theory, decision theory, and biological systems.…

Machine Learning · Computer Science 2026-02-10 Eduardo Figueiredo , Steven Adams , Luca Laurenti

We give a highly efficient "semi-agnostic" algorithm for learning univariate probability distributions that are well approximated by piecewise polynomial density functions. Let $p$ be an arbitrary distribution over an interval $I$ which is…

Machine Learning · Computer Science 2013-05-15 Siu-On Chan , Ilias Diakonikolas , Rocco A. Servedio , Xiaorui Sun

We give an algorithm for learning a mixture of {\em unstructured} distributions. This problem arises in various unsupervised learning scenarios, for example in learning {\em topic models} from a corpus of documents spanning several topics.…

Machine Learning · Computer Science 2013-09-19 Yuval Rabani , Leonard Schulman , Chaitanya Swamy
‹ Prev 1 2 3 10 Next ›