Related papers: Estimating a probability mass function with unknow…

Estimating Maximally Probable Constrained Relations by Mathematical Programming

Estimating a constrained relation is a fundamental problem in machine learning. Special cases are classification (the problem of estimating a map from a set of to-be-classified elements to a set of labels), clustering (the problem of…

Machine Learning · Computer Science 2014-08-06 Lizhen Qu , Bjoern Andres

Estimating the Missing Mass, Partition Function or Evidence for a Case of Sampling from a Discrete Set

We consider the problem of estimating the missing mass, partition function or evidence and its probability distribution in the case that for each sample point in the discrete sample space its (unnormalized) probability mass is revealed.…

Statistics Theory · Mathematics 2026-03-16 Bastiaan J. Braams

Nonparametric Divergence Estimation with Applications to Machine Learning on Distributions

Low-dimensional embedding, manifold learning, clustering, classification, and anomaly detection are among the most important problems in machine learning. The existing methods usually consider the case when each instance has a fixed,…

Machine Learning · Computer Science 2012-02-20 Barnabas Poczos , Liang Xiong , Jeff Schneider

Instability, Computational Efficiency and Statistical Accuracy

Many statistical estimators are defined as the fixed point of a data-dependent operator, with estimators based on minimizing a cost function being an important special case. The limiting performance of such estimators depends on the…

Machine Learning · Computer Science 2022-03-22 Nhat Ho , Koulik Khamaru , Raaz Dwivedi , Martin J. Wainwright , Michael I. Jordan , Bin Yu

Nonparametric maximum likelihood estimation under a likelihood ratio order

Comparison of two univariate distributions based on independent samples from them is a fundamental problem in statistics, with applications in a wide variety of scientific disciplines. In many situations, we might hypothesize that the two…

Methodology · Statistics 2021-07-08 Ted Westling , Kevin J. Downes , Dylan S. Small

A Targeted Approach to Confounder Selection for High-Dimensional Data

We consider the problem of selecting confounders for adjustment from a potentially large set of covariates, when estimating a causal effect. Recently, the high-dimensional Propensity Score (hdPS) method was developed for this task; hdPS…

Methodology · Statistics 2021-12-17 Asad Haris , Robert Platt

Practical and Consistent Estimation of f-Divergences

The estimation of an f-divergence between two probability distributions based on samples is a fundamental problem in statistics and machine learning. Most works study this problem under very weak assumptions, in which case it is provably…

Machine Learning · Statistics 2019-10-25 Paul K. Rubenstein , Olivier Bousquet , Josip Djolonga , Carlos Riquelme , Ilya Tolstikhin

Parameter Identification in a Probabilistic Setting

Parameter identification problems are formulated in a probabilistic language, where the randomness reflects the uncertainty about the knowledge of the true values. This setting allows conceptually easily to incorporate new information, e.g.…

Numerical Analysis · Computer Science 2013-03-19 Bojana V. Rosić , Anna Kučerová , Jan Sýkora , Oliver Pajonk , Alexander Litvinenko , Hermann G. Matthies

Progressively Sampled Equality-Constrained Optimization

An algorithm is proposed, analyzed, and tested for solving continuous nonlinear-equality-constrained optimization problems where the objective and constraint functions are defined by expectations or averages over large, finite numbers of…

Optimization and Control · Mathematics 2026-05-14 Frank E. Curtis , Lingjun Guo , Daniel P. Robinson

Adaptive Multinomial Matrix Completion

The task of estimating a matrix given a sample of observed entries is known as the \emph{matrix completion problem}. Most works on matrix completion have focused on recovering an unknown real-valued low-rank matrix from a random sample of…

Statistics Theory · Mathematics 2014-08-27 Olga Klopp , Jean Lafond , Eric Moulines , Joseph Salmon

Bounding the Test Log-Likelihood of Generative Models

Several interesting generative learning algorithms involve a complex probability distribution over many random variables, involving intractable normalization constants or latent variable normalization. Some of them may even not have an…

Machine Learning · Computer Science 2014-05-13 Yoshua Bengio , Li Yao , Kyunghyun Cho

Nonparametric Score Estimators

Estimating the score, i.e., the gradient of log density function, from a set of samples generated by an unknown distribution is a fundamental task in inference and learning of probabilistic models that involve flexible yet intractable…

Machine Learning · Statistics 2020-07-01 Yuhao Zhou , Jiaxin Shi , Jun Zhu

Phase Transition Unbiased Estimation in High Dimensional Settings

An important challenge in statistical analysis concerns the control of the finite sample bias of estimators. For example, the maximum likelihood estimator has a bias that can result in a significant inferential loss. This problem is…

Statistics Theory · Mathematics 2019-11-04 Stéphane Guerrier , Mucyo Karemera , Samuel Orso , Maria-Pia Victoria-Feser

Pivotal Estimation via Self-Normalization for High-Dimensional Linear Models with Error in Variables

We propose a new estimator for the high-dimensional linear regression model with observation error in the design where the number of coefficients is potentially larger than the sample size. The main novelty of our procedure is that the…

Methodology · Statistics 2019-09-09 Alexandre Belloni , Abhishek Kaul , Mathieu Rosenbaum

Estimating the unseen from multiple populations

Given samples from a distribution, how many new elements should we expect to find if we continue sampling this distribution? This is an important and actively studied problem, with many applications ranging from unseen species estimation to…

Machine Learning · Computer Science 2017-07-14 Aditi Raghunathan , Greg Valiant , James Zou

Unbiased Parameter Estimation for Bayesian Inverse Problems

In this paper we consider the estimation of unknown parameters in Bayesian inverse problems. In most cases of practical interest, there are several barriers to performing such estimation, This includes a numerical approximation of a…

Methodology · Statistics 2025-02-07 Neil K. Chada , Ajay Jasra , Mohamed Maama , Raul Tempone

Deep Probability Estimation

Reliable probability estimation is of crucial importance in many real-world applications where there is inherent (aleatoric) uncertainty. Probability-estimation models are trained on observed outcomes (e.g. whether it has rained or not, or…

Machine Learning · Computer Science 2023-02-09 Sheng Liu , Aakash Kaku , Weicheng Zhu , Matan Leibovich , Sreyas Mohan , Boyang Yu , Haoxiang Huang , Laure Zanna , Narges Razavian , Jonathan Niles-Weed , Carlos Fernandez-Granda

Nonparametric Inference on Unlabeled Histograms

Statistical inference on histograms and frequency counts plays a central role in categorical data analysis. Moving beyond classical methods that directly analyze labeled frequencies, we introduce a framework that models the multiset of…

Statistics Theory · Mathematics 2025-11-10 Yun Ma , Pengkun Yang

A review of predictive uncertainty estimation with machine learning

Predictions and forecasts of machine learning models should take the form of probability distributions, aiming to increase the quantity of information communicated to end users. Although applications of probabilistic prediction and…

Machine Learning · Statistics 2024-03-19 Hristos Tyralis , Georgia Papacharalampous

A Family of Computationally Efficient and Simple Estimators for Unnormalized Statistical Models

We introduce a new family of estimators for unnormalized statistical models. Our family of estimators is parameterized by two nonlinear functions and uses a single sample from an auxiliary distribution, generalizing Maximum Likelihood Monte…

Machine Learning · Computer Science 2012-03-19 Miika Pihlaja , Michael Gutmann , Aapo Hyvarinen