English
Related papers

Related papers: Estimating the number of classes

200 papers

Point estimation of class prevalences in the presence of data set shift has been a popular research topic for more than two decades. Less attention has been paid to the construction of confidence and prediction intervals for estimates of…

Machine Learning · Statistics 2019-07-23 Dirk Tasche

We wish to estimate the total number of classes in a population based on sample counts, especially in the presence of high latent diversity. Drawing on probability theory that characterizes distributions on the integers by ratios of…

Methodology · Statistics 2014-12-10 A. Willis , J. Bunge

Estimating the size of an elusive target population is of prominent interest in many areas in the life and social sciences. Our aim is to provide an efficient and workable method to estimate the unknown population size, given the frequency…

Applications · Statistics 2011-07-28 Irene Rocchetti , John Bunge , Dankmar Böhning

In cases of uncertainty, a multi-class classifier preferably returns a set of candidate classes instead of predicting a single class label with little guarantee. More precisely, the classifier should strive for an optimal balance between…

Machine Learning · Computer Science 2020-05-28 Thomas Mortier , Marek Wydmuch , Krzysztof Dembczyński , Eyke Hüllermeier , Willem Waegeman

We investigate a Poisson sampling design in the presence of unknown selection probabilities when applied to a population of unknown size for multiple sampling occasions. The fixed-population model is adopted and extended upon for inference.…

Methodology · Statistics 2020-01-30 Kyle Vincent , Saman Muthukumarana

We study the frequentist properties of Bayesian statistical inference for the stochastic block model, with an unknown number of classes of varying sizes. We equip the space of vertex labellings with a prior on the number of classes and,…

Statistics Theory · Mathematics 2020-05-05 J. van Waaij , B. J. K. Kleijn

Probabilities in the multiverse can be calculated by assuming that we are typical representatives in a given reference class. But is this class well defined? What should be included in the ensemble in which we are supposed to be typical?…

High Energy Physics - Theory · Physics 2008-11-26 Jaume Garriga , Alexander Vilenkin

When the cost of misclassifying a sample is high, it is useful to have an accurate estimate of uncertainty in the prediction for that sample. There are also multiple types of uncertainty which are best estimated in different ways, for…

Machine Learning · Computer Science 2019-03-18 Richard Harang , Ethan M. Rudd

The correct use and interpretation of models depends on several steps, two of which being the calibration by parameter estimation and the analysis of uncertainty. In the biological literature, these steps are seldom discussed together, but…

Quantitative Methods · Quantitative Biology 2015-08-17 André Chalom , Paulo Inácio de Knegt López de Prado

Estimating prevalence, the fraction of a population with a certain medical condition, is fundamental to epidemiology. Traditional methods rely on classification of test samples taken at random from a population. Such approaches to…

Methodology · Statistics 2022-03-25 Paul Patrone , Anthony Kearsley

Probabilistic classifiers output a probability distribution on target classes rather than just a class prediction. Besides providing a clear separation of prediction and decision making, the main advantage of probabilistic models is their…

Machine Learning · Computer Science 2019-02-20 Juozas Vaicenavicius , David Widmann , Carl Andersson , Fredrik Lindsten , Jacob Roll , Thomas B. Schön

While the accuracy of modern deep learning models has significantly improved in recent years, the ability of these models to generate uncertainty estimates has not progressed to the same degree. Uncertainty methods are designed to provide…

Machine Learning · Statistics 2020-06-17 Adam M. Oberman , Chris Finlay , Alexander Iannantuono , Tiago Salvador

We ask: Can focusing on likely classes of a single, in-domain sample improve model predictions? Prior work argued ``no''. We put forward a novel rationale in favor of ``yes'': Sharedness of features among classes indicates their reliability…

Machine Learning · Computer Science 2025-12-23 Johannes Schneider

We exploit a suitable moment-based characterization of the mixture of Poisson distribution for developing Bayesian inference for the unknown size of a finite population whose units are subject to multiple occurrences during an enumeration…

Methodology · Statistics 2018-06-19 Danilo Alunni Fegatelli , Luca Tardella

The missing mass refers to the proportion of data points in an unknown population of classifier inputs that belong to classes not present in the classifier's training data, which is assumed to be a random sample from that unknown…

Machine Learning · Computer Science 2025-03-11 Seongmin Lee , Marcel Böhme

The number of species can be estimated by sampling individuals from a species assemblage. The problem of estimating generalized species accumulation curve is addressed in a nonparametric Poisson mixture model. A likelihood-based estimator…

Statistics Theory · Mathematics 2007-06-13 Chang Xuan Mao

Class imbalance poses a significant challenge in classification tasks, where traditional approaches often lead to biased models and unreliable predictions. Undersampling and oversampling techniques have been commonly employed to address…

Machine Learning · Computer Science 2025-10-22 Matt Clifford , Jonathan Erskine , Alexander Hepburn , Raúl Santos-Rodríguez , Dario Garcia-Garcia

The availability of high-throughput parallel methods for sequencing microbial communities is increasing our knowledge of the microbial world at an unprecedented rate. Though most attention has focused on determining lower-bounds on the…

Methodology · Statistics 2011-09-15 Manuel Lladser , Raúl Gouet , Jens Reeder

We develop a theory of estimation when in addition to a sample of $n$ observed outcomes the underlying probabilities of the observed outcomes are known, as is typically the case in the context of numerical simulation modeling, e.g. in…

Methodology · Statistics 2023-04-14 Jobst Heitzig

In this paper we develop a very general class of bivariate discrete distributions. The basic idea is very simple. The marginals are obtained by taking the random geometric sum of a baseline distribution function. The proposed class of…

Methodology · Statistics 2018-05-22 Debasis Kundu
‹ Prev 1 2 3 10 Next ›