Related papers: Robust Inference of Trees

Robust Feature Selection by Mutual Information Distributions

Mutual information is widely used in artificial intelligence, in a descriptive way, to measure the stochastic dependence of discrete random variables. In order to address questions such as the reliability of the empirical value, one must…

Artificial Intelligence · Computer Science 2008-06-26 Marco Zaffalon , Marcus Hutter

Robust Feature Selection by Mutual Information Distributions

Mutual information is widely used in artificial intelligence, in a descriptive way, to measure the stochastic dependence of discrete random variables. In order to address questions such as the reliability of the empirical value, one must…

Artificial Intelligence · Computer Science 2014-08-08 Marco Zaffalon , Marcus Hutter

Robust Estimators under the Imprecise Dirichlet Model

Walley's Imprecise Dirichlet Model (IDM) for categorical data overcomes several fundamental problems which other approaches to uncertainty suffer from. Yet, to be useful in practice, one needs efficient ways for computing the…

Probability · Mathematics 2007-07-16 Marcus Hutter

Practical Robust Estimators for the Imprecise Dirichlet Model

Walley's Imprecise Dirichlet Model (IDM) for categorical i.i.d. data extends the classical Dirichlet model to a set of priors. It overcomes several fundamental problems which other approaches to uncertainty suffer from. Yet, to be useful in…

Statistics Theory · Mathematics 2009-12-30 Marcus Hutter

Uncertainty Quantification in Ensembles of Honest Regression Trees using Generalized Fiducial Inference

Due to their accuracies, methods based on ensembles of regression trees are a popular approach for making predictions. Some common examples include Bayesian additive regression trees, boosting and random forests. This paper focuses on…

Methodology · Statistics 2019-11-15 Suofei Wu , Jan Hannig , Thomas C. M. Lee

Exact Non-Parametric Bayesian Inference on Infinite Trees

Given i.i.d. data from an unknown distribution, we consider the problem of predicting future items. An adaptive way to estimate the probability density is to recursively subdivide the domain to an appropriate data-dependent granularity. A…

Probability · Mathematics 2009-12-30 Marcus Hutter

Fast Non-Parametric Bayesian Inference on Infinite Trees

Given i.i.d. data from an unknown distribution, we consider the problem of predicting future items. An adaptive way to estimate the probability density is to recursively subdivide the domain to an appropriate data-dependent granularity. A…

Statistics Theory · Mathematics 2007-06-13 Marcus Hutter

Learning Optimal Decision Trees from Large Datasets

Inferring a decision tree from a given dataset is one of the classic problems in machine learning. This problem consists of buildings, from a labelled dataset, a tree such that each node corresponds to a class and a path between the tree…

Machine Learning · Computer Science 2019-04-15 Florent Avellaneda

Information Aware Max-Norm Dirichlet Networks for Predictive Uncertainty Estimation

Precise estimation of uncertainty in predictions for AI systems is a critical factor in ensuring trust and safety. Deep neural networks trained with a conventional method are prone to over-confident predictions. In contrast to Bayesian…

Machine Learning · Computer Science 2021-01-05 Theodoros Tsiligkaridis

Learning Interpretable Models Using Uncertainty Oracles

A desirable property of interpretable models is small size, so that they are easily understandable by humans. This leads to the following challenges: (a) small sizes typically imply diminished accuracy, and (b) bespoke levers provided by…

Machine Learning · Computer Science 2024-08-26 Abhishek Ghose , Balaraman Ravindran

Inferring the Underlying Structure of Information Cascades

In social networks, information and influence diffuse among users as cascades. While the importance of studying cascades has been recognized in various applications, it is difficult to observe the complete structure of cascades in practice.…

Social and Information Networks · Computer Science 2012-10-15 Bo Zong , Yinghui Wu , Ambuj K. Singh , Xifeng Yan

Quantifying Intrinsic Uncertainty in Classification via Deep Dirichlet Mixture Networks

With the widespread success of deep neural networks in science and technology, it is becoming increasingly important to quantify the uncertainty of the predictions produced by deep learning. In this paper, we introduce a new method that…

Machine Learning · Computer Science 2019-08-15 Qingyang Wu , He Li , Lexin Li , Zhou Yu

Distribution of Mutual Information from Complete and Incomplete Data

Mutual information is widely used, in a descriptive way, to measure the stochastic dependence of categorical random variables. In order to address questions such as the reliability of the descriptive value, one must consider…

Machine Learning · Computer Science 2007-07-13 Marcus Hutter , Marco Zaffalon

Inference on the History of a Randomly Growing Tree

The spread of infectious disease in a human community or the proliferation of fake news on social media can be modeled as a randomly growing tree-shaped graph. The history of the random growth process is often unobserved but contains…

Probability · Mathematics 2021-01-15 Harry Crane , Min Xu

Efficient Sample-optimal Learning of Gaussian Tree Models via Sample-optimal Testing of Gaussian Mutual Information

Learning high-dimensional distributions is a significant challenge in machine learning and statistics. Classical research has mostly concentrated on asymptotic analysis of such data under suitable assumptions. While existing works…

Machine Learning · Computer Science 2024-11-19 Sutanu Gayen , Sanket Kale , Sayantan Sen

Efficient distributional regression trees learning algorithms for calibrated non-parametric probabilistic forecasts

The perspective of developing trustworthy AI for critical applications in science and engineering requires machine learning techniques that are capable of estimating their own uncertainty. In the context of regression, instead of estimating…

Machine Learning · Computer Science 2026-05-14 Quentin Duchemin , Guillaume Obozinski

Dynamic Trees: A Structured Variational Method Giving Efficient Propagation Rules

Dynamic trees are mixtures of tree structured belief networks. They solve some of the problems of fixed tree networks at the cost of making exact inference intractable. For this reason approximate methods such as sampling or mean field…

Machine Learning · Computer Science 2013-01-18 Amos J. Storkey

Inference with Mondrian Random Forests

Random forests are popular methods for regression and classification analysis, and many different variants have been proposed in recent years. One interesting example is the Mondrian random forest, in which the underlying constituent trees…

Statistics Theory · Mathematics 2025-11-10 Matias D. Cattaneo , Jason M. Klusowski , William G. Underwood

Quantifying probabilistic robustness of tree-based classifiers against natural distortions

The concept of trustworthy AI has gained widespread attention lately. One of the aspects relevant to trustworthy AI is robustness of ML models. In this study, we show how to probabilistically quantify robustness against naturally occurring…

Machine Learning · Computer Science 2022-11-30 Christoph Schweimer , Sebastian Scher

Quantifying Uncertainty in Random Forests via Confidence Intervals and Hypothesis Tests

This work develops formal statistical inference procedures for machine learning ensemble methods. Ensemble methods based on bootstrapping, such as bagging and random forests, have improved the predictive accuracy of individual trees, but…

Machine Learning · Statistics 2015-09-11 Lucas Mentch , Giles Hooker