Related papers: Prediction-Oriented Bayesian Active Learning

Test Distribution-Aware Active Learning: A Principled Approach Against Distribution Shift and Outliers

Expanding on MacKay (1992), we argue that conventional model-based methods for active learning - like BALD - have a fundamental shortfall: they fail to directly account for the test-time distribution of the input variables. This can lead to…

Machine Learning · Computer Science 2021-11-23 Andreas Kirsch , Tom Rainforth , Yarin Gal

Batch Bayesian Active Learning with Partial Batch Label Sampling

Over the past couple of decades, many active learning acquisition functions have been proposed, leaving practitioners with an unclear choice of which to use. Bayesian-based active learning offers principled objectives with explainable…

Machine Learning · Computer Science 2026-05-12 Kangping Hu , Stephen Mussmann

Bayesian Active Learning for Censored Regression

Bayesian active learning is based on information theoretical approaches that focus on maximising the information that new observations provide to the model parameters. This is commonly done by maximising the Bayesian Active Learning by…

Machine Learning · Computer Science 2024-02-20 Frederik Boe Hüttel , Christoffer Riis , Filipe Rodrigues , Francisco Câmara Pereira

Causal-EPIG: A Prediction-Oriented Active Learning Framework for CATE Estimation

Estimating the Conditional Average Treatment Effect (CATE) is often constrained by the high cost of obtaining outcome measurements, making active learning essential. However, conventional active learning strategies suffer from a fundamental…

Machine Learning · Statistics 2025-09-29 Erdun Gao , Jake Fawkes , Dino Sejdinovic

Unifying Approaches in Active Learning and Active Sampling via Fisher Information and Information-Theoretic Quantities

Recently proposed methods in data subset selection, that is active learning and active sampling, use Fisher information, Hessians, similarity matrices based on gradients, and gradient lengths to estimate how informative data is for a…

Machine Learning · Computer Science 2022-11-08 Andreas Kirsch , Yarin Gal

Big Batch Bayesian Active Learning by Considering Predictive Probabilities

We observe that BatchBALD, a popular acquisition function for batch Bayesian active learning for classification, can conflate epistemic and aleatoric uncertainty, leading to suboptimal performance. Motivated by this observation, we propose…

Machine Learning · Computer Science 2025-01-15 Sebastian W. Ober , Samuel Power , Tom Diethe , Henry B. Moss

Bayesian Active Learning for Classification and Preference Learning

Information theoretic active learning has been widely studied for probabilistic models. For simple regression an optimal myopic policy is easily tractable. However, for other tasks and with more complex models, such as classification with…

Machine Learning · Statistics 2011-12-30 Neil Houlsby , Ferenc Huszár , Zoubin Ghahramani , Máté Lengyel

Robust Expected Information Gain for Optimal Bayesian Experimental Design Using Ambiguity Sets

The ranking of experiments by expected information gain (EIG) in Bayesian experimental design is sensitive to changes in the model's prior distribution, and the approximation of EIG yielded by sampling will have errors similar to the use of…

Machine Learning · Statistics 2022-05-23 Jinwoo Go , Tobin Isaac

Targeted Active Learning for Bayesian Decision-Making

Active learning is usually applied to acquire labels of informative data points in supervised learning, to maximize accuracy in a sample-efficient way. However, maximizing the accuracy is not the end goal when the results are used for…

Machine Learning · Statistics 2021-10-22 Louis Filstroff , Iiris Sundin , Petrus Mikkola , Aleksei Tiulpin , Juuso Kylmäoja , Samuel Kaski

Loss-Driven Bayesian Active Learning

The central goal of active learning is to gather data that maximises downstream predictive performance, but popular approaches have limited flexibility in customising this data acquisition to different downstream problems and losses. We…

Machine Learning · Computer Science 2026-05-11 Zhuoyue Huang , Freddie Bickford Smith , Tom Rainforth

On Estimating the Gradient of the Expected Information Gain in Bayesian Experimental Design

Bayesian Experimental Design (BED), which aims to find the optimal experimental conditions for Bayesian inference, is usually posed as to optimize the expected information gain (EIG). The gradient information is often needed for efficient…

Machine Learning · Statistics 2023-12-14 Ziqiao Ao , Jinglai Li

PowerEvaluationBALD: Efficient Evaluation-Oriented Deep (Bayesian) Active Learning with Stochastic Acquisition Functions

We develop BatchEvaluationBALD, a new acquisition function for deep Bayesian active learning, as an expansion of BatchBALD that takes into account an evaluation set of unlabeled data, for example, the pool set. We also develop a variant for…

Machine Learning · Computer Science 2021-05-12 Andreas Kirsch , Yarin Gal

Active Learning with Expected Error Reduction

Active learning has been studied extensively as a method for efficient data collection. Among the many approaches in literature, Expected Error Reduction (EER) (Roy and McCallum) has been shown to be an effective method for active learning:…

Machine Learning · Computer Science 2022-11-18 Stephen Mussmann , Julia Reisler , Daniel Tsai , Ehsan Mousavi , Shayne O'Brien , Moises Goldszmidt

Active inference and artificial reasoning

This technical note considers the sampling of outcomes that provide the greatest amount of information about the structure of underlying world models. This generalisation furnishes a principled approach to structure learning under a…

Neurons and Cognition · Quantitative Biology 2025-12-25 Karl Friston , Lancelot Da Costa , Alexander Tschantz , Conor Heins , Christopher Buckley , Tim Verbelen , Thomas Parr

Active Model Selection

Classical learning assumes the learner is given a labeled data sample, from which it learns a model. The field of Active Learning deals with the situation where the learner begins not with a training sample, but instead with resources that…

Machine Learning · Computer Science 2012-07-19 Omid Madani , Daniel J. Lizotte , Russell Greiner

Exponentiated Gradient Exploration for Active Learning

Active learning strategies respond to the costly labelling task in a supervised classification by selecting the most useful unlabelled examples in training a predictive model. Many conventional active learning algorithms focus on refining…

Machine Learning · Computer Science 2014-08-12 Djallel Bouneffouf

Expected information gain estimation via density approximations: Sample allocation and dimension reduction

Computing expected information gain (EIG) from prior to posterior (equivalently, mutual information between candidate observations and model parameters or other quantities of interest) is a fundamental challenge in Bayesian optimal…

Methodology · Statistics 2026-01-30 Fengyi Li , Ricardo Baptista , Youssef Marzouk

Wild Guesses and Mild Guesses in Active Concept Learning

Human concept learning is typically active: learners choose which instances to query or test in order to reduce uncertainty about an underlying rule or category. Active concept learning must balance informativeness of queries against the…

Artificial Intelligence · Computer Science 2026-02-09 Anirudh Chari , Neil Pattanaik

Linear combination of one-step predictive information with an external reward in an episodic policy gradient setting: a critical analysis

One of the main challenges in the field of embodied artificial intelligence is the open-ended autonomous learning of complex behaviours. Our approach is to use task-independent, information-driven intrinsic motivation(s) to support…

Artificial Intelligence · Computer Science 2013-09-27 Keyan Zahedi , Georg Martius , Nihat Ay

Causal-BALD: Deep Bayesian Active Learning of Outcomes to Infer Treatment-Effects from Observational Data

Estimating personalized treatment effects from high-dimensional observational data is essential in situations where experimental designs are infeasible, unethical, or expensive. Existing approaches rely on fitting deep models on outcomes…

Machine Learning · Computer Science 2022-02-02 Andrew Jesson , Panagiotis Tigas , Joost van Amersfoort , Andreas Kirsch , Uri Shalit , Yarin Gal