Related papers: Batch Bayesian Active Learning with Partial Batch …

Prediction-Oriented Bayesian Active Learning

Information-theoretic approaches to active learning have traditionally focused on maximising the information gathered about the model parameters, most commonly by optimising the BALD score. We highlight that this can be suboptimal from the…

Machine Learning · Computer Science 2023-04-18 Freddie Bickford Smith , Andreas Kirsch , Sebastian Farquhar , Yarin Gal , Adam Foster , Tom Rainforth

Bayesian Batch Active Learning as Sparse Subset Approximation

Leveraging the wealth of unlabeled data produced in recent years provides great potential for improving supervised models. When the cost of acquiring labels is high, probabilistic active learning methods can be used to greedily select the…

Machine Learning · Statistics 2021-02-09 Robert Pinsler , Jonathan Gordon , Eric Nalisnick , José Miguel Hernández-Lobato

Test Distribution-Aware Active Learning: A Principled Approach Against Distribution Shift and Outliers

Expanding on MacKay (1992), we argue that conventional model-based methods for active learning - like BALD - have a fundamental shortfall: they fail to directly account for the test-time distribution of the input variables. This can lead to…

Machine Learning · Computer Science 2021-11-23 Andreas Kirsch , Tom Rainforth , Yarin Gal

Bayesian active learning for production, a systematic study and a reusable library

Active learning is able to reduce the amount of labelling effort by using a machine learning model to query the user for specific inputs. While there are many papers on new active learning techniques, these techniques rarely satisfy the…

Machine Learning · Computer Science 2020-06-18 Parmida Atighehchian , Frédéric Branchaud-Charron , Alexandre Lacoste

Big Batch Bayesian Active Learning by Considering Predictive Probabilities

We observe that BatchBALD, a popular acquisition function for batch Bayesian active learning for classification, can conflate epistemic and aleatoric uncertainty, leading to suboptimal performance. Motivated by this observation, we propose…

Machine Learning · Computer Science 2025-01-15 Sebastian W. Ober , Samuel Power , Tom Diethe , Henry B. Moss

BatchBALD: Efficient and Diverse Batch Acquisition for Deep Bayesian Active Learning

We develop BatchBALD, a tractable approximation to the mutual information between a batch of points and model parameters, which we use as an acquisition function to select multiple informative points jointly for the task of deep Bayesian…

Machine Learning · Computer Science 2019-10-29 Andreas Kirsch , Joost van Amersfoort , Yarin Gal

Scalable Batch Acquisition for Deep Bayesian Active Learning

In deep active learning, it is especially important to choose multiple examples to markup at each step to work efficiently, especially on large datasets. At the same time, existing solutions to this problem in the Bayesian setup, such as…

Machine Learning · Computer Science 2023-02-17 Aleksandr Rubashevskii , Daria Kotova , Maxim Panov

Active Learning with Expected Error Reduction

Active learning has been studied extensively as a method for efficient data collection. Among the many approaches in literature, Expected Error Reduction (EER) (Roy and McCallum) has been shown to be an effective method for active learning:…

Machine Learning · Computer Science 2022-11-18 Stephen Mussmann , Julia Reisler , Daniel Tsai , Ehsan Mousavi , Shayne O'Brien , Moises Goldszmidt

PowerEvaluationBALD: Efficient Evaluation-Oriented Deep (Bayesian) Active Learning with Stochastic Acquisition Functions

We develop BatchEvaluationBALD, a new acquisition function for deep Bayesian active learning, as an expansion of BatchBALD that takes into account an evaluation set of unlabeled data, for example, the pool set. We also develop a variant for…

Machine Learning · Computer Science 2021-05-12 Andreas Kirsch , Yarin Gal

Weighted Ensembles for Active Learning with Adaptivity

Labeled data can be expensive to acquire in several application domains, including medical imaging, robotics, and computer vision. To efficiently train machine learning models under such high labeling costs, active learning (AL) judiciously…

Machine Learning · Computer Science 2022-06-13 Konstantinos D. Polyzos , Qin Lu , Georgios B. Giannakis

Efficient Sampling-Based Bayesian Active Learning for synaptic characterization

Bayesian Active Learning (BAL) is an efficient framework for learning the parameters of a model, in which input stimuli are selected to maximize the mutual information between the observations and the unknown parameters. However, the…

Quantitative Methods · Quantitative Biology 2022-12-01 Camille Gontier , Simone Carlo Surace , Igor Delvendahl , Martin Müller , Jean-Pascal Pfister

Bayesian Active Learning for Censored Regression

Bayesian active learning is based on information theoretical approaches that focus on maximising the information that new observations provide to the model parameters. This is commonly done by maximising the Bayesian Active Learning by…

Machine Learning · Computer Science 2024-02-20 Frederik Boe Hüttel , Christoffer Riis , Filipe Rodrigues , Francisco Câmara Pereira

Targeted Active Learning for Bayesian Decision-Making

Active learning is usually applied to acquire labels of informative data points in supervised learning, to maximize accuracy in a sample-efficient way. However, maximizing the accuracy is not the end goal when the results are used for…

Machine Learning · Statistics 2021-10-22 Louis Filstroff , Iiris Sundin , Petrus Mikkola , Aleksei Tiulpin , Juuso Kylmäoja , Samuel Kaski

Bayesian Estimate of Mean Proper Scores for Diversity-Enhanced Active Learning

The effectiveness of active learning largely depends on the sampling efficiency of the acquisition function. Expected Loss Reduction (ELR) focuses on a Bayesian estimate of the reduction in classification error, and more general costs fit…

Machine Learning · Computer Science 2023-12-19 Wei Tan , Lan Du , Wray Buntine

Toward Optimal Probabilistic Active Learning Using a Bayesian Approach

Gathering labeled data to train well-performing machine learning models is one of the critical challenges in many applications. Active learning aims at reducing the labeling costs by an efficient and effective allocation of costly labeling…

Machine Learning · Computer Science 2020-06-03 Daniel Kottke , Marek Herde , Christoph Sandrock , Denis Huseljic , Georg Krempl , Bernhard Sick

Batch Active Learning at Scale

The ability to train complex and highly effective models often requires an abundance of training data, which can easily become a bottleneck in cost, time, and computational resources. Batch active learning, which adaptively issues batched…

Machine Learning · Computer Science 2021-08-02 Gui Citovsky , Giulia DeSalvo , Claudio Gentile , Lazaros Karydas , Anand Rajagopalan , Afshin Rostamizadeh , Sanjiv Kumar

Efficacy of Bayesian Neural Networks in Active Learning

Obtaining labeled data for machine learning tasks can be prohibitively expensive. Active learning mitigates this issue by exploring the unlabeled data space and prioritizing the selection of data that can best improve the model performance.…

Machine Learning · Computer Science 2021-04-21 Vineeth Rakesh , Swayambhoo Jain

Deep Bayesian Active Learning for Preference Modeling in Large Language Models

Leveraging human preferences for steering the behavior of Large Language Models (LLMs) has demonstrated notable success in recent years. Nonetheless, data selection and labeling are still a bottleneck for these systems, particularly at…

Machine Learning · Computer Science 2024-10-30 Luckeciano C. Melo , Panagiotis Tigas , Alessandro Abate , Yarin Gal

Loss-Driven Bayesian Active Learning

The central goal of active learning is to gather data that maximises downstream predictive performance, but popular approaches have limited flexibility in customising this data acquisition to different downstream problems and losses. We…

Machine Learning · Computer Science 2026-05-11 Zhuoyue Huang , Freddie Bickford Smith , Tom Rainforth

Speeding Up BatchBALD: A k-BALD Family of Approximations for Active Learning

Active learning is a powerful method for training machine learning models with limited labeled data. One commonly used technique for active learning is BatchBALD, which uses Bayesian neural networks to find the most informative points to…

Machine Learning · Computer Science 2023-01-24 Andreas Kirsch