English
Related papers

Related papers: Learning U-Statistics with Active Inference

200 papers

Inspired by the concept of active learning, we propose active inference$\unicode{x2013}$a methodology for statistical inference with machine-learning-assisted data collection. Assuming a budget on the number of labels that can be collected,…

Machine Learning · Statistics 2026-04-09 Tijana Zrnic , Emmanuel J. Candès

Active statistical inference is a new method for inference with AI-assisted data collection. Given a budget on the number of labeled data points that can be collected and assuming access to an AI predictive model, the basic idea is to…

Machine Learning · Statistics 2025-11-13 Puheng Li , Tijana Zrnic , Emmanuel Candès

We introduce a new framework for sample-efficient model evaluation that we call active testing. While approaches like active learning reduce the number of labels needed for model training, existing literature largely ignores the cost of…

Machine Learning · Statistics 2021-06-15 Jannik Kossen , Sebastian Farquhar , Yarin Gal , Tom Rainforth

U-statistics play central roles in many statistical learning tools but face the haunting issue of scalability. Significant efforts have been devoted into accelerating computation by U-statistic reduction. However, existing results almost…

Methodology · Statistics 2023-06-07 Meijia Shao , Dong Xia , Yuan Zhang

We propose the use of U-statistics to reduce variance for gradient estimation in importance-weighted variational inference. The key observation is that, given a base gradient estimator that requires $m > 1$ samples and a total of $n > m$…

Machine Learning · Computer Science 2023-02-28 Javier Burroni , Kenta Takatsu , Justin Domke , Daniel Sheldon

Supervised learning deals with the inference of a distribution over an output or label space $\CY$ conditioned on points in an observation space $\CX$, given a training dataset $D$ of pairs in $\CX \times \CY$. However, in a lot of…

Machine Learning · Computer Science 2007-11-15 Christos Dimitrakakis , Christian Savu-Krohn

Semi-supervised datasets are ubiquitous across diverse domains where obtaining fully labeled data is costly or time-consuming. The prevalence of such datasets has consistently driven the demand for new tools and methods that exploit the…

Statistics Theory · Mathematics 2024-03-12 Ilmun Kim , Larry Wasserman , Sivaraman Balakrishnan , Matey Neykov

Most of the existing learning models, particularly deep neural networks, are reliant on large datasets whose hand-labeling is expensive and time demanding. A current trend is to make the learning of these models frugal and less dependent on…

Computer Vision and Pattern Recognition · Computer Science 2022-12-12 Sebastien Deschamps , Hichem Sahbi

Optimal design for model training is a critical topic in machine learning. Active Learning aims at obtaining improved models by querying samples with maximum uncertainty according to the estimation model for artificially labeling; this has…

Data subsampling has become widely recognized as a tool to overcome computational and economic bottlenecks in analyzing massive datasets. We contribute to the development of adaptive design for estimation of finite population…

Methodology · Statistics 2024-07-08 Henrik Imberg , Xiaomi Yang , Carol Flannagan , Jonas Bärgman

Many active learning methods belong to the retraining-based approaches, which select one unlabeled instance, add it to the training set with its possible labels, retrain the classification model, and evaluate the criteria that we base our…

Machine Learning · Statistics 2017-03-01 Yazhou Yang , Marco Loog

At its core, this thesis aims to enhance the practicality of deep learning by improving the label and training efficiency of deep learning models. To this end, we investigate data subset selection techniques, specifically active learning…

Machine Learning · Computer Science 2024-03-11 Andreas Kirsch

In supervised learning, acquiring labeled training data for a predictive model can be very costly, but acquiring a large amount of unlabeled data is often quite easy. Active learning is a method of obtaining predictive models with high…

Machine Learning · Computer Science 2020-12-17 Hideitsu Hino

In many real-world applications, researchers aim to deploy models trained in a source domain to a target domain, where obtaining labeled data is often expensive, time-consuming, or even infeasible. While most existing literature assumes…

Methodology · Statistics 2025-08-26 Seong-ho Lee , Yanyuan Ma , Jiwei Zhao

Training high-quality instance segmentation models requires an abundance of labeled images with instance masks and classifications, which is often expensive to procure. Active learning addresses this challenge by striving for optimum…

Computer Vision and Pattern Recognition · Computer Science 2023-09-29 Ke Yu , Stephen Albro , Giulia DeSalvo , Suraj Kothawade , Abdullah Rashwan , Sasan Tavakkol , Kayhan Batmanghelich , Xiaoqi Yin

Selective prediction aims to learn a reliable model that abstains from making predictions when uncertain. These predictions can then be deferred to humans for further evaluation. As an everlasting challenge for machine learning, in many…

Machine Learning · Computer Science 2024-03-04 Jiefeng Chen , Jinsung Yoon , Sayna Ebrahimi , Sercan Arik , Somesh Jha , Tomas Pfister

Active learning aims to reduce labeling efforts by selectively asking humans to annotate the most important data points from an unlabeled pool and is an example of human-machine interaction. Though active learning has been extensively…

Machine Learning · Computer Science 2020-01-31 Hongjing Zhang , S. S. Ravi , Ian Davidson

In reinforcement learning (RL), agents often operate in partially observed and uncertain environments. Model-based RL suggests that this is best achieved by learning and exploiting a probabilistic model of the world. 'Active inference' is…

Machine Learning · Computer Science 2019-11-26 Alexander Tschantz , Manuel Baltieri , Anil. K. Seth , Christopher L. Buckley

Uncertainty Sampling is an Active Learning strategy that aims to improve the data efficiency of machine learning models by iteratively acquiring labels of data points with the highest uncertainty. While it has proven effective for…

Machine Learning · Computer Science 2025-02-28 Dominik Fuchsgruber , Tom Wollschläger , Bertrand Charpentier , Antonio Oroz , Stephan Günnemann

Active learning is usually applied to acquire labels of informative data points in supervised learning, to maximize accuracy in a sample-efficient way. However, maximizing the accuracy is not the end goal when the results are used for…

Machine Learning · Statistics 2021-10-22 Louis Filstroff , Iiris Sundin , Petrus Mikkola , Aleksei Tiulpin , Juuso Kylmäoja , Samuel Kaski
‹ Prev 1 2 3 10 Next ›