English
Related papers

Related papers: Contextual Dropout: An Efficient Sample-Dependent …

200 papers

Dropout is a widely used regularization technique which improves the generalization ability of a model by randomly dropping neurons. In light of this, we propose Dropout Prompt Learning, which aims for applying dropout to improve the…

Computer Vision and Pattern Recognition · Computer Science 2025-12-09 Biao Chen , Lin Zuo , Mengmeng Jing , Kunbin He , Yuchen Wang

Dropout is a very effective way of regularizing neural networks. Stochastically "dropping out" units with a certain probability discourages over-specific co-adaptations of feature detectors, preventing overfitting and improving network…

Neural and Evolutionary Computing · Computer Science 2017-08-04 Pietro Morerio , Jacopo Cavazza , Riccardo Volpi , Rene Vidal , Vittorio Murino

Active learning is relevant and challenging for high-dimensional regression models when the annotation of the samples is expensive. Yet most of the existing sampling methods cannot be applied to large-scale problems, consuming too much time…

Machine Learning · Computer Science 2020-01-24 Evgenii Tsymbalov , Maxim Panov , Alexander Shapeev

Dropout is used as a practical tool to obtain uncertainty estimates in large vision models and reinforcement learning (RL) tasks. But to obtain well-calibrated uncertainty estimates, a grid-search over the dropout probabilities is necessary…

Machine Learning · Statistics 2017-05-23 Yarin Gal , Jiri Hron , Alex Kendall

Uncertainty estimation for machine learning models is of high importance in many scenarios such as constructing the confidence intervals for model predictions and detection of out-of-distribution or adversarially generated points. In this…

Machine Learning · Computer Science 2022-05-06 Kirill Fedyanin , Evgenii Tsymbalov , Maxim Panov

Deep neural networks have become the default choice for many of the machine learning tasks such as classification and regression. Dropout, a method commonly used to improve the convergence of deep neural networks, generates an ensemble of…

Machine Learning · Statistics 2019-04-11 Tal Kachman , Michal Moshkovitz , Michal Rosen-Zvi

In classification applications, we often want probabilistic predictions to reflect confidence or uncertainty. Dropout, a commonly used training technique, has recently been linked to Bayesian inference, yielding an efficient way to quantify…

Machine Learning · Computer Science 2019-06-25 Zhilu Zhang , Adrian V. Dalca , Mert R. Sabuncu

Dropout is a simple but efficient regularization technique for achieving better generalization of deep neural networks (DNNs); hence it is widely used in tasks based on DNNs. During training, dropout randomly discards a portion of the…

Neural and Evolutionary Computing · Computer Science 2020-10-22 Hiroshi Inoue

Dropout has been witnessed with great success in training deep neural networks by independently zeroing out the outputs of neurons at random. It has also received a surge of interest for shallow learning, e.g., logistic regression. However,…

Machine Learning · Computer Science 2016-12-06 Zhe Li , Boqing Gong , Tianbao Yang

Dropout is a widely used regularization trick to resolve the overfitting issue in large feedforward neural networks trained on a small dataset, which performs poorly on the held-out test subset. Although the effectiveness of this…

Computation and Language · Computer Science 2023-03-29 Suman Adhya , Avishek Lahiri , Debarshi Kumar Sanyal

Deep learning tools have gained tremendous attention in applied machine learning. However such tools for regression and classification do not capture model uncertainty. In comparison, Bayesian models offer a mathematically grounded…

Machine Learning · Statistics 2016-10-05 Yarin Gal , Zoubin Ghahramani

In this work we explore the relevance of dropout for modern language models, particularly in the context of models on the scale of <100M parameters. We explore it's relevance firstly in the regime of improving the sample efficiency of…

Computation and Language · Computer Science 2024-09-10 Dylan Hillier , Leon Guertler , Bobby Cheng , Cheston Tan

The ability to adapt to changing environments and settings is essential for robots acting in dynamic and unstructured environments or working alongside humans with varied abilities or preferences. This work introduces an extremely simple…

Robotics · Computer Science 2022-10-31 Pamela Carreno-Medrano , Dana Kulić , Michael Burke

Dropout, a simple and effective way to train deep neural networks, has led to a number of impressive empirical successes and spawned many recent theoretical investigations. However, the gap between dropout's training and inference phases,…

Machine Learning · Computer Science 2017-02-17 Xuezhe Ma , Yingkai Gao , Zhiting Hu , Yaoliang Yu , Yuntian Deng , Eduard Hovy

Deep time series models are vulnerable to noisy data ubiquitous in real-world applications. Existing robustness strategies either prune data or rely on costly prior quantification, failing to balance effectiveness and efficiency. In this…

Artificial Intelligence · Computer Science 2026-05-26 Siru Zhong , Yiqiu Liu , Zhiqing Cui , Zezhi Shao , Fei Wang , Qingsong Wen , Yuxuan Liang

Deep neural networks are typically trained by uniformly sampling large datasets across epochs, despite evidence that not all samples contribute equally throughout learning. Recent work shows that progressively reducing the amount of…

Machine Learning · Computer Science 2026-04-15 Amar Gahir , Varshil Patel , Shreyank N Gowda

Transformer-based language models are widely deployed for reasoning, yet their behavior under inference-time stochasticity remains underexplored. While dropout is common during training, its inference-time effects via Monte Carlo sampling…

Machine Learning · Computer Science 2026-03-19 Antônio Junior Alves Caiado , Michael Hahsler

Learning to infer the conditional posterior model is a key step for robust meta-learning. This paper presents a new Bayesian meta-learning approach called Neural Variational Dropout Processes (NVDPs). NVDPs model the conditional posterior…

Machine Learning · Computer Science 2025-10-23 Insu Jeon , Youngjin Park , Gunhee Kim

Deep learning models frequently exploit spurious features in training data to achieve low training error, often resulting in poor generalization when faced with shifted testing distributions. To address this issue, various methods from…

Machine Learning · Computer Science 2025-02-11 Geraldin Nanfack , Eugene Belilovsky

Dropout has been proven to be an effective algorithm for training robust deep networks because of its ability to prevent overfitting by avoiding the co-adaptation of feature detectors. Current explanations of dropout include bagging, naive…

Computer Vision and Pattern Recognition · Computer Science 2019-12-02 Xu Shen , Xinmei Tian , Tongliang Liu , Fang Xu , Dacheng Tao
‹ Prev 1 2 3 10 Next ›