English
Related papers

Related papers: Explicitly Bayesian Regularizations in Deep Learni…

200 papers

Modern deep learning models generalize remarkably well in-distribution, despite being overparametrized and trained with little to no explicit regularization. Instead, current theory credits implicit regularization imposed by the choice of…

Machine Learning · Computer Science 2026-03-17 Jonathan Wenger , Beau Coker , Juraj Marusic , John P. Cunningham

In this work, we introduce a novel probabilistic representation of deep learning, which provides an explicit explanation for the Deep Neural Networks (DNNs) in three aspects: (i) neurons define the energy of a Gibbs distribution; (ii) the…

Machine Learning · Computer Science 2019-08-27 Xinjie Lan , Kenneth E. Barner

Neural networks are powerful function approximators with tremendous potential in learning complex distributions. However, they are prone to overfitting on spurious patterns. Bayesian inference provides a principled way to regularize neural…

Machine Learning · Computer Science 2024-12-02 Yanzhe Bekkemoen , Helge Langseth

Deep Neural Networks (DNN) and especially Convolutional Neural Networks (CNN) are a de-facto standard for the analysis of large volumes of signals and images. Yet, their development and underlying principles have been largely performed in…

Information Theory · Computer Science 2022-03-24 Ljubisa Stankovic , Danilo Mandic

Deep neural networks (DNNs) often require good regularizers to generalize well. Currently, state-of-the-art DNN regularization techniques consist in randomly dropping units and/or connections on each iteration of the training algorithm.…

Machine Learning · Computer Science 2018-03-06 Harris Partaourides , Sotirios P. Chatzis

Convolutional neural networks (CNNs) provide flexible function approximations for a wide variety of applications when the input variables are in the form of images or spatial data. Although CNNs often outperform traditional statistical…

Methodology · Statistics 2024-05-24 Yeseul Jeon , Won Chang , Seonghyun Jeong , Sanghoon Han , Jaewoo Park

We introduce implicit Bayesian neural networks, a simple and scalable approach for uncertainty representation in deep learning. Standard Bayesian approach to deep learning requires the impractical inference of the posterior distribution…

Machine Learning · Statistics 2020-10-27 Trung Trinh , Samuel Kaski , Markus Heinonen

Several works have shown that the regularization mechanisms underlying deep neural networks' generalization performances are still poorly understood. In this paper, we hypothesize that deep neural networks are regularized through their…

Machine Learning · Computer Science 2021-03-12 Carbonnelle Simon , Christophe De Vleeschouwer

Neural Networks (NNs) have provided state-of-the-art results for many challenging machine learning tasks such as detection, regression and classification across the domains of computer vision, speech recognition and natural language…

Machine Learning · Statistics 2026-04-21 Ethan Goan , Clinton Fookes

The key distinguishing property of a Bayesian approach is marginalization instead of optimization, not the prior, or Bayes rule. Bayesian inference is especially compelling for deep neural networks. (1) Neural networks are typically…

Machine Learning · Computer Science 2020-01-30 Andrew Gordon Wilson

Deep learning is a form of machine learning for nonlinear high dimensional pattern matching and prediction. By taking a Bayesian probabilistic perspective, we provide a number of insights into more efficient algorithms for optimisation and…

Machine Learning · Statistics 2018-01-23 Nicholas Polson , Vadim Sokolov

Despite impressive performance on numerous visual tasks, Convolutional Neural Networks (CNNs) --- unlike brains --- are often highly sensitive to small perturbations of their input, e.g. adversarial noise leading to erroneous decisions. We…

Deep Neural Networks (DNNs) have aroused great attention in Compressed Sensing (CS) restoration. However, the working mechanism of DNNs is not explainable, thereby it is unclear that how to design an optimal DNNs for CS restoration. In this…

Signal Processing · Electrical Eng. & Systems 2019-02-26 Xinjie Lan , Xin Guo , Kenneth E. Barner

Bayesian neural networks utilize probabilistic layers that capture uncertainty over weights and activations, and are trained using Bayesian inference. Since these probabilistic layers are designed to be drop-in replacement of their…

Machine Learning · Computer Science 2021-06-28 Daniel T. Chang

Traditional deep neural nets (NNs) have shown the state-of-the-art performance in the task of classification in various applications. However, NNs have not considered any types of uncertainty associated with the class probabilities to…

Machine Learning · Computer Science 2019-10-16 Xujiang Zhao , Yuzhe Ou , Lance Kaplan , Feng Chen , Jin-Hee Cho

Recent years have produced great advances in training large, deep neural networks (DNNs), including notable successes in training convolutional neural networks (convnets) to recognize natural images. However, our understanding of how these…

Computer Vision and Pattern Recognition · Computer Science 2015-06-23 Jason Yosinski , Jeff Clune , Anh Nguyen , Thomas Fuchs , Hod Lipson

Deep learning has been the engine powering many successes of data science. However, the deep neural network (DNN), as the basic model of deep learning, is often excessively over-parameterized, causing many difficulties in training,…

Machine Learning · Statistics 2021-03-09 Yan Sun , Qifan Song , Faming Liang

In this work we investigate the reasons why Batch Normalization (BN) improves the generalization performance of deep networks. We argue that one major reason, distinguishing it from data-independent normalization methods, is randomness of…

Machine Learning · Computer Science 2018-11-05 Alexander Shekhovtsov , Boris Flach

Bayesian neural networks (BNNs) augment deep networks with uncertainty quantification by Bayesian treatment of the network weights. However, such models face the challenge of Bayesian inference in a high-dimensional and usually…

Machine Learning · Computer Science 2021-03-30 Zhijie Deng , Yucen Luo , Jun Zhu , Bo Zhang

Deep learning models, including modern systems like large language models, are well known to offer unreliable estimates of the uncertainty of their decisions. In order to improve the quality of the confidence levels, also known as…

Machine Learning · Computer Science 2024-04-15 Jiayi Huang , Sangwoo Park , Osvaldo Simeone
‹ Prev 1 2 3 10 Next ›