Related papers: Noise Regularization for Conditional Density Estim…

Conditional Density Estimation with Neural Networks: Best Practices and Benchmarks

Given a set of empirical observations, conditional density estimation aims to capture the statistical relationship between a conditional variable $\mathbf{x}$ and a dependent variable $\mathbf{y}$ by modeling their conditional probability…

Machine Learning · Statistics 2019-04-16 Jonas Rothfuss , Fabio Ferreira , Simon Walther , Maxim Ulrich

Conditional Noise-Contrastive Estimation of Unnormalised Models

Many parametric statistical models are not properly normalised and only specified up to an intractable partition function, which renders parameter estimation difficult. Examples of unnormalised models are Gibbs distributions, Markov random…

Machine Learning · Statistics 2018-06-12 Ciwan Ceylan , Michael U. Gutmann

Regularizing Deep Neural Networks by Noise: Its Interpretation and Optimization

Overfitting is one of the most critical challenges in deep neural networks, and there are various types of regularization methods to improve generalization performance. Injecting noises to hidden units during training, e.g., dropout, is…

Machine Learning · Computer Science 2017-11-10 Hyeonwoo Noh , Tackgeun You , Jonghwan Mun , Bohyung Han

Noise Contrastive Estimation and Negative Sampling for Conditional Models: Consistency and Statistical Efficiency

Noise Contrastive Estimation (NCE) is a powerful parameter estimation method for log-linear models, which avoids calculation of the partition function or its derivatives at each training step, a computationally demanding step in many cases.…

Computation and Language · Computer Science 2018-09-07 Zhuang Ma , Michael Collins

OCDE: Odds Conditional Density Estimator

Conditional density estimation (CDE) models can be useful for many statistical applications, especially because the full conditional density is estimated instead of traditional regression point estimates, revealing more information about…

Methodology · Statistics 2021-07-12 Alex Akira Okuno , Felipe Maia Polo

Mitigating the Impact of False Negatives in Dense Retrieval with Contrastive Confidence Regularization

In open-domain Question Answering (QA), dense retrieval is crucial for finding relevant passages for answer generation. Typically, contrastive learning is used to train a retrieval model that maps passages and queries to the same semantic…

Computation and Language · Computer Science 2024-01-17 Shiqi Wang , Yeqin Zhang , Cam-Tu Nguyen

Consistency Training with Virtual Adversarial Discrete Perturbation

Consistency training regularizes a model by enforcing predictions of original and perturbed inputs to be similar. Previous studies have proposed various augmentation methods for the perturbation but are limited in that they are agnostic to…

Computation and Language · Computer Science 2022-04-29 Jungsoo Park , Gyuwan Kim , Jaewoo Kang

Neural SDE: Stabilizing Neural ODE Networks with Stochastic Noise

Neural Ordinary Differential Equation (Neural ODE) has been proposed as a continuous approximation to the ResNet architecture. Some commonly used regularization mechanisms in discrete neural networks (e.g. dropout, Gaussian noise) are…

Machine Learning · Computer Science 2019-06-07 Xuanqing Liu , Tesi Xiao , Si Si , Qin Cao , Sanjiv Kumar , Cho-Jui Hsieh

Selective Output Smoothing Regularization: Regularize Neural Networks by Softening Output Distributions

In this paper, we propose Selective Output Smoothing Regularization, a novel regularization method for training the Convolutional Neural Networks (CNNs). Inspired by the diverse effects on training from different samples, Selective Output…

Computer Vision and Pattern Recognition · Computer Science 2022-03-30 Xuan Cheng , Tianshu Xie , Xiaomin Wang , Qifeng Weng , Minghui Liu , Jiali Deng , Ming Liu

Cross-regularization: Adaptive Model Complexity through Validation Gradients

Model regularization requires extensive manual tuning to balance complexity against overfitting. Cross-regularization resolves this tradeoff by directly adapting regularization parameters through validation gradients during training. The…

Machine Learning · Computer Science 2025-06-25 Carlos Stein Brito

Consistency Regularization Can Improve Robustness to Label Noise

Consistency regularization is a commonly-used technique for semi-supervised and self-supervised learning. It is an auxiliary objective function that encourages the prediction of the network to be similar in the vicinity of the observed…

Machine Learning · Computer Science 2021-10-05 Erik Englesson , Hossein Azizpour

On the Effect of Regularization on Nonparametric Mean-Variance Regression

Uncertainty quantification is vital for decision-making and risk assessment in machine learning. Mean-variance regression models, which predict both a mean and residual noise for each data point, provide a simple approach to uncertainty…

Machine Learning · Statistics 2025-12-01 Eliot Wong-Toi , Alex Boyd , Vincent Fortuin , Stephan Mandt

CNT (Conditioning on Noisy Targets): A new Algorithm for Leveraging Top-Down Feedback

We propose a novel regularizer for supervised learning called Conditioning on Noisy Targets (CNT). This approach consists in conditioning the model on a noisy version of the target(s) (e.g., actions in imitation learning or labels in…

Machine Learning · Computer Science 2022-10-28 Alexia Jolicoeur-Martineau , Alex Lamb , Vikas Verma , Aniket Didolkar

Robust Convolution Neural ODEs via Contractivity-promoting regularization

Neural networks can be fragile to input noise and adversarial attacks. In this work, we consider Convolutional Neural Ordinary Differential Equations (NODEs), a family of continuous-depth neural networks represented by dynamical systems,…

Machine Learning · Computer Science 2025-08-18 Muhammad Zakwan , Liang Xu , Giancarlo Ferrari-Trecate

Stochasticity in Neural ODEs: An Empirical Study

Stochastic regularization of neural networks (e.g. dropout) is a wide-spread technique in deep learning that allows for better generalization. Despite its success, continuous-time models, such as neural ordinary differential equation (ODE),…

Machine Learning · Computer Science 2020-06-29 Viktor Oganesyan , Alexandra Volokhova , Dmitry Vetrov

Better Conditional Density Estimation for Neural Networks

The vast majority of the neural network literature focuses on predicting point values for a given set of response variables, conditioned on a feature vector. In many cases we need to model the full joint conditional distribution over the…

Machine Learning · Statistics 2016-06-09 Wesley Tansey , Karl Pichotta , James G. Scott

Deconvolutional Density Network: Modeling Free-Form Conditional Distributions

Conditional density estimation (CDE) is the task of estimating the probability of an event conditioned on some inputs. A neural network (NN) can also be used to compute the output distribution for continuous-domain, which can be viewed as…

Machine Learning · Computer Science 2021-12-30 Bing Chen , Mazharul Islam , Jisuo Gao , Lin Wang

Improving the Noise Estimation of Latent Neural Stochastic Differential Equations

Latent neural stochastic differential equations (SDEs) have recently emerged as a promising approach for learning generative models from stochastic time series data. However, they systematically underestimate the noise level inherent in…

Machine Learning · Computer Science 2025-06-11 Linus Heck , Maximilian Gelbrecht , Michael T. Schaub , Niklas Boers

Learning From Brains How to Regularize Machines

Despite impressive performance on numerous visual tasks, Convolutional Neural Networks (CNNs) --- unlike brains --- are often highly sensitive to small perturbations of their input, e.g. adversarial noise leading to erroneous decisions. We…

Machine Learning · Computer Science 2019-11-14 Zhe Li , Wieland Brendel , Edgar Y. Walker , Erick Cobos , Taliah Muhammad , Jacob Reimer , Matthias Bethge , Fabian H. Sinz , Xaq Pitkow , Andreas S. Tolias

Simple and Effective Regularization Methods for Training on Noisily Labeled Data with Generalization Guarantee

Over-parameterized deep neural networks trained by simple first-order methods are known to be able to fit any labeling of data. Such over-fitting ability hinders generalization when mislabeled training examples are present. On the other…

Machine Learning · Computer Science 2020-10-06 Wei Hu , Zhiyuan Li , Dingli Yu