Related papers: The C-SHIFT algorithm for normalizing covariances

Evaluating Prediction-Time Batch Normalization for Robustness under Covariate Shift

Covariate shift has been shown to sharply degrade both predictive accuracy and the calibration of uncertainty estimates for deep learning models. This is worrying, because covariate shift is prevalent in a wide range of real world…

Machine Learning · Computer Science 2021-01-18 Zachary Nado , Shreyas Padhy , D. Sculley , Alexander D'Amour , Balaji Lakshminarayanan , Jasper Snoek

Controlling Covariate Shift using Balanced Normalization of Weights

We introduce a new normalization technique that exhibits the fast convergence properties of batch normalization using a transformation of layer weights instead of layer outputs. The proposed technique keeps the contribution of positive and…

Machine Learning · Computer Science 2019-05-13 Aaron Defazio , Léon Bottou

Transformation, normalization and batch effect in the analysis of mass spectrometry data for omics studies

Data transformation, normalization and handling of batch effect are a key part of data analysis for almost all spectrometry-based omics data. This paper reviews and contrasts these three distinct aspects. We present a systematic overview of…

Methodology · Statistics 2016-06-20 Bart J. A. Mertens

Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift

Training Deep Neural Networks is complicated by the fact that the distribution of each layer's inputs changes during training, as the parameters of the previous layers change. This slows down the training by requiring lower learning rates…

Machine Learning · Computer Science 2015-03-03 Sergey Ioffe , Christian Szegedy

A Theoretical Analysis on Independence-driven Importance Weighting for Covariate-shift Generalization

Covariate-shift generalization, a typical case in out-of-distribution (OOD) generalization, requires a good performance on the unknown test distribution, which varies from the accessible training distribution in the form of covariate shift.…

Machine Learning · Computer Science 2023-10-18 Renzhe Xu , Xingxuan Zhang , Zheyan Shen , Tong Zhang , Peng Cui

Optimal covariance matrix estimation for high-dimensional noise in high-frequency data

We consider high-dimensional measurement errors with high-frequency data. Our objective is on recovering the high-dimensional cross-sectional covariance matrix of the random errors with optimality. In this problem, not all components of the…

Statistics Theory · Mathematics 2024-04-03 Jinyuan Chang , Qiao Hu , Cheng Liu , Cheng Yong Tang

Information Geometrically Generalized Covariate Shift Adaptation

Many machine learning methods assume that the training and test data follow the same distribution. However, in the real world, this assumption is very often violated. In particular, the phenomenon that the marginal distribution of the data…

Machine Learning · Computer Science 2023-04-20 Masanari Kimura , Hideitsu Hino

An Internal Covariate Shift Bounding Algorithm for Deep Neural Networks by Unitizing Layers' Outputs

Batch Normalization (BN) techniques have been proposed to reduce the so-called Internal Covariate Shift (ICS) by attempting to keep the distributions of layer outputs unchanged. Experiments have shown their effectiveness on training deep…

Machine Learning · Computer Science 2020-01-10 You Huang , Yuanlong Yu

Customizable Adaptive Regularization Techniques for B-Spline Modeling

B-spline models are a powerful way to represent scientific data sets with a functional approximation. However, these models can suffer from spurious oscillations when the data to be approximated are not uniformly distributed. Model…

Numerical Analysis · Mathematics 2023-01-04 David Lenz , Raine Yeh , Vijay Mahadevan , Iulian Grindeanu , Tom Peterka

Statistical Inference Using Mean Shift Denoising

In this paper, we study how the mean shift algorithm can be used to denoise a dataset. We introduce a new framework to analyze the mean shift algorithm as a denoising approach by viewing the algorithm as an operator on a distribution…

Methodology · Statistics 2016-10-14 Yunhua Xiang , Yen-Chi Chen

Mean Shift Rejection: Training Deep Neural Networks Without Minibatch Statistics or Normalization

Deep convolutional neural networks are known to be unstable during training at high learning rate unless normalization techniques are employed. Normalizing weights or activations allows the use of higher learning rates, resulting in faster…

Machine Learning · Computer Science 2019-12-02 Brendan Ruff , Taylor Beck , Joscha Bach

Stochastic Majorization-Minimization Algorithms for Large-Scale Optimization

Majorization-minimization algorithms consist of iteratively minimizing a majorizing surrogate of an objective function. Because of its simplicity and its wide applicability, this principle has been very popular in statistics and in signal…

Machine Learning · Statistics 2013-09-11 Julien Mairal

Weight Conditioning for Smooth Optimization of Neural Networks

In this article, we introduce a novel normalization technique for neural network weight matrices, which we term weight conditioning. This approach aims to narrow the gap between the smallest and largest singular values of the weight…

Computer Vision and Pattern Recognition · Computer Science 2026-03-16 Hemanth Saratchandran , Thomas X. Wang , Simon Lucey

Normalization Propagation: A Parametric Technique for Removing Internal Covariate Shift in Deep Networks

While the authors of Batch Normalization (BN) identify and address an important problem involved in training deep networks-- Internal Covariate Shift-- the current solution has certain drawbacks. Specifically, BN depends on batch statistics…

Machine Learning · Statistics 2016-07-13 Devansh Arpit , Yingbo Zhou , Bhargava U. Kota , Venu Govindaraju

Regularized Classification-Aware Quantization

Traditionally, quantization is designed to minimize the reconstruction error of a data source. When considering downstream classification tasks, other measures of distortion can be of interest; such as the 0-1 classification loss.…

Machine Learning · Computer Science 2021-07-22 Daniel Severo , Elad Domanovitz , Ashish Khisti

Adaptive Regularization of B-Spline Models for Scientific Data

B-spline models are a powerful way to represent scientific data sets with a functional approximation. However, these models can suffer from spurious oscillations when the data to be approximated are not uniformly distributed. Model…

Numerical Analysis · Mathematics 2022-03-29 David Lenz , Raine Yeh , Vijay Mahadevan , Iulian Grindeanu , Tom Peterka

On regularization methods of EM-Kaczmarz type

We consider regularization methods of Kaczmarz type in connection with the expectation-maximization (EM) algorithm for solving ill-posed equations. For noisy data, our methods are stabilized extensions of the well established…

Numerical Analysis · Mathematics 2015-05-13 Markus Haltmeier , Antonio Leitao , Elena Resmerita

Calibrated Prediction with Covariate Shift via Unsupervised Domain Adaptation

Reliable uncertainty estimates are an important tool for helping autonomous agents or human decision makers understand and leverage predictive models. However, existing approaches to estimating uncertainty largely ignore the possibility of…

Machine Learning · Computer Science 2020-05-22 Sangdon Park , Osbert Bastani , James Weimer , Insup Lee

An Adaptive Re-evaluation Method for Evolution Strategy under Additive Noise

The Covariance Matrix Adaptation Evolutionary Strategy (CMA-ES) is one of the most advanced algorithms in numerical black-box optimization. For noisy objective functions, several approaches were proposed to mitigate the noise, e.g.,…

Neural and Evolutionary Computing · Computer Science 2025-06-04 Catalin-Viorel Dinu , Yash J. Patel , Xavier Bonet-Monroig , Hao Wang

CrossNorm and SelfNorm for Generalization under Distribution Shifts

Traditional normalization techniques (e.g., Batch Normalization and Instance Normalization) generally and simplistically assume that training and test data follow the same distribution. As distribution shifts are inevitable in real-world…

Computer Vision and Pattern Recognition · Computer Science 2021-08-17 Zhiqiang Tang , Yunhe Gao , Yi Zhu , Zhi Zhang , Mu Li , Dimitris Metaxas