Related papers: Learning Quantized Neural Nets by Coarse Gradient …

Understanding Straight-Through Estimator in Training Activation Quantized Neural Nets

Training activation quantized neural networks involves minimizing a piecewise constant function whose gradient vanishes almost everywhere, which is undesirable for the standard back-propagation or chain rule. An empirical way around this…

Machine Learning · Computer Science 2019-09-26 Penghang Yin , Jiancheng Lyu , Shuai Zhang , Stanley Osher , Yingyong Qi , Jack Xin

QuantNet: Learning to Quantize by Learning within Fully Differentiable Framework

Despite the achievements of recent binarization methods on reducing the performance degradation of Binary Neural Networks (BNNs), gradient mismatching caused by the Straight-Through-Estimator (STE) still dominates quantized networks. This…

Computer Vision and Pattern Recognition · Computer Science 2020-09-11 Junjie Liu , Dongchao Wen , Deyu Wang , Wei Tao , Tse-Wei Chen , Kinya Osa , Masami Kato

Network Quantization with Element-wise Gradient Scaling

Network quantization aims at reducing bit-widths of weights and/or activations, particularly important for implementing deep neural networks with limited hardware resources. Most methods use the straight-through estimator (STE) to train…

Computer Vision and Pattern Recognition · Computer Science 2021-04-05 Junghyup Lee , Dohyung Kim , Bumsub Ham

Error-aware Quantization through Noise Tempering

Quantization has become a predominant approach for model compression, enabling deployment of large models trained on GPUs onto smaller form-factor devices for inference. Quantization-aware training (QAT) optimizes model parameters with…

Machine Learning · Computer Science 2022-12-13 Zheng Wang , Juncheng B Li , Shuhui Qu , Florian Metze , Emma Strubell

High-Dimensional Learning Dynamics of Quantized Models with Straight-Through Estimator

Quantized neural network training optimizes a discrete, non-differentiable objective. The straight-through estimator (STE) enables backpropagation through surrogate gradients and is widely used. While previous studies have primarily focused…

Machine Learning · Statistics 2025-10-14 Yuma Ichikawa , Shuhei Kashiwamura , Ayaka Sakata

Beyond Discreteness: Finite-Sample Analysis of Straight-Through Estimator for Quantization

Training quantized neural networks requires addressing the non-differentiable and discrete nature of the underlying optimization problem. To tackle this challenge, the straight-through estimator (STE) has become the most widely adopted…

Machine Learning · Computer Science 2025-05-26 Halyun Jeong , Jack Xin , Penghang Yin

Training Quantised Neural Networks with STE Variants: the Additive Noise Annealing Algorithm

Training quantised neural networks (QNNs) is a non-differentiable optimisation problem since weights and features are output by piecewise constant functions. The standard solution is to apply the straight-through estimator (STE), using…

Machine Learning · Computer Science 2022-03-23 Matteo Spallanzani , Gian Paolo Leonardi , Luca Benini

Robust Training of Neural Networks at Arbitrary Precision and Sparsity

The discontinuous operations inherent in quantization and sparsification introduce a long-standing obstacle to backpropagation, particularly in ultra-low precision and sparse regimes. While the community has long viewed quantization as…

Machine Learning · Computer Science 2026-03-11 Chengxi Ye , Grace Chu , Yanfeng Liu , Yichi Zhang , Lukasz Lew , Li Zhang , Mark Sandler , Andrew Howard

Recurrence of Optimum for Training Weight and Activation Quantized Networks

Deep neural networks (DNNs) are quantized for efficient inference on resource-constrained platforms. However, training deep learning models with low-precision weights and activations involves a demanding optimization task, which calls for…

Machine Learning · Computer Science 2021-05-25 Ziang Long , Penghang Yin , Jack Xin

Progressive Element-wise Gradient Estimation for Neural Network Quantization

Neural network quantization aims to reduce the bit-widths of weights and activations, making it a critical technique for deploying deep neural networks on resource-constrained hardware. Most Quantization-Aware Training (QAT) methods rely on…

Machine Learning · Computer Science 2025-09-03 Kaiqi Zhao

Propagating Asymptotic-Estimated Gradients for Low Bitwidth Quantized Neural Networks

The quantized neural networks (QNNs) can be useful for neural network acceleration and compression, but during the training process they pose a challenge: how to propagate the gradient of loss function through the graph flow with a…

Machine Learning · Computer Science 2020-03-26 Jun Chen , Yong Liu , Hao Zhang , Shengnan Hou , Jian Yang

Blended Coarse Gradient Descent for Full Quantization of Deep Neural Networks

Quantized deep neural networks (QDNNs) are attractive due to their much lower memory storage and faster inference speed than their regular full precision counterparts. To maintain the same performance level especially at low bit-widths,…

Machine Learning · Computer Science 2019-01-08 Penghang Yin , Shuai Zhang , Jiancheng Lyu , Stanley Osher , Yingyong Qi , Jack Xin

GDNSQ: Gradual Differentiable Noise Scale Quantization for Low-bit Neural Networks

Quantized neural networks can be viewed as a chain of noisy channels, where rounding in each layer reduces capacity as bit-width shrinks; the floating-point (FP) checkpoint sets the maximum input rate. We track capacity dynamics as the…

Machine Learning · Computer Science 2025-11-12 Sergey Salishev , Ian Akhremchik

Navigating Local Minima in Quantized Spiking Neural Networks

Spiking and Quantized Neural Networks (NNs) are becoming exceedingly important for hyper-efficient implementations of Deep Learning (DL) algorithms. However, these networks face challenges when trained using error backpropagation, due to…

Machine Learning · Computer Science 2022-02-16 Jason K. Eshraghian , Corey Lammie , Mostafa Rahimi Azghadi , Wei D. Lu

Custom Gradient Estimators are Straight-Through Estimators in Disguise

Quantization-aware training comes with a fundamental challenge: the derivative of quantization functions such as rounding are zero almost everywhere and nonexistent elsewhere. Various differentiable approximations of quantization functions…

Machine Learning · Computer Science 2024-05-24 Matt Schoenbauer , Daniele Moro , Lukasz Lew , Andrew Howard

Learning low-precision neural networks without Straight-Through Estimator(STE)

The Straight-Through Estimator (STE) is widely used for back-propagating gradients through the quantization function, but the STE technique lacks a complete theoretical understanding. We propose an alternative methodology called…

Machine Learning · Computer Science 2019-05-22 Zhi-Gang Liu , Matthew Mattina

Quantized Neural Networks: Training Neural Networks with Low Precision Weights and Activations

We introduce a method to train Quantized Neural Networks (QNNs) --- neural networks with extremely low precision (e.g., 1-bit) weights and activations, at run-time. At train-time the quantized weights and activations are used for computing…

Neural and Evolutionary Computing · Computer Science 2016-09-23 Itay Hubara , Matthieu Courbariaux , Daniel Soudry , Ran El-Yaniv , Yoshua Bengio

Estimating or Propagating Gradients Through Stochastic Neurons for Conditional Computation

Stochastic neurons and hard non-linearities can be useful for a number of reasons in deep learning models, but in many cases they pose a challenging problem: how to estimate the gradient of a loss function with respect to the input of such…

Machine Learning · Computer Science 2013-08-16 Yoshua Bengio , Nicholas Léonard , Aaron Courville

Improving the Straight-Through Estimator with Zeroth-Order Information

We study the problem of training neural networks with quantized parameters. Learning low-precision quantized parameters by enabling computation of gradients via the Straight-Through Estimator (STE) can be challenging. While the STE enables…

Machine Learning · Computer Science 2025-10-29 Ningfeng Yang , Tor M. Aamodt

Gradient-Free Training of Quantized Neural Networks

Training neural networks requires significant computational resources and energy. Methods like mixed-precision and quantization-aware training reduce bit usage, yet they still depend heavily on computationally expensive gradient-based…

Machine Learning · Computer Science 2025-09-30 Noa Cohen , Omkar Joglekar , Dotan Di Castro , Vladimir Tchuiev , Shir Kozlovsky , Michal Moshkovitz