Related papers: Automatic Quantization for Physics-Based Simulatio…

StatQAT: Statistical Quantizer Optimization for Deep Networks

Quantization is essential for reducing the computational cost and memory usage of deep neural networks, enabling efficient inference on low-precision hardware. Despite the growing adoption of uniform and floating-point quantization schemes,…

Machine Learning · Statistics 2026-05-19 Mehmet Aktukmak , Daniel Huang , Ke Ding

Randomizing multi-product formulas for Hamiltonian simulation

Quantum simulation, the simulation of quantum processes on quantum computers, suggests a path forward for the efficient simulation of problems in condensed-matter physics, quantum chemistry, and materials science. While the majority of…

Quantum Physics · Physics 2022-10-03 Paul K. Faehrmann , Mark Steudtner , Richard Kueng , Maria Kieferova , Jens Eisert

On the efficient representation and execution of deep acoustic models

In this paper we present a simple and computationally efficient quantization scheme that enables us to reduce the resolution of the parameters of a neural network from 32-bit floating point values to 8-bit integer values. The proposed…

Machine Learning · Computer Science 2016-12-20 Raziel Alvarez , Rohit Prabhavalkar , Anton Bakhtin

Compute-Optimal Quantization-Aware Training

Quantization-aware training (QAT) is a leading technique for improving the accuracy of quantized neural networks. Previous work has shown that decomposing training into a full-precision (FP) phase followed by a QAT phase yields superior…

Machine Learning · Computer Science 2026-02-27 Aleksandr Dremov , David Grangier , Angelos Katharopoulos , Awni Hannun

Qu-ANTI-zation: Exploiting Quantization Artifacts for Achieving Adversarial Outcomes

Quantization is a popular technique that $transforms$ the parameter representation of a neural network from floating-point numbers into lower-precision ones ($e.g.$, 8-bit integers). It reduces the memory footprint and the computational…

Machine Learning · Computer Science 2021-11-12 Sanghyun Hong , Michael-Andrei Panaitescu-Liess , Yiğitcan Kaya , Tudor Dumitraş

Optimal Quantization of Signals for System Identification

In this paper, we examine the optimal quantization of signals for system identification. We deal with memoryless quantization for the output signals and derive the optimal quantization schemes. The objective functions are the errors of…

Optimization and Control · Mathematics 2009-05-13 Koji Tsumura

A Precise Error Bound for Quantum Phase Estimation

Quantum phase estimation is one of the key algorithms in the field of quantum computing, but up until now, only approximate expressions have been derived for the probability of error. We revisit these derivations, and find that by ensuring…

Quantum Physics · Physics 2012-02-13 James M. Chappell , Max A. Lohe , Lorenz von Smekal , Azhar Iqbal , Derek Abbott

STQuant: Spatio-Temporal Adaptive Framework for Optimizer Quantization in Large Multimodal Model Training

Quantization is an effective way to reduce the memory cost of large-scale model training. However, most existing methods adopt fixed-precision policies, which ignore the fact that optimizer-state distributions vary significantly across…

Machine Learning · Computer Science 2026-04-10 Minglu Liu , Cunchen Hu , Liangliang Xu , Fengming Tang , Ruijia Wang , Fu Yu

Reliable Characterization for Improving and Validating Accurate Quantum Operations

A reliable method for characterizing quantum operations that is suitable for improving and validating their accuracies is indispensable for realizing a practical quantum computer. Known methods are still not sufficient because they lack…

Quantum Physics · Physics 2021-06-25 Takanori Sugiyama , Shinpei Imori , Fuyuhiko Tanaka

Quantization Design for Distributed Optimization

We consider the problem of solving a distributed optimization problem using a distributed computing platform, where the communication in the network is limited: each node can only communicate with its neighbours and the channel has a…

Systems and Control · Computer Science 2015-04-10 Ye Pu , Melanie N. Zeilinger , Colin N. Jones

Adaptive Quantization for Deep Neural Network

In recent years Deep Neural Networks (DNNs) have been rapidly developed in various applications, together with increasingly complex architectures. The performance gain of these DNNs generally comes with high computational costs and large…

Machine Learning · Computer Science 2017-12-05 Yiren Zhou , Seyed-Mohsen Moosavi-Dezfooli , Ngai-Man Cheung , Pascal Frossard

Quantum-Enhanced Simulation-Based Optimization

In this paper, we introduce a quantum-enhanced algorithm for simulation-based optimization. Simulation-based optimization seeks to optimize an objective function that is computationally expensive to evaluate exactly, and thus, is…

Quantum Physics · Physics 2021-03-08 Julien Gacon , Christa Zoufal , Stefan Woerner

Unbounded memory advantage in stochastic simulation using quantum mechanics

Simulating the stochastic evolution of real quantities on a digital computer requires a trade-off between the precision to which these quantities are approximated, and the memory required to store them. The statistical accuracy of the…

Quantum Physics · Physics 2017-10-16 Andrew J. P. Garner , Qing Liu , Jayne Thompson , Vlatko Vedral , Mile Gu

Effective Quantization for Diffusion Models on CPUs

Diffusion models have gained popularity for generating images from textual descriptions. Nonetheless, the substantial need for computational resources continues to present a noteworthy challenge, contributing to time-consuming processes.…

Computer Vision and Pattern Recognition · Computer Science 2023-11-30 Hanwen Chang , Haihao Shen , Yiyang Cai , Xinyu Ye , Zhenzhong Xu , Wenhua Cheng , Kaokao Lv , Weiwei Zhang , Yintong Lu , Heng Guo

Optimal and Near-Optimal Adaptive Vector Quantization

Quantization is a fundamental optimization for many machine-learning use cases, including compressing gradients, model weights and activations, and datasets. The most accurate form of quantization is \emph{adaptive}, where the error is…

Machine Learning · Computer Science 2025-08-01 Ran Ben-Basat , Yaniv Ben-Itzhak , Michael Mitzenmacher , Shay Vargaftik

SPFQ: A Stochastic Algorithm and Its Error Analysis for Neural Network Quantization

Quantization is a widely used compression method that effectively reduces redundancies in over-parameterized neural networks. However, existing quantization techniques for deep neural networks often lack a comprehensive error analysis due…

Machine Learning · Computer Science 2023-09-21 Jinjie Zhang , Rayan Saab

Error-mitigated digital quantum simulation

Variational algorithms may enable classically intractable simulations on near-future quantum computers. However, their potential is limited by hardware errors. It is therefore crucial to develop efficient ways to mitigate these errors.…

Quantum Physics · Physics 2019-05-29 Sam McArdle , Xiao Yuan , Simon Benjamin

Enabling Accuracy-Aware Quantum Compilers using Symbolic Resource Estimation

Approximation errors must be taken into account when compiling quantum programs into a low-level gate set. We present a methodology that tracks such errors automatically and then optimizes accuracy parameters to guarantee a specified…

Quantum Physics · Physics 2021-01-06 Giulia Meuli , Mathias Soeken , Martin Roetteler , Thomas Häner

Quantized Neural Networks for Low-Precision Accumulation with Guaranteed Overflow Avoidance

We introduce a quantization-aware training algorithm that guarantees avoiding numerical overflow when reducing the precision of accumulators during inference. We leverage weight normalization as a means of constraining parameters during…

Machine Learning · Computer Science 2023-02-01 Ian Colbert , Alessandro Pappalardo , Jakoba Petri-Koenig

Superior memory efficiency of quantum devices for the simulation of continuous-time stochastic processes

Continuous-time stochastic processes pervade everyday experience, and the simulation of models of these processes is of great utility. Classical models of systems operating in continuous-time must typically track an unbounded amount of…

Quantum Physics · Physics 2018-03-05 Thomas J. Elliott , Mile Gu