Related papers: Robust Quantization: One Model to Rule Them All

Robustness of Neural Networks to Parameter Quantization

Quantization, a commonly used technique to reduce the memory footprint of a neural network for edge computing, entails reducing the precision of the floating-point representation used for the parameters of the network. The impact of such…

Machine Learning · Computer Science 2019-03-27 Abhishek Murthy , Himel Das , Md Ariful Islam

Symmetry Regularization and Saturating Nonlinearity for Robust Quantization

Robust quantization improves the tolerance of networks for various implementations, allowing reliable output in different bit-widths or fragmented low-precision arithmetic. In this work, we perform extensive analyses to identify the sources…

Machine Learning · Computer Science 2022-08-02 Sein Park , Yeongsang Jang , Eunhyeok Park

RobustMQ: Benchmarking Robustness of Quantized Models

Quantization has emerged as an essential technique for deploying deep neural networks (DNNs) on devices with limited resources. However, quantized models exhibit vulnerabilities when exposed to various noises in real-world applications.…

Machine Learning · Computer Science 2023-08-07 Yisong Xiao , Aishan Liu , Tianyuan Zhang , Haotong Qin , Jinyang Guo , Xianglong Liu

Investigating the Impact of Quantization on Adversarial Robustness

Quantization is a promising technique for reducing the bit-width of deep models to improve their runtime performance and storage efficiency, and thus becomes a fundamental step for deployment. In real-world scenarios, quantized models are…

Machine Learning · Computer Science 2024-04-09 Qun Li , Yuan Meng , Chen Tang , Jiacheng Jiang , Zhi Wang

One Model for All Quantization: A Quantized Network Supporting Hot-Swap Bit-Width Adjustment

As an effective technique to achieve the implementation of deep neural networks in edge devices, model quantization has been successfully applied in many practical applications. No matter the methods of quantization aware training (QAT) or…

Computer Vision and Pattern Recognition · Computer Science 2021-05-05 Qigong Sun , Xiufang Li , Yan Ren , Zhongjian Huang , Xu Liu , Licheng Jiao , Fang Liu

Mixed-Precision Inference Quantization: Radically Towards Faster inference speed, Lower Storage requirement, and Lower Loss

Based on the model's resilience to computational noise, model quantization is important for compressing models and improving computing speed. Existing quantization techniques rely heavily on experience and "fine-tuning" skills. In the…

Machine Learning · Computer Science 2022-07-22 Daning Cheng , Wenguang Chen

RepQ: Generalizing Quantization-Aware Training for Re-Parametrized Architectures

Existing neural networks are memory-consuming and computationally intensive, making deploying them challenging in resource-constrained environments. However, there are various methods to improve their efficiency. Two such methods are…

Machine Learning · Computer Science 2023-11-10 Anastasiia Prutianova , Alexey Zaytsev , Chung-Kuei Lee , Fengyu Sun , Ivan Koryakovskiy

Robust error bounds for quantised and pruned neural networks

With the rise of smartphones and the internet-of-things, data is increasingly getting generated at the edge on local, personal devices. For privacy, latency and energy saving reasons, this shift is causing machine learning algorithms to…

Machine Learning · Computer Science 2021-04-29 Jiaqi Li , Ross Drummond , Stephen R. Duncan

DNQ: Dynamic Network Quantization

Network quantization is an effective method for the deployment of neural networks on memory and energy constrained mobile devices. In this paper, we propose a Dynamic Network Quantization (DNQ) framework which is composed of two modules: a…

Machine Learning · Computer Science 2018-12-07 Yuhui Xu , Shuai Zhang , Yingyong Qi , Jiaxian Guo , Weiyao Lin , Hongkai Xiong

A general framework for defining and optimizing robustness

Robustness of neural networks has recently attracted a great amount of interest. The many investigations in this area lack a precise common foundation of robustness concepts. Therefore, in this paper, we propose a rigorous and flexible…

Machine Learning · Computer Science 2021-06-01 Alessandro Tibo , Manfred Jaeger , Kim G. Larsen

Verification of Neural Networks: Specifying Global Robustness using Generative Models

The success of neural networks across most machine learning tasks and the persistence of adversarial examples have made the verification of such models an important quest. Several techniques have been successfully developed to verify…

Machine Learning · Computer Science 2019-10-14 Nathanaël Fijalkow , Mohit Kumar Gupta

A Survey on Methods and Theories of Quantized Neural Networks

Deep neural networks are the state-of-the-art methods for many real-world tasks, such as computer vision, natural language processing and speech recognition. For all its popularity, deep neural networks are also criticized for consuming a…

Machine Learning · Computer Science 2018-12-18 Yunhui Guo

Defensive Quantization: When Efficiency Meets Robustness

Neural network quantization is becoming an industry standard to efficiently deploy deep learning models on hardware platforms, such as CPU, GPU, TPU, and FPGAs. However, we observe that the conventional quantization approaches are…

Machine Learning · Computer Science 2019-04-19 Ji Lin , Chuang Gan , Song Han

QGen: On the Ability to Generalize in Quantization Aware Training

Quantization lowers memory usage, computational requirements, and latency by utilizing fewer bits to represent model weights and activations. In this work, we investigate the generalization properties of quantized neural networks, a…

Machine Learning · Computer Science 2024-04-22 MohammadHossein AskariHemmat , Ahmadreza Jeddi , Reyhane Askari Hemmat , Ivan Lazarevich , Alexander Hoffman , Sudhakar Sah , Ehsan Saboori , Yvon Savaria , Jean-Pierre David

Benchmarking the Robustness of Quantized Models

Quantization has emerged as an essential technique for deploying deep neural networks (DNNs) on devices with limited resources. However, quantized models exhibit vulnerabilities when exposed to various noises in real-world applications.…

Machine Learning · Computer Science 2023-04-11 Yisong Xiao , Tianyuan Zhang , Shunchang Liu , Haotong Qin

Quantization-aware Interval Bound Propagation for Training Certifiably Robust Quantized Neural Networks

We study the problem of training and certifying adversarially robust quantized neural networks (QNNs). Quantization is a technique for making neural networks more efficient by running them using low-bit integer arithmetic and is therefore…

Machine Learning · Computer Science 2022-11-30 Mathias Lechner , Đorđe Žikelić , Krishnendu Chatterjee , Thomas A. Henzinger , Daniela Rus

Gradient $\ell_1$ Regularization for Quantization Robustness

We analyze the effect of quantizing weights and activations of neural networks on their loss and derive a simple regularization scheme that improves robustness against post-training quantization. By training quantization-ready networks, our…

Machine Learning · Computer Science 2020-02-19 Milad Alizadeh , Arash Behboodi , Mart van Baalen , Christos Louizos , Tijmen Blankevoort , Max Welling

Neural Network Robustness as a Verification Property: A Principled Case Study

Neural networks are very successful at detecting patterns in noisy data, and have become the technology of choice in many fields. However, their usefulness is hampered by their susceptibility to adversarial attacks. Recently, many methods…

Machine Learning · Computer Science 2022-07-14 Marco Casadio , Ekaterina Komendantskaya , Matthew L. Daggitt , Wen Kokke , Guy Katz , Guy Amir , Idan Refaeli

On the Adversarial Robustness of Quantized Neural Networks

Reducing the size of neural network models is a critical step in moving AI from a cloud-centric to an edge-centric (i.e. on-device) compute paradigm. This shift from cloud to edge is motivated by a number of factors including reduced…

Machine Learning · Computer Science 2022-01-24 Micah Gorsline , James Smith , Cory Merkel

Vector Quantisation for Robust Segmentation

The reliability of segmentation models in the medical domain depends on the model's robustness to perturbations in the input space. Robustness is a particular challenge in medical imaging exhibiting various sources of image noise,…

Image and Video Processing · Electrical Eng. & Systems 2022-07-06 Ainkaran Santhirasekaram , Avinash Kori , Mathias Winkler , Andrea Rockall , Ben Glocker