Related papers: Sharpness-Aware Data Generation for Zero-shot Quan…

Genie: Show Me the Data for Quantization

Zero-shot quantization is a promising approach for developing lightweight deep neural networks when data is inaccessible owing to various reasons, including cost and issues related to privacy. By exploiting the learned parameters ($\mu$ and…

Machine Learning · Computer Science 2023-08-09 Yongkweon Jeon , Chungman Lee , Ho-young Kim

Zero-Shot Sharpness-Aware Quantization for Pre-trained Language Models

Quantization is a promising approach for reducing memory overhead and accelerating inference, especially in large pre-trained language model (PLM) scenarios. While having no access to original training data due to security and privacy…

Computation and Language · Computer Science 2023-10-23 Miaoxi Zhu , Qihuang Zhong , Li Shen , Liang Ding , Juhua Liu , Bo Du , Dacheng Tao

Long-Range Zero-Shot Generative Deep Network Quantization

Quantization approximates a deep network model with floating-point numbers by the one with low bit width numbers, in order to accelerate inference and reduce computation. Quantizing a model without access to the original data, zero-shot…

Computer Vision and Pattern Recognition · Computer Science 2022-11-18 Yan Luo , Yangcheng Gao , Zhao Zhang , Haijun Zhang , Mingliang Xu , Meng Wang

Zero-shot Adversarial Quantization

Model quantization is a promising approach to compress deep neural networks and accelerate inference, making it possible to be deployed on mobile and edge devices. To retain the high performance of full-precision models, most existing…

Computer Vision and Pattern Recognition · Computer Science 2021-03-31 Yuang Liu , Wei Zhang , Jun Wang

Generative Zero-shot Network Quantization

Convolutional neural networks are able to learn realistic image priors from numerous training samples in low-level image generation and restoration. We show that, for high-level image recognition tasks, we can further reconstruct…

Computer Vision and Pattern Recognition · Computer Science 2021-01-22 Xiangyu He , Qinghao Hu , Peisong Wang , Jian Cheng

ZeroQ: A Novel Zero Shot Quantization Framework

Quantization is a promising approach for reducing the inference time and memory footprint of neural networks. However, most existing quantization methods require access to the original training dataset for retraining during quantization.…

Computer Vision and Pattern Recognition · Computer Science 2020-03-29 Yaohui Cai , Zhewei Yao , Zhen Dong , Amir Gholami , Michael W. Mahoney , Kurt Keutzer

SQuAT: Sharpness- and Quantization-Aware Training for BERT

Quantization is an effective technique to reduce memory footprint, inference latency, and power consumption of deep learning models. However, existing quantization methods suffer from accuracy degradation compared to full-precision (FP)…

Machine Learning · Computer Science 2022-10-14 Zheng Wang , Juncheng B Li , Shuhui Qu , Florian Metze , Emma Strubell

Infrared Domain Adaptation with Zero-Shot Quantization

Quantization is one of the most popular techniques for reducing computation time and shrinking model size. However, ensuring the accuracy of quantized models typically involves calibration using training data, which may be inaccessible due…

Computer Vision and Pattern Recognition · Computer Science 2024-08-27 Burak Sevsay , Erdem Akagündüz

Zero-shot Quantization: A Comprehensive Survey

Network quantization has proven to be a powerful approach to reduce the memory and computational demands of deep learning models for deployment on resource-constrained devices. However, traditional quantization methods often rely on access…

Computer Vision and Pattern Recognition · Computer Science 2025-05-15 Minjun Kim , Jaehyeon Choi , Jongkeun Lee , Wonjin Cho , U Kang

Hard Sample Matters a Lot in Zero-Shot Quantization

Zero-shot quantization (ZSQ) is promising for compressing and accelerating deep neural networks when the data for training full-precision models are inaccessible. In ZSQ, network quantization is performed using synthetic samples, thus, the…

Computer Vision and Pattern Recognition · Computer Science 2023-03-27 Huantong Li , Xiangmiao Wu , Fanbing Lv , Daihai Liao , Thomas H. Li , Yonggang Zhang , Bo Han , Mingkui Tan

It's All In the Teacher: Zero-Shot Quantization Brought Closer to the Teacher

Model quantization is considered as a promising method to greatly reduce the resource requirements of deep neural networks. To deal with the performance drop induced by quantization errors, a popular method is to use training data to…

Computer Vision and Pattern Recognition · Computer Science 2022-04-04 Kanghyun Choi , Hye Yoon Lee , Deokki Hong , Joonsang Yu , Noseong Park , Youngsok Kim , Jinho Lee

SynQ: Accurate Zero-shot Quantization by Synthesis-aware Fine-tuning

How can we accurately quantize a pre-trained model without any data? Quantization algorithms are widely used for deploying neural networks on resource-constrained edge devices. Zero-shot Quantization (ZSQ) addresses the crucial and…

Computer Vision and Pattern Recognition · Computer Science 2026-03-20 Minjun Kim , Jongjin Kim , U Kang

Data Generation for Hardware-Friendly Post-Training Quantization

Zero-shot quantization (ZSQ) using synthetic data is a key approach for post-training quantization (PTQ) under privacy and security constraints. However, existing data generation methods often struggle to effectively generate data suitable…

Machine Learning · Computer Science 2025-02-06 Lior Dikstein , Ariel Lapid , Arnon Netzer , Hai Victor Habi

Task-Specific Zero-shot Quantization-Aware Training for Object Detection

Quantization is a key technique to reduce network size and computational complexity by representing the network parameters with a lower precision. Traditional quantization methods rely on access to original training data, which is often…

Computer Vision and Pattern Recognition · Computer Science 2025-07-23 Changhao Li , Xinrui Chen , Ji Wang , Kang Zhao , Jianfei Chen

Learning from Loss Landscape: Generalizable Mixed-Precision Quantization via Adaptive Sharpness-Aware Gradient Aligning

Mixed Precision Quantization (MPQ) has become an essential technique for optimizing neural network by determining the optimal bitwidth per layer. Existing MPQ methods, however, face a major hurdle: they require a computationally expensive…

Computer Vision and Pattern Recognition · Computer Science 2025-05-20 Lianbo Ma , Jianlun Ma , Yuee Zhou , Guoyang Xie , Qiang He , Zhichao Lu

Sharpness-Aware Minimization for Efficiently Improving Generalization

In today's heavily overparameterized models, the value of the training loss provides few guarantees on model generalization ability. Indeed, optimizing only the training loss value, as is commonly done, can easily lead to suboptimal model…

Machine Learning · Computer Science 2021-04-30 Pierre Foret , Ariel Kleiner , Hossein Mobahi , Behnam Neyshabur

Post-training Model Quantization Using GANs for Synthetic Data Generation

Quantization is a widely adopted technique for deep neural networks to reduce the memory and computational resources required. However, when quantized, most models would need a suitable calibration process to keep their performance intact,…

Computer Vision and Pattern Recognition · Computer Science 2023-05-11 Athanasios Masouris , Mansi Sharma , Adrian Boguszewski , Alexander Kozlov , Zhuo Wu , Raymond Lo

Integer-only Zero-shot Quantization for Efficient Speech Recognition

End-to-end neural network models achieve improved performance on various automatic speech recognition (ASR) tasks. However, these models perform poorly on edge hardware due to large memory and computation requirements. While quantizing…

Audio and Speech Processing · Electrical Eng. & Systems 2022-02-01 Sehoon Kim , Amir Gholami , Zhewei Yao , Nicholas Lee , Patrick Wang , Aniruddha Nrusimha , Bohan Zhai , Tianren Gao , Michael W. Mahoney , Kurt Keutzer

Zero-shot Learning with Class Description Regularization

The purpose of generative Zero-shot learning (ZSL) is to learning from seen classes, transfer the learned knowledge, and create samples of unseen classes from the description of these unseen categories. To achieve better ZSL accuracies,…

Computer Vision and Pattern Recognition · Computer Science 2021-07-01 Shayan Kousha , Marcus A. Brubaker

A Generalization Theory for Zero-Shot Prediction

A modern paradigm for generalization in machine learning and AI consists of pre-training a task-agnostic foundation model, generally obtained using self-supervised and multimodal contrastive learning. The resulting representations can be…

Machine Learning · Statistics 2025-09-03 Ronak Mehta , Zaid Harchaoui