English
Related papers

Related papers: Data Generation for Hardware-Friendly Post-Trainin…

200 papers

Zero-shot quantization (ZSQ) is promising for compressing and accelerating deep neural networks when the data for training full-precision models are inaccessible. In ZSQ, network quantization is performed using synthetic samples, thus, the…

Computer Vision and Pattern Recognition · Computer Science 2023-03-27 Huantong Li , Xiangmiao Wu , Fanbing Lv , Daihai Liao , Thomas H. Li , Yonggang Zhang , Bo Han , Mingkui Tan

Quantization approximates a deep network model with floating-point numbers by the one with low bit width numbers, in order to accelerate inference and reduce computation. Quantizing a model without access to the original data, zero-shot…

Computer Vision and Pattern Recognition · Computer Science 2022-11-18 Yan Luo , Yangcheng Gao , Zhao Zhang , Haijun Zhang , Mingliang Xu , Meng Wang

Quantization is a key technique to reduce network size and computational complexity by representing the network parameters with a lower precision. Traditional quantization methods rely on access to original training data, which is often…

Computer Vision and Pattern Recognition · Computer Science 2025-07-23 Changhao Li , Xinrui Chen , Ji Wang , Kang Zhao , Jianfei Chen

Quantization is a widely adopted technique for deep neural networks to reduce the memory and computational resources required. However, when quantized, most models would need a suitable calibration process to keep their performance intact,…

Computer Vision and Pattern Recognition · Computer Science 2023-05-11 Athanasios Masouris , Mansi Sharma , Adrian Boguszewski , Alexander Kozlov , Zhuo Wu , Raymond Lo

How can we accurately quantize a pre-trained model without any data? Quantization algorithms are widely used for deploying neural networks on resource-constrained edge devices. Zero-shot Quantization (ZSQ) addresses the crucial and…

Computer Vision and Pattern Recognition · Computer Science 2026-03-20 Minjun Kim , Jongjin Kim , U Kang

Quantization has emerged as one of the most prevalent approaches to compress and accelerate neural networks. Recently, data-free quantization has been widely studied as a practical and promising solution. It synthesizes data for calibrating…

Computer Vision and Pattern Recognition · Computer Science 2021-12-03 Xiangguo Zhang , Haotong Qin , Yifu Ding , Ruihao Gong , Qinghua Yan , Renshuai Tao , Yuhang Li , Fengwei Yu , Xianglong Liu

Learning to synthesize data has emerged as a promising direction in zero-shot quantization (ZSQ), which represents neural networks by low-bit integer without accessing any of the real data. In this paper, we observe an interesting…

Computer Vision and Pattern Recognition · Computer Science 2022-03-11 Yunshan Zhong , Mingbao Lin , Gongrui Nan , Jianzhuang Liu , Baochang Zhang , Yonghong Tian , Rongrong Ji

Network quantization has proven to be a powerful approach to reduce the memory and computational demands of deep learning models for deployment on resource-constrained devices. However, traditional quantization methods often rely on access…

Computer Vision and Pattern Recognition · Computer Science 2025-05-15 Minjun Kim , Jaehyeon Choi , Jongkeun Lee , Wonjin Cho , U Kang

Zero-shot quantization aims to learn a quantized model from a pre-trained full-precision model with no access to original real training data. The common idea in zero-shot quantization approaches is to generate synthetic data for quantizing…

Machine Learning · Computer Science 2025-10-09 Dung Hoang-Anh , Cuong Pham Trung Le , Jianfei Cai , Thanh-Toan Do

Zero-shot quantization (ZSQ) enables neural network compression without original training data, making it a promising solution for restricted data access scenarios. To compensate for the lack of data, recent ZSQ methods typically rely on…

Computer Vision and Pattern Recognition · Computer Science 2025-10-16 Inpyo Hong , Youngwan Jo , Hyojeong Lee , Sunghyun Ahn , Kijung Lee , Sanghyun Park

Zero-shot quantization is a promising approach for developing lightweight deep neural networks when data is inaccessible owing to various reasons, including cost and issues related to privacy. By exploiting the learned parameters ($\mu$ and…

Machine Learning · Computer Science 2023-08-09 Yongkweon Jeon , Chungman Lee , Ho-young Kim

Generative data-free quantization emerges as a practical compression approach that quantizes deep neural networks to low bit-width without accessing the real data. This approach generates data utilizing batch normalization (BN) statistics…

Computer Vision and Pattern Recognition · Computer Science 2022-10-21 Haotong Qin , Yifu Ding , Xiangguo Zhang , Jiakai Wang , Xianglong Liu , Jiwen Lu

Model quantization is a promising approach to compress deep neural networks and accelerate inference, making it possible to be deployed on mobile and edge devices. To retain the high performance of full-precision models, most existing…

Computer Vision and Pattern Recognition · Computer Science 2021-03-31 Yuang Liu , Wei Zhang , Jun Wang

Existing quantization aware training methods attempt to compensate for the quantization loss by leveraging on training data, like most of the post-training quantization methods, and are also time consuming. Both these methods are not…

Computer Vision and Pattern Recognition · Computer Science 2020-12-29 Tej pratap GVSL , Raja Kumar

Machine learning, particularly deep learning, is transforming industrial quality inspection. Yet, training robust machine learning models typically requires large volumes of high-quality labeled data, which are expensive, time-consuming,…

Computer Vision and Pattern Recognition · Computer Science 2026-05-26 Ruo-Syuan Mei , Sixian Jia , Guangze Li , Soo Yeon Lee , Brian Musser , William Keller , Sreten Zakula , Jorge Arinez , Chenhui Shao

Convolutional neural networks are able to learn realistic image priors from numerous training samples in low-level image generation and restoration. We show that, for high-level image recognition tasks, we can further reconstruct…

Computer Vision and Pattern Recognition · Computer Science 2021-01-22 Xiangyu He , Qinghao Hu , Peisong Wang , Jian Cheng

Post-training quantization (PTQ) improves efficiency but struggles with limited calibration data, especially under privacy constraints. Data-free quantization (DFQ) mitigates this by generating synthetic images using generative models such…

Computer Vision and Pattern Recognition · Computer Science 2025-07-30 Jiwoong Park , Chaeun Lee , Yongseok Choi , Sein Park , Deokki Hong , Jungwook Choi

Quantizing the floating-point weights and activations of deep convolutional neural networks to fixed-point representation yields reduced memory footprints and inference time. Recently, efforts have been afoot towards zero-shot quantization…

Computer Vision and Pattern Recognition · Computer Science 2021-12-14 Prasen Kumar Sharma , Arun Abraham , Vikram Nelvoy Rajendiran

Denoising diffusion (score-based) generative models have recently achieved significant accomplishments in generating realistic and diverse data. These approaches define a forward diffusion process for transforming data into noise and a…

Computer Vision and Pattern Recognition · Computer Science 2023-03-17 Yuzhang Shang , Zhihang Yuan , Bin Xie , Bingzhe Wu , Yan Yan

In the realm of deep neural network deployment, low-bit quantization presents a promising avenue for enhancing computational efficiency. However, it often hinges on the availability of training data to mitigate quantization errors, a…

Computer Vision and Pattern Recognition · Computer Science 2024-09-18 Yuhang Li , Youngeun Kim , Donghyun Lee , Souvik Kundu , Priyadarshini Panda
‹ Prev 1 2 3 10 Next ›