Related papers: Adaptive Data-Free Quantization

Rethinking Data-Free Quantization as a Zero-Sum Game

Data-free quantization (DFQ) recovers the performance of quantized network (Q) without accessing the real data, but generates the fake sample via a generator (G) by learning from full-precision network (P) instead. However, such sample…

Computer Vision and Pattern Recognition · Computer Science 2023-02-21 Biao Qian , Yang Wang , Richang Hong , Meng Wang

Generative Low-bitwidth Data Free Quantization

Neural network quantization is an effective way to compress deep models and improve their execution latency and energy efficiency, so that they can be deployed on mobile or embedded devices. Existing quantization methods require original…

Computer Vision and Pattern Recognition · Computer Science 2020-08-11 Shoukai Xu , Haokun Li , Bohan Zhuang , Jing Liu , Jiezhang Cao , Chuangrun Liang , Mingkui Tan

ACQ: Improving Generative Data-free Quantization Via Attention Correction

Data-free quantization aims to achieve model quantization without accessing any authentic sample. It is significant in an application-oriented context involving data privacy. Converting noise vectors into synthetic samples through a…

Computer Vision and Pattern Recognition · Computer Science 2023-08-01 Jixing Li , Xiaozhou Guo , Benzhe Dai , Guoliang Gong , Min Jin , Gang Chen , Wenyu Mao , Huaxiang Lu

Adaptive Dataset Quantization

Contemporary deep learning, characterized by the training of cumbersome neural networks on massive datasets, confronts substantial computational hurdles. To alleviate heavy data storage burdens on limited hardware resources, numerous…

Computer Vision and Pattern Recognition · Computer Science 2024-12-24 Muquan Li , Dongyang Zhang , Qiang Dong , Xiurui Xie , Ke Qin

DFQ-ViT: Data-Free Quantization for Vision Transformers without Fine-tuning

Data-Free Quantization (DFQ) enables the quantization of Vision Transformers (ViTs) without requiring access to data, allowing for the deployment of ViTs on devices with limited resources. In DFQ, the quantization model must be calibrated…

Computer Vision and Pattern Recognition · Computer Science 2025-07-22 Yujia Tong , Jingling Yuan , Tian Zhang , Jianquan Liu , Chuang Hu

RTF-Q: Efficient Unsupervised Domain Adaptation with Retraining-free Quantization

Performing unsupervised domain adaptation on resource-constrained edge devices is challenging. Existing research typically adopts architecture optimization (e.g., designing slimmable networks) but requires expensive training costs.…

Computer Vision and Pattern Recognition · Computer Science 2024-09-16 Nanyang Du , Chen Tang , Yuxiao Jiang , Yuan Meng , Zhi Wang

MimiQ: Low-Bit Data-Free Quantization of Vision Transformers with Encouraging Inter-Head Attention Similarity

Data-free quantization (DFQ) is a technique that creates a lightweight network from its full-precision counterpart without the original training data, often through a synthetic dataset. Although several DFQ methods have been proposed for…

Machine Learning · Computer Science 2025-04-15 Kanghyun Choi , Hye Yoon Lee , Dain Kwon , SunJong Park , Kyuyeun Kim , Noseong Park , Jonghyun Choi , Jinho Lee

Dataset Quantization with Active Learning based Adaptive Sampling

Deep learning has made remarkable progress recently, largely due to the availability of large, well-labeled datasets. However, the training on such datasets elevates costs and computational demands. To address this, various techniques like…

Computer Vision and Pattern Recognition · Computer Science 2024-07-11 Zhenghao Zhao , Yuzhang Shang , Junyi Wu , Yan Yan

Adaptive Quantization of Model Updates for Communication-Efficient Federated Learning

Communication of model updates between client nodes and the central aggregating server is a major bottleneck in federated learning, especially in bandwidth-limited settings and high-dimensional models. Gradient quantization is an effective…

Machine Learning · Computer Science 2021-02-10 Divyansh Jhunjhunwala , Advait Gadhikar , Gauri Joshi , Yonina C. Eldar

Data-Free Quantization with Accurate Activation Clipping and Adaptive Batch Normalization

Data-free quantization is a task that compresses the neural network to low bit-width without access to original training data. Most existing data-free quantization methods cause severe performance degradation due to inaccurate activation…

Machine Learning · Computer Science 2022-06-23 Yefei He , Luoming Zhang , Weijia Wu , Hong Zhou

Adaptive Distribution-aware Quantization for Mixed-Precision Neural Networks

Quantization-Aware Training (QAT) is a critical technique for deploying deep neural networks on resource-constrained devices. However, existing methods often face two major challenges: the highly non-uniform distribution of activations and…

Computer Vision and Pattern Recognition · Computer Science 2025-10-23 Shaohang Jia , Zhiyong Huang , Zhi Yu , Mingyang Hou , Shuai Miao , Han Yang

Data Quality-aware Mixed-precision Quantization via Hybrid Reinforcement Learning

Mixed-precision quantization mostly predetermines the model bit-width settings before actual training due to the non-differential bit-width sampling process, obtaining sub-optimal performance. Worse still, the conventional static…

Artificial Intelligence · Computer Science 2023-02-10 Yingchun Wang , Jingcai Guo , Song Guo , Weizhan Zhang

AdaQAT: Adaptive Bit-Width Quantization-Aware Training

Large-scale deep neural networks (DNNs) have achieved remarkable success in many application scenarios. However, high computational complexity and energy costs of modern DNNs make their deployment on edge devices challenging. Model…

Machine Learning · Computer Science 2024-04-29 Cédric Gernigon , Silviu-Ioan Filip , Olivier Sentieys , Clément Coggiola , Mickael Bruno

Diversifying Sample Generation for Accurate Data-Free Quantization

Quantization has emerged as one of the most prevalent approaches to compress and accelerate neural networks. Recently, data-free quantization has been widely studied as a practical and promising solution. It synthesizes data for calibrating…

Computer Vision and Pattern Recognition · Computer Science 2021-12-03 Xiangguo Zhang , Haotong Qin , Yifu Ding , Ruihao Gong , Qinghua Yan , Renshuai Tao , Yuhang Li , Fengwei Yu , Xianglong Liu

Data-Free Network Quantization With Adversarial Knowledge Distillation

Network quantization is an essential procedure in deep learning for development of efficient fixed-point inference models on mobile or edge platforms. However, as datasets grow larger and privacy regulations become stricter, data sharing…

Computer Vision and Pattern Recognition · Computer Science 2020-05-11 Yoojin Choi , Jihwan Choi , Mostafa El-Khamy , Jungwon Lee

SQuant: On-the-Fly Data-Free Quantization via Diagonal Hessian Approximation

Quantization of deep neural networks (DNN) has been proven effective for compressing and accelerating DNN models. Data-free quantization (DFQ) is a promising approach without the original datasets under privacy-sensitive and confidential…

Machine Learning · Computer Science 2022-02-16 Cong Guo , Yuxian Qiu , Jingwen Leng , Xiaotian Gao , Chen Zhang , Yunxin Liu , Fan Yang , Yuhao Zhu , Minyi Guo

Long-Range Zero-Shot Generative Deep Network Quantization

Quantization approximates a deep network model with floating-point numbers by the one with low bit width numbers, in order to accelerate inference and reduce computation. Quantizing a model without access to the original data, zero-shot…

Computer Vision and Pattern Recognition · Computer Science 2022-11-18 Yan Luo , Yangcheng Gao , Zhao Zhang , Haijun Zhang , Mingliang Xu , Meng Wang

Towards Feature Distribution Alignment and Diversity Enhancement for Data-Free Quantization

To obtain lower inference latency and less memory footprint of deep neural networks, model quantization has been widely employed in deep model deployment, by converting the floating points to low-precision integers. However, previous…

Computer Vision and Pattern Recognition · Computer Science 2022-12-20 Yangcheng Gao , Zhao Zhang , Richang Hong , Haijun Zhang , Jicong Fan , Shuicheng Yan

Diverse Sample Generation: Pushing the Limit of Generative Data-free Quantization

Generative data-free quantization emerges as a practical compression approach that quantizes deep neural networks to low bit-width without accessing the real data. This approach generates data utilizing batch normalization (BN) statistics…

Computer Vision and Pattern Recognition · Computer Science 2022-10-21 Haotong Qin , Yifu Ding , Xiangguo Zhang , Jiakai Wang , Xianglong Liu , Jiwen Lu

Enhancing Generalization in Data-free Quantization via Mixup-class Prompting

Post-training quantization (PTQ) improves efficiency but struggles with limited calibration data, especially under privacy constraints. Data-free quantization (DFQ) mitigates this by generating synthetic images using generative models such…

Computer Vision and Pattern Recognition · Computer Science 2025-07-30 Jiwoong Park , Chaeun Lee , Yongseok Choi , Sein Park , Deokki Hong , Jungwook Choi