English
Related papers

Related papers: Adaptive Dataset Quantization

200 papers

State-of-the-art deep neural networks are trained with large amounts (millions or even billions) of data. The expensive computation and memory costs make it difficult to train them on limited hardware resources, especially for recent…

Computer Vision and Pattern Recognition · Computer Science 2023-08-22 Daquan Zhou , Kai Wang , Jianyang Gu , Xiangyu Peng , Dongze Lian , Yifan Zhang , Yang You , Jiashi Feng

Deep learning has made remarkable progress recently, largely due to the availability of large, well-labeled datasets. However, the training on such datasets elevates costs and computational demands. To address this, various techniques like…

Computer Vision and Pattern Recognition · Computer Science 2024-07-11 Zhenghao Zhao , Yuzhang Shang , Junyi Wu , Yan Yan

This paper addresses the challenges of storage and communication costs for large-scale datasets in resource-constrained edge devices by proposing a novel dataset quantization approach to reduce intra-sample redundancy. Unlike traditional…

Computer Vision and Pattern Recognition · Computer Science 2025-12-09 Chenyue Yu , Jianyu Yu

Dataset Distillation (DD) compresses large datasets into compact synthetic ones that maintain training performance. However, current methods mainly target sample reduction, with limited consideration of data precision and its impact on…

Computer Vision and Pattern Recognition · Computer Science 2026-03-04 My H. Dinh , Aditya Sant , Akshay Malhotra , Keya Patani , Shahab Hamidi-Rad

Quantization-Aware Training (QAT) is a critical technique for deploying deep neural networks on resource-constrained devices. However, existing methods often face two major challenges: the highly non-uniform distribution of activations and…

Computer Vision and Pattern Recognition · Computer Science 2025-10-23 Shaohang Jia , Zhiyong Huang , Zhi Yu , Mingyang Hou , Shuai Miao , Han Yang

Large-scale image datasets are fundamental to deep learning, but their high storage demands pose challenges for deployment in resource-constrained environments. While existing approaches reduce dataset size by discarding samples, they often…

Computer Vision and Pattern Recognition · Computer Science 2026-03-03 Chenyue Yu , Lingao Xiao , Jinhong Deng , Ivor W. Tsang , Yang He

Model quantization is challenging due to many tedious hyper-parameters such as precision (bitwidth), dynamic range (minimum and maximum discrete values) and stepsize (interval between discrete values). Unlike prior arts that carefully tune…

Machine Learning · Computer Science 2021-07-08 Zhang Zhaoyang , Shao Wenqi , Gu Jinwei , Wang Xiaogang , Luo Ping

Large-scale deep neural networks (DNNs) have achieved remarkable success in many application scenarios. However, high computational complexity and energy costs of modern DNNs make their deployment on edge devices challenging. Model…

Machine Learning · Computer Science 2024-04-29 Cédric Gernigon , Silviu-Ioan Filip , Olivier Sentieys , Clément Coggiola , Mickael Bruno

Data-free quantization (DFQ) recovers the performance of quantized network (Q) without the original data, but generates the fake sample via a generator (G) by learning from full-precision network (P), which, however, is totally independent…

Computer Vision and Pattern Recognition · Computer Science 2023-03-21 Biao Qian , Yang Wang , Richang Hong , Meng Wang

In recent years Deep Neural Networks (DNNs) have been rapidly developed in various applications, together with increasingly complex architectures. The performance gain of these DNNs generally comes with high computational costs and large…

Machine Learning · Computer Science 2017-12-05 Yiren Zhou , Seyed-Mohsen Moosavi-Dezfooli , Ngai-Man Cheung , Pascal Frossard

Quantization-aware training (QAT) is a representative model compression method to reduce redundancy in weights and activations. However, most existing QAT methods require end-to-end training on the entire dataset, which suffers from long…

Machine Learning · Computer Science 2024-08-21 Xijie Huang , Zechun Liu , Shih-Yang Liu , Kwang-Ting Cheng

We investigate the compression of deep neural networks by quantizing their weights and activations into multiple binary bases, known as multi-bit networks (MBNs), which accelerate the inference and reduce the storage for the deployment on…

Computer Vision and Pattern Recognition · Computer Science 2020-07-07 Zhongnan Qu , Zimu Zhou , Yun Cheng , Lothar Thiele

Quantization is a fundamental optimization for many machine-learning use cases, including compressing gradients, model weights and activations, and datasets. The most accurate form of quantization is \emph{adaptive}, where the error is…

Machine Learning · Computer Science 2025-08-01 Ran Ben-Basat , Yaniv Ben-Itzhak , Michael Mitzenmacher , Shay Vargaftik

Quantization is a technique for reducing deep neural networks (DNNs) training and inference times, which is crucial for training in resource constrained environments or applications where inference is time critical. State-of-the-art (SOTA)…

Machine Learning · Computer Science 2023-05-24 Lorenz Kummer , Kevin Sidak , Tabea Reichmann , Wilfried Gansterer

Quantization is one of the most applied Deep Neural Network (DNN) compression strategies, when deploying a trained DNN model on an embedded system or a cell phone. This is owing to its simplicity and adaptability to a wide range of…

Machine Learning · Computer Science 2022-10-10 Ahmed Haj Yahmed , Houssem Ben Braiek , Foutse Khomh , Sonia Bouzidi , Rania Zaatour

Hardware-friendly network quantization (e.g., binary/uniform quantization) can efficiently accelerate the inference and meanwhile reduce memory consumption of the deep neural networks, which is crucial for model deployment on…

Computer Vision and Pattern Recognition · Computer Science 2019-08-15 Ruihao Gong , Xianglong Liu , Shenghu Jiang , Tianxiang Li , Peng Hu , Jiazhen Lin , Fengwei Yu , Junjie Yan

Quantization has been an effective technology in ANN (approximate nearest neighbour) search due to its high accuracy and fast search speed. To meet the requirement of different applications, there is always a trade-off between retrieval…

Computer Vision and Pattern Recognition · Computer Science 2020-12-08 Jingkuan Song , Xiaosu Zhu , Lianli Gao , Xin-Shun Xu , Wu Liu , Heng Tao Shen

Mixed-precision quantization mostly predetermines the model bit-width settings before actual training due to the non-differential bit-width sampling process, obtaining sub-optimal performance. Worse still, the conventional static…

Artificial Intelligence · Computer Science 2023-02-10 Yingchun Wang , Jingcai Guo , Song Guo , Weizhan Zhang

We present APQ for efficient deep learning inference on resource-constrained hardware. Unlike previous methods that separately search the neural architecture, pruning policy, and quantization policy, we optimize them in a joint manner. To…

Machine Learning · Computer Science 2020-06-16 Tianzhe Wang , Kuan Wang , Han Cai , Ji Lin , Zhijian Liu , Song Han

As Deep Neural Networks (DNNs) usually are overparameterized and have millions of weight parameters, it is challenging to deploy these large DNN models on resource-constrained hardware platforms, e.g., smartphones. Numerous network…

Computer Vision and Pattern Recognition · Computer Science 2022-05-24 Peng Hu , Xi Peng , Hongyuan Zhu , Mohamed M. Sabry Aly , Jie Lin
‹ Prev 1 2 3 10 Next ›