English
Related papers

Related papers: Ternary and Binary Quantization for Improved Class…

200 papers

Quantization is widely applied in machine learning to reduce computational and storage costs for both data and models. Considering that classification tasks are fundamental to the field, it is crucial to investigate how quantization impacts…

Machine Learning · Computer Science 2025-07-14 Weizhi Lu , Mingrui Chen , Weiyu Li

Inference time, model size, and accuracy are critical for deploying deep neural network models. Numerous research efforts have been made to compress neural network models with faster inference and higher accuracy. Pruning and quantization…

Machine Learning · Computer Science 2023-03-06 Dan Liu , Xue Liu

Random projection is often used to project higher-dimensional vectors onto a lower-dimensional space, while approximately preserving their pairwise distances. It has emerged as a powerful tool in various data processing tasks and has…

Machine Learning · Computer Science 2020-06-30 Wenye Li , Shuzhong Zhang

Neural network models are resource hungry. It is difficult to deploy such deep networks on devices with limited resources, like smart wearables, cellphones, drones, and autonomous vehicles. Low bit quantization such as binary and ternary…

Machine Learning · Computer Science 2021-09-15 Ryan Razani , Grégoire Morin , Vahid Partovi Nia , Eyyüb Sari

Inference time, model size, and accuracy are three key factors in deep model compression. Most of the existing work addresses these three key factors separately as it is difficult to optimize them all at the same time. For example, low-bit…

Computer Vision and Pattern Recognition · Computer Science 2023-07-18 Dan Liu , Xi Chen , Jie Fu , Chen Ma , Xue Liu

Model quantization enables the deployment of deep neural networks under resource-constrained devices. Vector quantization aims at reducing the model size by indexing model weights with full-precision embeddings, i.e., codewords, while the…

Computer Vision and Pattern Recognition · Computer Science 2022-12-27 Dan Liu , Xi Chen , Chen Ma , Xue Liu

In this paper, we study randomized reduction methods, which reduce high-dimensional features into low-dimensional space by randomized methods (e.g., random projection, random hashing), for large-scale high-dimensional classification.…

Machine Learning · Computer Science 2015-07-21 Tianbao Yang , Lijun Zhang , Rong Jin , Shenghuo Zhu

Most of the existing works use projection functions for ternary quantization in discrete space. Scaling factors and thresholds are used in some cases to improve the model accuracy. However, the gradients used for optimization are inaccurate…

Computer Vision and Pattern Recognition · Computer Science 2022-12-27 Dan Liu , Xue Liu

A low precision deep neural network training technique for producing sparse, ternary neural networks is presented. The technique incorporates hard- ware implementation costs during training to achieve significant model compression for…

Computer Vision and Pattern Recognition · Computer Science 2017-10-11 Julian Faraone , Nicholas Fraser , Giulio Gambardella , Michaela Blott , Philip H. W. Leong

Traditionally, quantization is designed to minimize the reconstruction error of a data source. When considering downstream classification tasks, other measures of distortion can be of interest; such as the 0-1 classification loss.…

Machine Learning · Computer Science 2021-07-22 Daniel Severo , Elad Domanovitz , Ashish Khisti

Random projections offer an appealing and flexible approach to a wide range of large-scale statistical problems. They are particularly useful in high-dimensional settings, where we have many covariates recorded for each observation. In…

Methodology · Statistics 2019-11-26 Timothy I. Cannings

With the explosive growth of image databases, deep hashing, which learns compact binary descriptors for images, has become critical for fast image retrieval. Many existing deep hashing methods leverage quantization loss, defined as distance…

Computer Vision and Pattern Recognition · Computer Science 2017-11-01 Yuefu Zhou , Shanshan Huang , Ya Zhang , Yanfeng Wang

An effective unsupervised hashing algorithm leads to compact binary codes preserving the neighborhood structure of data as much as possible. One of the most established schemes for unsupervised hashing is to reduce the dimensionality of…

Computer Vision and Pattern Recognition · Computer Science 2021-10-04 Sobhan Hemati , H. R. Tizhoosh

Dimensionality reduction is an essential technique for multi-way large-scale data, i.e., tensor. Tensor ring (TR) decomposition has become popular due to its high representation ability and flexibility. However, the traditional TR…

Numerical Analysis · Mathematics 2024-12-20 Longhao Yuan , Chao Li , Jianting Cao , Qibin Zhao

The rapid development of large pre-trained language models has greatly increased the demand for model compression techniques, among which quantization is a popular solution. In this paper, we propose BinaryBERT, which pushes BERT…

Computation and Language · Computer Science 2021-07-23 Haoli Bai , Wei Zhang , Lu Hou , Lifeng Shang , Jing Jin , Xin Jiang , Qun Liu , Michael Lyu , Irwin King

Dimensionality reduction is an effective method for learning high-dimensional data, which can provide better understanding of decision boundaries in human-readable low-dimensional subspace. Linear methods, such as principal component…

Machine Learning · Computer Science 2020-07-09 Koji Maruhashi , Heewon Park , Rui Yamaguchi , Satoru Miyano

Binary concepts are empirically used by humans to generalize efficiently. And they are based on Bernoulli distribution which is the building block of information. These concepts span both low-level and high-level features such as "large vs…

Machine Learning · Computer Science 2023-03-23 Zizhao Hu , Mohammad Rostami

As a typical dimensionality reduction technique, random projection can be simply implemented with linear projection, while maintaining the pairwise distances of high-dimensional data with high probability. Considering this technique is…

Machine Learning · Computer Science 2014-10-14 Weizhi Lu , Weiyu Li , Kidiyo Kpalma , Joseph Ronsin

This paper, broadly speaking, covers the use of randomness in two main areas: low-rank approximation and kernel methods. Low-rank approximation is very important in numerical linear algebra. Many applications depend on matrix decomposition…

Numerical Analysis · Mathematics 2020-08-12 Rishi Advani , Madison Crim , Sean O'Hagan

Binary, or one-bit, representations of data arise naturally in many applications, and are appealing in both hardware implementations and algorithm design. In this work, we study the problem of data classification from binary data and…

Machine Learning · Computer Science 2017-07-10 Deanna Needell , Rayan Saab , Tina Woolf
‹ Prev 1 2 3 10 Next ›