Related papers: SFVInt: Simple, Fast and Generic Variable-Length I…

Vectorized VByte Decoding

We consider the ubiquitous technique of VByte compression, which represents each integer as a variable length sequence of bytes. The low 7 bits of each byte encode a portion of the integer, and the high bit of each byte is reserved as a…

Information Retrieval · Computer Science 2017-01-17 Jeff Plaisance , Nathan Kurz , Daniel Lemire

Decoding billions of integers per second through vectorization

In many important applications -- such as search engines and relational database systems -- data is stored in the form of arrays of integers. Encoding and, most importantly, decoding of these arrays consumes considerable CPU time.…

Information Retrieval · Computer Science 2021-02-02 Daniel Lemire , Leonid Boytsov

Refining Datapath for Microscaling ViTs

Vision Transformers (ViTs) leverage the transformer architecture to effectively capture global context, demonstrating strong performance in computer vision tasks. A major challenge in ViT hardware acceleration is that the model family…

Hardware Architecture · Computer Science 2025-06-17 Can Xiao , Jianyi Cheng , Aaron Zhao

Stream VByte: Faster Byte-Oriented Integer Compression

Arrays of integers are often compressed in search engines. Though there are many ways to compress integers, we are interested in the popular byte-oriented integer compression techniques (e.g., VByte or Google's Varint-GB). They are…

Information Retrieval · Computer Science 2017-10-11 Daniel Lemire , Nathan Kurz , Christoph Rupp

Faster Base64 Encoding and Decoding Using AVX2 Instructions

Web developers use base64 formats to include images, fonts, sounds and other resources directly inside HTML, JavaScript, JSON and XML files. We estimate that billions of base64 messages are decoded every day. We are motivated to improve the…

Mathematical Software · Computer Science 2026-04-07 Wojciech Muła , Daniel Lemire

FlashSVD: Memory-Efficient Inference with Streaming for Low-Rank Models

Singular Value Decomposition (SVD) has recently seen a surge of interest as a simple yet powerful tool for large language models (LLMs) compression, with a growing number of works demonstrating 20-80% parameter reductions at minimal…

Machine Learning · Computer Science 2025-08-05 Zishan Shao , Yixiao Wang , Qinsi Wang , Ting Jiang , Zhixu Du , Hancheng Ye , Danyang Zhuo , Yiran Chen , Hai Li

EFTViT: Efficient Federated Training of Vision Transformers with Masked Images on Resource-Constrained Clients

Federated learning research has recently shifted from Convolutional Neural Networks (CNNs) to Vision Transformers (ViTs) due to their superior capacity. ViTs training demands higher computational resources due to the lack of 2D inductive…

Computer Vision and Pattern Recognition · Computer Science 2025-09-03 Meihan Wu , Tao Chang , Cui Miao , Jie Zhou , Chun Li , Xiangyu Xu , Ming Li , Xiaodong Wang

BLINC: Lightweight Bimodal Learning for Low-Complexity VVC Intra Coding

The latest video coding standard, Versatile Video Coding (VVC), achieves almost twice coding efficiency compared to its predecessor, the High Efficiency Video Coding (HEVC). However, achieving this efficiency (for intra coding) requires 31x…

Multimedia · Computer Science 2022-12-13 Farhad Pakdaman , Mohammad Ali Adelimanesh , Mahmoud Reza Hashemi

Upscaledb: Efficient Integer-Key Compression in a Key-Value Store using SIMD Instructions

Compression can sometimes improve performance by making more of the data available to the processors faster. We consider the compression of integer keys in a B+-tree index. For this purpose, systems such as IBM DB2 use variable-byte…

Databases · Computer Science 2017-01-18 Daniel Lemire , Christoph Rupp

SPINT: Spatial Permutation-Invariant Neural Transformer for Consistent Intracortical Motor Decoding

Intracortical Brain-Computer Interfaces (iBCI) aim to decode behavior from neural population activity, enabling individuals with motor impairments to regain motor functions and communication abilities. A key challenge in long-term iBCI is…

Neurons and Cognition · Quantitative Biology 2025-07-14 Trung Le , Hao Fang , Jingyuan Li , Tung Nguyen , Lu Mi , Amy Orsborn , Uygar Sümbül , Eli Shlizerman

FineViT: Progressively Unlocking Fine-Grained Perception with Dense Recaptions

While Multimodal Large Language Models (MLLMs) have experienced rapid advancements, their visual encoders frequently remain a performance bottleneck. Conventional CLIP-based encoders struggle with dense spatial tasks due to the loss of…

Computer Vision and Pattern Recognition · Computer Science 2026-03-19 Peisen Zhao , Xiaopeng Zhang , Mingxing Xu , Ruoyu Sun , Zewei Du , Dunzheng Wang , Guanghao Zheng , Haohang Xu , Zhibo Zhang , Yuhang Zhang , Yi Ai , Lin Liu , Qi Tian

Towards Accurate and Efficient Sub-8-Bit Integer Training

Neural network training is a memory- and compute-intensive task. Quantization, which enables low-bitwidth formats in training, can significantly mitigate the workload. To reduce quantization error, recent methods have developed new data…

Machine Learning · Computer Science 2024-11-19 Wenjin Guo , Donglai Liu , Weiying Xie , Yunsong Li , Xuefei Ning , Zihan Meng , Shulin Zeng , Jie Lei , Zhenman Fang , Yu Wang

SPViT: Enabling Faster Vision Transformers via Soft Token Pruning

Recently, Vision Transformer (ViT) has continuously established new milestones in the computer vision field, while the high computation and memory cost makes its propagation in industrial production difficult. Pruning, a traditional model…

Computer Vision and Pattern Recognition · Computer Science 2022-09-22 Zhenglun Kong , Peiyan Dong , Xiaolong Ma , Xin Meng , Mengshu Sun , Wei Niu , Xuan Shen , Geng Yuan , Bin Ren , Minghai Qin , Hao Tang , Yanzhi Wang

SigVIC: Spatial Importance Guided Variable-Rate Image Compression

Variable-rate mechanism has improved the flexibility and efficiency of learning-based image compression that trains multiple models for different rate-distortion tradeoffs. One of the most common approaches for variable-rate is to…

Image and Video Processing · Electrical Eng. & Systems 2023-03-17 Jiaming Liang , Meiqin Liu , Chao Yao , Chunyu Lin , Yao Zhao

LeViT-UNet: Make Faster Encoders with Transformer for Medical Image Segmentation

Medical image segmentation plays an essential role in developing computer-assisted diagnosis and therapy systems, yet still faces many challenges. In the past few years, the popular encoder-decoder architectures based on CNNs (e.g., U-Net)…

Computer Vision and Pattern Recognition · Computer Science 2021-07-20 Guoping Xu , Xingrong Wu , Xuan Zhang , Xinwei He

Rice-Marlin Codes: Tiny and Efficient Variable-to-Fixed Codes

Marlin is a Variable-to-Fixed (VF) codec optimized for high decoding speed through the use of small sized dictionaries that fit in the L1 cache of most CPUs. While the size of Marlin dictionaries is adequate for decoding, they are still too…

Information Theory · Computer Science 2018-11-15 Manuel Martinez , Joan Serra-Sagristà

LLaVA-UHD v4: What Makes Efficient Visual Encoding in MLLMs?

Visual encoding constitutes a major computational bottleneck in Multimodal Large Language Models (MLLMs), especially for high-resolution image inputs. The prevailing practice typically adopts global encoding followed by post-ViT…

Computer Vision and Pattern Recognition · Computer Science 2026-05-12 Kechen Fang , Yihua Qin , Chongyi Wang , Wenshuo Ma , Tianyu Yu , Yuan Yao

SegViTv2: Exploring Efficient and Continual Semantic Segmentation with Plain Vision Transformers

This paper investigates the capability of plain Vision Transformers (ViTs) for semantic segmentation using the encoder-decoder framework and introduces \textbf{SegViTv2}. In this study, we introduce a novel Attention-to-Mask (\atm) module…

Computer Vision and Pattern Recognition · Computer Science 2023-08-31 Bowen Zhang , Liyang Liu , Minh Hieu Phan , Zhi Tian , Chunhua Shen , Yifan Liu

Base64 encoding and decoding at almost the speed of a memory copy

Many common document formats on the Internet are text-only such as email (MIME) and the Web (HTML, JavaScript, JSON and XML). To include images or executable code in these documents, we first encode them as text using base64. Standard…

Distributed, Parallel, and Cluster Computing · Computer Science 2020-01-08 Wojciech Muła , Daniel Lemire

SSIM-Variation-Based Complexity Optimization for Versatile Video Coding

To date, Versatile Video Coding (VVC) has a more magnificent overall performance than High Efficiency Video Coding (HEVC). The Quadtree with Nested Multi-Type Tree (QTMT) coding block structure can substantially enhance video coding quality…

Multimedia · Computer Science 2023-01-18 Jielian Lin , Hongbin Lin , Zhichen Zhang , Yiwen Xu , Tiesong Zhao