Related papers: Huffman coding as an algorithm to construct chains…

Source Coding for Quasiarithmetic Penalties

Huffman coding finds a prefix code that minimizes mean codeword length for a given probability distribution over a finite number of items. Campbell generalized the Huffman problem to a family of problems in which the goal is to minimize not…

Information Theory · Computer Science 2007-07-16 Michael B. Baer

Tales of Huffman

We study the new problem of Huffman-like codes subject to individual restrictions on the code-word lengths of a subset of the source words. These are prefix codes with minimal expected code-word length for a random source where additionally…

Information Theory · Computer Science 2007-07-13 Paul M. B. Vitanyi , Zvi Lotker

Abstract Huffman Coding and PIFO Tree Embeddings

Algorithms for deriving Huffman codes and the recently developed algorithm for compiling PIFO trees to trees of fixed shape (Mohan et al. 2022) are similar, but work with different underlying algebraic operations. In this paper, we exploit…

Information Theory · Computer Science 2023-01-10 Keri D'Angelo , Dexter Kozen

Prefix Codes: Equiprobable Words, Unequal Letter Costs

Describes a near-linear-time algorithm for a variant of Huffman coding, in which the letters may have non-uniform lengths (as in Morse code), but with the restriction that each word to be encoded has equal probability. [See also ``Huffman…

Data Structures and Algorithms · Computer Science 2015-06-02 Mordecai Golin , Neal E. Young

Quantum-inspired Huffman Coding

Huffman Compression, also known as Huffman Coding, is one of many compression techniques in use today. The two important features of Huffman coding are instantaneousness that is the codes can be interpreted as soon as they are received and…

Information Theory · Computer Science 2013-02-22 A. S. Tolba , M. Z. Rashad , M. A. El-Dosuky

Optimal Skeleton Huffman Trees Revisited

A skeleton Huffman tree is a Huffman tree in which all disjoint maximal perfect subtrees are shrunk into leaves. Skeleton Huffman trees, besides saving storage space, are also used for faster decoding and for speeding up Huffman-shaped…

Data Structures and Algorithms · Computer Science 2020-03-25 Dmitry Kosolobov , Oleg Merkurev

Reserved-Length Prefix Coding

Huffman coding finds an optimal prefix code for a given probability mass function. Consider situations in which one wishes to find an optimal code with the restriction that all codewords have lengths that lie in a user-specified set of…

Information Theory · Computer Science 2008-01-03 Michael B. Baer

Efficient algorithms for modifying and sampling from a categorical distribution

Probabilistic programming languages and other machine learning applications often require samples to be generated from a categorical distribution where the probability of each one of $n$ categories is specified as a parameter. If the…

Data Structures and Algorithms · Computer Science 2019-06-28 Daniel Tang

Accelerating Lossless Data Compression with GPUs

Huffman compression is a statistical, lossless, data compression algorithm that compresses data by assigning variable length codes to symbols, with the more frequently appearing symbols given shorter codes than the less. This work is a…

Information Theory · Computer Science 2011-07-11 R. L. Cloud , M. L. Curry , H. L. Ward , A. Skjellum , P. Bangalore

$D$-ary Bounded-Length Huffman Coding

Efficient optimal prefix coding has long been accomplished via the Huffman algorithm. However, there is still room for improvement and exploration regarding variants of the Huffman problem. Length-limited Huffman coding, useful for many…

Information Theory · Computer Science 2007-07-13 Michael B. Baer

Source Coding with Free Bits and the Multi-Way Number Partitioning Problem

We introduce a new variant of variable-length source coding for sending a source over two parallel channels, one of which is costly and the other free. We give a complete solution to this problem. Next, we relate the problem to the number…

Data Structures and Algorithms · Computer Science 2026-01-30 Niloufar Ahmadypour , Amin Gohari

Space-Efficient Huffman Codes Revisited

Canonical Huffman code is an optimal prefix-free compression code whose codewords enumerated in the lexicographical order form a list of binary words in non-decreasing lengths. Gagie et al. (2015) gave a representation of this coding…

Data Structures and Algorithms · Computer Science 2021-08-19 Szymon Grabowski , Dominik Köppl

A Generic Top-Down Dynamic-Programming Approach to Prefix-Free Coding

Given a probability distribution over a set of n words to be transmitted, the Huffman Coding problem is to find a minimal-cost prefix free code for transmitting those words. The basic Huffman coding problem can be solved in O(n log n) time…

Data Structures and Algorithms · Computer Science 2008-09-29 Mordecai Golin , Xiaoming Xu , Jiajin Yu

Short Huffman Codes Producing 1s Half of the Time

The design of the channel part of a digital communication system (e.g., error correction, modulation) is heavily based on the assumption that the data to be transmitted forms a fair bit stream. However, simple source encoders such as short…

Information Theory · Computer Science 2011-07-25 Fabian Altenbach , Georg Böcherer , Rudolf Mathar

Revisiting Huffman Coding: Toward Extreme Performance on Modern GPU Architectures

Today's high-performance computing (HPC) applications are producing vast volumes of data, which are challenging to store and transfer efficiently during the execution, such that data compression is becoming a critical technique to mitigate…

Distributed, Parallel, and Cluster Computing · Computer Science 2021-03-02 Jiannan Tian , Cody Rivera , Sheng Di , Jieyang Chen , Xin Liang , Dingwen Tao , Franck Cappello

Polynomial Time Algorithms for Constructing Optimal Binary AIFV-$2$ Codes

Huffman Codes are optimal Instantaneous Fixed-to-Variable (FV) codes in which every source symbol can only be encoded by one codeword. Relaxing these constraints permits constructing better FV codes. More specifically, recent work has shown…

Information Theory · Computer Science 2020-01-31 Mordecai Golin , Elfarouk Harb

Weighted Adaptive Coding

Huffman coding is known to be optimal, yet its dynamic version may be even more efficient in practice. A new variant of Huffman encoding has been proposed recently, that provably always performs better than static Huffman coding by at least…

Data Structures and Algorithms · Computer Science 2020-05-19 Aharon Fruchtman , Yoav Gross , Shmuel T. Klein , Dana Shapira

Huffman Coding with Letter Costs: A Linear-Time Approximation Scheme

We give a polynomial-time approximation scheme for the generalization of Huffman Coding in which codeword letters have non-uniform costs (as in Morse code, where the dash is twice as long as the dot). The algorithm computes a…

Data Structures and Algorithms · Computer Science 2015-06-02 Mordecai Golin , Claire Mathieu , Neal E. Young

Twenty (or so) Questions: $D$-ary Length-Bounded Prefix Coding

Efficient optimal prefix coding has long been accomplished via the Huffman algorithm. However, there is still room for improvement and exploration regarding variants of the Huffman problem. Length-limited Huffman coding, useful for many…

Information Theory · Computer Science 2007-07-13 Michael B. Baer

Optimal Prefix Free Code in Linear Time

We describe an algorithm computing an optimal prefix free code from $N$ unsorted positive integer weights in time linear in the number of machine words holding those weights. This algorithm takes advantage of common non-algebraic…

Data Structures and Algorithms · Computer Science 2017-03-02 Jérémy Barbay