Related papers: About adaptive coding on countable alphabets

Pattern Coding Meets Censoring: (almost) Adaptive Coding on Countable Alphabets

Adaptive coding faces the following problem: given a collection of source classes such that each class in the collection has non-trivial minimax redundancy rate, can we design a single code which is asymptotically minimax over each class in…

Information Theory · Computer Science 2016-09-02 Anna Ben-Hamou , Stephane Boucheron , Elisabeth Gassiat

Coding on countably infinite alphabets

This paper describes universal lossless coding strategies for compressing sources on countably infinite alphabets. Classes of memoryless sources defined by an envelope condition on the marginal distribution provide benchmarks for coding…

Statistics Theory · Mathematics 2015-01-05 Stéphane Boucheron , Aurélien Garivier , Elisabeth Gassiat

About Adaptive Coding on Countable Alphabets: Max-Stable Envelope Classes

In this paper, we study the problem of lossless universal source coding for stationary memoryless sources on countably infinite alphabets. This task is generally not achievable without restricting the class of sources over which…

Information Theory · Computer Science 2015-04-20 Boucheron Stephane , Elisabeth Gassiat , Mesrob I. Ohannessian

Universal Coding on Infinite Alphabets: Exponentially Decreasing Envelopes

This paper deals with the problem of universal lossless coding on a countable infinite alphabet. It focuses on some classes of sources defined by an envelope condition on the marginal distribution, namely exponentially decreasing envelope…

Information Theory · Computer Science 2011-07-07 Dominique Bontemps

Universal Weak Variable-Length Source Coding on Countable Infinite Alphabets

Motivated from the fact that universal source coding on countably infinite alphabets is not feasible, this work introduces the notion of almost lossless source coding. Analog to the weak variable-length source coding problem studied by Han…

Information Theory · Computer Science 2021-11-30 Jorge F. Silva , Pablo Piantanida

Universal Lossless Compression with Unknown Alphabets - The Average Case

Universal compression of patterns of sequences generated by independently identically distributed (i.i.d.) sources with unknown, possibly large, alphabets is investigated. A pattern is a sequence of indices that contains all consecutive…

Information Theory · Computer Science 2016-11-17 Gil I. Shamir

Applications of Universal Source Coding to Statistical Analysis of Time Series

We show how universal codes can be used for solving some of the most important statistical problems for time series. By definition, a universal code (or a universal lossless data compressor) can compress any sequence generated by a…

Information Theory · Computer Science 2008-09-09 Boris Ryabko

Universal Compression of Envelope Classes: Tight Characterization via Poisson Sampling

The Poisson-sampling technique eliminates dependencies among symbol appearances in a random sequence. It has been used to simplify the analysis and strengthen the performance guarantees of randomized algorithms. Applying this method to…

Information Theory · Computer Science 2014-05-30 Jayadev Acharya , Ashkan Jafarpour , Alon Orlitsky , Ananda Theertha Suresh

On redundancy of memoryless sources over countable alphabets

The minimum average number of bits need to describe a random variable is its entropy, assuming knowledge of the underlying statistics On the other hand, universal compression supposes that the distribution of the random variable, while…

Information Theory · Computer Science 2014-04-02 Maryam Hosseini , Narayana Santhanam

Practical Binary Adaptive Block Coder

This paper describes design of a low-complexity algorithm for adaptive encoding/ decoding of binary sequences produced by memoryless sources. The algorithm implements universal block codes constructed for a set of contexts identified by the…

Information Theory · Computer Science 2007-07-13 Yuriy A. Reznik

Universal Source Coding for Monotonic and Fast Decaying Monotonic Distributions

We study universal compression of sequences generated by monotonic distributions. We show that for a monotonic distribution over an alphabet of size $k$, each probability parameter costs essentially $0.5 \log (n/k^3)$ bits, where $n$ is the…

Information Theory · Computer Science 2007-07-13 Gil I. Shamir

Universal Variable-to-Fixed Length Lossy Compression at Finite Blocklengths

We consider universal variable-to-fixed length compression of memoryless sources with a fidelity criterion. We design a dictionary codebook over the reproduction alphabet which is used to parse the source stream. Once a source subsequence…

Information Theory · Computer Science 2022-11-24 Nematollah Iri

$D$-semifaithful codes that are universal over both memoryless sources and distortion measures

We prove the existence of codebooks for d-semifaithful lossy compression that are simultaneously universal with respect to both the class of finite-alphabet memoryless sources and the class of all bounded additive distortion measures. By…

Information Theory · Computer Science 2022-03-08 Neri Merhav

Universal Densities Exist for Every Finite Reference Measure

As it is known, universal codes, which estimate the entropy rate consistently, exist for stationary ergodic sources over finite alphabets but not over countably infinite ones. We generalize universal coding as the problem of universal…

Information Theory · Computer Science 2023-08-01 Łukasz Dębowski

Tight Bounds on the Average Length, Entropy, and Redundancy of Anti-Uniform Huffman Codes

In this paper we consider the class of anti-uniform Huffman codes and derive tight lower and upper bounds on the average length, entropy, and redundancy of such codes in terms of the alphabet size of the source. The Fibonacci distributions…

Information Theory · Computer Science 2012-07-27 Soheil Mohajer , Ali Kakhbod

Asymptotics and Non-asymptotics for Universal Fixed-to-Variable Source Coding

Universal fixed-to-variable lossless source coding for memoryless sources is studied in the finite blocklength and higher-order asymptotics regimes. Optimal third-order coding rates are derived for general fixed-to-variable codes and for…

Information Theory · Computer Science 2014-12-16 Oliver Kosut , Lalitha Sankar

New Algorithms and Lower Bounds for Sequential-Access Data Compression

This thesis concerns sequential-access data compression, i.e., by algorithms that read the input one or more times from beginning to end. In one chapter we consider adaptive prefix coding, for which we must read the input character by…

Information Theory · Computer Science 2009-02-03 Travis Gagie

ABC: Attention with Bounded-memory Control

Transformer architectures have achieved state-of-the-art results on a variety of sequence modeling tasks. However, their attention mechanism comes with a quadratic complexity in sequence lengths, making the computational overhead…

Computation and Language · Computer Science 2022-06-03 Hao Peng , Jungo Kasai , Nikolaos Pappas , Dani Yogatama , Zhaofeng Wu , Lingpeng Kong , Roy Schwartz , Noah A. Smith

Improved Redundancy Bounds for Exponential Objectives

We present new lower and upper bounds for the compression rate of binary prefix codes optimized over memoryless sources according to two related exponential codeword length objectives. The objectives explored here are exponential-average…

Information Theory · Computer Science 2011-05-03 Michael B. Baer

Study On Universal Lossless Data Compression by using Context Dependence Multilevel Pattern Matching Grammar Transform

In this paper, the context dependence multilevel pattern matching(in short CDMPM) grammar transform is proposed; based on this grammar transform, the universal lossless data compression algorithm, CDMPM code is then developed. Moreover we…

Discrete Mathematics · Computer Science 2013-03-21 Chung-Song Kim , Chol-Hun Kim