Related papers: Sequential Universal Modeling for Non-Binary Seque…

A Family of LZ78-based Universal Sequential Probability Assignments

We propose and study a family of universal sequential probability assignments on individual sequences, based on the incremental parsing procedure of the Lempel-Ziv (LZ78) compression algorithm. We show that the normalized log loss under any…

Information Theory · Computer Science 2025-12-15 Naomi Sagan , Tsachy Weissman

Compressing combinatorial objects

Most of the world's digital data is currently encoded in a sequential form, and compression methods for sequences have been studied extensively. However, there are many types of non-sequential data for which good compression techniques are…

Information Theory · Computer Science 2016-01-15 Christian Steinruecken

Nonparametric Decentralized Sequential Detection via Universal Source Coding

We consider nonparametric or universal sequential hypothesis testing problem when the distribution under the null hypothesis is fully known but the alternate hypothesis corresponds to some other unknown distribution. These algorithms are…

Information Theory · Computer Science 2013-08-30 Jithin K. Sreedharan , Vinod Sharma

On Finite Memory Universal Data Compression and Classification of Individual Sequences

Consider the case where consecutive blocks of N letters of a semi-infinite individual sequence X over a finite-alphabet are being compressed into binary sequences by some one-to-one mapping. No a-priori information about X is available at…

Information Theory · Computer Science 2013-01-25 Jacob Ziv

Universal Lossless Compression with Unknown Alphabets - The Average Case

Universal compression of patterns of sequences generated by independently identically distributed (i.i.d.) sources with unknown, possibly large, alphabets is investigated. A pattern is a sequence of indices that contains all consecutive…

Information Theory · Computer Science 2016-11-17 Gil I. Shamir

Compression-based methods for nonparametric density estimation, on-line prediction, regression and classification for time series

We address the problem of nonparametric estimation of characteristics for stationary and ergodic time series. We consider finite-alphabet time series and real-valued ones and the following four problems: i) estimation of the (limiting)…

Information Theory · Computer Science 2007-11-01 Boris Ryabko

Combining non-stationary prediction, optimization and mixing for data compression

In this paper an approach to modelling nonstationary binary sequences, i.e., predicting the probability of upcoming symbols, is presented. After studying the prediction model we evaluate its performance in two non-artificial test cases.…

Information Theory · Computer Science 2013-02-13 Christopher Mattern

Sequential Change Detection through Empirical Distribution and Universal Codes

Universal compression algorithms have been studied in the past for sequential change detection, where they have been used to estimate the post-change distribution in the modified version of the Cumulative Sum (CUSUM) Test. In this paper, we…

Information Theory · Computer Science 2021-12-15 Vikrant Malik , R. K. Bansal

Universal compression of Gaussian sources with unknown parameters

For a collection of distributions over a countable support set, the worst case universal compression formulation by Shtarkov attempts to assign a universal distribution over the support set. The formulation aims to ensure that the universal…

Information Theory · Computer Science 2014-10-17 A. Orlitsky , N. Santhanam

Universal Graph Compression: Stochastic Block Models

Motivated by the prevalent data science applications of processing large-scale graph data such as social networks and biological networks, this paper investigates lossless compression of data in the form of a labeled graph. Particularly, we…

Information Theory · Computer Science 2024-05-24 Alankrita Bhatt , Ziao Wang , Chi Wang , Lele Wang

Convergence and Error Bounds for Universal Prediction of Nonbinary Sequences

Solomonoff's uncomputable universal prediction scheme $\xi$ allows to predict the next symbol $x_k$ of a sequence $x_1...x_{k-1}$ for any Turing computable, but otherwise unknown, probabilistic environment $\mu$. This scheme will be…

Machine Learning · Computer Science 2007-05-23 Marcus Hutter

New Algorithms and Lower Bounds for Sequential-Access Data Compression

This thesis concerns sequential-access data compression, i.e., by algorithms that read the input one or more times from beginning to end. In one chapter we consider adaptive prefix coding, for which we must read the input character by…

Information Theory · Computer Science 2009-02-03 Travis Gagie

Unified Compression Algorithm for Distributed Nonconvex Optimization: Generalized to 1-Bit, Saturation, and Bounded Noise

In this paper, we propose a unified compression algorithm for distributed nonconvex opitmization with both the locally- and globally-bounded communication compressors, including 1-bit compressors, saturating quantizers, and the…

Optimization and Control · Mathematics 2026-04-14 Haonan Wang , Minghui Liwang , Yiguang Hong , Karl H. Johansson , Xinlei Yi

A Unified Approach to Universal Prediction: Generalized Upper and Lower Bounds

We study sequential prediction of real-valued, arbitrary and unknown sequences under the squared error loss as well as the best parametric predictor out of a large, continuous class of predictors. Inspired by recent results from…

Machine Learning · Computer Science 2014-01-24 N. Denizcan Vanli , Suleyman S. Kozat

Integer Set Compression and Statistical Modeling

Compression of integer sets and sequences has been extensively studied for settings where elements follow a uniform probability distribution. In addition, methods exist that exploit clustering of elements in order to achieve higher…

Information Theory · Computer Science 2014-02-11 N. Jesper Larsson

Universal coding, intrinsic volumes, and metric complexity

We study sequential probability assignment in the Gaussian setting, where the goal is to predict, or equivalently compress, a sequence of real-valued observations almost as well as the best Gaussian distribution with mean constrained to a…

Information Theory · Computer Science 2025-05-27 Jaouad Mourtada

Universal Source Coding for Monotonic and Fast Decaying Monotonic Distributions

We study universal compression of sequences generated by monotonic distributions. We show that for a monotonic distribution over an alphabet of size $k$, each probability parameter costs essentially $0.5 \log (n/k^3)$ bits, where $n$ is the…

Information Theory · Computer Science 2007-07-13 Gil I. Shamir

A Universal Non-Parametric Approach For Improved Molecular Sequence Analysis

In the field of biological research, it is essential to comprehend the characteristics and functions of molecular sequences. The classification of molecular sequences has seen widespread use of neural network-based techniques. Despite their…

Machine Learning · Computer Science 2024-02-14 Sarwan Ali , Tamkanat E Ali , Prakash Chourasia , Murray Patterson

Communication Compression for Distributed Nonconvex Optimization

This paper considers distributed nonconvex optimization with the cost functions being distributed over agents. Noting that information compression is a key tool to reduce the heavy communication load for distributed algorithms as agents…

Optimization and Control · Mathematics 2022-10-10 Xinlei Yi , Shengjun Zhang , Tao Yang , Tianyou Chai , Karl H. Johansson

Compression Algorithm Based on Irregular Sequence

The paper introduces a new lossless, highly robust compression algorithm that similar with LZW algorithm, yet the algorithm discards dictionary processing and uses irregular sequences with massive, random information instead. Then the paper…

Signal Processing · Electrical Eng. & Systems 2020-06-24 Rui Zhu