English
Related papers

Related papers: Finding Synchronization Codes to Boost Compression…

200 papers

Embedding tables are used by machine learning systems to work with categorical features. In modern Recommendation Systems, these tables can be very large, necessitating the development of new methods for fitting them in memory, even during…

Machine Learning · Computer Science 2023-10-24 Henry Ling-Hei Tsang , Thomas Dybdahl Ahle

A technique of lossless compression via substring enumeration (CSE) attains compression ratios as well as popular lossless compressors for one-dimensional (1D) sources. The CSE utilizes a probabilistic model built from the circular string…

Information Theory · Computer Science 2017-01-25 Takahiro Ota , Hiroyoshi Morita

We introduce synchronization strings as a novel way of efficiently dealing with synchronization errors, i.e., insertions and deletions. Synchronization errors are strictly more general and much harder to deal with than commonly considered…

Information Theory · Computer Science 2017-04-05 Bernhard Haeupler , Amirbehshad Shahrasbi

People tend to store a lot of files inside theirs storage. When the storage nears it limit, they then try to reduce those files size to minimum by using data compression software. In this paper we propose a new algorithm for data…

Data Structures and Algorithms · Computer Science 2012-09-06 I. Made Agus Dwi Suarjaya

Context-aware compression techniques have gained increasing attention as model sizes continue to grow, introducing computational bottlenecks that hinder efficient deployment. A structured encoding approach was proposed to selectively…

Computation and Language · Computer Science 2025-02-13 Barnaby Schmitt , Alistair Grosvenor , Matthias Cunningham , Clementine Walsh , Julius Pembrokeshire , Jonathan Teel

This paper investigates compression of data encrypted with block ciphers, such as the Advanced Encryption Standard (AES). It is shown that such data can be feasibly compressed without knowledge of the secret key. Block ciphers operating in…

Information Theory · Computer Science 2010-09-10 Demijan Klinc , Carmit Hazay , Ashish Jagmohan , Hugo Krawczyk , Tal Rabin

We study the problem of compressing a source sequence in the presence of side-information that is related to the source via insertions, deletions and substitutions. We propose a simple algorithm to compress the source sequence when the…

Information Theory · Computer Science 2016-11-15 Nan Ma , Kannan Ramchandran , David Tse

In-memory columnar databases have become mainstream over the last decade and have vastly improved the fast processing of large volumes of data through multi-core parallelism and in-memory compression thereby eliminating the usual…

Databases · Computer Science 2016-09-27 Jayanth Jayanth

Self-synchronization under the presence of additive noise can be achieved by allocating a certain number of bits of each codeword as markers for synchronization. Difference systems of sets are combinatorial designs which specify the…

Information Theory · Computer Science 2013-03-19 Yuichiro Fujiwara , Vladimir D. Tonchev

Already in the 1960s, Levenshtein and others studied error-correcting codes that protect against synchronization errors, such as symbol insertions and deletions. However, despite significant efforts, progress on designing such codes has…

Information Theory · Computer Science 2021-01-05 Bernhard Haeupler , Amirbehshad Shahrasbi

Stochastic computing (SC) is a high density, low-power computation technique which encodes values as unary bitstreams instead of binary-encoded (BE) values. Practical SC implementations require deterministic or pseudo-random number…

Emerging Technologies · Computer Science 2019-02-28 Vincent T. Lee , Samuel Archibald Elliot , Armin Alaghi , Luis Ceze

Dataset Condensation (DC) aims to obtain a condensed dataset that allows models trained on the condensed dataset to achieve performance comparable to those trained on the full dataset. Recent DC approaches increasingly focus on encoding…

Computer Vision and Pattern Recognition · Computer Science 2025-03-19 Bowen Yuan , Yuxia Fu , Zijian Wang , Yadan Luo , Zi Huang

Concurrent coding is an unconventional encoding technique that simultaneously provides protection against noise, burst errors and interference. This simple-to-understand concept is investigated by distinguishing 2 types of code, open and…

Information Theory · Computer Science 2019-01-29 David M Benton

We consider a resource-limited scenario where a sensor that uses compressed sensing (CS) collects a low number of measurements in order to observe a sparse signal, and the measurements are subsequently quantized at a low bit-rate followed…

Information Theory · Computer Science 2015-06-19 Amirpasha Shirazinia , Saikat Chatterjee , Mikael Skoglund

Data compression has been widely applied in many data processing areas. Compression methods use variable-size codes with the shorter codes assigned to symbols or groups of symbols that appear in the data frequently. Fibonacci coding, as a…

Performance · Computer Science 2007-12-19 R. Baca , V. Snasel , J. Platos , M. Kratky , E. El-Qawasmeh

We discuss a method to construct quantum codes correcting amplitude damping errors via code concatenation. The inner codes are chosen as asymmetric Calderbank-Shor-Steane (CSS) codes. By concatenating with outer codes correcting symmetric…

Quantum Physics · Physics 2016-10-31 Tyler Jackson , Markus Grassl , Bei Zeng

Bit arrays, or bitmaps, are used to significantly speed up set operations in several areas, such as data warehousing, information retrieval, and data mining, to cite a few. However, bitmaps usually use a large storage space, thus requiring…

Data Structures and Algorithms · Computer Science 2015-03-14 Alessandro Colantonio , Roberto Di Pietro

Reliability is an important requirement for both communication and storage systems. Due to continuous scale down of technology multiple adjacent bits error probability increases. The data may be corrupted due soft errors. Error correction…

Hardware Architecture · Computer Science 2020-02-19 Sayan Tripathi , Jhilam Jana , Jaydeb Bhaumik

We present many new results related to reliable (interactive) communication over insertion-deletion channels. Synchronization errors, such as insertions and deletions, strictly generalize the usual symbol corruption errors and are much…

Information Theory · Computer Science 2018-03-22 Bernhard Haeupler , Amirbehshad Shahrasbi , Ellen Vitercik

Run Length Encoding(RLE) is one of the oldest algorithms for data-compression available, a method used for compression of large data into smaller and therefore more compact data. It compresses by looking at the data for repetitions of the…

Data Structures and Algorithms · Computer Science 2015-01-23 Meo Mespotine
‹ Prev 1 2 3 10 Next ›