English
Related papers

Related papers: Malleable Coding: Compressed Palimpsests

200 papers

In cloud computing, storage area networks, remote backup storage, and similar settings, stored data is modified with updates from new versions. Representing information and modifying the representation are both expensive. Therefore it is…

Information Theory · Computer Science 2011-05-11 Lav R. Varshney , Julius Kusuma , Vivek K Goyal

Describes a near-linear-time algorithm for a variant of Huffman coding, in which the letters may have non-uniform lengths (as in Morse code), but with the restriction that each word to be encoded has equal probability. [See also ``Huffman…

Data Structures and Algorithms · Computer Science 2015-06-02 Mordecai Golin , Neal E. Young

The explosion of the amount of data stored in cloud systems calls for more efficient paradigms for redundancy. While replication is widely used to ensure data availability, erasure correcting codes provide a much better trade-off between…

Distributed, Parallel, and Cluster Computing · Computer Science 2017-03-16 Steve Jiekak , Anne-Marie Kermarrec , Nicolas Le Scouarnec , Gilles Straub , Alexandre Van Kempen

The eXtensible Markup Language (XML) provides a powerful and flexible means of encoding and exchanging data. As it turns out, its main advantage as an encoding format (namely, its requirement that all open and close markup tags are present…

Databases · Computer Science 2015-05-13 Gregory Leighton , Denilson Barbosa

We study the problem of designing systems in order to minimize cost while meeting a given flexibility target. Flexibility is attained by enforcing a joint chance constraint, which ensures that the system will exhibit feasible operation with…

Optimization and Control · Mathematics 2021-06-25 Joshua L. Pulsipher , Victor M. Zavala

This work considers the problem of transmitting multiple compressible sources over a network at minimum cost. The aim is to find the optimal rates at which the sources should be compressed and the network flows using which they should be…

Information Theory · Computer Science 2009-08-13 Aditya Ramamoorthy

Distributed storage systems must handle both data heterogeneity, arising from non-uniform access demands, and device heterogeneity, caused by time-varying node reliability. In this paper, we study convertible codes, which enable the…

Information Theory · Computer Science 2026-01-16 Anina Gruica , Benjamin Jany , Stanislav Kruglik

This paper investigates, from information theoretic grounds, a learning problem based on the principle that any regularity in a given dataset can be exploited to extract compact features from data, i.e., using fewer bits than needed to…

Machine Learning · Statistics 2018-11-14 Matías Vera , Leonardo Rey Vega , Pablo Piantanida

In large-scale distributed storage systems, erasure codes are used to achieve fault tolerance in the face of node failures. Tuning code parameters to observed failure rates has been shown to significantly reduce storage cost. Such tuning of…

Information Theory · Computer Science 2020-06-05 Francisco Maturana , V. S. Chaitanya Mukka , K. V. Rashmi

This paper presents prefix codes which minimize various criteria constructed as a convex combination of maximum codeword length and average codeword length or maximum redundancy and average redundancy, including a convex combination of the…

Information Theory · Computer Science 2011-02-11 Charalambos D. Charalambous , Themistoklis Charalambous , Farzad Rezaei

We consider large-scale linear inverse problems in Bayesian settings. We follow a recent line of work that applies the approximate message passing (AMP) framework to multi-processor (MP) computational systems, where each processor node…

Information Theory · Computer Science 2016-11-09 Junan Zhu , Dror Baron , Ahmad Beirami

Adaptive coding faces the following problem: given a collection of source classes such that each class in the collection has non-trivial minimax redundancy rate, can we design a single code which is asymptotically minimax over each class in…

Information Theory · Computer Science 2016-09-02 Anna Ben-Hamou , Stephane Boucheron , Elisabeth Gassiat

Distributed storage systems for large-scale applications typically use replication for reliability. Recently, erasure codes were used to reduce the large storage overhead, while increasing data reliability. A main limitation of…

Information Theory · Computer Science 2014-05-06 Dimitris S. Papailiopoulos , Alexandros G. Dimakis

The explosion in the volumes of data being stored online has resulted in distributed storage systems transitioning to erasure coding based schemes. Yet, the codes being deployed in practice are fairly short. In this work, we address what we…

Information Theory · Computer Science 2016-09-22 Parikshit Gopalan , Guangda Hu , Swastik Kopparty , Shubhangi Saraf , Carol Wang , Sergey Yekhanin

Cloud providers have recently introduced new offerings whereby spare computing resources are accessible at discounts compared to on-demand computing. Exploiting such opportunity is challenging inasmuch as such resources are accessed with…

Information Theory · Computer Science 2019-05-28 Yaoqing Yang , Matteo Interlandi , Pulkit Grover , Soummya Kar , Saeed Amizadeh , Markus Weimer

In this paper we study the tradeoff between parallelism and communication cost in a map-reduce computation. For any problem that is not "embarrassingly parallel," the finer we partition the work of the reducers so that more parallelism can…

Distributed, Parallel, and Cluster Computing · Computer Science 2012-06-21 Foto N. Afrati , Anish Das Sarma , Semih Salihoglu , Jeffrey D. Ullman

This paper investigates the problem of variable-length lossy source coding allowing a positive excess distortion probability and an overflow probability of codeword lengths. Novel one-shot achievability and converse bounds of the optimal…

Information Theory · Computer Science 2018-12-17 Shota Saito , Hideki Yagi , Toshiyasu Matsushima

The problem of variable-rate lossless data compression is considered, for codes with and without prefix constraints. Sharp bounds are derived for the best achievable compression rate of memoryless sources, when the excess-rate probability…

Information Theory · Computer Science 2025-11-13 Andreas Theocharous , Lampros Gavalakis , Ioannis Kontoyiannis

The weighted-Hamming metric generalizes the Hamming metric by assigning different weights to blocks of coordinates. It is well-suited for applications such as coding over independent parallel channels, each of which has a different level of…

Information Theory · Computer Science 2026-01-21 Sebastian Bitzer , Alberto Ravagnani , Violetta Weger

We investigate the fundamental task of addition under uncertainty, namely, addends that are represented as intervals of numbers rather than single values. One potential source of such uncertainty can occur when obtaining discrete-valued…

Information Theory · Computer Science 2026-02-09 Johannes Bund , Christoph Lenzen , Moti Medina
‹ Prev 1 2 3 10 Next ›