English
Related papers

Related papers: Partition Reduction for Lossy Data Compression Pro…

200 papers

The amount of data generated and gathered in scientific simulations and data collection applications is continuously growing, putting mounting pressure on storage and bandwidth concerns. A means of reducing such issues is data compression;…

Numerical Analysis · Mathematics 2025-05-15 Alyson Fox , Peter Lindstrom

This paper presents error-bounded lossy compression tailored for particle datasets from diverse scientific applications in cosmology, fluid dynamics, and fusion energy sciences. As today's high-performance computing capabilities advance,…

Information Theory · Computer Science 2024-04-05 Congrong Ren , Sheng Di , Longtao Zhang , Kai Zhao , Hanqi Guo

Rapidly increasing data sizes in scientific computing are the driving force behind the need for lossy compression. The main drawback of lossy data compression is the introduction of error. This paper explains why many error-bounded…

Distributed, Parallel, and Cluster Computing · Computer Science 2024-07-23 Alex Fallin , Martin Burtscher

Entropy and free-energy estimation are key in thermodynamic characterization of simulated systems ranging from spin models through polymers, colloids, protein structure, and drug-design. Current techniques suffer from being model specific,…

Statistical Mechanics · Physics 2019-10-30 Ram Avinery , Micha Kornreich , Roy Beck

Today's HPC applications are producing extremely large amounts of data, such that data storage and analysis are becoming more challenging for scientific research. In this work, we design a new error-controlled lossy compression algorithm…

Information Theory · Computer Science 2017-06-14 Dingwen Tao , Sheng Di , Zizhong Chen , Franck Cappello

An alternative approach to two-part 'critical compression' is presented. Whereas previous results were based on summing a lossless code at reduced precision with a lossy-compressed error or noise term, the present approach uses a similar…

Multimedia · Computer Science 2013-01-03 John Scoville

Lossy compressors are increasingly adopted in scientific research, tackling volumes of data from experiments or parallel numerical simulations and facilitating data storage and movement. In contrast with the notion of entropy in lossless…

Distributed, Parallel, and Cluster Computing · Computer Science 2023-05-16 Robert Underwood , Julie Bessac , David Krasowska , Jon C. Calhoun , Sheng Di , Franck Cappello

Can we use machine learning to compress graph data? The absence of ordering in graphs poses a significant challenge to conventional compression algorithms, limiting their attainable gains as well as their ability to discover relevant…

Machine Learning · Computer Science 2023-09-26 Giorgos Bouritsas , Andreas Loukas , Nikolaos Karalias , Michael M. Bronstein

Machine learning has had a major impact on data compression over the last decade and inspired many new, exciting theoretical and applied questions. This paper describes one such direction -- relative entropy coding -- which focuses on…

Information Theory · Computer Science 2026-02-10 Gergely Flamich , Deniz Gündüz

Our increasingly digital and connected world has led to the generation of unprecedented amounts of data. This data must be efficiently managed, transmitted, and stored to preserve resources and allow scalability. Data compression has…

Information Theory · Computer Science 2025-10-09 Jonas G. Matt , Pengcheng Huang , Balz Maag

Today's scientific simulations, for example in the high-performance exascale sector, produce huge amounts of data. Due to limited I/O bandwidth and available storage space, there is the necessity to reduce scientific data of high…

Distributed, Parallel, and Cluster Computing · Computer Science 2024-07-25 N. Böing , J. Holke , C. Hergl , L. Spataro , G. Gassner , A. Basermann

Comparison-based algorithms are algorithms for which the execution of each operation is solely based on the outcome of a series of comparisons between elements. Comparison-based computations can be naturally represented via the following…

Data Structures and Algorithms · Computer Science 2020-11-17 Michel Schellekens

In this paper we investigate the problem of partitioning an input string T in such a way that compressing individually its parts via a base-compressor C gets a compressed output that is shorter than applying C over the entire T at once.…

Data Structures and Algorithms · Computer Science 2009-06-26 Paolo Ferragina , Igor Nitto , Rossano Venturini

Many scientific applications opt for particles instead of meshes as their basic primitives to model complex systems composed of billions of discrete entities. Such applications span a diverse array of scientific domains, including molecular…

Distributed, Parallel, and Cluster Computing · Computer Science 2024-11-04 Longtao Zhang , Ruoyu Li , Congrong Ren , Sheng Di , Jinyang Liu , Jiajun Huang , Robert Underwood , Pascal Grosset , Dingwen Tao , Xin Liang , Hanqi Guo , Franck Capello , Kai Zhao

We formulate the problem of performing optimal data compression under the constraints that compressed data can be used for accurate classification in machine learning. We show that this translates to a problem of minimizing the mutual…

Signal Processing · Electrical Eng. & Systems 2022-11-04 Jingchao Gao , Ao Tang , Weiyu Xu

We consider a natural generalization of the Partial Vertex Cover problem. Here an instance consists of a graph G = (V,E), a positive cost function c: V-> Z^{+}, a partition $P_1,..., P_r$ of the edge set $E$, and a parameter $k_i$ for each…

Data Structures and Algorithms · Computer Science 2015-03-19 Suman Kalyan Bera , Shalmoli Gupta , Amit Kumar , Sambuddha Roy

Increasing data volumes from scientific simulations and instruments (supercomputers, accelerators, telescopes) often exceed network, storage, and analysis capabilities. The scientific community's response to this challenge is scientific…

We present herein a scheme by which to accurately evaluate the error exponents of a lossy data compression problem, which characterize average probabilities over a code ensemble of compression failure and success above or below a critical…

Statistical Mechanics · Physics 2007-05-23 Tadaaki Hosaka , Yoshiyuki Kabashima

Compression of floating-point data will play an important role in high-performance computing as data bandwidth and storage become dominant costs. Lossy compression of floating-point data is powerful, but theoretical results are needed to…

Numerical Analysis · Mathematics 2024-07-03 James Diffenderfer , Alyson Fox , Jeffrey Hittinger , Geoffrey Sanders , Peter Lindstrom

The unfolding problem formulation for correcting experimental data distortions due to finite resolution and limited detector acceptance is discussed. A novel validation of the problem solution is proposed. Attention is drawn to fact that…

Data Analysis, Statistics and Probability · Physics 2016-04-08 Nikolai Gagunashvili
‹ Prev 1 2 3 10 Next ›