Related papers: Exploring compression techniques for ROOT IO

ROOT I/O compression improvements for HEP analysis

We overview recent changes in the ROOT I/O system, increasing performance and enhancing it and improving its interaction with other data analysis ecosystems. Both the newly introduced compression algorithms, the much faster bulk I/O data…

Other Computer Science · Computer Science 2021-02-03 Oksana Shadura , Brian Paul Bockelman , Philippe Canal , Danilo Piparo , Zhe Zhang

ROOT I/O compression algorithms and their performance impact within Run 3

The LHCs Run3 will push the envelope on data-intensive workflows and, since at the lowest level this data is managed using the ROOT software framework, preparations for managing this data are starting already. At the beginning of LHC Run 1,…

Performance · Computer Science 2020-08-26 Oksana Shadura , Brian Paul Bockelman

Optimizing ROOT IO For Analysis

The ROOT I/O (RIO) subsystem is foundational to most HEP experiments - it provides a file format, a set of APIs/semantics, and a reference implementation in C++. It is often found at the base of an experiment's framework and is used to…

Distributed, Parallel, and Cluster Computing · Computer Science 2017-11-08 Brian Bockelman , Zhe Zhang , Jim Pivarski

Increasing Parallelism in the ROOT I/O Subsystem

When processing large amounts of data, the rate at which reading and writing can take place is a critical factor. High energy physics data processing relying on ROOT is no exception. The recent parallelisation of LHC experiments' software…

Distributed, Parallel, and Cluster Computing · Computer Science 2018-04-11 Guilherme Amadio , Brian Bockelman , Philippe Canal , Danilo Piparo , Enric Tejedor , Zhe Zhang

ROOT - A C++ Framework for Petabyte Data Storage, Statistical Analysis and Visualization

ROOT is an object-oriented C++ framework conceived in the high-energy physics (HEP) community, designed for storing and analyzing petabytes of data in an efficient way. Any instance of a C++ class can be stored into a ROOT file in a…

Data Analysis, Statistics and Probability · Physics 2015-09-01 Ilka Antcheva , Maarten Ballintijn , Bertrand Bellenot , Marek Biskup , Rene Brun , Nenad Buncic , Philippe Canal , Diego Casadei , Olivier Couet , Valery Fine , Leandro Franco , Gerardo Ganis , Andrei Gheata , David Gonzalez Maline , Masaharu Goto , Jan Iwaszkiewicz , Anna Kreshuk , Diego Marcos Segura , Richard Maunder , Lorenzo Moneta , Axel Naumann , Eddy Offermann , Valeriy Onuchin , Suzanne Panacek , Fons Rademakers , Paul Russo , Matevz Tadel

Optimizing Frameworks Performance Using C++ Modules Aware ROOT

ROOT is a data analysis framework broadly used in and outside of High Energy Physics (HEP). Since HEP software frameworks always strive for performance improvements, ROOT was extended with experimental support of runtime C++ Modules. C++…

Programming Languages · Computer Science 2019-10-02 Yuka Takahashi , Vassil Vassilev , Oksana Shadura , Raphael Isemann

ROOT for the HL-LHC: data format

This document discusses the state, roadmap, and risks of the foundational components of ROOT with respect to the experiments at the HL-LHC (Run 4 and beyond). As foundational components, the document considers in particular the ROOT…

High Energy Physics - Experiment · Physics 2022-04-12 Axel Naumann , Philippe Canal , Enric Tejedor , Enrico Guiraud , Lorenzo Moneta , Bertrand Bellenot , Olivier Couet , Alja Mrak Tadel , Matevz Tadel , Sergey Linev , Javier Lopez Gomez , Jonas Rembser , Vincenzo Eduardo Padulano , Jakob Blomer , Jonas Hahnfeld , Bernhard Manfred Gruber , Vassil Vassilev

To Compress or Not To Compress: Energy Trade-Offs and Benefits of Lossy Compressed I/O

Modern scientific simulations generate massive volumes of data, creating significant challenges for I/O and storage systems. Error-bounded lossy compression (EBLC) offers a solution by reducing data set sizes while preserving data quality…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-06-16 Grant Wilkins , Sheng Di , Jon C. Calhoun , Robert Underwood , Franck Cappello

Speeding HEP Analysis with ROOT Bulk I/O

Distinct HEP workflows have distinct I/O needs; while ROOT I/O excels at serializing complex C++ objects common to reconstruction, analysis workflows typically have simpler objects and can sustain higher event rates. To meet these…

Distributed, Parallel, and Cluster Computing · Computer Science 2020-08-26 Brian Bockelman , Zhe Zhang , Oksana Shadura

New Algorithms and Lower Bounds for Sequential-Access Data Compression

This thesis concerns sequential-access data compression, i.e., by algorithms that read the input one or more times from beginning to end. In one chapter we consider adaptive prefix coding, for which we must read the input character by…

Information Theory · Computer Science 2009-02-03 Travis Gagie

Challenges and Solutions in Selecting Optimal Lossless Data Compression Algorithms

The rapid growth of digital data has heightened the demand for efficient lossless compression methods. However, existing algorithms exhibit trade-offs: some achieve high compression ratios, others excel in encoding or decoding speed, and…

Information Theory · Computer Science 2025-10-01 Md. Atiqur Rahman , MM Fazle Rabbi

Access Time Tradeoffs in Archive Compression

Web archives, query and proxy logs, and so on, can all be very large and highly repetitive; and are accessed only sporadically and partially, rather than continually and holistically. This type of data is ideal for compression-based…

Information Theory · Computer Science 2016-03-01 Matthias Petri , Alistair Moffat , P. C. Nagesh , Anthony Wirth

Efficient model compression with Random Operation Access Specific Tile (ROAST) hashing

Advancements in deep learning are often associated with increasing model sizes. The model size dramatically affects the deployment cost and latency of deep models. For instance, models like BERT cannot be deployed on edge devices and…

Machine Learning · Computer Science 2022-07-25 Aditya Desai , Keren Zhou , Anshumali Shrivastava

Compression Performance Analysis of Different File Formats

In data storage and transmission, file compression is a common technique for reducing the volume of data, reducing data storage space and transmission time and bandwidth. However, there are significant differences in the compression…

Other Computer Science · Computer Science 2023-08-24 Han Yang , Guangjun Qin , Yongqing Hu

Examining Interplay of Compression and Encryption and Applicability to 5G Teleoperations

Modern IoT and networked systems rely on fast and secure delivery of time-critical information. Use cases such as teleoperations require fast data delivery over mobile networks, which despite improvements in 5G are still quite constrained.…

Networking and Internet Architecture · Computer Science 2023-11-21 Duncan Joly , Jason Carpenter , Zhi-Li Zhang

The Energy-Throughput Trade-off in Lossless-Compressed Source Code Storage

Retrieving data from large-scale source code archives is vital for AI training, neural-based software analysis, and information retrieval, to cite a few. This paper studies and experiments with the design of a compressed key-value store for…

Data Structures and Algorithms · Computer Science 2026-01-21 Paolo Ferragina , Francesco Tosoni

Consistently faster and smaller compressed bitmaps with Roaring

Compressed bitmap indexes are used in databases and search engines. Many bitmap compression techniques have been proposed, almost all relying primarily on run-length encoding (RLE). However, on unsorted data, we can get superior performance…

Databases · Computer Science 2018-03-05 Daniel Lemire , Gregory Ssi-Yan-Kai , Owen Kaser

Prototyping a ROOT-based distributed analysis workflow for HL-LHC: the CMS use case

The challenges expected for the next era of the Large Hadron Collider (LHC), both in terms of storage and computing resources, provide LHC experiments with a strong motivation for evaluating ways of rethinking their computing models at many…

Distributed, Parallel, and Cluster Computing · Computer Science 2023-07-25 Tommaso Tedeschi , Vincenzo Eduardo Padulano , Daniele Spiga , Diego Ciangottini , Mirco Tracolli , Enric Tejedor Saavedra , Enrico Guiraud , Massimo Biasotto

The Impact of Data Compression in Real-Time and Historical Data Acquisition Systems on the Accuracy of Analytical Solutions

In industrial and IoT environments, massive amounts of real-time and historical process data are continuously generated and archived. With sensors and devices capturing every operational detail, the volume of time-series data has become a…

Databases · Computer Science 2025-11-03 Reham Faqehi , Haya Alhuraib , Hamad Saiari , Zyad Bamigdad

Optimal Rate-Distortion-Leakage Tradeoff for Single-Server Information Retrieval

Private information retrieval protocols guarantee that a user can privately and losslessly retrieve a single file from a database stored across multiple servers. In this work, we propose to simultaneously relax the conditions of perfect…

Information Theory · Computer Science 2022-01-07 Yauhen Yakimenka , Hsuan-Yin Lin , Eirik Rosnes , Jörg Kliewer