English
Related papers

Related papers: Exploring compression techniques for ROOT IO

200 papers

We overview recent changes in the ROOT I/O system, increasing performance and enhancing it and improving its interaction with other data analysis ecosystems. Both the newly introduced compression algorithms, the much faster bulk I/O data…

Other Computer Science · Computer Science 2021-02-03 Oksana Shadura , Brian Paul Bockelman , Philippe Canal , Danilo Piparo , Zhe Zhang

The LHCs Run3 will push the envelope on data-intensive workflows and, since at the lowest level this data is managed using the ROOT software framework, preparations for managing this data are starting already. At the beginning of LHC Run 1,…

Performance · Computer Science 2020-08-26 Oksana Shadura , Brian Paul Bockelman

The ROOT I/O (RIO) subsystem is foundational to most HEP experiments - it provides a file format, a set of APIs/semantics, and a reference implementation in C++. It is often found at the base of an experiment's framework and is used to…

Distributed, Parallel, and Cluster Computing · Computer Science 2017-11-08 Brian Bockelman , Zhe Zhang , Jim Pivarski

When processing large amounts of data, the rate at which reading and writing can take place is a critical factor. High energy physics data processing relying on ROOT is no exception. The recent parallelisation of LHC experiments' software…

Distributed, Parallel, and Cluster Computing · Computer Science 2018-04-11 Guilherme Amadio , Brian Bockelman , Philippe Canal , Danilo Piparo , Enric Tejedor , Zhe Zhang

ROOT is an object-oriented C++ framework conceived in the high-energy physics (HEP) community, designed for storing and analyzing petabytes of data in an efficient way. Any instance of a C++ class can be stored into a ROOT file in a…

ROOT is a data analysis framework broadly used in and outside of High Energy Physics (HEP). Since HEP software frameworks always strive for performance improvements, ROOT was extended with experimental support of runtime C++ Modules. C++…

Programming Languages · Computer Science 2019-10-02 Yuka Takahashi , Vassil Vassilev , Oksana Shadura , Raphael Isemann

This document discusses the state, roadmap, and risks of the foundational components of ROOT with respect to the experiments at the HL-LHC (Run 4 and beyond). As foundational components, the document considers in particular the ROOT…

Modern scientific simulations generate massive volumes of data, creating significant challenges for I/O and storage systems. Error-bounded lossy compression (EBLC) offers a solution by reducing data set sizes while preserving data quality…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-06-16 Grant Wilkins , Sheng Di , Jon C. Calhoun , Robert Underwood , Franck Cappello

Distinct HEP workflows have distinct I/O needs; while ROOT I/O excels at serializing complex C++ objects common to reconstruction, analysis workflows typically have simpler objects and can sustain higher event rates. To meet these…

Distributed, Parallel, and Cluster Computing · Computer Science 2020-08-26 Brian Bockelman , Zhe Zhang , Oksana Shadura

This thesis concerns sequential-access data compression, i.e., by algorithms that read the input one or more times from beginning to end. In one chapter we consider adaptive prefix coding, for which we must read the input character by…

Information Theory · Computer Science 2009-02-03 Travis Gagie

The rapid growth of digital data has heightened the demand for efficient lossless compression methods. However, existing algorithms exhibit trade-offs: some achieve high compression ratios, others excel in encoding or decoding speed, and…

Information Theory · Computer Science 2025-10-01 Md. Atiqur Rahman , MM Fazle Rabbi

Web archives, query and proxy logs, and so on, can all be very large and highly repetitive; and are accessed only sporadically and partially, rather than continually and holistically. This type of data is ideal for compression-based…

Information Theory · Computer Science 2016-03-01 Matthias Petri , Alistair Moffat , P. C. Nagesh , Anthony Wirth

Advancements in deep learning are often associated with increasing model sizes. The model size dramatically affects the deployment cost and latency of deep models. For instance, models like BERT cannot be deployed on edge devices and…

Machine Learning · Computer Science 2022-07-25 Aditya Desai , Keren Zhou , Anshumali Shrivastava

In data storage and transmission, file compression is a common technique for reducing the volume of data, reducing data storage space and transmission time and bandwidth. However, there are significant differences in the compression…

Other Computer Science · Computer Science 2023-08-24 Han Yang , Guangjun Qin , Yongqing Hu

Modern IoT and networked systems rely on fast and secure delivery of time-critical information. Use cases such as teleoperations require fast data delivery over mobile networks, which despite improvements in 5G are still quite constrained.…

Networking and Internet Architecture · Computer Science 2023-11-21 Duncan Joly , Jason Carpenter , Zhi-Li Zhang

Retrieving data from large-scale source code archives is vital for AI training, neural-based software analysis, and information retrieval, to cite a few. This paper studies and experiments with the design of a compressed key-value store for…

Data Structures and Algorithms · Computer Science 2026-01-21 Paolo Ferragina , Francesco Tosoni

Compressed bitmap indexes are used in databases and search engines. Many bitmap compression techniques have been proposed, almost all relying primarily on run-length encoding (RLE). However, on unsorted data, we can get superior performance…

Databases · Computer Science 2018-03-05 Daniel Lemire , Gregory Ssi-Yan-Kai , Owen Kaser

The challenges expected for the next era of the Large Hadron Collider (LHC), both in terms of storage and computing resources, provide LHC experiments with a strong motivation for evaluating ways of rethinking their computing models at many…

Distributed, Parallel, and Cluster Computing · Computer Science 2023-07-25 Tommaso Tedeschi , Vincenzo Eduardo Padulano , Daniele Spiga , Diego Ciangottini , Mirco Tracolli , Enric Tejedor Saavedra , Enrico Guiraud , Massimo Biasotto

In industrial and IoT environments, massive amounts of real-time and historical process data are continuously generated and archived. With sensors and devices capturing every operational detail, the volume of time-series data has become a…

Databases · Computer Science 2025-11-03 Reham Faqehi , Haya Alhuraib , Hamad Saiari , Zyad Bamigdad

Private information retrieval protocols guarantee that a user can privately and losslessly retrieve a single file from a database stored across multiple servers. In this work, we propose to simultaneously relax the conditions of perfect…

Information Theory · Computer Science 2022-01-07 Yauhen Yakimenka , Hsuan-Yin Lin , Eirik Rosnes , Jörg Kliewer
‹ Prev 1 2 3 10 Next ›