English
Related papers

Related papers: Data Compression for Analytics over Large-scale In…

200 papers

Data compression is widely used in contemporary column-oriented DBMSes to lower space usage and to speed up query processing. Pioneering systems have introduced compression to tackle the disk bandwidth bottleneck by trading CPU processing…

Databases · Computer Science 2021-05-20 Alexander Slesarev , Evgeniy Klyuchikov , Kirill Smirnov , George Chernishev

The multidimensional databases often use compression techniques in order to decrease the size of the database. This paper introduces a new method called difference sequence compression. Under some conditions, this new technique is able to…

Databases · Computer Science 2011-04-28 István Szépkúti

In-memory columnar databases have become mainstream over the last decade and have vastly improved the fast processing of large volumes of data through multi-core parallelism and in-memory compression thereby eliminating the usual…

Databases · Computer Science 2016-09-27 Jayanth Jayanth

One utilisation of multidimensional databases is the field of On-line Analytical Processing (OLAP). The applications in this area are designed to make the analysis of shared multidimensional information fast [9]. On one hand, speed can be…

Databases · Computer Science 2011-05-04 István Szépkúti

Today, with the growing demands of information storage and data transfer, data compression is becoming increasingly important. Data Compression is a technique which is used to decrease the size of data. This is very useful when some huge…

Information Theory · Computer Science 2025-06-13 Mohammad Hosseini

Growing main memory sizes have facilitated database management systems that keep the entire database in main memory. The drastic performance improvements that came along with these in-memory systems have made it possible to reunite the two…

Databases · Computer Science 2012-08-02 Florian Funke , Alfons Kemper , Thomas Neumann

Modern RDBMSs support the ability to compress data using methods such as null suppression and dictionary encoding. Data compression offers the promise of significantly reducing storage requirements and improving I/O performance for decision…

Databases · Computer Science 2011-09-06 Hideaki Kimura , Vivek Narasayya , Manoj Syamala

Large Language Models (LLMs) can enhance analytics systems with powerful data summarization, cleaning, and semantic transformation capabilities. However, deploying LLMs at scale -- processing millions to billions of rows -- remains…

Databases · Computer Science 2025-07-08 Bardia Mohammadi , Laurent Bindschaedler

In this research paper so as to handle Data in warehousing as well as reduce the wastage of data and provide a better results which takes more and more turn into a focal point of the data source business. Data warehousing and on-line…

Distributed, Parallel, and Cluster Computing · Computer Science 2016-12-28 Ahmed Mateen , Lareab Chaudhary

In this thesis, we describe a new, practical approach to integrating hardware-based data compression within the memory hierarchy, including on-chip caches, main memory, and both on-chip and off-chip interconnects. This new approach is fast,…

Hardware Architecture · Computer Science 2016-09-08 Gennady Pekhimenko

Users of MapReduce often run into performance problems when they scale up their workloads. Many of the problems they encounter can be overcome by applying techniques learned from over three decades of research on parallel DBMSs. However,…

Databases · Computer Science 2011-05-24 Avrilia Floratou , Jignesh Patel , Eugene Shekita , Sandeep Tata

Multidimensional databases are a great asset for decision making. Their users express complex OLAP (On-Line Analytical Processing) queries, often returning huge volumes of facts, sometimes providing little or no information. Furthermore,…

Databases · Computer Science 2012-08-02 Saida Aissi , Mohamed Salah Gouider

Main memory column-stores have proven to be efficient for processing analytical queries. Still, there has been much less work in the context of clusters. Using only a single machine poses several restrictions: Processing power and data…

Databases · Computer Science 2017-09-18 Demian Hespe , Martin Weidner , Jonathan Dees , Peter Sanders

Compressing integer keys is a fundamental operation among multiple communities, such as database management (DB), information retrieval (IR), and high-performance computing (HPC). Recent advances in \emph{learned indexes} have inspired the…

Databases · Computer Science 2024-12-17 Qiyu Liu , Siyuan Han , Jianwei Liao , Jin Li , Jingshu Peng , Jun Du , Lei Chen

Data warehouses are the core of decision support sys- tems, which nowadays are used by all kind of enter- prises in the entire world. Although many studies have been conducted on the need of decision support systems (DSSs) for small…

Databases · Computer Science 2011-02-02 Oksana Grabova , Jérôme Darmont , Jean-Hugues Chauchat , Iryna Zolotaryova

Data is the central asset of today's dynamically operating organization and their business. This data is usually stored in database. A major consideration is applied on the security of that data from the unauthorized access and intruders.…

Databases · Computer Science 2013-08-25 Manish Sharma , Atul Chaudhary , Santosh Kumar

Databases play an essential role in our society today. Databases are embedded in sectors like corporations, institutions, and government organizations, among others. These databases are used for our video and audio streaming platforms,…

Databases · Computer Science 2018-10-02 Jasper Kyle Catapang

Read-optimized columnar databases use differential updates to handle writes by maintaining a separate write-optimized delta partition which is periodically merged with the read-optimized and compressed main partition. This merge process…

In industrial and IoT environments, massive amounts of real-time and historical process data are continuously generated and archived. With sensors and devices capturing every operational detail, the volume of time-series data has become a…

Databases · Computer Science 2025-11-03 Reham Faqehi , Haya Alhuraib , Hamad Saiari , Zyad Bamigdad

With the more and more growing demand for semantic Web services over large databases, an efficient evaluation of Datalog queries is arousing a renewed interest among researchers and industry experts. In this scenario, to reduce memory…

Artificial Intelligence · Computer Science 2020-02-19 Alessio Fiorentino , Nicola Leone , Marco Manna , Simona Perri , Jessica Zangari
‹ Prev 1 2 3 10 Next ›