Related papers: Difference Sequence Compression of Multidimensiona…

Caching in Multidimensional Databases

One utilisation of multidimensional databases is the field of On-line Analytical Processing (OLAP). The applications in this area are designed to make the analysis of shared multidimensional information fast [9]. On one hand, speed can be…

Databases · Computer Science 2011-05-04 István Szépkúti

Data Compression for Analytics over Large-scale In-memory Column Databases

Data compression schemes have exhibited their importance in column databases by contributing to the high-performance OLAP (Online Analytical Processing) query processing. Existing works mainly concentrate on evaluating compression schemes…

Databases · Computer Science 2016-07-06 Chunbin Lin , Jianguo Wang , Yannis Papakonstantinou

DNA Lossless Differential Compression Algorithm based on Similarity of Genomic Sequence Database

Modern biological science produces vast amounts of genomic sequence data. This is fuelling the need for efficient algorithms for sequence compression and analysis. Data compression and the associated techniques coming from information…

Data Structures and Algorithms · Computer Science 2011-09-05 Heba Afify , Muhammad Islam , Manal Abdel Wahed

Difference-Huffman Coding of Multidimensional Databases

A new compression method called difference-Huffman coding (DHC) is introduced in this paper. It is verified empirically that DHC results in a smaller multidimensional physical representation than those for other previously published…

Databases · Computer Science 2011-04-21 István Szépkúti

Multidimensional or Relational? / How to Organize an On-line Analytical Processing Database

In the past few years, the number of OLAP applications increased quickly. These applications use two significantly different DB structures: multidimensional (MD) and table-based. One can show that the traditional model of relational…

Databases · Computer Science 2011-04-21 István Szépkúti

Optimizations and Heuristics to improve Compression in Columnar Database Systems

In-memory columnar databases have become mainstream over the last decade and have vastly improved the fast processing of large volumes of data through multi-core parallelism and in-memory compression thereby eliminating the usual…

Databases · Computer Science 2016-09-27 Jayanth Jayanth

Multi-Dimensional Data Compression and Query Processing in Array Databases

In recent times, the production of multidimensional data in various domains and their storage in array databases has witnessed a sharp increase; this rapid growth in data volumes necessitates compression in array databases. However,…

Databases · Computer Science 2022-11-14 Minsoo Kim , Hyubjin Lee , Yon Dohn Chung

On the Scalability of Multidimensional Databases

It is commonly accepted in the practice of on-line analytical processing of databases that the multidimensional database organization is less scalable than the relational one. It is easy to see that the size of the multidimensional…

Databases · Computer Science 2011-04-27 István Szépkúti

Towards the Next Generation of Data Warehouse Personalization System: A Survey and a Comparative Study

Multidimensional databases are a great asset for decision making. Their users express complex OLAP (On-Line Analytical Processing) queries, often returning huge volumes of facts, sometimes providing little or no information. Furthermore,…

Databases · Computer Science 2012-08-02 Saida Aissi , Mohamed Salah Gouider

Compressing Binary Decision Diagrams

The paper introduces a new technique for compressing Binary Decision Diagrams in those cases where random access is not required. Using this technique, compression and decompression can be done in linear time in the size of the BDD and…

Artificial Intelligence · Computer Science 2008-12-18 Esben Rune Hansen , S. Srinivasa Rao , Peter Tiedemann

Compression Aware Physical Database Design

Modern RDBMSs support the ability to compress data using methods such as null suppression and dictionary encoding. Data compression offers the promise of significantly reducing storage requirements and improving I/O performance for decision…

Databases · Computer Science 2011-09-06 Hideaki Kimura , Vivek Narasayya , Manoj Syamala

Bilateral Distribution Compression: Reducing Both Data Size and Dimensionality

Existing distribution compression methods reduce the number of observations in a dataset by minimising the Maximum Mean Discrepancy (MMD) between original and compressed sets, but modern datasets are often large in both sample size and…

Machine Learning · Statistics 2026-01-28 Dominic Broadbent , Nick Whiteley , Robert Allison , Tom Lovett

Multidimensional Datawarehouse with Combination Formula

Multidimensional in data warehouse is a compulsion and become the most important for information delivery, without multidimensional Multidimensional in data warehouse is a compulsion and become the most important for information delivery,…

Databases · Computer Science 2010-06-10 Spits Warnars

Genetic Sequence compression using Machine Learning and Arithmetic Encoding Decoding Techniques

We live in a period where bio-informatics is rapidly expanding, a significant quantity of genomic data has been produced as a result of the advancement of high-throughput genome sequencing technology, raising concerns about the costs…

Quantitative Methods · Quantitative Biology 2023-03-10 Mehedi Hasan Sarkar , Adnan Ferdous Ashrafi

A Constraint Programming Approach for Mining Sequential Patterns in a Sequence Database

Constraint-based pattern discovery is at the core of numerous data mining tasks. Patterns are extracted with respect to a given set of constraints (frequency, closedness, size, etc). In the context of sequential pattern mining, a large…

Artificial Intelligence · Computer Science 2013-11-28 Jean-Philippe Métivier , Samir Loudni , Thierry Charnois

Compressing Sets and Multisets of Sequences

This article describes lossless compression algorithms for multisets of sequences, taking advantage of the multiset's unordered structure. Multisets are a generalisation of sets where members are allowed to occur multiple times. A multiset…

Information Theory · Computer Science 2014-01-27 Christian Steinruecken

Constraint-based Sequential Pattern Mining with Decision Diagrams

Constrained sequential pattern mining aims at identifying frequent patterns on a sequential database of items while observing constraints defined over the item attributes. We introduce novel techniques for constraint-based sequential…

Machine Learning · Computer Science 2019-01-01 Amin Hosseininasab , Willem-Jan van Hoeve , Andre A. Cire

Improved Distribution Matching for Dataset Condensation

Dataset Condensation aims to condense a large dataset into a smaller one while maintaining its ability to train a well-performing model, thus reducing the storage cost and training effort in deep learning applications. However, conventional…

Machine Learning · Computer Science 2023-07-20 Ganlong Zhao , Guanbin Li , Yipeng Qin , Yizhou Yu

Data Inference from Encrypted Databases: A Multi-dimensional Order-Preserving Matching Approach

Due to increasing concerns of data privacy, databases are being encrypted before they are stored on an untrusted server. To enable search operations on the encrypted data, searchable encryption techniques have been proposed. Representative…

Cryptography and Security · Computer Science 2020-01-27 Yanjun Pan , Alon Efrat , Ming Li , Boyang Wang , Hanyu Quan , Joseph Mitchell , Jie Gao , Esther Arkin

Multisize Dataset Condensation

While dataset condensation effectively enhances training efficiency, its application in on-device scenarios brings unique challenges. 1) Due to the fluctuating computational resources of these devices, there's a demand for a flexible…

Computer Vision and Pattern Recognition · Computer Science 2024-04-16 Yang He , Lingao Xiao , Joey Tianyi Zhou , Ivor Tsang