English
Related papers

Related papers: Bigtable Merge Compaction

200 papers

In the burgeoning era of big data, selecting the optimal database solution has become a critical decision for organizations across every industry. Big data demands a powerful database solution. Traditionally, SQL Database, Database ruled,…

Databases · Computer Science 2024-05-10 Mohamed Hassan

The demanding requirements of the new Big Data intensive era raised the need for flexible storage systems capable of handling huge volumes of unstructured data and of tackling the challenges that traditional databases were facing. NoSQL…

Databases · Computer Science 2020-03-17 Chaimae Asaad , Karim Baïna , Mounir Ghogho

Big data systems development is full of challenges in view of the variety of application areas and domains that this technology promises to serve. Typically, fundamental design decisions involved in big data systems design include choosing…

Distributed, Parallel, and Cluster Computing · Computer Science 2019-04-29 Samiya Khan , Xiufeng Liu , Syed Arshad Ali , Mansaf Alam

Digital world is growing very fast and become more complex in the volume (terabyte to petabyte), variety (structured and un-structured and hybrid), velocity (high speed in growth) in nature. This refers to as Big Data that is a global…

Databases · Computer Science 2013-07-02 A B M Moniruzzaman , Syed Akhter Hossain

NoSQL databases have become an important component of many big data and real-time web applications. Their distributed nature and scalability make them an ideal data storage repository for a variety of use cases. While NoSQL databases are…

Databases · Computer Science 2022-12-26 Florian Eppinger , Uta Störl

Big data systems development is full of challenges in view of the variety of application areas and domains that this technology promises to serve. Typically, fundamental design decisions involved in big data systems design include choosing…

Databases · Computer Science 2019-11-27 Samiya Khan , Xiufeng Liu , Syed Arshad Ali , Mansaf Alam

In recent years, the increased need to house and process large volumes of data has prompted the need for distributed storage and querying systems. The growth of machine-readable RDF triples has prompted both industry and academia to develop…

Databases · Computer Science 2016-01-11 Albert Haque

Context: The efficient processing of Big Data is a challenging task for SQL and NoSQL Databases, where competent software architecture plays a vital role. The SQL Databases are designed for structuring data and supporting vertical…

Databases · Computer Science 2022-09-16 Wisal Khan , Teerath Kumar , Zhang Cheng , Kislay Raj , Arunabha M Roy , Bin Luo

Fragmentation leads to unpredictable and degraded application performance. While these problems have been studied in detail for desktop filesystem workloads, this study examines newer systems such as scalable object stores and multimedia…

Databases · Computer Science 2009-08-21 Russell Sears , Catharine van Ingen

Log-structured merge (LSM) trees offer efficient ingestion by appending incoming data, and thus, are widely used as the storage layer of production NoSQL data stores. To enable competitive read performance, LSM-trees periodically…

Databases · Computer Science 2022-03-01 Subhadeep Sarkar , Dimitris Staratzis , Zichen Zhu , Manos Athanassoulis

Big Data processing systems handle huge unstructured and structured data to store, process, and analyze through cluster analysis which helps in identifying unseen patterns to find the relationships between them. Clustering analysis over the…

Distributed, Parallel, and Cluster Computing · Computer Science 2022-11-11 Dipesh Gyawali

In last few years, the volume of the data has grown manyfold. The data storages have been inundated by various disparate potential data outlets, leading by social media such as Facebook, Twitter, etc. The existing data models are largely…

Databases · Computer Science 2015-10-08 Sugam Sharma

Growing main memory sizes have facilitated database management systems that keep the entire database in main memory. The drastic performance improvements that came along with these in-memory systems have made it possible to reunite the two…

Databases · Computer Science 2012-08-02 Florian Funke , Alfons Kemper , Thomas Neumann

Data-structure dynamization is a general approach for making static data structures dynamic. It is used extensively in geometric settings and in the guise of so-called merge (or compaction) policies in big-data databases such as Google…

Data Structures and Algorithms · Computer Science 2024-07-25 Claire Mathieu , Rajmohan Rajaraman , Neal E. Young , Arman Yousefi

The proliferation of small files in data lakes poses significant challenges, including degraded query performance, increased storage costs, and scalability bottlenecks in distributed storage systems. Log-structured table formats (LSTs) such…

Up until recently, relational databases were considered as the de-facto technology for persisting and managing large volumes of data. This came to change with the emergence of enterprises producing extremely large datasets and having…

Distributed, Parallel, and Cluster Computing · Computer Science 2019-10-15 Sebastian Scholze , Fulya Feryal Horozal , Marie-Saphira Flug , Ana Teresa Correia

Google BigTable's scale-out design for distributed key-value storage inspired a generation of NoSQL databases. Recently the NewSQL paradigm emerged in response to analytic workloads that demand distributed computation local to data storage.…

Databases · Computer Science 2016-12-13 Dylan Hutchison , Jeremy Kepner , Vijay Gadepally , Bill Howe

The article deals with the problem which led to Big Data. Big Data information technology is the set of methods and means of processing different types of structured and unstructured dynamic large amounts of data for their analysis and use…

Databases · Computer Science 2019-05-07 Nataliya Shakhovska , Uyrii Bolubash , Oleh Veres

There is great excitement about learned index structures, but understandable skepticism about the practicality of a new method uprooting decades of research on B-Trees. In this paper, we work to remove some of that uncertainty by…

In the era of big data, conventional RDBMS models have become impractical for handling colossal workloads. Consequently, NoSQL databases have emerged as the preferred storage solutions for executing processing-intensive Online Analytical…

‹ Prev 1 2 3 10 Next ›