English
Related papers

Related papers: Demystifying Object-based Big Data Storage Systems

200 papers

Distributed File Systems (DFS) have emerged as sophisticated solutions for efficient file storage and management across interconnected computer nodes. The main objective of DFS is to achieve flexible, scalable, and resilient file storage…

Distributed, Parallel, and Cluster Computing · Computer Science 2024-04-09 Xueting Pan , Ziqian Luo , Lisang Zhou

Big data systems development is full of challenges in view of the variety of application areas and domains that this technology promises to serve. Typically, fundamental design decisions involved in big data systems design include choosing…

Distributed, Parallel, and Cluster Computing · Computer Science 2019-04-29 Samiya Khan , Xiufeng Liu , Syed Arshad Ali , Mansaf Alam

This paper analyzes various distributed storage systems that use data fragmentation and dispersal as a way of protection.Existing solutions have been organized into two categories: bitwise and structurewise. Systems from the bitwise…

Cryptography and Security · Computer Science 2017-06-20 Katarzyna Kapusta , Gerard Memmi

Big Data is defined as high volume of variety of data with an exponential data growth rate. Data are amalgamated to generate revenue, which results a large data silo. Data are the oils of modern IT industries. Therefore, the data are…

Distributed, Parallel, and Cluster Computing · Computer Science 2020-05-15 Ripon Patgiri , Sabuzima Nayak

The exponential growth of big data has transformed how large organisations leverage information to drive innovation, optimise processes, and maintain competitive advantages. However, managing and extracting insights from vast, heterogeneous…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-05-09 Fathima Nuzla Ismail , Abira Sengupta , Shanika Amarasoma

Big data storage management is one of the most challenging issues for Grid computing environments, since large amount of data intensive applications frequently involve a high degree of data access locality. Grid applications typically deal…

Distributed, Parallel, and Cluster Computing · Computer Science 2012-07-13 Ajay Kumar , Seema Bawa

Data availability is one of the most important features in distributed storage systems, made possible by data replication. Nowadays data are generated rapidly and the goal to develop efficient, scalable and reliable storage systems has…

Distributed, Parallel, and Cluster Computing · Computer Science 2022-02-01 Chryssis Georgiou , Nicolas Nicolaou , Andria Trigeorgi

This work examines strategies to handle large shared data objects in distributed storage systems (DSS), while boosting the number of concurrent accesses, maintaining strong consistency guarantees, and ensuring good operation performance. To…

Distributed, Parallel, and Cluster Computing · Computer Science 2021-03-09 Antonio Fernandez Anta , Chryssis Georgiou , Theophanis Hadjistasi , Nicolas Nicolaou , Efstathios Stavrakis , Andria Trigeorgi

Big Data technology is described. Big data is a popular term used to describe the exponential growth and availability of data, both structured and unstructured. There is constructed dataspace architecture. Dataspace has focused solely - and…

Databases · Computer Science 2019-05-07 Nataliya Shakhovska , Yurii Bolubash

Data storage systems serve as the foundation of digital society. The enormous data generated by people on a daily basis make the fault tolerance of data storage systems increasingly important. Unfortunately, modern storage systems consist…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-07-08 Mai Zheng , Duo Zhang , Ahmed Dajani

This paper presents an architecture, based on Distributed Ledger Technologies (DLTs) and Decentralized File Storage (DFS) systems, to support the use of Personal Information Management Systems (PIMS). DLT and DFS are used to manage data…

Cryptography and Security · Computer Science 2020-07-08 Mirko Zichichi , Stefano Ferretti , Gabriele D'Angelo

Recent developments in the industry of personal computing led to a greater number of the so-called edge devices. Such devices typically do not collaborate or foresee the possibility of collaboration to offer aggregated storage and computing…

Distributed, Parallel, and Cluster Computing · Computer Science 2020-06-17 R. Copstein , F. Dotti

One of the purposes of Big Data systems is to support analysis of data gathered from heterogeneous data sources. Since data warehouses have been used for several decades to achieve the same goal, they could be leveraged also to provide…

Databases · Computer Science 2018-09-13 Darja Solodovnikova , Laila Niedrite

Many research questions can be answered quickly and efficiently using data already collected for previous research. This practice is called secondary data analysis (SDA), and has gained popularity due to lower costs and improved research…

Digital Libraries · Computer Science 2020-04-07 Yasith Jayawardana , Sampath Jayarathna

This article dwells on the basic characteristic features of the Big Data technologies. It is analyzed the existing definition of the "big data" term. The article proposes and describes the elements of the generalized formal model of big…

Databases · Computer Science 2019-05-09 Shakhovska Nataliya , Veres Oleh , Hirnyak Mariia

Big data management is a reality for an increasing number of organizations in many areas and represents a set of challenges involving big data modeling, storage and retrieval, analysis and visualization. However, technological resources,…

Databases · Computer Science 2022-03-15 Rogerio Rossi , Kechi Hirama

In this paper we look at the growth of distributed object stores (DOS) and examine the underlying mechanisms that guide their use and development. Our focus is on the fundamental principles of operation that define this class of system, how…

Software Engineering · Computer Science 2013-08-09 Robert Primmer

Recently, we have been witnessing huge advancements in the scale of data we routinely generate and collect in pretty much everything we do, as well as our ability to exploit modern technologies to process, analyze and understand this data.…

Databases · Computer Science 2017-09-25 Radwa Elshawi , Sherif Sakr

Companies are using machine learning to solve real-world problems and are developing hundreds to thousands of features in the process. They are building feature engineering pipelines as part of MLOps life cycle to transform data from…

Machine Learning · Computer Science 2023-06-01 Anya Li , Bhala Ranganathan , Feng Pan , Mickey Zhang , Qianjun Xu , Runhan Li , Sethu Raman , Shail Paragbhai Shah , Vivienne Tang

Big Data processing systems handle huge unstructured and structured data to store, process, and analyze through cluster analysis which helps in identifying unseen patterns to find the relationships between them. Clustering analysis over the…

Distributed, Parallel, and Cluster Computing · Computer Science 2022-11-11 Dipesh Gyawali
‹ Prev 1 2 3 10 Next ›