Related papers: Using Regression Techniques to Predict Large Data …

A Comparative Study of Replication Techniques in Grid Computing Systems

Grid Computing is a type of parallel and distributed systems that is designed to provide reliable access to data and computational resources in wide area networks. These resources are distributed in different geographical locations, however…

Distributed, Parallel, and Cluster Computing · Computer Science 2013-09-27 Sheida Dayyani , Mohammad Reza Khayyambashi

Secure, Efficient Data Transport and Replica Management for High-Performance Data-Intensive Computing

An emerging class of data-intensive applications involve the geographically dispersed extraction of complex scientific information from very large collections of measured or computed data. Such applications arise, for example, in…

Distributed, Parallel, and Cluster Computing · Computer Science 2007-05-23 Bill Allcock , Joe Bester , John Bresnahan , Ann L. Chervenak , Ian Foster , Carl Kesselman , Sam Meder , Veronika Nefedova , Darcy Quesnel , Steven Tuecke

On Efficient Data Transfers Across Geographically Dispersed Datacenters

As applications become more distributed to improve user experience and offer higher availability, businesses rely on geographically dispersed datacenters that host such applications more than ever. Dedicated inter-datacenter networks have…

Networking and Internet Architecture · Computer Science 2019-08-30 Mohammad Noormohammadpour

On Distributed Storage Allocations of Large Files for Maximum Service Rate

Allocation of (redundant) file chunks throughout a distributed storage system affects important performance metrics such as the probability of file recovery, data download time, or the service rate of the system under a given data access…

Distributed, Parallel, and Cluster Computing · Computer Science 2018-08-24 Pei Peng , Emina Soljanin

Analysis of a Stochastic Model of Replication in Large Distributed Storage Systems: A Mean-Field Approach

Distributed storage systems such as Hadoop File System or Google File System (GFS) ensure data availability and durability using replication. This paper is focused on the analysis of the efficiency of replication mechanism that determines…

Distributed, Parallel, and Cluster Computing · Computer Science 2017-03-28 Wen Sun , Véronique Simon , Sébastien Monnet , Philippe Robert , Pierre Sens

A Taxonomy of Data Grids for Distributed Data Sharing, Management and Processing

Data Grids have been adopted as the platform for scientific communities that need to share, access, transport, process and manage large data collections distributed worldwide. They combine high-end computing technologies with…

Distributed, Parallel, and Cluster Computing · Computer Science 2007-05-23 Srikumar Venugopal , Rajkumar Buyya , Kotagiri Ramamohanarao

A Flexible Bandwidth Reservation Framework for Bulk Data Transfers in Grid Networks

In grid networks, distributed resources are interconnected by wide area network to support compute and data-intensive applications, which require reliable and efficient transfer of gigabits (even terabits) of data. Different from…

Networking and Internet Architecture · Computer Science 2016-08-16 Bin Bin Chen , Pascale Primet

Heterogeneous Strong Computation Migration

The continuous increase in performance requirements, for both scientific computation and industry, motivates the need of a powerful computing infrastructure. The Grid appeared as a solution for inexpensive execution of heavy applications in…

Distributed, Parallel, and Cluster Computing · Computer Science 2010-10-05 Anolan Milanés , Noemi Rodriguez , Bruno Schulze

Towards a Centralized Scheduling Framework for Communication Flows in Distributed Systems

The overall performance of a distributed system is highly dependent on the communication efficiency of the system. Although network resources (links, bandwidth) are becoming increasingly more available, the communication performance of data…

Data Structures and Algorithms · Computer Science 2009-06-02 Mugurel Ionut Andreica , Eliana-Dina Tirsa , Nicolae Tapus , Florin Pop , Ciprian Mihai Dobre

Improving Grid Computing Performance by Optimally Reducing Checkpointing Effect

Grid computing is a collection of computer resources that are gathered together from various areas to give computational resources such as storage, data or application services. This is to permit clients to access this huge measure of…

Distributed, Parallel, and Cluster Computing · Computer Science 2020-06-05 Garba Aliyu , Kana A. F. D. , Abdullahi Mohammed , Idris Abdulmumin , Shehu Adamu , Fatsuma Jauro

Replication in Data Grids: Metrics and Strategies

We focus in this report on two main axes. The first is dedicated to the study of the effect of replicas distribution on data grid performances. In this respect, our main contributions are as follows: 1) An overview of replication strategies…

Databases · Computer Science 2019-12-24 Tarek Hamrouni

Accelerating Large-scale Data Exploration through Data Diffusion

Data-intensive applications often require exploratory analysis of large datasets. If analysis is performed on distributed resources, data locality can be crucial to high throughput and performance. We propose a "data diffusion" approach…

Distributed, Parallel, and Cluster Computing · Computer Science 2016-11-17 Ioan Raicu , Yong Zhao , Ian Foster , Alex Szalay

GRID Storage Optimization in Transparent and User-Friendly Way for LHCb Datasets

The LHCb collaboration is one of the four major experiments at the Large Hadron Collider at CERN. Many petabytes of data are produced by the detectors and Monte-Carlo simulations. The LHCb Grid interware LHCbDIRAC is used to make data…

Distributed, Parallel, and Cluster Computing · Computer Science 2017-12-06 Mikhail Hushchyn , Andrey Ustyuzhanin , Philippe Charpentier , Christophe Haen

Grid Capacity Expansion under Data Centers and Electrified Manufacturing Large Loads

In this paper, we consider the expansion of power grids under emerging large loads from data centers and electrified manufacturing. We develop a multi-period grid capacity expansion model to determine optimal investment profiles for power…

Systems and Control · Electrical Eng. & Systems 2026-05-29 Jiyong Lee , Melody Agustin , Joanne Langsdorf , Erhan Kutanolgu , Michael Baldea , Ilias Mitrai

A Security Based Data Mining Approach in Data Grid

Grid computing is the next logical step to distributed computing. Main objective of grid computing is an innovative approach to share resources such as CPU usage; memory sharing and software sharing. Data Grids provide transparent access to…

Distributed, Parallel, and Cluster Computing · Computer Science 2010-03-23 S. Vidhya , S. Karthikeyan

Dynamic Adaptation in Data Storage: Real-Time Machine Learning for Enhanced Prefetching

The exponential growth of data storage demands has necessitated the evolution of hierarchical storage management strategies [1]. This study explores the application of streaming machine learning [3] to revolutionize data prefetching within…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-01-30 Chiyu Cheng , Chang Zhou , Yang Zhao , Jin Cao

Adaptive Scheduling for Machine Learning Tasks over Networks

A key functionality of emerging connected autonomous systems such as smart transportation systems, smart cities, and the industrial Internet-of-Things, is the ability to process and learn from data collected at different physical locations.…

Machine Learning · Computer Science 2021-01-26 Konstantinos Gatsis

End-to-End Trajectory Distribution Prediction Based on Occupancy Grid Maps

In this paper, we aim to forecast a future trajectory distribution of a moving agent in the real world, given the social scene images and historical trajectories. Yet, it is a challenging task because the ground-truth distribution is…

Computer Vision and Pattern Recognition · Computer Science 2022-04-01 Ke Guo , Wenxi Liu , Jia Pan

Efficient Transmission and Reconstruction of Dependent Data Streams via Edge Sampling

Data stream processing is an increasingly important topic due to the prevalence of smart devices and the demand for real-time analytics. Geo-distributed streaming systems, where cloud-based queries utilize data streams from multiple…

Distributed, Parallel, and Cluster Computing · Computer Science 2022-11-22 Joel Wolfrath , Abhishek Chandra

On Storage Allocation for Maximum Service Rate in Distributed Storage Systems

Storage allocation affects important performance measures of distributed storage systems. Most previous studies on the storage allocation consider its effect separately either on the success of the data recovery or on the service rate…

Distributed, Parallel, and Cluster Computing · Computer Science 2016-11-17 Moslem Noori , Emina Soljanin , Masoud Ardakani