English
Related papers

Related papers: Scheduling Data Intensive Workloads through Virtua…

200 papers

Cloud Computing is emerging as a new computational paradigm shift. Hadoop-MapReduce has become a powerful Computation Model for processing large data on distributed commodity hardware clusters such as Clouds. In all Hadoop implementations,…

Distributed, Parallel, and Cluster Computing · Computer Science 2012-07-04 B. Thirumala Rao , L. S. S. Reddy

We consider a natural scheduling problem which arises in many distributed computing frameworks. Jobs with diverse resource requirements (e.g. memory requirements) arrive over time and must be served by a cluster of servers, each with a…

Networking and Internet Architecture · Computer Science 2019-01-21 Konstantinos Psychas , Javad Ghaderi

MapReduce, the popular programming paradigm for large-scale data processing, has traditionally been deployed over tightly-coupled clusters where the data is already locally available. The assumption that the data and compute resources are…

Distributed, Parallel, and Cluster Computing · Computer Science 2012-07-31 Benjamin Heintz , Abhishek Chandra , Ramesh K. Sitaraman

The primary motivation for uptake of virtualization has been resource isolation, capacity management and resource customization allowing resource providers to consolidate their resources in virtual machines. Various approaches have been…

Distributed, Parallel, and Cluster Computing · Computer Science 2010-09-27 Omer Khalid , Ivo Maljevic , Richard Anthony , Miltos Petridis , Kevin Parrot , Markus Schulz

It is cost-efficient for a tenant with a limited budget to establish a virtual MapReduce cluster by renting multiple virtual private servers (VPSs) from a VPS provider. To provide an appropriate scheduling scheme for this type of computing…

Distributed, Parallel, and Cluster Computing · Computer Science 2018-09-12 Ming-Chang Lee , Jia-Chun Lin , Ramin Yahyapour

In hadoop, the job scheduling is an independent module, users can design their own job scheduler based on their actual application requirements, thereby meet their specific business needs. Currently, hadoop has three schedulers: FIFO,…

Distributed, Parallel, and Cluster Computing · Computer Science 2015-06-02 Bo Jiang , Jiaying Wu , Xiuyu Shi , Ruhuan Huang

Undoubtedly, the MapReduce is the most powerful programming paradigm in distributed computing. The enhancement of the MapReduce is essential and it can lead the computing faster. Therefore, here are many scheduling algorithms to discuss…

Distributed, Parallel, and Cluster Computing · Computer Science 2017-04-11 Rajdeep Das , Rohit Pratap Singh , Ripon Patgiri

MapReduce framework is the de facto standard in Hadoop. Considering the data locality in data centers, the load balancing problem of map tasks is a special case of affinity scheduling problem. There is a huge body of work on affinity…

Distributed, Parallel, and Cluster Computing · Computer Science 2017-05-10 Mohammadamir Kavousi

Distributed cloud environments hosting data-intensive applications often experience slowdowns due to network congestion, asymmetric bandwidth, and inter-node data shuffling. These factors are typically not captured by traditional host-level…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-11-21 Sankalpa Timilsina , Susmit Shannigrahi

MapReduce is emerged as a prominent programming model for data-intensive computation. In this work, we study power-aware MapReduce scheduling in the speed scaling setting first introduced by Yao et al. [FOCS 1995]. We focus on the…

Distributed, Parallel, and Cluster Computing · Computer Science 2014-02-13 Evripidis Bampis , Vincent Chau , Dimitrios Letsios , Giorgio Lucarelli , Ioannis Milis , Georgios Zois

With the rapid development of cloud computing, virtual machine scheduling has become one of the most important but challenging issues for the cloud computing community, especially for practical heterogeneous request sequences. By analyzing…

Distributed, Parallel, and Cluster Computing · Computer Science 2022-11-30 Haochuan Cui , Junjie Sheng , Bo Jin , Yiqiu Hu , Li Su , Lei Zhu , Wenli Zhou , Xiangfeng Wang

Nowadays many companies have available large amounts of raw, unstructured data. Among Big Data enabling technologies, a central place is held by the MapReduce framework and, in particular, by its open source implementation, Apache Hadoop.…

Distributed, Parallel, and Cluster Computing · Computer Science 2017-01-18 Eugenio Gianniti , Danilo Ardagna , Michele Ciavotta , Mauro Passacantando

In this paper, a method for efficient scheduling to obtain optimum job throughput in a distributed campus grid environment is presented; Traditional job schedulers determine job scheduling using user and job resource attributes. User…

Distributed, Parallel, and Cluster Computing · Computer Science 2010-07-15 Srirangam V Addepallil , Per Andersen , George L Barnes

A queue is required when a service provider is not able to handle jobs arriving over the time. In a highly flexible and dynamic environment, some jobs might demand for faster execution at run-time especially when the resources are limited…

Performance · Computer Science 2015-03-24 Yash Gupta , Kamalakar Karlapalem

Modern industry-scale data centers need to manage a large number of virtual machines (VMs). Due to the continual creation and release of VMs, many small resource fragments are scattered across physical machines (PMs). To handle these…

Machine Learning · Computer Science 2025-05-26 Xianzhong Ding , Yunkai Zhang , Binbin Chen , Donghao Ying , Tieying Zhang , Jianjun Chen , Lei Zhang , Alberto Cerpa , Wan Du

Advance reservation is important to guarantee the quality of services of jobs by allowing exclusive access to resources over a defined time interval on resources. It is a challenge for the scheduler to organize available resources…

Distributed, Parallel, and Cluster Computing · Computer Science 2012-03-06 Bo Li , Yijian Pei , Bin Shen , Hao Wu , Min He , Jundong Yang

We propose a novel job scheduling approach for homogeneous cluster computing platforms. Its key feature is the use of virtual machine technology to share fractional node resources in a precise and controlled manner. Other VM-based…

Distributed, Parallel, and Cluster Computing · Computer Science 2011-06-27 Henri Casanova , Mark Stillwell , Frédéric Vivien

Virtualization technology has enabled applications to be decoupled from the underlying hardware providing the benefits of portability, better control over execution environment and isolation. It has been widely adopted in scientific grids…

Distributed, Parallel, and Cluster Computing · Computer Science 2010-09-27 Omer Khalid , Ivo Maljevic , Richard Anthony , Miltos Petridis , Kevin Parrot , Markus Schulz

With the rapid growth in computing power demand, cloud native networks have emerged as a promising solution to address the challenges of efficient resource coordination, particularly in coping with the dynamic fluctuations of network…

Distributed, Parallel, and Cluster Computing · Computer Science 2026-02-11 Hao Jiang , Meng Qin , Ruijie Kuai , Dandan Liang , Yue Gao

Load balance is important for MapReduce to reduce job duration, increase parallel efficiency, etc. Previous work focuses on coarse-grained scheduling. This study concerns fine-grained scheduling on MapReduce operations. Each operation…

Distributed, Parallel, and Cluster Computing · Computer Science 2014-04-15 Liya Fan , Bo Gao , Xi Sun , Fa Zhang , Zhiyong Liu
‹ Prev 1 2 3 10 Next ›