English
Related papers

Related papers: MapReplay: Trace-Driven Benchmark Generation for J…

200 papers

MapReduce is a popular programming paradigm for developing large-scale, data-intensive computation. Many frameworks that implement this paradigm have recently been developed. To leverage these frameworks, however, developers must become…

Databases · Computer Science 2018-06-20 Maaz Bin Safeer Ahmad , Alvin Cheung

MapReduce is a technique used to vastly improve distributed processing of data and can massively speed up computation. Hadoop and its MapReduce relies on JVM and Java which is expensive on memory. High Performance Computing based MapReduce…

Distributed, Parallel, and Cluster Computing · Computer Science 2020-06-29 Vignesh S. , Muthumanikandan V. , Siddarth S. , Sainath G

Analyzing large scale data has emerged as an important activity for many organizations in the past few years. This large scale data analysis is facilitated by the MapReduce programming and execution model and its implementations, most…

Databases · Computer Science 2012-03-02 Iman Elghandour , Ashraf Aboulnaga

Developers often use microbenchmarks to choose the most performant implementation of a method or a class. On the Java Virtual Machine (JVM), this is commonly done using the Java Microbenchmark Harness (JMH) which addresses common pitfalls…

Programming Languages · Computer Science 2026-05-25 Filippo Schiavio , Lubomír Bulej , Walter Binder

To efficiently exploit the resources of new many-core architectures, integrating dozens or even hundreds of cores per chip, parallel programming models have evolved to expose massive amounts of parallelism, often in the form of fine-grained…

Distributed, Parallel, and Cluster Computing · Computer Science 2014-05-14 Andi Drebes , Karine Heydemann , Antoniu Pop , Albert Cohen , Nathalie Drach

Due to the high computational demands executing a rigorous comparison between hyperparameter optimization (HPO) methods is often cumbersome. The goal of this paper is to facilitate a better empirical evaluation of HPO methods by providing…

Machine Learning · Computer Science 2019-05-14 Aaron Klein , Frank Hutter

This is the era of High Performance Computing (HPC). There is a great demand of the best performance evaluation techniques for the file and storage systems. The task of evaluation is both necessary and hard. It gives in depth analysis of…

Operating Systems · Computer Science 2013-12-09 Brijender Kahanwal , Tejinder Pal Singh

An application's performance regressions can be detected by both application or microbenchmarks. While application benchmarks stress the system under test by sending synthetic but realistic requests which, e.g., simulate real user traffic,…

Distributed, Parallel, and Cluster Computing · Computer Science 2023-11-08 Nils Japke , Christoph Witzko , Martin Grambow , David Bermbach

Many graphics rendering algorithms used in both real-time games and virtual reality applications can get performance boosts by temporally reusing previous computations. However, algorithms based on temporal reuse are typically measured…

Graphics · Computer Science 2023-05-09 Erfan Momeni Yazdi , Markku Mäkitalo , Julius Ikkala , Pekka Jääskeläinen

The recently proposed Renaissance suite is composed of modern, real-world, concurrent, and object-oriented workloads that exercise various concurrency primitives of the JVM. Renaissance was used to compare performance of two…

Production deployment of AI coding agents requires fast, reproducible evaluation signals. Existing industrial practices trade off speed and fidelity: online A/B testing takes weeks and risks user experience, shadow deployment yields signals…

Software Engineering · Computer Science 2026-05-12 Smriti Jha , Matteo Paltenghi , Chandra Maddila , Vijayaraghavan Murali , Shubham Ugare , Satish Chandra

Many parallel data frameworks have been proposed in recent years that let sequential programs access parallel processing. To capitalize on the benefits of such frameworks, existing code must often be rewritten to the domain-specific…

Programming Languages · Computer Science 2016-11-24 Maaz Bin Safeer Ahmad , Alvin Cheung

In this paper, we study CPU utilization time patterns of several MapReduce applications. After extracting running patterns of several applications, they are saved in a reference database to be later used to tweak system parameters to…

Distributed, Parallel, and Cluster Computing · Computer Science 2013-01-30 Nikzad Babaii Rizvandi , Javid Taheri , Albert Y. Zomaya

With the growing reliance on automated code completion tools in software development, the need for comprehensive evaluation benchmarks has become critical. Existing benchmarks focus more on code completion in function and class level by…

Software Engineering · Computer Science 2025-11-03 Qinyun Wu , Chao Peng , Pengfei Gao , Ruida Hu , Haoyu Gan , Bo Jiang , Jinhe Tang , Zhiwen Deng , Zhanming Guan , Cuiyun Gao , Xia Liu , Ping Yang

Understanding and predicting the performance of big data applications running in the cloud or on-premises could help minimise the overall cost of operations and provide opportunities in efforts to identify performance bottlenecks. The…

Distributed, Parallel, and Cluster Computing · Computer Science 2020-05-26 Sheriffo Ceesay , Adam Barker , Yuhui Lin

Mobile application performance is a vital factor for user experience. Yet, performance issues are notoriously difficult to detect in development environments, where they often manifest less conspicuously, making their diagnosis more…

Software Engineering · Computer Science 2026-03-05 Zhengquan Li , Zhenhao Li , Zishuo Ding

Most of the popular Big Data analytics tools evolved to adapt their working environment to extract valuable information from a vast amount of unstructured data. The ability of data mining techniques to filter this helpful information from…

Distributed, Parallel, and Cluster Computing · Computer Science 2022-09-23 Taha Tekdogan , Ali Cakmak

MapReduce (MR) is the most popular solution to build applications for large-scale data processing. These applications are often deployed on large clusters of commodity machines, where failures happen constantly due to bugs, hardware…

Distributed, Parallel, and Cluster Computing · Computer Science 2013-02-11 João Eugenio Marynowski , Michel Albonico , Eduardo Cunha de Almeida , Gerson Sunyé

Long-running service workloads (e.g. web search engine) and short-term data analysis workloads (e.g. Hadoop MapReduce jobs) co-locate in today's data centers. Developing realistic benchmarks to reflect such practical scenario of mixed…

Distributed, Parallel, and Cluster Computing · Computer Science 2015-12-07 Rui Han , Shulin Zhan , Chenrong Shao , Junwei Wang , Lizy K. John , Jiangtao Xu , Gang Lu , Lei Wang

WebAssembly (Wasm for short) brings a new, powerful capability to the web as well as Edge, IoT, and embedded systems. Wasm is a portable, compact binary code format with high performance and robust sandboxing properties. As Wasm…

Programming Languages · Computer Science 2024-09-04 Doehyun Baek , Jakob Getz , Yusung Sim , Daniel Lehmann , Ben L. Titzer , Sukyoung Ryu , Michael Pradel
‹ Prev 1 2 3 10 Next ›