Related papers: A Linear Combination-based Method to Construct Pro…

Data Motif-based Proxy Benchmarks for Big Data and AI Workloads

For the architecture community, reasonable simulation time is a strong requirement in addition to performance data accuracy. However, emerging big data and AI workloads are too huge at binary size level and prohibitively expensive to run on…

Distributed, Parallel, and Cluster Computing · Computer Science 2018-10-23 Wanling Gao , Jianfeng Zhan , Lei Wang , Chunjie Luo , Zhen Jia , Daoyi Zheng , Chen Zheng , Xiwen He , Hainan Ye , Haibin Wang , Rui Ren

A Dwarf-based Scalable Big Data Benchmarking Methodology

Different from the traditional benchmarking methodology that creates a new benchmark or proxy for every possible workload, this paper presents a scalable big data benchmarking methodology. Among a wide variety of big data analytics…

Hardware Architecture · Computer Science 2017-11-10 Wanling Gao , Lei Wang , Jianfeng Zhan , Chunjie Luo , Daoyi Zheng , Zhen Jia , Biwei Xie , Chen Zheng , Qiang Yang , Haibin Wang

Crossing the Architectural Barrier: Evaluating Representative Regions of Parallel HPC Applications

Exascale computing will get mankind closer to solving important social, scientific and engineering problems. Due to high prototyping costs, High Performance Computing (HPC) system architects make use of simulation models for design space…

Performance · Computer Science 2018-03-28 Alexandra Ferreron , Radhika Jagtap , Sascha Bischoff , Roxana Rusitoru

A linear-time benchmarking tool for generalized surface codes

Quantum information processors need to be protected against errors and faults. One of the most widely considered fault-tolerant architecture is based on surface codes. While the general principles of these codes are well understood and…

Quantum Physics · Physics 2016-11-15 Nicolas Delfosse , Pavithran Iyer , David Poulin

BigDataBench: a Big Data Benchmark Suite from Internet Services

As architecture, systems, and data management communities pay greater attention to innovative big data systems and architectures, the pressure of benchmarking and evaluating these systems rises. Considering the broad use of big data…

Databases · Computer Science 2016-11-17 Lei Wang , Jianfeng Zhan , Chunjie Luo , Yuqing Zhu , Qiang Yang , Yongqiang He , Wanling Gao , Zhen Jia , Yingjie Shi , Shujie Zhang , Chen Zheng , Gang Lu , Kent Zhan , Xiaona Li , Bizhu Qiu

Can Small Training Runs Reliably Guide Data Curation? Rethinking Proxy-Model Practice

Data teams at frontier AI companies routinely train small proxy models to make critical decisions about pretraining data recipes for full-scale training runs. However, the community has a limited understanding of whether and when…

Machine Learning · Computer Science 2026-04-14 Jiachen T. Wang , Tong Wu , Kaifeng Lyu , James Zou , Dawn Song , Ruoxi Jia , Prateek Mittal

100x Cost & Latency Reduction: Performance Analysis of AI Query Approximation using Lightweight Proxy Models

Several data warehouse and database providers have recently introduced extensions to SQL called AI Queries, enabling users to specify functions and conditions in SQL that are evaluated by LLMs, thereby broadening significantly the kinds of…

Databases · Computer Science 2026-04-16 Yeounoh Chung , Rushabh Desai , Jian He , Yu Xiao , Thibaud Hottelier , Yves-Laurent Kom Samo , Pushkar Khadilkar , Xianshun Chen , Sam Idicula , Fatma Özcan , Alon Halevy , Yannis Papakonstantinou

Using Small Proxy Datasets to Accelerate Hyperparameter Search

One of the biggest bottlenecks in a machine learning workflow is waiting for models to train. Depending on the available computing resources, it can take days to weeks to train a neural network on a large dataset with many classes such as…

Machine Learning · Computer Science 2019-06-13 Sam Shleifer , Eric Prokop

On Benchmarking the Capability of Symbolic Execution Tools with Logic Bombs

Symbolic execution now becomes an indispensable technique for software testing and program analysis. There are several symbolic execution tools available off-the-shelf, and we need a practical benchmark approach to learn their capabilities.…

Software Engineering · Computer Science 2018-05-28 Hui Xu , Zirui Zhao , Yangfan Zhou , Michael R. Lyu

Fake Runs, Real Fixes -- Analyzing xPU Performance Through Simulation

As models become larger, ML accelerators are a scarce resource whose performance must be continually optimized to improve efficiency. Existing performance analysis tools are coarse grained, and fail to capture model performance at the…

Performance · Computer Science 2025-03-20 Ioannis Zarkadas , Amanda Tomlinson , Asaf Cidon , Baris Kasikci , Ofir Weisse

Characterizing and Subsetting Big Data Workloads

Big data benchmark suites must include a diversity of data and workloads to be useful in fairly evaluating big data systems and architectures. However, using truly comprehensive benchmarks poses great challenges for the architecture…

Performance · Computer Science 2016-11-15 Zhen Jia , Jianfeng Zhan , Lei Wang , Rui Han , Sally A. McKee , Qiang Yang , Chunjie Luo , Jingwei Li

PAI: Fast, Accurate, and Full Benchmark Performance Projection with AI

The exponential increase in complex IPs within modern SoCs, driven by Moore's Law, has created a pressing need for fast and accurate hardware-software power-performance analysis. Traditional performance simulators (such as cycle accurate…

Hardware Architecture · Computer Science 2026-03-23 Avery Johnson , Mohammad Majharul Islam , Riad Akram , Abdullah Muzahid

Towards CPU Performance Prediction: New Challenge Benchmark Dataset and Novel Approach

The server central processing unit (CPU) market continues to exhibit robust demand due to the rising global need for computing power. Against this backdrop, CPU benchmark performance prediction is crucial for architecture designers. It…

Performance · Computer Science 2024-10-29 Xiaoman Liu

DLBricks: Composable Benchmark Generation to Reduce Deep Learning Benchmarking Effort on CPUs (Extended)

The past few years have seen a surge of applying Deep Learning (DL) models for a wide array of tasks such as image classification, object detection, machine translation, etc. While DL models provide an opportunity to solve otherwise…

Machine Learning · Computer Science 2021-03-02 Cheng Li , Abdul Dakkak , Jinjun Xiong , Wen-mei Hwu

Benchmarking Simulation-Based Inference

Recent advances in probabilistic modelling have led to a large number of simulation-based inference algorithms which do not require numerical evaluation of likelihoods. However, a public benchmark with appropriate performance metrics for…

Machine Learning · Statistics 2021-04-12 Jan-Matthis Lueckmann , Jan Boelts , David S. Greenberg , Pedro J. Gonçalves , Jakob H. Macke

PageRank Pipeline Benchmark: Proposal for a Holistic System Benchmark for Big-Data Platforms

The rise of big data systems has created a need for benchmarks to measure and compare the capabilities of these systems. Big data benchmarks present unique scalability challenges. The supercomputing community has wrestled with these…

Performance · Computer Science 2016-12-13 Patrick Dreher , Chansup Byun , Chris Hill , Vijay Gadepally , Bradley Kuszmaul , Jeremy Kepner

Forecasting Downstream Performance of LLMs With Proxy Metrics

Progress in language model development is often driven by comparative decisions: which architecture to adopt, which pretraining corpus to use, or which training recipe to apply. Making these decisions well requires reliable performance…

Computation and Language · Computer Science 2026-05-19 Arkil Patel , Siva Reddy , Marius Mosbach , Dzmitry Bahdanau

Novel Power and Completion Time Models for Virtualized Environments

Power consumption costs takes upto half of operational expenses of datacenters making power management a critical concern. Advances in processor technology provide fine-grained control over operating frequency and voltage of processors and…

Distributed, Parallel, and Cluster Computing · Computer Science 2014-11-13 Swetha P. T. Srinivasan , Umesh Bellur

MD-Bench: A generic proxy-app toolbox for state-of-the-art molecular dynamics algorithms

Proxy-apps, or mini-apps, are simple self-contained benchmark codes with performance-relevant kernels extracted from real applications. Initially used to facilitate software-hardware co-design, they are a crucial ingredient for serious…

Computational Physics · Physics 2022-07-28 Rafael Ravedutti Lucio Machado , Jan Eitzinger , Harald Köstler , Gerhard Wellein

Simulation-Based Performance Prediction of HPC Applications: A Case Study of HPL

We propose a simulation-based approach for performance modeling of parallel applications on high-performance computing platforms. Our approach enables full-system performance modeling: (1) the hardware platform is represented by an abstract…

Distributed, Parallel, and Cluster Computing · Computer Science 2020-11-06 Gen Xu , Huda Ibeid , Xin Jiang , Vjekoslav Svilan , Zhaojuan Bian