English
Related papers

Related papers: A Linear Combination-based Method to Construct Pro…

200 papers

For the architecture community, reasonable simulation time is a strong requirement in addition to performance data accuracy. However, emerging big data and AI workloads are too huge at binary size level and prohibitively expensive to run on…

Distributed, Parallel, and Cluster Computing · Computer Science 2018-10-23 Wanling Gao , Jianfeng Zhan , Lei Wang , Chunjie Luo , Zhen Jia , Daoyi Zheng , Chen Zheng , Xiwen He , Hainan Ye , Haibin Wang , Rui Ren

Different from the traditional benchmarking methodology that creates a new benchmark or proxy for every possible workload, this paper presents a scalable big data benchmarking methodology. Among a wide variety of big data analytics…

Hardware Architecture · Computer Science 2017-11-10 Wanling Gao , Lei Wang , Jianfeng Zhan , Chunjie Luo , Daoyi Zheng , Zhen Jia , Biwei Xie , Chen Zheng , Qiang Yang , Haibin Wang

Exascale computing will get mankind closer to solving important social, scientific and engineering problems. Due to high prototyping costs, High Performance Computing (HPC) system architects make use of simulation models for design space…

Performance · Computer Science 2018-03-28 Alexandra Ferreron , Radhika Jagtap , Sascha Bischoff , Roxana Rusitoru

Quantum information processors need to be protected against errors and faults. One of the most widely considered fault-tolerant architecture is based on surface codes. While the general principles of these codes are well understood and…

Quantum Physics · Physics 2016-11-15 Nicolas Delfosse , Pavithran Iyer , David Poulin

As architecture, systems, and data management communities pay greater attention to innovative big data systems and architectures, the pressure of benchmarking and evaluating these systems rises. Considering the broad use of big data…

Data teams at frontier AI companies routinely train small proxy models to make critical decisions about pretraining data recipes for full-scale training runs. However, the community has a limited understanding of whether and when…

Machine Learning · Computer Science 2026-04-14 Jiachen T. Wang , Tong Wu , Kaifeng Lyu , James Zou , Dawn Song , Ruoxi Jia , Prateek Mittal

Several data warehouse and database providers have recently introduced extensions to SQL called AI Queries, enabling users to specify functions and conditions in SQL that are evaluated by LLMs, thereby broadening significantly the kinds of…

One of the biggest bottlenecks in a machine learning workflow is waiting for models to train. Depending on the available computing resources, it can take days to weeks to train a neural network on a large dataset with many classes such as…

Machine Learning · Computer Science 2019-06-13 Sam Shleifer , Eric Prokop

Symbolic execution now becomes an indispensable technique for software testing and program analysis. There are several symbolic execution tools available off-the-shelf, and we need a practical benchmark approach to learn their capabilities.…

Software Engineering · Computer Science 2018-05-28 Hui Xu , Zirui Zhao , Yangfan Zhou , Michael R. Lyu

As models become larger, ML accelerators are a scarce resource whose performance must be continually optimized to improve efficiency. Existing performance analysis tools are coarse grained, and fail to capture model performance at the…

Performance · Computer Science 2025-03-20 Ioannis Zarkadas , Amanda Tomlinson , Asaf Cidon , Baris Kasikci , Ofir Weisse

Big data benchmark suites must include a diversity of data and workloads to be useful in fairly evaluating big data systems and architectures. However, using truly comprehensive benchmarks poses great challenges for the architecture…

Performance · Computer Science 2016-11-15 Zhen Jia , Jianfeng Zhan , Lei Wang , Rui Han , Sally A. McKee , Qiang Yang , Chunjie Luo , Jingwei Li

The exponential increase in complex IPs within modern SoCs, driven by Moore's Law, has created a pressing need for fast and accurate hardware-software power-performance analysis. Traditional performance simulators (such as cycle accurate…

Hardware Architecture · Computer Science 2026-03-23 Avery Johnson , Mohammad Majharul Islam , Riad Akram , Abdullah Muzahid

The server central processing unit (CPU) market continues to exhibit robust demand due to the rising global need for computing power. Against this backdrop, CPU benchmark performance prediction is crucial for architecture designers. It…

Performance · Computer Science 2024-10-29 Xiaoman Liu

The past few years have seen a surge of applying Deep Learning (DL) models for a wide array of tasks such as image classification, object detection, machine translation, etc. While DL models provide an opportunity to solve otherwise…

Machine Learning · Computer Science 2021-03-02 Cheng Li , Abdul Dakkak , Jinjun Xiong , Wen-mei Hwu

Recent advances in probabilistic modelling have led to a large number of simulation-based inference algorithms which do not require numerical evaluation of likelihoods. However, a public benchmark with appropriate performance metrics for…

Machine Learning · Statistics 2021-04-12 Jan-Matthis Lueckmann , Jan Boelts , David S. Greenberg , Pedro J. Gonçalves , Jakob H. Macke

The rise of big data systems has created a need for benchmarks to measure and compare the capabilities of these systems. Big data benchmarks present unique scalability challenges. The supercomputing community has wrestled with these…

Performance · Computer Science 2016-12-13 Patrick Dreher , Chansup Byun , Chris Hill , Vijay Gadepally , Bradley Kuszmaul , Jeremy Kepner

Progress in language model development is often driven by comparative decisions: which architecture to adopt, which pretraining corpus to use, or which training recipe to apply. Making these decisions well requires reliable performance…

Computation and Language · Computer Science 2026-05-19 Arkil Patel , Siva Reddy , Marius Mosbach , Dzmitry Bahdanau

Power consumption costs takes upto half of operational expenses of datacenters making power management a critical concern. Advances in processor technology provide fine-grained control over operating frequency and voltage of processors and…

Distributed, Parallel, and Cluster Computing · Computer Science 2014-11-13 Swetha P. T. Srinivasan , Umesh Bellur

Proxy-apps, or mini-apps, are simple self-contained benchmark codes with performance-relevant kernels extracted from real applications. Initially used to facilitate software-hardware co-design, they are a crucial ingredient for serious…

Computational Physics · Physics 2022-07-28 Rafael Ravedutti Lucio Machado , Jan Eitzinger , Harald Köstler , Gerhard Wellein

We propose a simulation-based approach for performance modeling of parallel applications on high-performance computing platforms. Our approach enables full-system performance modeling: (1) the hardware platform is represented by an abstract…

Distributed, Parallel, and Cluster Computing · Computer Science 2020-11-06 Gen Xu , Huda Ibeid , Xin Jiang , Vjekoslav Svilan , Zhaojuan Bian
‹ Prev 1 2 3 10 Next ›