Related papers: MLPerf Inference Benchmark

MLPerf Mobile Inference Benchmark

This paper presents the first industry-standard open-source machine learning (ML) benchmark to allow perfor mance and accuracy evaluation of mobile devices with different AI chips and software stacks. The benchmark draws from the expertise…

Machine Learning · Computer Science 2022-04-07 Vijay Janapa Reddi , David Kanter , Peter Mattson , Jared Duke , Thai Nguyen , Ramesh Chukka , Ken Shiring , Koan-Sin Tan , Mark Charlebois , William Chou , Mostafa El-Khamy , Jungwook Hong , Tom St. John , Cindy Trinh , Michael Buch , Mark Mazumder , Relia Markovic , Thomas Atta , Fatih Cakir , Masoud Charkhabi , Xiaodong Chen , Cheng-Ming Chiang , Dave Dexter , Terry Heo , Gunther Schmuelling , Maryam Shabani , Dylan Zika

MLPerf Training Benchmark

Machine learning (ML) needs industry-standard performance benchmarks to support design and competitive evaluation of the many emerging software and hardware solutions for ML. But ML training presents three unique benchmarking challenges…

Machine Learning · Computer Science 2020-03-03 Peter Mattson , Christine Cheng , Cody Coleman , Greg Diamos , Paulius Micikevicius , David Patterson , Hanlin Tang , Gu-Yeon Wei , Peter Bailis , Victor Bittorf , David Brooks , Dehao Chen , Debojyoti Dutta , Udit Gupta , Kim Hazelwood , Andrew Hock , Xinyuan Huang , Atsushi Ike , Bill Jia , Daniel Kang , David Kanter , Naveen Kumar , Jeffery Liao , Guokai Ma , Deepak Narayanan , Tayo Oguntebi , Gennady Pekhimenko , Lillian Pentecost , Vijay Janapa Reddi , Taylor Robie , Tom St. John , Tsuguchika Tabaru , Carole-Jean Wu , Lingjie Xu , Masafumi Yamazaki , Cliff Young , Matei Zaharia

MLPerf Power: Benchmarking the Energy Efficiency of Machine Learning Systems from Microwatts to Megawatts for Sustainable AI

Rapid adoption of machine learning (ML) technologies has led to a surge in power consumption across diverse systems, from tiny IoT devices to massive datacenter clusters. Benchmarking the energy efficiency of these systems is crucial for…

Hardware Architecture · Computer Science 2025-02-07 Arya Tschand , Arun Tejusve Raghunath Rajan , Sachin Idgunji , Anirban Ghosh , Jeremy Holleman , Csaba Kiraly , Pawan Ambalkar , Ritika Borkar , Ramesh Chukka , Trevor Cockrell , Oliver Curtis , Grigori Fursin , Miro Hodak , Hiwot Kassa , Anton Lokhmotov , Dejan Miskovic , Yuechao Pan , Manu Prasad Manmathan , Liz Raymond , Tom St. John , Arjun Suresh , Rowan Taubitz , Sean Zhan , Scott Wasson , David Kanter , Vijay Janapa Reddi

MLPerf HPC: A Holistic Benchmark Suite for Scientific Machine Learning on HPC Systems

Scientific communities are increasingly adopting machine learning and deep learning models in their applications to accelerate scientific insights. High performance computing systems are pushing the frontiers of performance with a rich…

Machine Learning · Computer Science 2021-10-28 Steven Farrell , Murali Emani , Jacob Balma , Lukas Drescher , Aleksandr Drozd , Andreas Fink , Geoffrey Fox , David Kanter , Thorsten Kurth , Peter Mattson , Dawei Mu , Amit Ruhela , Kento Sato , Koichi Shirahata , Tsuguchika Tabaru , Aristeidis Tsaris , Jan Balewski , Ben Cumming , Takumi Danjo , Jens Domke , Takaaki Fukai , Naoto Fukumoto , Tatsuya Fukushi , Balazs Gerofi , Takumi Honda , Toshiyuki Imamura , Akihiko Kasagi , Kentaro Kawakami , Shuhei Kudo , Akiyoshi Kuroda , Maxime Martinasso , Satoshi Matsuoka , Henrique Mendonça , Kazuki Minami , Prabhat Ram , Takashi Sawada , Mallikarjun Shankar , Tom St. John , Akihiro Tabuchi , Venkatram Vishwanath , Mohamed Wahib , Masafumi Yamazaki , Junqi Yin

MLPerf Automotive

We present MLPerf Automotive, the first standardized public benchmark for evaluating Machine Learning systems that are deployed for AI acceleration in automotive systems. Developed through a collaborative partnership between MLCommons and…

Machine Learning · Computer Science 2025-11-03 Radoyeh Shojaei , Predrag Djurdjevic , Mostafa El-Khamy , James Goel , Kasper Mecklenburg , John Owens , Pınar Muyan-Özçelik , Tom St. John , Jinho Suh , Arjun Suresh

Benchmarking Contemporary Deep Learning Hardware and Frameworks:A Survey of Qualitative Metrics

This paper surveys benchmarking principles, machine learning devices including GPUs, FPGAs, and ASICs, and deep learning software frameworks. It also reviews these technologies with respect to benchmarking from the perspectives of a…

Distributed, Parallel, and Cluster Computing · Computer Science 2020-02-19 Wei Dai , Daniel Berleant

LLM-Inference-Bench: Inference Benchmarking of Large Language Models on AI Accelerators

Large Language Models (LLMs) have propelled groundbreaking advancements across several domains and are commonly used for text generation applications. However, the computational demands of these complex models pose significant challenges,…

Machine Learning · Computer Science 2024-11-04 Krishna Teja Chitty-Venkata , Siddhisanket Raskar , Bharat Kale , Farah Ferdaus , Aditya Tanikanti , Ken Raffenetti , Valerie Taylor , Murali Emani , Venkatram Vishwanath

Benchmarking TinyML Systems: Challenges and Direction

Recent advancements in ultra-low-power machine learning (TinyML) hardware promises to unlock an entirely new class of smart applications. However, continued progress is limited by the lack of a widely accepted benchmark for these systems.…

Performance · Computer Science 2021-02-02 Colby R. Banbury , Vijay Janapa Reddi , Max Lam , William Fu , Amin Fazel , Jeremy Holleman , Xinyuan Huang , Robert Hurtado , David Kanter , Anton Lokhmotov , David Patterson , Danilo Pau , Jae-sun Seo , Jeff Sieracki , Urmish Thakker , Marian Verhelst , Poonam Yadav

Low-Precision Hardware Architectures Meet Recommendation Model Inference at Scale

Tremendous success of machine learning (ML) and the unabated growth in ML model complexity motivated many ML-specific designs in both CPU and accelerator architectures to speed up the model inference. While these architectures are diverse,…

Machine Learning · Computer Science 2021-05-27 Zhaoxia , Deng , Jongsoo Park , Ping Tak Peter Tang , Haixin Liu , Jie , Yang , Hector Yuen , Jianyu Huang , Daya Khudia , Xiaohan Wei , Ellie Wen , Dhruv Choudhary , Raghuraman Krishnamoorthi , Carole-Jean Wu , Satish Nadathur , Changkyu Kim , Maxim Naumov , Sam Naghshineh , Mikhail Smelyanskiy

MLHarness: A Scalable Benchmarking System for MLCommons

With the society's growing adoption of machine learning (ML) and deep learning (DL) for various intelligent solutions, it becomes increasingly imperative to standardize a common set of measures for ML/DL models with large scale open…

Machine Learning · Computer Science 2025-04-24 Yen-Hsiang Chang , Jianhao Pu , Wen-mei Hwu , Jinjun Xiong

Demystifying the MLPerf Benchmark Suite

MLPerf, an emerging machine learning benchmark suite strives to cover a broad range of applications of machine learning. We present a study on its characteristics and how the MLPerf benchmarks differ from some of the previous deep learning…

Machine Learning · Computer Science 2019-08-27 Snehil Verma , Qinzhe Wu , Bagus Hanindhito , Gunjan Jha , Eugene B. John , Ramesh Radhakrishnan , Lizy K. John

MLPerf Tiny Benchmark

Advancements in ultra-low-power tiny machine learning (TinyML) systems promise to unlock an entirely new class of smart applications. However, continued progress is limited by the lack of a widely accepted and easily reproducible benchmark…

Machine Learning · Computer Science 2021-08-26 Colby Banbury , Vijay Janapa Reddi , Peter Torelli , Jeremy Holleman , Nat Jeffries , Csaba Kiraly , Pietro Montino , David Kanter , Sebastian Ahmed , Danilo Pau , Urmish Thakker , Antonio Torrini , Peter Warden , Jay Cordaro , Giuseppe Di Guglielmo , Javier Duarte , Stephen Gibellini , Videet Parekh , Honson Tran , Nhan Tran , Niu Wenxu , Xu Xuesong

The ML.ENERGY Benchmark: Toward Automated Inference Energy Measurement and Optimization

As the adoption of Generative AI in real-world services grow explosively, energy has emerged as a critical bottleneck resource. However, energy remains a metric that is often overlooked, under-explored, or poorly understood in the context…

Machine Learning · Computer Science 2025-10-17 Jae-Won Chung , Jeff J. Ma , Ruofan Wu , Jiachen Liu , Oh Jun Kweon , Yuxuan Xia , Zhiyu Wu , Mosharaf Chowdhury

A Survey of LLM Inference Systems

The past few years has witnessed specialized large language model (LLM) inference systems, such as vLLM, SGLang, Mooncake, and DeepFlow, alongside rapid LLM adoption via services like ChatGPT. Driving these system design efforts is the…

Databases · Computer Science 2025-06-30 James Pan , Guoliang Li

A Holistic Assessment of the Reliability of Machine Learning Systems

As machine learning (ML) systems increasingly permeate high-stakes settings such as healthcare, transportation, military, and national security, concerns regarding their reliability have emerged. Despite notable progress, the performance of…

Machine Learning · Computer Science 2023-08-01 Anthony Corso , David Karamadian , Romeo Valentin , Mary Cooper , Mykel J. Kochenderfer

InferBench: Understanding Deep Learning Inference Serving with an Automatic Benchmarking System

Deep learning (DL) models have become core modules for many applications. However, deploying these models without careful performance benchmarking that considers both hardware and software's impact often leads to poor service and costly…

Machine Learning · Computer Science 2021-01-06 Huaizheng Zhang , Yizheng Huang , Yonggang Wen , Jianxiong Yin , Kyle Guan

Benchmarking Safety Monitors for Image Classifiers with Machine Learning

High-accurate machine learning (ML) image classifiers cannot guarantee that they will not fail at operation. Thus, their deployment in safety-critical applications such as autonomous vehicles is still an open issue. The use of fault…

Artificial Intelligence · Computer Science 2021-10-05 Raul Sena Ferreira , Jean Arlat , Jeremie Guiochet , Hélène Waeselynck

The Silent Hyperparameter: Quantifying the Impact of Inference Backends on LLM Reproducibility

Progress in LLMs is increasingly measured through standardized benchmarks, where state-of-the-art improvements are often separated by fractions of a percentage point. At the same time, the computational cost of evaluating modern LLMs has…

Machine Learning · Computer Science 2026-05-21 David Pape , Jonathan Evertz , Lea Schönherr

Towards Perspective-Based Specification of Machine Learning-Enabled Systems

Machine learning (ML) teams often work on a project just to realize the performance of the model is not good enough. Indeed, the success of ML-enabled systems involves aligning data with business problems, translating them into ML tasks,…

Software Engineering · Computer Science 2022-06-22 Hugo Villamizar , Marcos Kalinowski , Helio Lopes

AIPerf: Automated machine learning as an AI-HPC benchmark

The plethora of complex artificial intelligence (AI) algorithms and available high performance computing (HPC) power stimulates the expeditious development of AI components with heterogeneous designs. Consequently, the need for cross-stack…

Distributed, Parallel, and Cluster Computing · Computer Science 2021-03-16 Zhixiang Ren , Yongheng Liu , Tianhui Shi , Lei Xie , Yue Zhou , Jidong Zhai , Youhui Zhang , Yunquan Zhang , Wenguang Chen