Related papers: MLPerf Mobile Inference Benchmark

MLPerf Inference Benchmark

Machine-learning (ML) hardware and software system demand is burgeoning. Driven by ML applications, the number of different ML inference systems has exploded. Over 100 organizations are building ML inference chips, and the systems that…

Machine Learning · Computer Science 2020-05-12 Vijay Janapa Reddi , Christine Cheng , David Kanter , Peter Mattson , Guenther Schmuelling , Carole-Jean Wu , Brian Anderson , Maximilien Breughe , Mark Charlebois , William Chou , Ramesh Chukka , Cody Coleman , Sam Davis , Pan Deng , Greg Diamos , Jared Duke , Dave Fick , J. Scott Gardner , Itay Hubara , Sachin Idgunji , Thomas B. Jablin , Jeff Jiao , Tom St. John , Pankaj Kanwar , David Lee , Jeffery Liao , Anton Lokhmotov , Francisco Massa , Peng Meng , Paulius Micikevicius , Colin Osborne , Gennady Pekhimenko , Arun Tejusve Raghunath Rajan , Dilip Sequeira , Ashish Sirasao , Fei Sun , Hanlin Tang , Michael Thomson , Frank Wei , Ephrem Wu , Lingjie Xu , Koichi Yamada , Bing Yu , George Yuan , Aaron Zhong , Peizhao Zhang , Yuchen Zhou

MLPerf Training Benchmark

Machine learning (ML) needs industry-standard performance benchmarks to support design and competitive evaluation of the many emerging software and hardware solutions for ML. But ML training presents three unique benchmarking challenges…

Machine Learning · Computer Science 2020-03-03 Peter Mattson , Christine Cheng , Cody Coleman , Greg Diamos , Paulius Micikevicius , David Patterson , Hanlin Tang , Gu-Yeon Wei , Peter Bailis , Victor Bittorf , David Brooks , Dehao Chen , Debojyoti Dutta , Udit Gupta , Kim Hazelwood , Andrew Hock , Xinyuan Huang , Atsushi Ike , Bill Jia , Daniel Kang , David Kanter , Naveen Kumar , Jeffery Liao , Guokai Ma , Deepak Narayanan , Tayo Oguntebi , Gennady Pekhimenko , Lillian Pentecost , Vijay Janapa Reddi , Taylor Robie , Tom St. John , Tsuguchika Tabaru , Carole-Jean Wu , Lingjie Xu , Masafumi Yamazaki , Cliff Young , Matei Zaharia

MLPerf Automotive

We present MLPerf Automotive, the first standardized public benchmark for evaluating Machine Learning systems that are deployed for AI acceleration in automotive systems. Developed through a collaborative partnership between MLCommons and…

Machine Learning · Computer Science 2025-11-03 Radoyeh Shojaei , Predrag Djurdjevic , Mostafa El-Khamy , James Goel , Kasper Mecklenburg , John Owens , Pınar Muyan-Özçelik , Tom St. John , Jinho Suh , Arjun Suresh

AIPerf: Automated machine learning as an AI-HPC benchmark

The plethora of complex artificial intelligence (AI) algorithms and available high performance computing (HPC) power stimulates the expeditious development of AI components with heterogeneous designs. Consequently, the need for cross-stack…

Distributed, Parallel, and Cluster Computing · Computer Science 2021-03-16 Zhixiang Ren , Yongheng Liu , Tianhui Shi , Lei Xie , Yue Zhou , Jidong Zhai , Youhui Zhang , Yunquan Zhang , Wenguang Chen

MobileAIBench: Benchmarking LLMs and LMMs for On-Device Use Cases

The deployment of Large Language Models (LLMs) and Large Multimodal Models (LMMs) on mobile devices has gained significant attention due to the benefits of enhanced privacy, stability, and personalization. However, the hardware constraints…

Computation and Language · Computer Science 2024-06-18 Rithesh Murthy , Liangwei Yang , Juntao Tan , Tulika Manoj Awalgaonkar , Yilun Zhou , Shelby Heinecke , Sachin Desai , Jason Wu , Ran Xu , Sarah Tan , Jianguo Zhang , Zhiwei Liu , Shirley Kokane , Zuxin Liu , Ming Zhu , Huan Wang , Caiming Xiong , Silvio Savarese

Mobile-MMLU: A Mobile Intelligence Language Understanding Benchmark

Rapid advancements in large language models (LLMs) have increased interest in deploying them on mobile devices for on-device AI applications. Mobile users interact differently with LLMs compared to desktop users, creating unique…

Computation and Language · Computer Science 2025-03-27 Sondos Mahmoud Bsharat , Mukul Ranjan , Aidar Myrzakhan , Jiacheng Liu , Bowei Guo , Shengkun Tang , Zhuang Liu , Yuanzhi Li , Zhiqiang Shen

PalmBench: A Comprehensive Benchmark of Compressed Large Language Models on Mobile Platforms

Deploying large language models (LLMs) locally on mobile devices is advantageous in scenarios where transmitting data to remote cloud servers is either undesirable due to privacy concerns or impractical due to network connection. Recent…

Machine Learning · Computer Science 2025-01-10 Yilong Li , Jingyu Liu , Hao Zhang , M Badri Narayanan , Utkarsh Sharma , Shuai Zhang , Pan Hu , Yijing Zeng , Jayaram Raghuram , Suman Banerjee

MLHarness: A Scalable Benchmarking System for MLCommons

With the society's growing adoption of machine learning (ML) and deep learning (DL) for various intelligent solutions, it becomes increasingly imperative to standardize a common set of measures for ML/DL models with large scale open…

Machine Learning · Computer Science 2025-04-24 Yen-Hsiang Chang , Jianhao Pu , Wen-mei Hwu , Jinjun Xiong

MLPerf HPC: A Holistic Benchmark Suite for Scientific Machine Learning on HPC Systems

Scientific communities are increasingly adopting machine learning and deep learning models in their applications to accelerate scientific insights. High performance computing systems are pushing the frontiers of performance with a rich…

Machine Learning · Computer Science 2021-10-28 Steven Farrell , Murali Emani , Jacob Balma , Lukas Drescher , Aleksandr Drozd , Andreas Fink , Geoffrey Fox , David Kanter , Thorsten Kurth , Peter Mattson , Dawei Mu , Amit Ruhela , Kento Sato , Koichi Shirahata , Tsuguchika Tabaru , Aristeidis Tsaris , Jan Balewski , Ben Cumming , Takumi Danjo , Jens Domke , Takaaki Fukai , Naoto Fukumoto , Tatsuya Fukushi , Balazs Gerofi , Takumi Honda , Toshiyuki Imamura , Akihiko Kasagi , Kentaro Kawakami , Shuhei Kudo , Akiyoshi Kuroda , Maxime Martinasso , Satoshi Matsuoka , Henrique Mendonça , Kazuki Minami , Prabhat Ram , Takashi Sawada , Mallikarjun Shankar , Tom St. John , Akihiro Tabuchi , Venkatram Vishwanath , Mohamed Wahib , Masafumi Yamazaki , Junqi Yin

lm-Meter: Unveiling Runtime Inference Latency for On-Device Language Models

Large Language Models (LLMs) are increasingly integrated into everyday applications, but their prevalent cloud-based deployment raises growing concerns around data privacy and long-term sustainability. Running LLMs locally on mobile and…

Machine Learning · Computer Science 2025-10-08 Haoxin Wang , Xiaolong Tu , Hongyu Ke , Huirong Chai , Dawei Chen , Kyungtae Han

LLM-Inference-Bench: Inference Benchmarking of Large Language Models on AI Accelerators

Large Language Models (LLMs) have propelled groundbreaking advancements across several domains and are commonly used for text generation applications. However, the computational demands of these complex models pose significant challenges,…

Machine Learning · Computer Science 2024-11-04 Krishna Teja Chitty-Venkata , Siddhisanket Raskar , Bharat Kale , Farah Ferdaus , Aditya Tanikanti , Ken Raffenetti , Valerie Taylor , Murali Emani , Venkatram Vishwanath

Demystifying the MLPerf Benchmark Suite

MLPerf, an emerging machine learning benchmark suite strives to cover a broad range of applications of machine learning. We present a study on its characteristics and how the MLPerf benchmarks differ from some of the previous deep learning…

Machine Learning · Computer Science 2019-08-27 Snehil Verma , Qinzhe Wu , Bagus Hanindhito , Gunjan Jha , Eugene B. John , Ramesh Radhakrishnan , Lizy K. John

MLPerf Tiny Benchmark

Advancements in ultra-low-power tiny machine learning (TinyML) systems promise to unlock an entirely new class of smart applications. However, continued progress is limited by the lack of a widely accepted and easily reproducible benchmark…

Machine Learning · Computer Science 2021-08-26 Colby Banbury , Vijay Janapa Reddi , Peter Torelli , Jeremy Holleman , Nat Jeffries , Csaba Kiraly , Pietro Montino , David Kanter , Sebastian Ahmed , Danilo Pau , Urmish Thakker , Antonio Torrini , Peter Warden , Jay Cordaro , Giuseppe Di Guglielmo , Javier Duarte , Stephen Gibellini , Videet Parekh , Honson Tran , Nhan Tran , Niu Wenxu , Xu Xuesong

HammerBench: Fine-Grained Function-Calling Evaluation in Real Mobile Device Scenarios

Evaluating the performance of LLMs in multi-turn human-agent interactions presents significant challenges, particularly due to the complexity and variability of user behavior. In this paper, we introduce HammerBench, a novel benchmark…

Computation and Language · Computer Science 2025-02-18 Jun Wang , Jiamu Zhou , Muning Wen , Xiaoyun Mo , Haoyu Zhang , Qiqiang Lin , Cheng Jin , Xihuai Wang , Weinan Zhang , Qiuying Peng , Jun Wang

MobileVLM : A Fast, Strong and Open Vision Language Assistant for Mobile Devices

We present MobileVLM, a competent multimodal vision language model (MMVLM) targeted to run on mobile devices. It is an amalgamation of a myriad of architectural designs and techniques that are mobile-oriented, which comprises a set of…

Computer Vision and Pattern Recognition · Computer Science 2024-01-02 Xiangxiang Chu , Limeng Qiao , Xinyang Lin , Shuang Xu , Yang Yang , Yiming Hu , Fei Wei , Xinyu Zhang , Bo Zhang , Xiaolin Wei , Chunhua Shen

MobileAgentBench: An Efficient and User-Friendly Benchmark for Mobile LLM Agents

Large language model (LLM)-based mobile agents are increasingly popular due to their capability to interact directly with mobile phone Graphic User Interfaces (GUIs) and their potential to autonomously manage daily tasks. Despite their…

Artificial Intelligence · Computer Science 2024-06-13 Luyuan Wang , Yongyu Deng , Yiwei Zha , Guodong Mao , Qinmin Wang , Tianchen Min , Wei Chen , Shoufa Chen

BenchML: an extensible pipelining framework for benchmarking representations of materials and molecules at scale

We introduce a machine-learning (ML) framework for high-throughput benchmarking of diverse representations of chemical systems against datasets of materials and molecules. The guiding principle underlying the benchmarking approach is to…

Machine Learning · Computer Science 2021-12-07 Carl Poelking , Felix A. Faber , Bingqing Cheng

Insights into Performance Fitness and Error Metrics for Machine Learning

Machine learning (ML) is the field of training machines to achieve high level of cognition and perform human-like analysis. Since ML is a data-driven approach, it seemingly fits into our daily lives and operations as well as complex and…

Machine Learning · Computer Science 2021-11-25 M. Z. Naser , Amir Alavi

MNN-LLM: A Generic Inference Engine for Fast Large Language Model Deployment on Mobile Devices

Large language models (LLMs) have demonstrated exceptional performance across a variety of tasks. However, their substantial scale leads to significant computational resource consumption during inference, resulting in high costs.…

Machine Learning · Computer Science 2025-06-13 Zhaode Wang , Jingbang Yang , Xinyu Qian , Shiwen Xing , Xiaotang Jiang , Chengfei Lv , Shengyu Zhang

Neural Network Inference on Mobile SoCs

The ever-increasing demand from mobile Machine Learning (ML) applications calls for evermore powerful on-chip computing resources. Mobile devices are empowered with heterogeneous multi-processor Systems-on-Chips (SoCs) to process ML…

Machine Learning · Computer Science 2021-02-03 Siqi Wang , Anuj Pathania , Tulika Mitra