English
Related papers

Related papers: MLPerf Mobile Inference Benchmark

200 papers

Machine-learning (ML) hardware and software system demand is burgeoning. Driven by ML applications, the number of different ML inference systems has exploded. Over 100 organizations are building ML inference chips, and the systems that…

Machine learning (ML) needs industry-standard performance benchmarks to support design and competitive evaluation of the many emerging software and hardware solutions for ML. But ML training presents three unique benchmarking challenges…

We present MLPerf Automotive, the first standardized public benchmark for evaluating Machine Learning systems that are deployed for AI acceleration in automotive systems. Developed through a collaborative partnership between MLCommons and…

The plethora of complex artificial intelligence (AI) algorithms and available high performance computing (HPC) power stimulates the expeditious development of AI components with heterogeneous designs. Consequently, the need for cross-stack…

Distributed, Parallel, and Cluster Computing · Computer Science 2021-03-16 Zhixiang Ren , Yongheng Liu , Tianhui Shi , Lei Xie , Yue Zhou , Jidong Zhai , Youhui Zhang , Yunquan Zhang , Wenguang Chen

The deployment of Large Language Models (LLMs) and Large Multimodal Models (LMMs) on mobile devices has gained significant attention due to the benefits of enhanced privacy, stability, and personalization. However, the hardware constraints…

Rapid advancements in large language models (LLMs) have increased interest in deploying them on mobile devices for on-device AI applications. Mobile users interact differently with LLMs compared to desktop users, creating unique…

Computation and Language · Computer Science 2025-03-27 Sondos Mahmoud Bsharat , Mukul Ranjan , Aidar Myrzakhan , Jiacheng Liu , Bowei Guo , Shengkun Tang , Zhuang Liu , Yuanzhi Li , Zhiqiang Shen

Deploying large language models (LLMs) locally on mobile devices is advantageous in scenarios where transmitting data to remote cloud servers is either undesirable due to privacy concerns or impractical due to network connection. Recent…

With the society's growing adoption of machine learning (ML) and deep learning (DL) for various intelligent solutions, it becomes increasingly imperative to standardize a common set of measures for ML/DL models with large scale open…

Machine Learning · Computer Science 2025-04-24 Yen-Hsiang Chang , Jianhao Pu , Wen-mei Hwu , Jinjun Xiong

Large Language Models (LLMs) are increasingly integrated into everyday applications, but their prevalent cloud-based deployment raises growing concerns around data privacy and long-term sustainability. Running LLMs locally on mobile and…

Machine Learning · Computer Science 2025-10-08 Haoxin Wang , Xiaolong Tu , Hongyu Ke , Huirong Chai , Dawei Chen , Kyungtae Han

Large Language Models (LLMs) have propelled groundbreaking advancements across several domains and are commonly used for text generation applications. However, the computational demands of these complex models pose significant challenges,…

MLPerf, an emerging machine learning benchmark suite strives to cover a broad range of applications of machine learning. We present a study on its characteristics and how the MLPerf benchmarks differ from some of the previous deep learning…

Machine Learning · Computer Science 2019-08-27 Snehil Verma , Qinzhe Wu , Bagus Hanindhito , Gunjan Jha , Eugene B. John , Ramesh Radhakrishnan , Lizy K. John

Advancements in ultra-low-power tiny machine learning (TinyML) systems promise to unlock an entirely new class of smart applications. However, continued progress is limited by the lack of a widely accepted and easily reproducible benchmark…

Evaluating the performance of LLMs in multi-turn human-agent interactions presents significant challenges, particularly due to the complexity and variability of user behavior. In this paper, we introduce HammerBench, a novel benchmark…

Computation and Language · Computer Science 2025-02-18 Jun Wang , Jiamu Zhou , Muning Wen , Xiaoyun Mo , Haoyu Zhang , Qiqiang Lin , Cheng Jin , Xihuai Wang , Weinan Zhang , Qiuying Peng , Jun Wang

We present MobileVLM, a competent multimodal vision language model (MMVLM) targeted to run on mobile devices. It is an amalgamation of a myriad of architectural designs and techniques that are mobile-oriented, which comprises a set of…

Computer Vision and Pattern Recognition · Computer Science 2024-01-02 Xiangxiang Chu , Limeng Qiao , Xinyang Lin , Shuang Xu , Yang Yang , Yiming Hu , Fei Wei , Xinyu Zhang , Bo Zhang , Xiaolin Wei , Chunhua Shen

Large language model (LLM)-based mobile agents are increasingly popular due to their capability to interact directly with mobile phone Graphic User Interfaces (GUIs) and their potential to autonomously manage daily tasks. Despite their…

Artificial Intelligence · Computer Science 2024-06-13 Luyuan Wang , Yongyu Deng , Yiwei Zha , Guodong Mao , Qinmin Wang , Tianchen Min , Wei Chen , Shoufa Chen

We introduce a machine-learning (ML) framework for high-throughput benchmarking of diverse representations of chemical systems against datasets of materials and molecules. The guiding principle underlying the benchmarking approach is to…

Machine Learning · Computer Science 2021-12-07 Carl Poelking , Felix A. Faber , Bingqing Cheng

Machine learning (ML) is the field of training machines to achieve high level of cognition and perform human-like analysis. Since ML is a data-driven approach, it seemingly fits into our daily lives and operations as well as complex and…

Machine Learning · Computer Science 2021-11-25 M. Z. Naser , Amir Alavi

Large language models (LLMs) have demonstrated exceptional performance across a variety of tasks. However, their substantial scale leads to significant computational resource consumption during inference, resulting in high costs.…

Machine Learning · Computer Science 2025-06-13 Zhaode Wang , Jingbang Yang , Xinyu Qian , Shiwen Xing , Xiaotang Jiang , Chengfei Lv , Shengyu Zhang

The ever-increasing demand from mobile Machine Learning (ML) applications calls for evermore powerful on-chip computing resources. Mobile devices are empowered with heterogeneous multi-processor Systems-on-Chips (SoCs) to process ML…

Machine Learning · Computer Science 2021-02-03 Siqi Wang , Anuj Pathania , Tulika Mitra
‹ Prev 1 2 3 10 Next ›