MLPerf Inference Benchmark

Vijay Janapa Reddi; Christine Cheng; David Kanter; Peter Mattson; Guenther Schmuelling; Carole-Jean Wu; Brian Anderson; Maximilien Breughe; Mark Charlebois; William Chou; Ramesh Chukka; Cody Coleman; Sam Davis; Pan Deng; Greg Diamos; Jared Duke; Dave Fick; J. Scott Gardner; Itay Hubara; Sachin Idgunji; Thomas B. Jablin; Jeff Jiao; Tom St. John; Pankaj Kanwar; David Lee; Jeffery Liao; Anton Lokhmotov; Francisco Massa; Peng Meng; Paulius Micikevicius; Colin Osborne; Gennady Pekhimenko; Arun Tejusve Raghunath Rajan; Dilip Sequeira; Ashish Sirasao; Fei Sun; Hanlin Tang; Michael Thomson; Frank Wei; Ephrem Wu; Lingjie Xu; Koichi Yamada; Bing Yu; George Yuan; Aaron Zhong; Peizhao Zhang; Yuchen Zhou

MLPerf Inference Benchmark

Machine Learning 2020-05-12 v2 Performance Machine Learning

Authors: Vijay Janapa Reddi , Christine Cheng , David Kanter , Peter Mattson , Guenther Schmuelling , Carole-Jean Wu , Brian Anderson , Maximilien Breughe , Mark Charlebois , William Chou , Ramesh Chukka , Cody Coleman , Sam Davis , Pan Deng , Greg Diamos , Jared Duke , Dave Fick , J. Scott Gardner , Itay Hubara , Sachin Idgunji , Thomas B. Jablin , Jeff Jiao , Tom St. John , Pankaj Kanwar , David Lee , Jeffery Liao , Anton Lokhmotov , Francisco Massa , Peng Meng , Paulius Micikevicius , Colin Osborne , Gennady Pekhimenko , Arun Tejusve Raghunath Rajan , Dilip Sequeira , Ashish Sirasao , Fei Sun , Hanlin Tang , Michael Thomson , Frank Wei , Ephrem Wu , Lingjie Xu , Koichi Yamada , Bing Yu , George Yuan , Aaron Zhong , Peizhao Zhang , Yuchen Zhou

View on arXiv ↗ PDF ↗

Abstract

Machine-learning (ML) hardware and software system demand is burgeoning. Driven by ML applications, the number of different ML inference systems has exploded. Over 100 organizations are building ML inference chips, and the systems that incorporate existing models span at least three orders of magnitude in power consumption and five orders of magnitude in performance; they range from embedded devices to data-center solutions. Fueling the hardware are a dozen or more software frameworks and libraries. The myriad combinations of ML hardware and ML software make assessing ML-system performance in an architecture-neutral, representative, and reproducible manner challenging. There is a clear need for industry-wide standard ML benchmarking and evaluation criteria. MLPerf Inference answers that call. In this paper, we present our benchmarking method for evaluating ML inference systems. Driven by more than 30 organizations as well as more than 200 ML engineers and practitioners, MLPerf prescribes a set of rules and best practices to ensure comparability across systems with wildly differing architectures. The first call for submissions garnered more than 600 reproducible inference-performance measurements from 14 organizations, representing over 30 systems that showcase a wide range of capabilities. The submissions attest to the benchmark's flexibility and adaptability.

Keywords

unified modeling language large language model inference benchmark evaluation

Cite

@article{arxiv.1911.02549,
  title  = {MLPerf Inference Benchmark},
  author = {Vijay Janapa Reddi and Christine Cheng and David Kanter and Peter Mattson and Guenther Schmuelling and Carole-Jean Wu and Brian Anderson and Maximilien Breughe and Mark Charlebois and William Chou and Ramesh Chukka and Cody Coleman and Sam Davis and Pan Deng and Greg Diamos and Jared Duke and Dave Fick and J. Scott Gardner and Itay Hubara and Sachin Idgunji and Thomas B. Jablin and Jeff Jiao and Tom St. John and Pankaj Kanwar and David Lee and Jeffery Liao and Anton Lokhmotov and Francisco Massa and Peng Meng and Paulius Micikevicius and Colin Osborne and Gennady Pekhimenko and Arun Tejusve Raghunath Rajan and Dilip Sequeira and Ashish Sirasao and Fei Sun and Hanlin Tang and Michael Thomson and Frank Wei and Ephrem Wu and Lingjie Xu and Koichi Yamada and Bing Yu and George Yuan and Aaron Zhong and Peizhao Zhang and Yuchen Zhou},
  journal= {arXiv preprint arXiv:1911.02549},
  year   = {2020}
}

Comments

ISCA 2020

MLPerf Inference Benchmark

Abstract

Keywords

Cite

Comments

Related papers