Related papers: Accelerating TinyML Inference on Microcontrollers …

On-Sensor Convolutional Neural Networks with Early-Exits

Tiny Machine Learning (TinyML) is a novel research field aiming at integrating Machine Learning (ML) within embedded devices with limited memory, computation, and energy. Recently, a new branch of TinyML has emerged, focusing on integrating…

Machine Learning · Computer Science 2025-06-03 Hazem Hesham Yousef Shalby , Arianna De Vecchi , Alice Scandelli , Pietro Bartoli , Diana Trojaniello , Manuel Roveri , Federica Villa

Co-Design of CNN Accelerators for TinyML using Approximate Matrix Decomposition

The paradigm shift towards local and on-device inference under stringent resource constraints is represented by the tiny machine learning (TinyML) domain. The primary goal of TinyML is to integrate intelligence into tiny, low-cost devices…

Hardware Architecture · Computer Science 2026-04-24 José Juan Hernández Morales , Georgios Mentzos , Frank Hannig , Konstantinos Balaskas , Georgios Zervakis , Jörg Henkel , Jürgen Teich

Split CNN Inference on Networked Microcontrollers

Running deep neural networks on microcontroller units (MCUs) is severely constrained by limited memory resources. While TinyML techniques reduce model size and computation, they often fail in practice due to excessive peak Random Access…

Distributed, Parallel, and Cluster Computing · Computer Science 2026-05-12 Junyu Lu , Shashwath Suresh , Hao Liu , Qi Hong , Qing Wang

vMCU: Coordinated Memory Management and Kernel Optimization for DNN Inference on MCUs

IoT devices based on microcontroller units (MCU) provide ultra-low power consumption and ubiquitous computation for near-sensor deep learning models (DNN). However, the memory of MCU is usually 2-3 orders of magnitude smaller than mobile…

Hardware Architecture · Computer Science 2024-06-12 Size Zheng , Renze Chen , Meng Li , Zihao Ye , Luis Ceze , Yun Liang

MinUn: Accurate ML Inference on Microcontrollers

Running machine learning inference on tiny devices, known as TinyML, is an emerging research area. This task requires generating inference code that uses memory frugally, a task that standard ML frameworks are ill-suited for. A deployment…

Machine Learning · Computer Science 2022-12-01 Shikhar Jaiswal , Rahul Kiran Kranti Goli , Aayan Kumar , Vivek Seshadri , Rahul Sharma

Measuring what Really Matters: Optimizing Neural Networks for TinyML

With the surge of inexpensive computational and memory resources, neural networks (NNs) have experienced an unprecedented growth in architectural and computational complexity. Introducing NNs to resource-constrained devices enables…

Machine Learning · Computer Science 2021-04-22 Lennart Heim , Andreas Biri , Zhongnan Qu , Lothar Thiele

Tiny Machine Learning: Progress and Futures

Tiny Machine Learning (TinyML) is a new frontier of machine learning. By squeezing deep learning models into billions of IoT devices and microcontrollers (MCUs), we expand the scope of AI applications and enable ubiquitous intelligence.…

Machine Learning · Computer Science 2024-04-02 Ji Lin , Ligeng Zhu , Wei-Ming Chen , Wei-Chen Wang , Song Han

Efficient Neural Networks for Tiny Machine Learning: A Comprehensive Review

The field of Tiny Machine Learning (TinyML) has gained significant attention due to its potential to enable intelligent applications on resource-constrained devices. This review provides an in-depth analysis of the advancements in efficient…

Machine Learning · Statistics 2023-11-21 Minh Tri Lê , Pierre Wolinski , Julyan Arbel

Neural Network Quantization for Microcontrollers: A Comprehensive Survey of Methods, Platforms, and Applications

The deployment of Quantized Neural Networks (QNNs) on resource-constrained edge devices, such as microcontrollers (MCUs), introduces fundamental challenges in balancing model performance, computational complexity, and memory constraints.…

Machine Learning · Computer Science 2026-01-08 Hamza A. Abushahla , Dara Varam , Ariel Justine N. Panopio , Mohamed I. AlHajri

MCUNet: Tiny Deep Learning on IoT Devices

Machine learning on tiny IoT devices based on microcontroller units (MCU) is appealing but challenging: the memory of microcontrollers is 2-3 orders of magnitude smaller even than mobile phones. We propose MCUNet, a framework that jointly…

Computer Vision and Pattern Recognition · Computer Science 2020-11-20 Ji Lin , Wei-Ming Chen , Yujun Lin , John Cohn , Chuang Gan , Song Han

Optimizing TinyML: The Impact of Reduced Data Acquisition Rates for Time Series Classification on Microcontrollers

Tiny Machine Learning (TinyML) enables efficient, lowcost, and privacy preserving machine learning inference directly on microcontroller units (MCUs) connected to sensors. Optimizing models for these constrained environments is crucial.…

Machine Learning · Computer Science 2024-09-18 Riya Samanta , Bidyut Saha , Soumya K. Ghosh , Ram Babu Roy

TinyOL: TinyML with Online-Learning on Microcontrollers

Tiny machine learning (TinyML) is a fast-growing research area committed to democratizing deep learning for all-pervasive microcontrollers (MCUs). Challenged by the constraints on power, memory, and computation, TinyML has achieved…

Machine Learning · Computer Science 2021-04-13 Haoyu Ren , Darko Anicic , Thomas Runkler

DEX: Data Channel Extension for Efficient CNN Inference on Tiny AI Accelerators

Tiny machine learning (TinyML) aims to run ML models on small devices and is increasingly favored for its enhanced privacy, reduced latency, and low cost. Recently, the advent of tiny AI accelerators has revolutionized the TinyML field by…

Machine Learning · Computer Science 2024-12-10 Taesik Gong , Fahim Kawsar , Chulhong Min

Kernel Approximation using Analog In-Memory Computing

Kernel functions are vital ingredients of several machine learning algorithms, but often incur significant memory and computational costs. We introduce an approach to kernel approximation in machine learning algorithms suitable for…

Machine Learning · Computer Science 2024-11-07 Julian Büchel , Giacomo Camposampiero , Athanasios Vasilopoulos , Corey Lammie , Manuel Le Gallo , Abbas Rahimi , Abu Sebastian

Benchmarking Energy and Latency in TinyML: A Novel Method for Resource-Constrained AI

The rise of IoT has increased the need for on-edge machine learning, with TinyML emerging as a promising solution for resource-constrained devices such as MCU. However, evaluating their performance remains challenging due to diverse…

Machine Learning · Computer Science 2025-12-01 Pietro Bartoli , Christian Veronesi , Andrea Giudici , David Siorpaes , Diana Trojaniello , Franco Zappa

An Ultra-low Power TinyML System for Real-time Visual Processing at Edge

Tiny machine learning (TinyML), executing AI workloads on resource and power strictly restricted systems, is an important and challenging topic. This brief firstly presents an extremely tiny backbone to construct high efficiency CNN models…

Image and Video Processing · Electrical Eng. & Systems 2023-06-02 Kunran Xu , Huawei Zhang , Yishi Li , Yuhao Zhang , Rui Lai , Yi Liu

Rethinking Temporal Models for TinyML: LSTM versus 1D-CNN in Resource-Constrained Devices

Time series classification underpins applications such as human activity recognition, healthcare monitoring, and gesture detection in the IoT domain. Tiny Machine Learning enables models to run directly on low-power microcontroller units,…

Performance · Computer Science 2026-03-06 Bidyut Saha , Riya Samanta

Benchmarking Ultra-Low-Power $\mu$NPUs

Efficient on-device neural network (NN) inference offers predictable latency, improved privacy and reliability, and lower operating costs for vendors than cloud-based inference. This has sparked recent development of microcontroller-scale…

Machine Learning · Computer Science 2025-11-03 Josh Millar , Yushan Huang , Sarab Sethi , Hamed Haddadi , Anil Madhavapeddy

Evaluating the Energy Efficiency of NPU-Accelerated Machine Learning Inference on Embedded Microcontrollers

The deployment of machine learning (ML) models on microcontrollers (MCUs) is constrained by strict energy, latency, and memory requirements, particularly in battery-operated and real-time edge devices. While software-level optimizations…

Emerging Technologies · Computer Science 2025-09-29 Anastasios Fanariotis , Theofanis Orphanoudakis , Vasilis Fotopoulos

MLonMCU: TinyML Benchmarking with Fast Retargeting

While there exist many ways to deploy machine learning models on microcontrollers, it is non-trivial to choose the optimal combination of frameworks and targets for a given application. Thus, automating the end-to-end benchmarking flow is…

Machine Learning · Computer Science 2024-07-08 Philipp van Kempen , Rafael Stahl , Daniel Mueller-Gritschneder , Ulf Schlichtmann