Blog

May 6, 2021

TinyML Brings AI to Smallest Arm Devices

TinyML focuses on optimizing machine learning (ML) workloads so that they can be processed on microcontrollers no bigger than a grain of rice and consuming only milliwatts of power.

By Arm Editorial Team

TinyML focuses on the optimization of machine learning (ML) workloads so that they can be processed on microcontrollers no bigger than a grain of rice and consuming only a few milliwatts of power.

TinyML gives tiny devices intelligence. We mean tiny in every sense of the word: as tiny as a grain of rice and consuming tiny amounts of power. Supported by Arm, Google, Qualcomm and others, tinyML has the potential to transform the Internet of Things (IoT), where billions of tiny devices, based on Arm chips, are already being used to provide greater insight and efficiency in sectors including consumer, medical, automotive and industrial.

Why target microcontrollers with tinyML?

Microcontrollers such as the Arm Cortex-M family are an ideal platform for ML because they’re already used everywhere. They perform real-time calculations quickly and efficiently, so they’re reliable and responsive, and because they use very little power, can be deployed in places where replacing the battery is difficult or inconvenient. Perhaps even more importantly, they’re cheap enough to be used just about anywhere. The market analyst IDC reports that 28.1 billion microcontrollers were sold in 2018, and forecasts that annual shipment volume will grow to 38.2 billion by 2023.

TinyML on microcontrollers gives us new techniques for analyzing and making sense of the massive amount of data generated by the IoT. In particular, deep learning methods can be used to process information and make sense of the data from sensors that do things like detect sounds, capture images, and track motion.

Advanced pattern recognition in a very compact format

Looking at the math involved in machine learning, data scientists found they could reduce complexity by making certain changes, such as replacing floating-point calculations with simple 8-bit operations. These changes created machine learning models that work much more efficiently and require far fewer processing and memory resources.

TinyML technology is evolving rapidly thanks to new technology and an engaged base of committed developers. Only a few years ago, we were celebrating our ability to run a speech-recognition model capable of waking the system if it detects certain words on a constrained Arm Cortex-M3 microcontroller using just 15 kilobytes (KB) of code and 22KB of data.

Since then, Arm has launched new machine learning (ML) processors, called the Ethos-U55 and Ethos-U65, a microNPU specifically designed to accelerate ML inference in embedded and IoT devices.

The Ethos-U55, combined with the AI-capable Cortex-M55 processor, will provide a significant uplift in ML performance and improvement in energy efficiency over the already impressive examples we are seeing today.

Arm Cortex-M55 and Ethos-U55 in action

TinyML takes endpoint devices to the next level

The potential use cases of tinyML are almost unlimited. Developers are already working with tinyML to explore all sorts of new ideas: responsive traffic lights that change signaling to reduce congestion, industrial machines that can predict when they’ll need service, sensors that can monitor crops for the presence of damaging insects, in-store shelves that can request restocking when inventory gets low, healthcare monitors that track vitals while maintaining privacy. The list goes on.

TinyML can make endpoint devices more consistent and reliable, since there’s less need to rely on busy, crowded internet connections to send data back and forth to the cloud. Reducing or even eliminating interactions with the cloud has major benefits including reduced energy use, significantly reduced latency in processing data and security benefits, since data that doesn’t travel is far less exposed to attack.

It’s worth nothing that these tinyML models, which perform inference on the microcontroller, aren’t intended to replace the more sophisticated inference that currently happens in the cloud. What they do instead is bring specific capabilities down from the cloud to the endpoint device. That way, developers can save cloud interactions for if and when they’re needed.

TinyML also gives developers a powerful new set of tools for solving problems. ML makes it possible to detect complex events that rule-based systems struggle to identify, so endpoint AI devices can start contributing in new ways. Also, since ML makes it possible to control devices with words or gestures, instead of buttons or a smartphone, endpoint devices can be built more rugged and deployable in more challenging operating environments.

TinyML gaining momentum with an expanding ecosystem

Industry players have been quick to recognize the value of tinyML and have moved rapidly to create a supportive ecosystem. Developers at every level, from enthusiastic hobbyists to experienced professionals, can now access tools that make it easy to get started. All that’s needed is a laptop, an open-source software library and a USB cable to connect the laptop to one of several inexpensive development boards priced as low as a few dollars.

In fact, at the start of 2021, Raspberry Pi released its very first microcontroller board, one of the most affordable development board available in the market at just $4. Named Raspberry Pi Pico, it’s powered by the RP2040 SoC, a surprisingly powerful dual Arm Cortex-M0+ processor. The RP2040 MCU is able to run TensorFlow Lite Micro and we’re expecting to see a wide range of ML use cases for this board over the coming months.

Arm is a strong proponent of tinyML because our microcontroller architectures are so central to the IoT, and because we see the potential of on-device inference. Arm’s collaboration with Google is making it even easier for developers to deploy endpoint machine learning in power-conscious environments.

The combination of Arm CMSIS-NN libraries with Google’s TensorFlow Lite Micro (TFLu) framework, allows data scientists and software developers to take advantage of Arm’s hardware optimizations without needing to become experts in embedded programming.

On top of this, Arm is investing in new tools derived from Keil MDK to help developers get from prototype to production when deploying ML applications.

How to accelerate tinyML performance on Arm

TinyML would not be possible without a number of early influencers. Pete Warden, a “founding father” of tinyML and a technical lead of TensorFlow Lite Micro at Google, Arm Innovator, Kwabena Agyeman, who developed OpenMV, a project dedicated to low-cost, extensible, Python-powered machine-vision modules that support machine learning algorithms, and Arm Innovator, Daniel Situnayake a founding tinyML engineer and developer from Edge Impulse, a company that offers a full tinyML pipeline that covers data collection, model training and model optimization. Also, Arm partners such as Cartesiam.ai, a company that offers NanoEdge AI, a tool that creates software models on the endpoint based on the sensor behavior observed in real conditions have been pushing the possibilities of tinyML to another level.

Arm, is also a partner of the TinyML Foundation, an open community that coordinates meet-ups to help people connect, share ideas, and get involved. There are many localised tinyML meet-ups covering UK, Israel and Seattle to name a few, as well as a global series of tinyML Summits. For more information, visit the tinyML foundation website.

Learn more about TinyML development

Watch our latest video to see what’s possible with tinyML. Plus, if you’re a developer, you can follow the tutorial afterwards to build a person detection demo using the Arduino Portenta H7, TensorFlow Lite for Microcontrollers and Mbed OS.

Watch on YouTube

By Arm Editorial Team

Article Text

Copy Text

Any re-use permitted for informational and non-commercial or personal use only.

Editorial Contact

Brian Fuller and Jack Melling

editorial@arm.com

Stay informed with Arm's top stories, insights, and conversations.

Media Information

Latest on X

; Arm @Arm ·

18h 2082562635061461323

We kicked off FYE27 with record first-quarter revenue. 🎉

As AI becomes part of every cloud, every device and every sector, the industry is increasingly converging on Arm—through IP, CSS or silicon—as the common compute platform we believe will define the next decade of

Reply on Twitter 2082562635061461323 Retweet on Twitter 2082562635061461323 10 Like on Twitter 2082562635061461323 42 Twitter 2082562635061461323

; Arm @Arm ·

27 Jul 2081862230878888311

From launch to the OCP show floor earlier this year.

ASRock Rack was among the first to offer servers powered by the Arm AGI CPU.

At OCP EMEA, we saw that collaboration in the rack—bringing high-density, energy-efficient compute to cloud and AI workloads:

Reply on Twitter 2081862230878888311 Retweet on Twitter 2081862230878888311 8 Like on Twitter 2081862230878888311 44 Twitter 2081862230878888311

; Arm @Arm ·

27 Jul 2081752118931689927

As robots take on more real-world tasks, the conversation is shifting from capability to scalability. 🤖

@AGIBOTofficial’s progress shows how to make that possible: efficient AI, real-time control and scalable compute from cloud to edge, built on Arm. https://okt.to/1t8unb

Reply on Twitter 2081752118931689927 Retweet on Twitter 2081752118931689927 0 Like on Twitter 2081752118931689927 24 Twitter 2081752118931689927

; Arm @Arm ·

24 Jul 2080711637720326322

Robots are now heading to the Moon. 🌕🤖

@LunarOutpostInc is working with @nvidia to bring AI to robots designed for upcoming lunar missions using NVIDIA Jetson, built on the Arm compute platform.

The technology will support LiDAR, sensor processing, mapping and autonomous

Reply on Twitter 2080711637720326322 Retweet on Twitter 2080711637720326322 6 Like on Twitter 2080711637720326322 50 Twitter 2080711637720326322

; Arm @Arm ·

24 Jul 2080685017739493722

'@databricks is increasing its use of Arm-based Microsoft @Azure Cobalt. 🚀

Building on Cobalt 100, it plans to adopt Cobalt 200 to further improve performance and efficiency for agentic AI and data-intensive workloads.

The future of AI is being built on Arm.

Reply on Twitter 2080685017739493722 Retweet on Twitter 2080685017739493722 4 Like on Twitter 2080685017739493722 35 Twitter 2080685017739493722

; Arm @Arm ·

23 Jul 2080416448346837287

Agentic AI changes the performance question.

It’s no longer only ➡️ “How fast did the model generate the response?”

It’s ➡️ “How efficiently did the system complete the task?”

That puts the CPU—coordinating retrieval, tools, execution and verification—at the heart of the

Reply on Twitter 2080416448346837287 Retweet on Twitter 2080416448346837287 5 Like on Twitter 2080416448346837287 23 Twitter 2080416448346837287

; Arm @Arm ·

23 Jul 2080366060130177180

There's a reason all four major US hyperscalers are building on Arm.

@GoogleCloud's Arm-based Axion shows why. By building on Arm Neoverse and our software ecosystem, Google could tailor its silicon to its own workloads without rebuilding the underlying architecture.

As the Arm

Reply on Twitter 2080366060130177180 Retweet on Twitter 2080366060130177180 4 Like on Twitter 2080366060130177180 36 Twitter 2080366060130177180

TinyML Brings AI to Smallest Arm Devices

Why target microcontrollers with tinyML?

Advanced pattern recognition in a very compact format

Arm Cortex-M55 and Ethos-U55 in action

TinyML takes endpoint devices to the next level

TinyML gaining momentum with an expanding ecosystem

How to accelerate tinyML performance on Arm

Learn more about TinyML development

Editorial Contact

Stay informed with Arm's top stories, insights, and conversations.

Media Information

Company Overview & History

Arm Corporate Guidelines

Media Contacts

Latest on X