Blog

February 26, 2025

Choosing the Right Arm Edge AI Solution for Your AI application

Discover how the Cortex-A320, Arm’s smallest Armv9-A processor, expands your options for IoT edge AI with enhanced efficiency and performance.

By Tim Menasveta, Director, Product Management, IoT Line of Business, Arm

With the launch of Cortex-A320, Arm’s smallest implementation of the Armv9-A architecture, developers now have even more options for processing IoT edge AI workloads. But with so many choices, how do you determine the right processor for your specific AI application? As a system developer, you need to navigate that decision by comparing Cortex-A, Cortex-M, and Ethos-U NPU-based devices—along with their potential combinations. Beyond the cost, discover how each processor impacts AI functionality and what software development flows are available to streamline your project.

What Makes Arm Cortex-A320 Ideal for IoT Edge AI?

The efficiency of AI computation in embedded devices has grown leaps and bounds in recent years. Improvements in Arm’s M- and A-profile architecture deliver multi-fold increases in machine learning (ML) inferences per unit energy consumed. Specifically on M-profile, the Cortex-M52, Cortex-M55, and Cortex-M85 CPUs (all based on the Armv8.1-M architecture) have integrated the programmable Helium vector extension, unlocking new AI-enabled use cases on microcontroller-class devices. Cortex-A processors based on Armv9, such as the recently announced Cortex-A320, on the other hand, boosted AI performance from their earlier generation thanks to the scalable vector extension (SVE2). The Ethos-U family of neural processing units (NPU) has improved processing efficiency – especially with transformer networks – with the latest generation, the Ethos-U85.

How To Choose the Right Edge AI Solution?

Each architecture offers advantages on different fronts. When considering which hardware is best suited, raw performance should be weighed against design flexibility. Additionally, software development flow, including CI/CD requirements need to be considered.

How Arm CPUs and NPUs Handle Different AI Data Types and Workloads

Meeting the required AI processing performance is, of course, mandatory.

By nature, Cortex-A processors are programmable processors that can target a wide range of end-uses. Integrated into Cortex-A is the Neon/SVE2 vector engine designed to accelerate neural networks and any vectorized code. The data types natively supported are also numerous. Cortex-M processors, with the Helium vector engine, present the same characteristics, although perfected for more cost and energy constrained target use. By contrast Ethos-U NPUs (up to Ethos-U85) are purposely designed to process neural network operators, and specifically with quantized 8-bit integer data weights. They are very efficient at their tasks, for network operators that can be mapped to hardware present in those NPUs.

The latest generation of Cortex-A CPUs based on the Armv9 architecture supports a broad set of data types, including BF16. In addition, new matrix-multiply instructions have been introduced that significantly increases performance on neural network processing. A good explanation on how matrix multiply is implemented with SVE2 can be found in this blog.

The Cortex-M55 was the first Cortex-M processor to integrate Helium vector technology, followed later by the Cortex-M85. Both processors implement the dual-beat Helium configuration, which delivers up to eight 8-bit integer multiply-accumulate (MAC) operations per clock cycle. Helium also supports natively other data types: FP16, FP32, for example.

Finally, Ethos-U NPUs deliver very efficient neural network (NN) processing, but on models with quantized data types. Int8 weights and Int8 or Int16 activation data specifically. This design choice enhances the NPUs execution efficiency but would restrict its usage only with these data types.

One way to assess a processor’s performance for real-world AI workloads is by analyzing its theoretical MAC execution capability per data type and per clock cycle. Since neural network processing uses large datasets, memory access performance is another crucial factor. However, in this instance, we will focus specifically on processor-bound performance rather than memory-bound performance.

Neural network processing rate is often limited by the MAC operation rate of the underlying hardware. While the actual network processing performance varies based on the network structure, the theoretical MAC processing rate shown below provides one indicator of the hardware’s capabilities.

MAC/core/clock cycle	datatype	Int8	Int16	Int32	BF16	FP16	FP32
Cortex-M55 & Cortex-M85		8	4	2	N/A	4	2
Ethos-U85(128 MACs)		128	64	N/A	N/A	N/A	N/A
Ethos-U85(2048MACs)		2048	1024	N/A	N/A	N/A	N/A
Cortex-A320		32	8	4	8	8	4

What Software Tools and Frameworks Support Arm AI Hardware

Another aspect to consider is the software support for each hardware solution. Arm offers a comprehensive set of open-source runtime support software for all AI hardware solutions: Cortex-A, Cortex-M, and Ethos-U. Arm supports hardware acceleration for various ML frameworks and runtimes, including PyTorch, ExecuTorch, Llama.cpp, TensorFlow, and LiteRT via XNNPACK. Any ML framework can be optimized to leverage Arm AI features, with runtimes executing on Arm processors and utilizing software acceleration libraries like CMSIS-NN for Cortex-M/Helium and Arm Compute Library or KleidiAI for int8 and bf16 in Neon/SVE2. The Vela compiler is an offline tool that optimizes models for efficient deployment on Ethos-U, where it further refines the executable binary for maximum hardware performance.

When to Pair Cortex Processors With Ethos-U NPUs for Edge AI

Some edge AI use-cases with well-defined AI workloads can benefit from off-loading NN processing to a dedicated NPU, thereby freeing up the host processor from such compute-intensive tasks. As discussed, the Ethos-U NPU is very efficient at processing neural networks with quantized 8-bit integer weights. Transformer networks are especially suited to run on Ethos-U85. However, the Ethos-U85 NPU must be driven by a host processor, which can be either a Cortex-M or a Cortex-A.

Various host processor-Ethos-U configurations are possible. Ethos-U can be driven by a Cortex-M, which is done on Helium enabled Cortex-M processors, such as the Cortex-M55. Some examples of this system-on-chip configuration are available today on the market. Recently, running generative AI workloads on small language models (SLMs) has been gaining interest in the industry; the Ethos-U combined with a Helium-enabled Cortex-M is perfect for such use-cases.

There are also system-on-chips based on Cortex-A processors that integrate an ML island of Cortex-M with Ethos-U processors. Typically, these SoCs are geared to run rich operating systems, such as Linux, and support a larger and more flexible memory system. Cortex-M CPUs have 32-bit addressable memory address space with a direct memory address mapping, while the more recent Cortex-A processors, such as the Cortex-A320, have 40-bit memory addressable space that also benefits from virtual memory addressing by a memory management unit (MMU).

As we see large language model (LLM) execution gravitating to edge AI devices, having larger and more flexible memory systems can ease the execution of such models with larger numbers of parameters, for example >1Bn parameter LLMs. With growing interest in SLMs, Cortex-M with Ethos-U85 is a great fit. Cortex-M processors have 4GB addressing space, with some reserved for system functions. As LLM models grow in size, however, Cortex-A systems, with larger and more flexible memory, may become essential.

Recently, we announced another configuration called ‘direct drive’. This is where the Cortex-A processor directly drives the Ethos-U NPU. Such a configuration removes the need for a dedicated Cortex-M ‘driver’ processor. There is a Linux driver for the Ethos-U85 that runs on the host Cortex-A.

How Cortex-A320 Expands Flexibility for Edge AI in IoT

Edge AI system developers now have more options to optimize the last-mile AI in IoT. Whether choosing a Cortex-M, Cortex-A, or an Ethos-U-accelerated system, each serves different needs. With the Cortex-A320 processor’s ability to directly drive Ethos-U85, designers gain even more flexibility. As Arm’s smallest and most efficient Armv9-A Cortex-A processor, Cortex-A320 enhances edge AI efficiency while adapting to the evolving demands of generative AI on embedded systems.

Learn how the future of IoT is being shaped with transformative edge AI solutions from Arm.

Takeaways

Arm offers a range of AI hardware options, such as Cortex-M, Cortex-A, and Ethos-U NPUs, each suited for specific edge AI workloads.
Ethos-U85 excels at processing quantized neural networks and is ideal for transformer and small language model use cases.
The Cortex-A320, with direct Ethos-U85 support, delivers flexibility, efficiency, and scalability for IoT and generative AI applications.

Frequently Asked Questions

Which Arm processors should I choose for edge AI?

The Arm Cortex-M is ideal for cost- and energy-constrained tasks, whereas the Cortex-A suits more complex workloads. The Ethos-U NPUs are ideal for accelerating quantized neural networks.

What is the advantage of pairing Ethos-U85 with a Cortex processor?

It offloads neural network tasks from the CPU, improving performance and efficiency for models like transformers and SLMs.

How does the Cortex-A320 benefit IoT edge AI?

It’s Arm’s most efficient Armv9 Cortex-A CPU, capable of directly driving Ethos-U85 for flexible, high performance AI in compact IoT devices.

By Tim Menasveta, Director, Product Management, IoT Line of Business, Arm

Article Text

Copy Text

Any re-use permitted for informational and non-commercial or personal use only.

Editorial Contact

Arm Editorial Team

editorial@arm.com

Subscribe to Blogs and Podcasts

Get the latest blogs & podcasts direct from Arm

Blog

Mar 15, 2024

Enabling Next-Gen Edge AI Applications with Transformer Networks

Stephen Su, Senior Segment Marketing Manager, Arm IoT, Arm

Podcast

Jan 17, 2025

Smart Buildings at Billion-Square-Meter Scale: Inside China’s Massive Real Estate Digital Transformation

Blog

Feb 26, 2025

Embracing the Future with Cortex-A320: A Deep Dive into the General Armv9 Architecture Adoption

Christophe Fava-Rivi, Director, Software Product Management, IoT Line of Business, Arm

Blog

Feb 26, 2025

Introducing Cortex-A320: Ultra-efficient Armv9 CPU Optimized for IoT

Dimosthenis Rossidis, Senior Product Manager, IoT Line of Business, Arm

News

Feb 26, 2025

Arm Drives Next-Generation Performance for IoT with World’s First Armv9 Edge AI Platform

Paul Williamson, SVP and GM of the IoT LoB, Arm

Blog

Nov 13, 2024

Arm Ethos-U85 NPU: Unlocking Generative AI at the Edge with Small Language Models

Arm Editorial Team

Media Information

Latest on X

; Arm @Arm ·

14h 1958325682267717849

The new Google #Pixel10 smartphones are here! 🎉

With the Tensor G5 on Arm, Pixel 10 delivers next-gen AI, smarter performance, and incredible efficiency right in your pocket.

👏 Congrats to the @Google team on such an exciting #MadeByGoogle launch!

Powerful and proactive: Pixel 10 phones are here

Learn more about the new Pixel 10, Pixel 10 Pro and Pixel 10 Pro XL phones announced today at Made by Google.

okt.to

Reply on Twitter 1958325682267717849 Retweet on Twitter 1958325682267717849 8 Like on Twitter 1958325682267717849 33 Twitter 1958325682267717849

; Arm @Arm ·

21h 1958212693459878125

Chiplets are the future - a collaborative and robust ecosystem is the key.🔑

@theaustinlyons explores how open chiplet ecosystems, like Arm’s, encourage broader adoption and enable the next generation of efficient, standards-based chiplet integration.

Chipstrat

Chipstrat is an interdisciplinary lens on semiconductors, AI, and business strategy. Read by tech leaders and investor...

okt.to

Reply on Twitter 1958212693459878125 Retweet on Twitter 1958212693459878125 5 Like on Twitter 1958212693459878125 19 Twitter 1958212693459878125

; Arm @Arm ·

20 Aug 1958100613528526937

Innovation. Technology. Community.

For the third year running, we’ve joined forces with @WILDLABSNET to support early-career women in East Africa through the Women in Conservation Technology (WiCT) programme.

🎥 Hear their voices, in their own words: https://okt.to/gl0EhU

Reply on Twitter 1958100613528526937 Retweet on Twitter 1958100613528526937 5 Like on Twitter 1958100613528526937 11 Twitter 1958100613528526937

; Arm @Arm ·

19 Aug 1957860586265464916

What does ABBA Voyage teach us about AI? 🎤

@itspetergabriel shares his take on virtual concerts, the future of live performance, and how AI is reshaping the stage.

Which artist would you see in a virtual show?

🎧 Listen to the full episode: https://okt.to/He3zqZ

Reply on Twitter 1957860586265464916 Retweet on Twitter 1957860586265464916 1 Like on Twitter 1957860586265464916 3 Twitter 1957860586265464916

; Arm @Arm ·

19 Aug 1957832202059755858

Rusted Firmware-A v0.1 is here! 🎉

Developed by Arm + @Google and built on TF-A’s proven track record, RF-A is an open-source prototype introducing the Rust programming language for secure, efficient firmware development on Arm-based platforms. 💪

https://okt.to/vX8jzr

Reply on Twitter 1957832202059755858 Retweet on Twitter 1957832202059755858 5 Like on Twitter 1957832202059755858 19 Twitter 1957832202059755858

; Arm @Arm ·

15 Aug 1956452480960843892

Here's a little BTS for your Friday✨

Arm CMO, Ami Badani, recently spoke with @CNN’s Anna Stewart on how Arm is enabling real-time compute to transforming landscape of AI-powered devices. Redefining what is possible in the cloud and at the Edge.

Reply on Twitter 1956452480960843892 Retweet on Twitter 1956452480960843892 3 Like on Twitter 1956452480960843892 18 Twitter 1956452480960843892

; Arm @Arm ·

15 Aug 1956424324514296184

Congratulations to @SiMa_Inc on the launch of Modalix™, bringing AI and LLM capabilities to Physical AI applications at the edge! 🥳

Built on the Arm compute platform, Modalix™ is a next-gen solution built to deliver performance without sacrificing power.…

Reply on Twitter 1956424324514296184 Retweet on Twitter 1956424324514296184 1 Like on Twitter 1956424324514296184 14 Twitter 1956424324514296184

; Arm @Arm ·

15 Aug 1956401987500867919

Which of the below are key findings from @VDC_Research's recent report, in partnership with Arm, exploring the next era of embedded technology - led by AI and built on Arm?

Hint: This is a trick question it's all true🥳

Reply on Twitter 1956401987500867919 Retweet on Twitter 1956401987500867919 1 Like on Twitter 1956401987500867919 5 Twitter 1956401987500867919

; Arm @Arm ·

15 Aug 1956379484308775165

🚗 What if your SDV software could start development before hardware even exists?

With Arm Compute Subsystems and @SOAFEE’s open standards-based stack, developers get early access to virtual prototypes, accelerating cloud-native automotive innovation.

Built on Arm. Ready for…

Reply on Twitter 1956379484308775165 Retweet on Twitter 1956379484308775165 4 Like on Twitter 1956379484308775165 9 Twitter 1956379484308775165

; Arm @Arm ·

14 Aug 1956072094879388098

Microsoft has just announced a major update for Windows Insiders in the Xbox Insider Program.

For the first time ever, ARM64-compatible games can be downloaded and played locally from the Xbox PC app

👏 Huge shoutout to the teams at @Microsoft & @Xbox!

Xbox PC App Experience Expanding on Arm®-based Windows 11 PCs

Today, we’re beginning to roll out an update for Arm®-based Windows 11 PCs, which introduces changes and improvem...

okt.to

Reply on Twitter 1956072094879388098 Retweet on Twitter 1956072094879388098 7 Like on Twitter 1956072094879388098 24 Twitter 1956072094879388098

; Arm @Arm ·

14 Aug 1956008310601031746

🎧 Can AI make music with soul?

In this one of a kind conversation, @itspetergabriel joins Arm CEO Rene Haas to share his perspective, including why we need to "work with it" to build something remarkable, rather than fight it: https://okt.to/gITxA4

Reply on Twitter 1956008310601031746 Retweet on Twitter 1956008310601031746 4 Like on Twitter 1956008310601031746 9 Twitter 1956008310601031746

; Arm @Arm ·

13 Aug 1955712728959500678

How do you inspire the next generation of innovators?

Through partnerships like ours with @siemenssoftware and the @unisouthampton we're providing the resources necessary to support the pipeline of emerging semiconductor talent at scale. 🌟
https://okt.to/8w9Lbo

Reply on Twitter 1955712728959500678 Retweet on Twitter 1955712728959500678 2 Like on Twitter 1955712728959500678 16 Twitter 1955712728959500678

; Arm @Arm ·

12 Aug 1955254555534332348

👾 Ready to build the future of neural graphics?

We’ve launched the world’s first open neural graphics dev kit, so you can start creating AI-enhanced visuals today, a year ahead of hardware!

Includes:
✅ @UnrealEngine plugin
✅ @VulkanAPI emulation
✅ Open models on @github &…

Arm Software Developers @ArmSoftwareDev

Introducing Arm neural technology – an industry first for on-device AI and mobile graphics.

From 2026, it will bring dedicated neural accelerators to Arm GPUs, kicking off with Neural Super Sampling – an AI-powered graphics upscaler that delivers 2x resolution uplift at…

Reply on Twitter 1955254555534332348 Retweet on Twitter 1955254555534332348 14 Like on Twitter 1955254555534332348 52 Twitter 1955254555534332348

; Arm @Arm ·

8 Aug 1953878635959476238

AI’s rapid rise is changing the workplace.

Charlotte Eaton, CPO at Arm, sat down with the Future Ready Leadership Podcast to explore how complex this transformation really is and the mindset shift that is needed to support an AI-ready culture. ⚡

How to Build an AI Leadership Mindset with Charlotte Eaton, CPO of Arm

Discover the AI leadership mindset with Charlotte Eaton, CPO of Arm. Learn how to build an AI-ready workforce and ...

okt.to

Reply on Twitter 1953878635959476238 Retweet on Twitter 1953878635959476238 3 Like on Twitter 1953878635959476238 16 Twitter 1953878635959476238

; Arm @Arm ·

7 Aug 1953518779494731890

Embedded Intelligence + Production Environments =

⚡Real time anomaly detection
⚡Minimal waste
⚡Continuous process optimization

What more could you need? Arm-based platforms are bringing AI to the edge and redefining quality control in manufacturing. 💪…

Reply on Twitter 1953518779494731890 Retweet on Twitter 1953518779494731890 27 Like on Twitter 1953518779494731890 87 Twitter 1953518779494731890

; Arm @Arm ·

7 Aug 1953512486117748952

We’re proud to share that Arm has been named one of the Top 100 Internship Programs in the US by @Yello! 🏆

This recognition reflects the meaningful, hands-on experience our interns gain from day one.

Thank you to everyone who makes it possible. 💙

https://okt.to/YuwoBb

Reply on Twitter 1953512486117748952 Retweet on Twitter 1953512486117748952 6 Like on Twitter 1953512486117748952 26 Twitter 1953512486117748952

; Arm @Arm ·

7 Aug 1953474832542077331

F1®, precision starts long before race day.

In the @AstonMartinF1 HQ, the CoreWeave wind tunnel uses Arm-powered, state of the art technology to help interface with over 1,000 sensors simultaneously to turn data into faster, smarter decisions.

Less time validating. More time…

Reply on Twitter 1953474832542077331 Retweet on Twitter 1953474832542077331 5 Like on Twitter 1953474832542077331 15 Twitter 1953474832542077331

; Arm @Arm ·

6 Aug 1953208734693425333

We believe in supporting the next generation of innovators - that's why we're proud to have signed the Pledge to America's Youth and committed to increasing our work in this area.

This effort will foster early interest in Al, promote Al literacy, and enable comprehensive Al…

Reply on Twitter 1953208734693425333 Retweet on Twitter 1953208734693425333 4 Like on Twitter 1953208734693425333 19 Twitter 1953208734693425333

; Arm @Arm ·

6 Aug 1953204947199095140

Leaders around the globe are investing in the search for efficient and sustainable solutions to enable AI data centers at scale.

@FT spoke with Mohamed Awad, SVP and GM, Infrastructure Line of Business at Arm, about the ongoing race for AI capacity! ⚡

Inside the relentless race for AI capacity

The quest for superintelligence is spurring a data centre boom — but critics question the cost, environmental impact and whether it is all needed

okt.to

Reply on Twitter 1953204947199095140 Retweet on Twitter 1953204947199095140 8 Like on Twitter 1953204947199095140 28 Twitter 1953204947199095140

; Arm @Arm ·

4 Aug 1952377677467255178

Musical legend 🤝 Tech CEO

In the 🆕 episode of Tech Unheard, @itspetergabriel joins Rene Haas to explore how AI can open up access to science, music and the arts - and why that access matters.

Listen here: https://okt.to/ZVn8ft

Reply on Twitter 1952377677467255178 Retweet on Twitter 1952377677467255178 3 Like on Twitter 1952377677467255178 21 Twitter 1952377677467255178

; Arm @Arm ·

4 Aug 1952346796040020100

A world-first for the world’s youngest. We’re supporting @Simprints and @Gavi to launch a contactless AI tool that identifies infants for vital vaccines.

Built on Arm Neoverse, starting in Ghana. Because smarter, more equitable healthcare should start from birth.…

Reply on Twitter 1952346796040020100 Retweet on Twitter 1952346796040020100 10 Like on Twitter 1952346796040020100 30 Twitter 1952346796040020100

; Arm @Arm ·

1 Aug 1951396286961270968

Chiplets are here and they’re reshaping the landscape of compute.

At #62DAC, @EddieRamirez, VP Infrastructure at Arm, shared how Arm Total Design alongside open standards, scalable IP, and a strong partner ecosystem are accelerating the creation of interoperable, silicon-proven…

Reply on Twitter 1951396286961270968 Retweet on Twitter 1951396286961270968 8 Like on Twitter 1951396286961270968 42 Twitter 1951396286961270968

; Arm @Arm ·

1 Aug 1951321099058168204

"We’re just at the beginning of the AI-defined vehicle era.” – Suraj Gajendra

Built on our scalable compute platform, SOAFEE enables standardization, flexibility, and software reuse which help OEMs move faster in this new automotive era.🚗

https://okt.to/whaNx9

Reply on Twitter 1951321099058168204 Retweet on Twitter 1951321099058168204 2 Like on Twitter 1951321099058168204 11 Twitter 1951321099058168204

; Arm @Arm ·

1 Aug 1951246558176985330

In this article for @eetimes, Dipti Vachani explores how the automotive industry must evolve to meet the demands of increasingly complex, AI-defined vehicles.

Read more about what it will take to build a more resilient automotive compute ecosystem: https://okt.to/l0wPcG

Reply on Twitter 1951246558176985330 Retweet on Twitter 1951246558176985330 5 Like on Twitter 1951246558176985330 13 Twitter 1951246558176985330

Choosing the Right Arm Edge AI Solution for Your AI application

What Makes Arm Cortex-A320 Ideal for IoT Edge AI?

How To Choose the Right Edge AI Solution?

How Arm CPUs and NPUs Handle Different AI Data Types and Workloads

What Software Tools and Frameworks Support Arm AI Hardware

When to Pair Cortex Processors With Ethos-U NPUs for Edge AI

How Cortex-A320 Expands Flexibility for Edge AI in IoT

Takeaways

Frequently Asked Questions

Which Arm processors should I choose for edge AI?

What is the advantage of pairing Ethos-U85 with a Cortex processor?

How does the Cortex-A320 benefit IoT edge AI?

Editorial Contact

Related

Enabling Next-Gen Edge AI Applications with Transformer Networks

Smart Buildings at Billion-Square-Meter Scale: Inside China’s Massive Real Estate Digital Transformation

Embracing the Future with Cortex-A320: A Deep Dive into the General Armv9 Architecture Adoption

Introducing Cortex-A320: Ultra-efficient Armv9 CPU Optimized for IoT

Arm Drives Next-Generation Performance for IoT with World’s First Armv9 Edge AI Platform

Arm Ethos-U85 NPU: Unlocking Generative AI at the Edge with Small Language Models

Media Information

Company Overview & History

Arm Corporate Guidelines

Media Contacts

Latest on X