Blog

October 9, 2024

Why Arm is the Compute Platform for All AI Workloads

The Arm CPU can be seamlessly augmented and integrated with AI accelerator technologies as part of a flexible heterogeneous computing approach to AI.

By Arm Editorial Team

For AI, no individual piece of hardware or computing component will be the “one size fits-all” solution for all workloads. AI needs to be distributed across the entire modern topography of computing, from cloud to edge – and that requires a heterogeneous computing platform that offers the flexibility to use different computational engines, including the CPU, GPU and NPU, for different AI use cases and demands.

The Arm CPU already provides a foundation for accelerated AI everywhere, from the smallest embedded device to the largest datacenter. This is due to its performance and efficiency capabilities, pervasiveness, ease of programmability and flexibility.

Focusing on flexibility, there are three key reasons why this is hugely beneficial to the ecosystem. Firstly, it means the Arm CPU can process a broad range of AI inference use cases, many of which are commonly used across billions of devices, like today’s smartphones, and in cloud and data centers worldwide – and not only that, because beyond inference the CPU is often used for additional tasks in the stack, such as data pre-processing and orchestration. Secondly, developers can run a broader range of software in a greater variety of data formats without needing to build multiple versions of the code. And, thirdly, CPU’s flexibility makes it the perfect partner for accelerated AI workloads.

Delivering diversity and choice to enable the industry to deploy AI compute their way

Alongside the CPU portfolio, the Arm compute platform includes AI accelerator technologies, such as GPUs and NPUs, which are being integrated with the CPU across various markets.

In mobile, Arm Compute Subsystems (CSS) for Client features the Armv9.2 CPU cluster integrated with the Arm Immortalis-G925 GPU to offer acceleration capabilities for various AI use cases, including image segmentation, object detection, natural language processing, and speech-to-text. In IoT, the Arm Ethos-U85 NPU is designed to run with Cortex-A-based systems that require accelerated AI performance, such as factory automation.

Also, in addition to Arm’s own accelerator technologies, our CPUs give our partners the flexibility to create their own customized, differentiated silicon solutions. For example, NVIDIA’s Grace Blackwell and Grace Hopper superchips for AI-based infrastructure both incorporate Arm CPUs alongside NVIDIA’s AI accelerator technologies to deliver significant uplifts in AI performance.

The Grace Blackwell superchip combines NVIDIA’s Blackwell GPU architecture with the Arm Neoverse-based Grace CPU. Arm’s unique offering enabled NVIDIA to make system-level design optimizations, reducing energy consumption by 25 times and providing a 30 times increase in performance per GPU compared to NVIDIA H100 GPUs. Specifically, NVIDIA was able to implement their own high-bandwidth NVLink interconnect technology, improving data bandwidth and latency between the CPU, GPU and memory – an optimization made possible thanks to the flexibility of the Arm Neoverse platform.

Click to read Accelerate Your AI Data Center Dreams

Arm is committed to bringing these AI acceleration opportunities across the ecosystem through Arm Total Design. The program provides faster access to Arm’s CSS technology, unlocking hardware and software advancements to drive AI and silicon innovation and enabling the quicker development and deployment of AI-optimized silicon solutions.

Flexibility of the Arm CPU: Delivering the architecture AI demands

Central to the flexibility of the Arm CPU designs is our industry-leading architecture. It offers a foundational platform that can be closely integrated with AI accelerator technologies and supports various vector lengths, from 128 bit to 2048 bit, which allows for multiple neural networks to be executed easily across many different data points.

The flexibility of the Arm’s architecture enables diverse customization opportunities for the entire silicon ecosystem, with our heritage built on enabling partners to build their own differentiated silicon solutions as quickly as possible. This unique flexibility also allows Arm to continuously innovate the architecture, introducing critical instructions and features on a regular cadence that accelerate AI computation to benefit the entire ecosystem, from leading silicon partners to the 20 million plus software developers building on the Arm compute platform.

This started with the Armv7 architecture, which introduced advanced Single Instruction Multiple Data (SIMD) extensions, such as NEON technology, as Arm’s initial venture into machine learning (ML) workloads. It has been enhanced over the past few years, with additions focused on vector dot product and matrix multiplication as part of Armv8, before the introduction of Arm Scalable Vector Extensions 2 (SVE2) and the new Arm Scalable Matrix Extension (SME) as key elements of Armv9 that drive higher compute performance and reduced power consumption for a range of generative AI workloads and use cases.

Seamless integration with AI accelerator technologies

Arm is the compute platform for the age of AI, driving ongoing architectural innovation that directly corresponds with the evolution of AI-based applications that are becoming faster, more interactive, and more immersive. The Arm CPU can be seamlessly augmented and integrated with AI accelerator technologies, such as GPUs and NPUs, as part of a flexible heterogeneous computing approach to AI workloads.

While the Arm CPU is the practical choice for processing many AI inference workloads, its flexibility means it is the perfect companion for accelerator technologies where more powerful and performant AI is needed to deliver certain use cases and computation demands. For our technology partners, this helps to deliver endless customization options to enable them to build complete silicon solutions for their AI workloads.

By Arm Editorial Team

Article Text

Copy Text

Any re-use permitted for informational and non-commercial or personal use only.

Editorial Contact

Arm Editorial Team

editorial@arm.com

Stay informed with Arm's top stories, insights, and conversations.

Blog

Jan 08, 2024

Arm: The Technology Foundation for AI Everywhere

Arm Editorial Team

News

Sep 25, 2024

Accelerating and Scaling AI Inference Everywhere with New Llama 3.2 LLMs on Arm

Ian Bratt, VP of ML Technology and Fellow, Arm

News

Sep 16, 2024

Arm Accelerates AI From Cloud to Edge With New PyTorch and ExecuTorch Integrations to Deliver Immediate Performance Improvements for Developers

Alex Spinelli, SVP, AI and Developer Platforms, Arm

Blog

Sep 10, 2024

Unlocking New Real-world Generative AI Use Cases on the Mobile CPU

Ronan Naughton, Director, Product Management, Client Line of Business, Arm

News

May 29, 2024

Redefining Mobile Experiences with AI-Optimized Arm CSS for Client and New Arm Kleidi Software

Chris Bergey, Executive Vice President, Edge AI Business Unit, Arm

News

Apr 09, 2024

Arm Accelerates Edge AI with Latest Generation Ethos-U NPU and New IoT Reference Design Platform

Paul Williamson, SVP and GM of the IoT Business, Arm

Media Information

Latest on X

; Arm @Arm ·

2h 2069193867459494327

At Web Summit Rio, Will Abbey, EVP Chief Commercial Officer at Arm, explored what comes next for AI infrastructure. As agentic AI drives new demands on compute, performance and power efficiency will be critical to delivering meaningful outcomes at scale. https://okt.to/QrsZvB

Reply on Twitter 2069193867459494327 Retweet on Twitter 2069193867459494327 0 Like on Twitter 2069193867459494327 2 Twitter 2069193867459494327

; Arm @Arm ·

4h 2069167255720485057

Neural Super Sampling and Denoising (NSSD) is helping introduce a new era of gaming on mobile with Neural Dawn. By bringing AI-powered upscaling and denoising together, NSSD supports advanced graphics experiences on mobile. 📱

Reply on Twitter 2069167255720485057 Retweet on Twitter 2069167255720485057 2 Like on Twitter 2069167255720485057 13 Twitter 2069167255720485057

; Arm @Arm ·

12h 2069042931256668429

Agentic AI requires a new approach to infrastructure.

At #COMPUTEX2026, @Rebellions_inc showcased RebelCard™ and discussed how Arm AGI CPU and AI accelerators can work together to deliver scalable, energy-efficient AI inference.

Hear from Co-founder & CTO Jinwook Oh ⬇️

Reply on Twitter 2069042931256668429 Retweet on Twitter 2069042931256668429 6 Like on Twitter 2069042931256668429 35 Twitter 2069042931256668429

; Arm @Arm ·

19 Jun 2068067327824875953

"We have almost a control tower view of the entire industry. We see everything. We talk to everyone."

On @TBPN, Arm CEO Rene Haas shared how our position at the center of the world's compute ecosystem gives us a unique perspective on the technology and trends shaping the next

Reply on Twitter 2068067327824875953 Retweet on Twitter 2068067327824875953 9 Like on Twitter 2068067327824875953 53 Twitter 2068067327824875953

; Arm @Arm ·

19 Jun 2067969475094192602

A new addition to Arm Cambridge. 🏢

We recently opened doors to E&F, the sixth building on our Cambridge campus, designed to support collaboration, innovation and growth for years to come.

Here's a look inside 📸

Reply on Twitter 2067969475094192602 Retweet on Twitter 2067969475094192602 10 Like on Twitter 2067969475094192602 71 Twitter 2067969475094192602

; Arm @Arm ·

18 Jun 2067575912783093887

The future of AI infrastructure will be built as a system.

As AI workloads become more complex, performance alone isn't enough. Infrastructure must also be efficient, scalable and designed to work seamlessly across the stack.

At #COMPUTEX2026 @Rebellions_inc Co-founder & CTO

Reply on Twitter 2067575912783093887 Retweet on Twitter 2067575912783093887 12 Like on Twitter 2067575912783093887 89 Twitter 2067575912783093887

; Arm @Arm ·

17 Jun 2067380305623728312

Today Arm hosted the Women in @TechworksHub: Engineering Intelligently event, bringing together innovators from across the technology industry to discuss leadership, transformation, and the future of talent in the AI era.

Alongside insights from Arm's Chief People Officer,

Reply on Twitter 2067380305623728312 Retweet on Twitter 2067380305623728312 3 Like on Twitter 2067380305623728312 29 Twitter 2067380305623728312

Why Arm is the Compute Platform for All AI Workloads

Delivering diversity and choice to enable the industry to deploy AI compute their way

Flexibility of the Arm CPU: Delivering the architecture AI demands

Seamless integration with AI accelerator technologies

Editorial Contact

Related

Arm: The Technology Foundation for AI Everywhere

Accelerating and Scaling AI Inference Everywhere with New Llama 3.2 LLMs on Arm

Arm Accelerates AI From Cloud to Edge With New PyTorch and ExecuTorch Integrations to Deliver Immediate Performance Improvements for Developers

Unlocking New Real-world Generative AI Use Cases on the Mobile CPU

Redefining Mobile Experiences with AI-Optimized Arm CSS for Client and New Arm Kleidi Software

Arm Accelerates Edge AI with Latest Generation Ethos-U NPU and New IoT Reference Design Platform

Media Information

Company Overview & History

Arm Corporate Guidelines

Media Contacts

Latest on X