Blog

April 15, 2025

Why the Right Software Approach is Vital to AI Innovation

Arm recognizes the importance of software in fulfilling AI's true potential, so developers can create AI applications quickly and at scale.

By Arm Editorial Team

As Mark Hambleton, Arm’s SVP of Software, says in the Arm Silicon Reimagined report: “The future of AI development relies on the synergy between software and hardware.”

However, the big challenge, as set out in a new Arm-sponsored CIO report, is that developer workflows are often fragmented. This means developers are unable to move as quickly as they would like when creating and scaling new AI applications.

At Arm, we recognize the importance of software in fulfilling AI’s true potential. We work from the foundational architecture and across the stack to simplify AI development and enable seamless performance accelerations for new AI applications and workloads.

How Armv9 Architecture Accelerates AI and ML Workloads

Arm is continuously evolving our architecture, which essentially acts as the interface between hardware and software. Today, the Armv9 architecture is the modern technology foundation for various markets from cloud to edge, including smartphones, datacenters, high-performance computing and automotive applications.

Arm updates each architecture with new features, which more recently included Scalable Matrix Extension (SME) and Scalable Vector Extension 2 (SVE2), that are critical for accelerating generative AI and common machine learning (ML) workloads across all applications. SME enables complex matrix processing capabilities in common instruction sets that allow developers to get great performance on their AI applications and then seamlessly migrate across ecosystems. This creates greater possibilities to run more AI workloads across more hardware, while offering an improved user experience.

Why CPU Remains the Preferred Platform for AI Development

These architectural features are built into Arm’s CPUs, which have emerged as the target platform of choice for software developers. This is due to their widespread adoption from cloud to edge, and suitability as an immediate target for most AI inference workloads that are commonly used across billions of devices, like today’s smartphones, and in cloud and datacenters worldwide. By targeting the CPU, developers can run a broader range of software in a greater variety of data formats without needing to build multiple versions of the code required for specialist NPUs.

CPUs offer developers the consistency that they value, avoiding the fragmentation and inefficiencies associated with bespoke hardware solutions. As Hambleton noted in the Silicon Reimagined report: “Interoperability across AI frameworks is a critical concern for developers. This is why developers frequently default to CPU back-ends, as their ubiquity ensures broader compatibility.”

There are other factors beyond architectural advancement that are helping to scale AI workloads. In the CIO report, Nick Horne, Arm’s VP of ML Engineering, states that AI has evolved from requiring enormous models in the cloud to smaller, more efficient models that can run at the edge – on devices. He says: “Now you can get excellent models that provide great results running on the device in your pocket, and in some cases entirely on the CPU.”

How Arm’s Open Source Collaboration Empowers AI Developers

Arm works extensively with the open source community to democratize AI and create opportunities for developers to easily access the latest architectural features and performance across hardware from a broad range of Arm ecosystem partners.

Horne highlights the benefits of this approach to developers in the CIO report. He says: “Working with open source AI frameworks with good hardware abstraction minimizes the loss of flexibility.” This helps developers to avoid being tied down to a specific piece of hardware, cloud service provider or software platform.

What is Arm Kleidi and How Does it Accelerate AI Workloads

Arm Kleidi is a great example of these benefits in action. Kleidi includes developer enablement technologies, resources and micro-kernel libraries providing effortless AI workload acceleration for models running on Arm CPUs. As Kleidi libraries are integrated into the most popular open source AI frameworks and runtimes, including MediaPipe from Google, ExecuTorch and PyTorch from Meta, and llama.cpp, these performance optimizations require no additional work from developers, saving time, effort and costs. Kleidi is now integrated across all markets that Arm covers, including mobile, cloud, datacenter, automotive, and IoT.

How Arm’s Ecosystem Partnerships Enable Scalable AI Deployment

On a broader level, Arm works across our industry-leading software ecosystem with various partners to securely and safely deploy AI at scale. For example, our partnership with GitHub on GitHub runners enables developers to test and deploy trained models more efficiently in the cloud. More recently, the Arm extension for GitHub CoPilot offers developers access to a fully integrated native Arm workflow, with accurate code generation, test case creation and bug fixing.

Arm is also committed to frictionless software development through various initiatives that simplify and accelerate the deployment of low-level software and firmware. Initiatives such as Linaro OneLab, Trusted Firmware and PSA Certified foster collaboration and provide blueprints for secure software deployment and support in the rapidly advancing spheres of edge AI and high-performance IoT. In the automotive industry, the Arm-founded SOAFEE (Scalable Open Architecture for Embedded Edge) initiative is dedicated to delivering a standards-based framework for software re-use at scale to accelerate development cycles. This is supporting unprecedented demands for more AI in applications featured in the software-defined vehicle (SDV), while enhancing driver experiences.

Why Open Standards Are Key to AI Innovation

Finally, a lack of standardized practices can hinder innovation and lead to future complexities for developers. Open standards mean developers and researchers can transition seamlessly between platforms, while also allowing them to focus on training, quantization, and deployment that add value to the ongoing innovation of models.

How Arm is Accelerating and Future-Proofing AI Development

For AI to reach its full potential, the software development process needs to be accelerated, open and streamlined. Arm’s technologies and supporting ecosystem helps to future-proof AI development through focusing on open standards, hardware abstraction and compatibility with evolving frameworks. This approach allows developers to seamlessly create and deploy their AI applications, models and workloads at scale across diverse hardware with enhanced performance — building better software on Arm for the age of AI.

Takeaways

SME and SVE2 are Armv9 architecture extensions designed to accelerate generative AI and ML workloads across cloud, edge, and device environments.
SME enables efficient matrix processing, allowing developers to achieve strong AI performance and easily migrate workloads across different hardware ecosystems.
SME2 builds on SME with higher throughput, real-time processing improvements, and better power efficiency for advanced mobile AI use cases.

Frequently Asked Questions

What are SME and SVE2 in the Armv9 architecture?

They are extensions that boost AI and ML performance by accelerating matrix and vector operations directly on the CPU.

How does SME improve AI application performance?

SME delivers scalable matrix processing and efficient memory use, enabling AI workloads to run smoothly across different hardware platforms.

What advantages does SME2 add over SME?

SME2 increases throughput, improves real-time AI task performance, and enhances energy efficiency for mobile and edge devices.

AI in Software

Learn more about AI in software for the new age of development.

Read Now

By Arm Editorial Team

Article Text

Copy Text

Any re-use permitted for informational and non-commercial or personal use only.

Editorial Contact

Arm Editorial Team

editorial@arm.com

Subscribe to Blogs and Podcasts

Get the latest blogs & podcasts direct from Arm

Blog

Mar 27, 2025

Introducing the Arm AI Readiness Index Report: A Global Benchmark for AI Implementation

Arm Editorial Team

Blog

Feb 27, 2025

Silicon Reimagined: New Foundations for the Age of AI

Arm Editorial Team

Blog

Mar 10, 2025

Arm Kleidi Arrives in Automotive Markets to Accelerate Performance for AI-based Applications

Suraj Gajendra, VP Product and Solutions, Automotive Line of Business, Arm

Blog

May 29, 2024

Accelerating AI Developer Innovation Everywhere with New Arm Kleidi

Geraint North, Fellow, AI and Developer Platforms, Arm

Blog

Jun 06, 2024

GitHub Actions: Simplifying Arm-based Application Development

Jason Andrews, Solutions Director and Distinguished Engineer, Arm

News

Feb 19, 2025

New Arm Extension for GitHub Copilot Simplifies and Accelerates Migration to Arm-based Servers

Alex Spinelli, SVP, AI and Developer Platforms and Services, Arm

Media Information

Latest on X

; Arm @Arm ·

6 Mar 2030054875506119132

AI is reshaping modern infrastructure inside out.

Arm Neoverse enables the scalable performance and efficiency required across hyperscale cloud, AI-driven data centers and edge deployments. 💪

https://okt.to/uh8tPL

Reply on Twitter 2030054875506119132 Retweet on Twitter 2030054875506119132 6 Like on Twitter 2030054875506119132 29 Twitter 2030054875506119132

; Arm @Arm ·

6 Mar 2029716888968675394

What does Physical AI look like up close?

433 Arm-based cores powering everything from high-performance AI compute to real-time actuation.

Drew Henry shares why Arm’s engineering collaboration with @TensorAuto matters.

🎥 See inside the Tensor garage.

Reply on Twitter 2029716888968675394 Retweet on Twitter 2029716888968675394 17 Like on Twitter 2029716888968675394 91 Twitter 2029716888968675394

; Arm @Arm ·

5 Mar 2029673883880640615

In the AI data center, power sets the pace. ⚡

Always-on AI shifts the constraint from peak performance to intelligent orchestration. CPUs are the control layer that keeps accelerators productive and systems operating efficiently at scale.

Our work with @meta reflects Arm’s

Reply on Twitter 2029673883880640615 Retweet on Twitter 2029673883880640615 10 Like on Twitter 2029673883880640615 37 Twitter 2029673883880640615

; Arm @Arm ·

5 Mar 2029623001965019403

AI should be personal, immediate and everywhere.

Armv9 CPUs + KleidiAI are bringing real-time generative AI and multimodal experiences directly to billions of smartphones.

Built for performance, power and efficiency at global scale, so AI runs on-device, not just in the cloud.

Reply on Twitter 2029623001965019403 Retweet on Twitter 2029623001965019403 12 Like on Twitter 2029623001965019403 59 Twitter 2029623001965019403

; Arm @Arm ·

5 Mar 2029502757191733340

Your smartphone camera is about to become a generative AI studio. 📸

We've partnered with @tecnomobile to bring real-time, fully on-device AI-generated content previews to smartphones — running at 30fps with zero cloud reliance.

Built on Armv9 CPUs and accelerated by Arm

Reply on Twitter 2029502757191733340 Retweet on Twitter 2029502757191733340 13 Like on Twitter 2029502757191733340 43 Twitter 2029502757191733340

; Arm @Arm ·

5 Mar 2029367819586925030

🍎 +💻 + 📖 = ⚡

Apple’s 2026 MacBook lineup expands AI performance across the stack — from M5 Max and M5 Pro in MacBook Pro and Air to A18 Pro in the new MacBook Neo.

Built on the Arm compute platform, this next generation of devices pushes AI performance further, showcasing

Reply on Twitter 2029367819586925030 Retweet on Twitter 2029367819586925030 33 Like on Twitter 2029367819586925030 316 Twitter 2029367819586925030

; Arm @Arm ·

4 Mar 2029311191282028661

Join us at #EmbeddedWorld and see how we’re powering the next wave of edge and physical AI.

Experience live demos and discover how our next-generation architectures, platforms, and compute subsystems enable our partners to accelerate the path from silicon to AI-enabled

Reply on Twitter 2029311191282028661 Retweet on Twitter 2029311191282028661 7 Like on Twitter 2029311191282028661 39 Twitter 2029311191282028661

Why the Right Software Approach is Vital to AI Innovation

How Armv9 Architecture Accelerates AI and ML Workloads

Why CPU Remains the Preferred Platform for AI Development

How Arm’s Open Source Collaboration Empowers AI Developers

What is Arm Kleidi and How Does it Accelerate AI Workloads

How Arm’s Ecosystem Partnerships Enable Scalable AI Deployment

Why Open Standards Are Key to AI Innovation

How Arm is Accelerating and Future-Proofing AI Development

Takeaways

Frequently Asked Questions

What are SME and SVE2 in the Armv9 architecture?

How does SME improve AI application performance?

What advantages does SME2 add over SME?

AI in Software

Editorial Contact

Related

Introducing the Arm AI Readiness Index Report: A Global Benchmark for AI Implementation

Silicon Reimagined: New Foundations for the Age of AI

Arm Kleidi Arrives in Automotive Markets to Accelerate Performance for AI-based Applications

Accelerating AI Developer Innovation Everywhere with New Arm Kleidi

GitHub Actions: Simplifying Arm-based Application Development

New Arm Extension for GitHub Copilot Simplifies and Accelerates Migration to Arm-based Servers

Media Information

Company Overview & History

Arm Corporate Guidelines

Media Contacts

Latest on X