Blog

October 2, 2025

From Cloud to Edge, Why Arm is Built for Scaling Your AI Stack

By Arm Editorial Team

As AI rapidly expands across datacenters, devices, and everything in between, the real challenge isn’t building intelligent computing, it’s building the infrastructure required to scale it.

AI is not a single-layer problem, it’s a sprawling ecosystem shaped by the world’s biggest technology leaders. Across this trillion‑dollar transformation, one architecture keeps emerging – Arm.

AI Datacenters Defined by Industry Giants, Powered by Arm

Alongside Arm, the biggest names in AI—NVIDIA, AWS, Microsoft, Google, Oracle, and OpenAI—are collectively driving next‑generation datacenter buildouts. Estimates place AI infrastructure investments in the trillions of dollars, fueled by demands for training, inference, and cost‑efficient scale.

By 2025, half of the compute shipped to top hyperscalers is projected to be Arm‑based. AWS (Graviton), Google Cloud (Axion), and Microsoft Azure (Cobalt) all now deploy Arm-based chips for cloud infrastructure, enabling significant energy and cost savings as well as scalability. NVIDIA’s Grace CPU, built on Arm Neoverse, anchors their Grace Blackwell AI superchip, which has seen 3.6 million units ordered by the top four U.S. hyperscale cloud providers alone. In fact, over 1 billion Arm Neoverse CPUs have now been shipped into datacenters, underscoring the architecture’s central role in this global buildout.

Across the most advanced AI datacenter stacks, Arm is the common denominator, enabling scalability, efficiency, and adaptable performance where older architectures fall short.

In fact, Arm delivers unmatched price-performance and power-efficiency:

NVIDIA’s Grace‑Hopper Superchip yields up to 8X faster model training and 4.5X LLM inference performance versus x86 systems¹.

Google’s Axion offers up to 3X better recommender performance², 2.5X higher inference, and 64% cost savings compared to x86³.

As of December 2024, over 50% of EC2 capacity is built on AWS Graviton⁴.

Moreover, recent analysis from the consulting Signal65 shows that the Arm Neoverse-based AWS Graviton 4 chips, are not only leading the competition on price-performance, but significantly outpacing comparable x86 offerings from AMD and Intel on overall performance across enterprise workloads. For example, Signal65’s benchmarking tests showed Graviton4 delivering up to 168% better large language model (LLM) inference performance and 220% higher price-performance than AMD, while also beating Intel in networking throughput by 53% and machine learning (ML) training speeds by 34%. These results underscore Arm’s architectural advantage across both AI and general compute tasks.

AI from Cloud to Edge Needs New Compute

AI isn’t confined to datacenters, it’s expanding outward. Smartphones, PCs, and IoT devices – from low power sensors to high-performance industrial applications – now demand on‑device generative AI, reshaping user experiences.

Arm is uniquely positioned here, too. The new Arm Lumex Compute Subsystem (CSS) platform for consumer devices unlocks real-time on-device AI use cases like assistants, voice translation and personalization, with the new SME2-enabled Arm CPUs delivering up to 5x faster AI performance. Meanwhile, the world’s first Armv9 edge AI platform, which is optimized for edge AI workloads across IoT applications, enables on-device AI models of over one billion parameters.

Arm is powering a cloud-to-edge revolution, and it’s built to scale across that continuum.

Software as the Differentiator, Arm Tools for an AI Era

In AI, hardware provides the foundation, but it’s software that defines the experience. As AI workloads scale in complexity and reach, developers need an ecosystem that can move as fast as their ambitions. This is where Arm’s unique advantage shines: a unified architecture supported by a robust, optimized software ecosystem that spans from cloud to edge.

Arm’s massive developer base—now 22 million strong—benefits from an ecosystem where the same code, tools, and frameworks run seamlessly across devices, whether it’s datacenter-scale model training or real-time inference at the edge. This architectural consistency enables faster development, streamlined optimization, and wider deployment without redundant engineering effort.

Key frameworks like PyTorch ExecuTorch, TensorFlow Lite, and MediaPipe are now deeply integrated and optimized for Arm-based systems via Arm KleidiAI, a lightweight, open-source optimization layer that activates Arm-optimized microkernels under the hood. That means developers can tap into performance enhancements automatically without modifying code, across everything from hyperscale cloud platforms to smartphones and embedded devices.

For example, on Graviton4, KleidiAI enables time-to-first-token for Llama 3 to run up to 2.5x faster than baseline, while mobile implementations leveraging MediaPipe see performance boosts of up to 30% on models, like Gemma 2B. Whether managing AI factories or deploying chatbots at the edge, the software experience is predictable, performant, and power efficient.

This kind of seamless, system-aware software enablement is what differentiates Arm’s approach. Developers aren’t left navigating fragmented stacks or doing backend rework. Instead, they inherit the benefits of an ecosystem that’s co-designed, hardware and software together, for AI performance and efficiency.

In the AI era, where performance-per-watt is everything, Arm’s software ecosystem isn’t just keeping up, it’s meeting developers where they are and accelerating innovation.

The Backbone of AI at Scale

AI is being forged at an unprecedented scale, from trillion-dollar datacenters to next-gen smartphones and in-vehicle systems. The architecture bridging these worlds is Arm.

With hyperscaler adoption, flexible edge compute, and a vibrant, AI-ready software ecosystem, Arm stands as the backbone of AI infrastructure, today and tomorrow.

Ready to learn more? Explore how Arm is powering the AI era at scale.

References

¹ NVIDIA GH200 Grace Hopper Superchip Architecture

² Unpacking Axion: Google Cloud’s Custom Arm-based Processor Built for the AI age

³ Harness the Power of Retrieval-Augmented Generation with Arm Neoverse-powered Google Axion Processors

⁴ AWS re:Invent 2024 – Monday Night Live with Peter DeSantis

By Arm Editorial Team

Article Text

Copy Text

Any re-use permitted for informational and non-commercial or personal use only.

Editorial Contact

Arm Editorial Team

editorial@arm.com

Subscribe to Blogs and Podcasts

Get the latest blogs & podcasts direct from Arm

Blog

May 19, 2025

The Arm Ecosystem: Powering AI Everywhere – From Cloud to Edge

Arm Editorial Team

Blog

Apr 16, 2025

Llama 4 Runs on Arm

Arm Editorial Team

Blog

Apr 01, 2025

Half of the Compute Shipped to Top Hyperscalers in 2025 will be Arm-based

Mohamed Awad, SVP and GM of the Infrastructure Business, Arm

Blog

Apr 09, 2025

Unpacking Axion: Google Cloud’s Custom Arm-based Processor Built for the AI age

Bhumik Patel, Director, Server Ecosystem Development, Arm

Podcast

Mar 11, 2025

How Arm & NVIDIA Are Shaping the Future of AI and Datacenters

Blog

Nov 20, 2024

Igniting a New Era of Cloud Computing for AI

Dermot O'Driscoll, Vice President of Product Solutions, Infrastructure Line of Business, Arm

Media Information

Latest on X

; Arm @Arm ·

6h 1981316423814132113

Something exciting is coming. 👀

Join @GeelyAutoUK for the Geely EX5 launch event, live streaming today, Thursday, Oct 23 at 16:45 BST / 8:45 PT.

We can’t say much yet, but trust us, you’ll want to see this.

🎥 Watch live:

Geely EX5 UK Launch | INTELLIGENT EVERYDAY

Welcome to the Future of Driving!Join us for the exclusive UK launch of the Geely EX5, the next-generation electri...

okt.to

Reply on Twitter 1981316423814132113 Retweet on Twitter 1981316423814132113 4 Like on Twitter 1981316423814132113 5 Twitter 1981316423814132113

; Arm @Arm ·

22 Oct 1981027795431039252

ExecuTorch 1.0 GA is here, and it's redefining what’s possible for AI at the edge. 🙌

Built on PyTorch & optimized for the Arm compute platform, it enables faster, higher-performance AI across devices, bringing edge AI to life everywhere, for everyone: https://okt.to/f83kPo

Reply on Twitter 1981027795431039252 Retweet on Twitter 1981027795431039252 2 Like on Twitter 1981027795431039252 14 Twitter 1981027795431039252

; Arm @Arm ·

22 Oct 1980994392576885221

Our new Bengaluru office set the stage for Rene Haas and @AshwiniVaishnaw's conversation on Tech Unheard.

Fresh from the grand opening, they dive into India’s next chapter, exploring how innovation, talent, and resilience are driving progress in tech🎙️: https://okt.to/ZBcOE1

Reply on Twitter 1980994392576885221 Retweet on Twitter 1980994392576885221 4 Like on Twitter 1980994392576885221 14 Twitter 1980994392576885221

; Arm @Arm ·

21 Oct 1980751324116390081

📷 This time last week at #OCPSummit25!

We joined tech leaders rethinking how the world powers AI, sharing our vision for the open, converged AI data center and highlighting Arm’s FCSA contribution, Arm Total Design expansion, and spot on the OCP Board: https://newsroom.arm.com/blog/key-takeaways-from-ocp-global-summit-2025?utm_source=twitter&utm_medium=social-organic&utm_content=blog&utm_campaign=mk03_infrastructure_na

Reply on Twitter 1980751324116390081 Retweet on Twitter 1980751324116390081 1 Like on Twitter 1980751324116390081 6 Twitter 1980751324116390081

; Arm @Arm ·

21 Oct 1980701958555201880

As Arm's ecosystem continues to unlock better performance per watt at hyperscale - customer momentum on Microsoft’s Arm-based Cobalt 100 VMs for general-purpose and cloud-native workloads is accelerating. ⬇️

How Azure Cobalt 100 VMs are powering real-world solutions, delivering performance and efficiency...

Learn how you can accelerate product development, scale analytics platforms, or improve user experiences with Azure Cobalt 100 VMs.

azure.microsoft.com

Reply on Twitter 1980701958555201880 Retweet on Twitter 1980701958555201880 5 Like on Twitter 1980701958555201880 28 Twitter 1980701958555201880

; Arm @Arm ·

20 Oct 1980383796160532981

For Rene Haas, leadership means moving fast, embracing change, and knowing when to pivot.

Speaking with future leaders at @CarnegieMellon, he highlighted the importance of experimentation and learning from fast failure.

🎧 Hear more on Tech Unheard: https://okt.to/Jbgn2E

Reply on Twitter 1980383796160532981 Retweet on Twitter 1980383796160532981 1 Like on Twitter 1980383796160532981 13 Twitter 1980383796160532981

; Arm @Arm ·

20 Oct 1980258226344952005

🆕 We’re expanding Arm Flexible Access to include our first Armv9 edge AI platform, giving innovators low-cost access to the performance, efficiency, & security they need to bring intelligence to every edge device, and drive the next wave of AI innovation: https://okt.to/GyJDV2

Reply on Twitter 1980258226344952005 Retweet on Twitter 1980258226344952005 7 Like on Twitter 1980258226344952005 32 Twitter 1980258226344952005

; Arm @Arm ·

19 Oct 1979880253595037712

“Stay useful, stay flexible and hold the door open for the person behind you.” – Tamika Curry Smith, Arm

Ahead of the #USGP, Arm and the @AstonMartinF1 Team joined forces in Austin to inspire high school and university students through hands-on learning, mentoring, and…

Reply on Twitter 1979880253595037712 Retweet on Twitter 1979880253595037712 4 Like on Twitter 1979880253595037712 16 Twitter 1979880253595037712

; Arm @Arm ·

17 Oct 1979140387022270924

SME2’s integration into @OPPO’s AI framework is a huge step for on-device AI!

Built into the Arm Lumex compute platform, SME2 delivers faster, more efficient AI performance - with a 1.2x performance improvement and 63% reduction in quantization precision loss in OPPO AI’s…

Reply on Twitter 1979140387022270924 Retweet on Twitter 1979140387022270924 5 Like on Twitter 1979140387022270924 25 Twitter 1979140387022270924

; Arm @Arm ·

16 Oct 1978955390197916037

The @nvidia DGX Spark is now available, with leading OEMs launching new AI workstations.

Built on the Arm-based NVIDIA GB10 Grace Blackwell Superchip, these desktop computing systems deliver petaflop-scale AI performance and support for models up to 200B parameters, all…

Reply on Twitter 1978955390197916037 Retweet on Twitter 1978955390197916037 6 Like on Twitter 1978955390197916037 28 Twitter 1978955390197916037

; Arm @Arm ·

16 Oct 1978953133196767368

A calm moment before the doors opened at #OCPSummit25. Since then, the Arm booth has been buzzing with conversations, meetings, and insights on building the Converged AI Datacenter, where performance meets efficiency and collaboration drives innovation.

Reply on Twitter 1978953133196767368 Retweet on Twitter 1978953133196767368 2 Like on Twitter 1978953133196767368 9 Twitter 1978953133196767368

; Arm @Arm ·

16 Oct 1978896049767866684

Designed for AI, Apple's new M5 chip, built on the Arm architecture, delivers major gains in performance and efficiency, bringing next-generation AI experiences to the new Macbook Pro, iPad Pro and Apple Vision Pro - with Arm innovation at the foundation.

Apple unleashes M5, the next big leap in AI performance for Apple silicon

Apple today announced M5, delivering advances to every aspect of the chip and the next big leap in AI.

okt.to

Reply on Twitter 1978896049767866684 Retweet on Twitter 1978896049767866684 4 Like on Twitter 1978896049767866684 37 Twitter 1978896049767866684

; Arm @Arm ·

16 Oct 1978881404151767303

Building on Arm’s appointment to the @OpenComputePrj board, Tech Arena spoke with Eddie Ramirez on how the Foundation Chiplet System Architecture drives openness, interoperability and efficiency across AI infrastructure:

#OCPSummit25

Arm Joins OCP Board, Contributes Chiplet Architecture Spec

Appointment to Open Compute Project Foundation board of directors, contribution of Foundation Chiplet System Architecture ...

okt.to

Reply on Twitter 1978881404151767303 Retweet on Twitter 1978881404151767303 3 Like on Twitter 1978881404151767303 20 Twitter 1978881404151767303

; Arm @Arm ·

16 Oct 1978820536663728292

Edge AI is accelerating faster than ever. ⚡

In the latest Arm Viewpoints podcast, Arm’s SVP and GM of IoT, Paul Williamson and @VDC_Research’s Chris Rommel discuss what’s driving innovation at the edge — from software complexity to smarter system design.…

Reply on Twitter 1978820536663728292 Retweet on Twitter 1978820536663728292 4 Like on Twitter 1978820536663728292 10 Twitter 1978820536663728292

; Arm @Arm ·

16 Oct 1978758591898149058

As vehicles become AI-defined, compute must evolve. 🚘

This week, we contributed the Foundation Chiplet System Architecture (FCSA) to @OpenComputePrj. This lays the groundwork for open, interoperable chiplet design, helping the whole ecosystem move faster.…

Reply on Twitter 1978758591898149058 Retweet on Twitter 1978758591898149058 8 Like on Twitter 1978758591898149058 30 Twitter 1978758591898149058

; Arm @Arm ·

16 Oct 1978664718416879941

Great conversations at #OCPSummit25! 💡

Eddie Ramirez joined partners from Meta, Rebellions, and Novatek to explore how Arm-based chiplets and ecosystem collaboration are shaping a new generation of composable, AI-ready infrastructure.

Reply on Twitter 1978664718416879941 Retweet on Twitter 1978664718416879941 2 Like on Twitter 1978664718416879941 15 Twitter 1978664718416879941

; Arm @Arm ·

15 Oct 1978609650007203986

It’s not #OCPSummit25 without the iconic Arm-based hardware and DPU wall. 💪
We've got a seriously impressive line up, featuring:
🔹 @NVIDIA Grace Blackwell GB300
🔹 NeuReality NR1
🔹 Novatek’s Arm Neoverse CSS N2-based SoC
🔹 …and plenty more from Marvell, xSight, and others

Reply on Twitter 1978609650007203986 Retweet on Twitter 1978609650007203986 1 Like on Twitter 1978609650007203986 8 Twitter 1978609650007203986

; Arm @Arm ·

15 Oct 1978494349966025044

Announcing a deepened, strategic partnership with @Meta to drive the next era of AI.

From software to the data center, we’re accelerating our collaboration, combining our power-efficient leadership with Meta’s AI innovation to scale AI everywhere: https://okt.to/v6mhgw

Reply on Twitter 1978494349966025044 Retweet on Twitter 1978494349966025044 16 Like on Twitter 1978494349966025044 76 Twitter 1978494349966025044

; Arm @Arm ·

15 Oct 1978477155295395883

“AI is powering the shift from on-device intelligence to cloud-to-car integration.”

In this #AIToyToTools podcast series, Suraj Gajendra shares how end-to-end AI and standardization are defining the future of automotive innovation.

🎧 Listen now

Reply on Twitter 1978477155295395883 Retweet on Twitter 1978477155295395883 3 Like on Twitter 1978477155295395883 13 Twitter 1978477155295395883

; Arm @Arm ·

14 Oct 1978226762904408272

We’re catching you up on the latest from #OCPSummit25 👇

1️⃣ Mohamed Awad kicked things off this morning with a keynote on how the AI era is transforming the way we build and power the world’s infrastructure.

He highlighted how Arm Total Design, advances in chiplet innovation,…

Reply on Twitter 1978226762904408272 Retweet on Twitter 1978226762904408272 4 Like on Twitter 1978226762904408272 21 Twitter 1978226762904408272

; Arm @Arm ·

14 Oct 1978148310532702296

We’re accelerating the next phase of open, interoperable AI infrastructure.

Announced today at #OCPSummit25 - we've contributed the vendor-neutral FCSA specification to @OpenComputePrj & expanded our Arm Total Design ecosystem to include new capabilities: https://okt.to/I85ZDv

Reply on Twitter 1978148310532702296 Retweet on Twitter 1978148310532702296 3 Like on Twitter 1978148310532702296 20 Twitter 1978148310532702296

; Arm @Arm ·

14 Oct 1978114597887283370

vivo’s new X300 Series launched in the Chinese Market, powered by our V9.3 CPU Cluster and the first to use SME2 for faster on-device AI.

⬆️ Up to 20% faster translation
⬆️ Up to 30% faster gallery search

This marks a leap forward for mobile AI performance.

Congrats…

Reply on Twitter 1978114597887283370 Retweet on Twitter 1978114597887283370 6 Like on Twitter 1978114597887283370 20 Twitter 1978114597887283370

; Arm @Arm ·

14 Oct 1977892633335726574

We've been appointed to the @OpenComputePrj Board of Directors, marking a key step toward the next phase of AI infrastructure. 🎉

We're excited to play our part in advancing open, interoperable designs across the computing ecosystem, represented by Mohamed Awad:…

Reply on Twitter 1977892633335726574 Retweet on Twitter 1977892633335726574 1 Like on Twitter 1977892633335726574 18 Twitter 1977892633335726574

; Arm @Arm ·

13 Oct 1977865453239181639

🗓️ Happening tomorrow at #OCPSummit25!

Mohamed Awad shares how energy-efficient compute, chiplet-based design, and open collaboration are transforming AI and cloud infrastructure — and shaping the next era of the data center.

📍 Tuesday | 09:05am https://okt.to/HBPtCK

Reply on Twitter 1977865453239181639 Retweet on Twitter 1977865453239181639 5 Like on Twitter 1977865453239181639 15 Twitter 1977865453239181639

From Cloud to Edge, Why Arm is Built for Scaling Your AI Stack

AI Datacenters Defined by Industry Giants, Powered by Arm

AI from Cloud to Edge Needs New Compute

Software as the Differentiator, Arm Tools for an AI Era

The Backbone of AI at Scale

References

Editorial Contact

Related

The Arm Ecosystem: Powering AI Everywhere – From Cloud to Edge

Llama 4 Runs on Arm

Half of the Compute Shipped to Top Hyperscalers in 2025 will be Arm-based

Unpacking Axion: Google Cloud’s Custom Arm-based Processor Built for the AI age

How Arm & NVIDIA Are Shaping the Future of AI and Datacenters

Igniting a New Era of Cloud Computing for AI

Media Information

Company Overview & History

Arm Corporate Guidelines

Media Contacts

Latest on X