Blog

April 17, 2024

Arm’s Mission to Help Tackle AI’s Insatiable Energy Needs

The challenge (and opportunity) of powering workloads in the AI datacenter

By Rene Haas, Chief Executive Officer, Arm

AI has the potential to exceed all the transformative innovations created in the past century. The benefits to society around health care, productivity, education and many other areas will be beyond our imagination. To run these complex AI workloads, the amount of compute required in the world’s data centers needs to exponentially scale. However, this insatiable need for compute has exposed a critical challenge: The immense power data centers require to fuel this groundbreaking technology.

Today’s data centers already consume lots of power: Globally 460 terawatt-hours (TWh) of electricity are needed annually. That’s equivalent to the entire country of Germany. In the United States, data center electricity consumption was 2.5% of the U.S. total (~130 TWh) in 2022 and is expected to triple to 7.5% (~390 TWh) by 2030, according to the Boston Consulting Group. That’s the equivalent of the electricity used by about 40 million U.S. houses – almost a third of the total homes in the U.S.

Future AI models will continue to become larger and smarter, fueling the need for more compute, which increases demand for power as part of a virtuous cycle. Finding ways to reduce the power requirements for these large data centers is paramount to achieving the societal breakthroughs and realizing the AI promise.

In other words, no electricity, no AI.

Companies need to rethink everything to tackle energy efficiency.

Reimagining the future of AI – a future powered by Arm

The power efficiency DNA of Arm – a company whose initial products were designed to run off batteries and sparked the mobile-phone revolution – allows the industry to rethink how chips are built to accommodate these growing demands of AI.

In a typical server rack, the compute chip alone can consume more than 50 percent of the power budget. Engineers are looking for any method to find ways to reduce this number, every watt counts.

It’s no surprise that in this search, the world’s largest AI hyperscalers have turned to Arm to reduce power. Arm’s latest Neoverse CPU is the most high-performant, power-efficient processor for cloud data centers versus the competition. Neoverse offers hyperscalers the flexibility to customize their silicon to optimize for their demanding workloads, all while delivering leading performance and energy efficiency. Every watt saved enables more compute. This is why Amazon, Microsoft, Google, and Oracle have now all adopted Arm Neoverse technology to solve both general-purpose compute and CPU-based AI inference and training. Arm Neoverse is on the path to being the de-facto standard across cloud data centers.

Consider the data from recent announcements:

AWS Arm-based Graviton: 25 percent faster performance for Amazon Sagemaker for AI inference, 30 percent faster for web applications, 40 percent faster for databases, and 60 percent more efficient than competition.

Google Cloud Arm-based Axion: 50 percent more performance and 60 percent better energy efficiency compared to legacy competition architectures, powering CPU-based AI inference and training, YouTube, Google Earth, among others.

Microsoft Azure Arm-based Cobalt: 40 percent performance improvement over competition, powering services such as Microsoft Teams and coupling with Maia accelerators to drive Azure’s end-to end AI architecture.

Oracle Cloud Arm-based Ampere Altra Max: 2.5 times more performance per rack of servers at 2.8 times less power versus traditional competition and being used for generative AI inference models – summarization, tokenization of data for LLM training, and batched inference use cases.

It’s evident that Arm Neoverse has enabled vast improvements on performance and power-efficiency for general-purpose compute in the cloud. However, customers are now finding the same benefits for accelerated computing. Large-scale AI training requires unique accelerated computing architectures, like the NVIDIA Grace Blackwell platform (GB200), which combines NVIDIA’s Blackwell GPU architecture with the Arm-based Grace CPU. This Arm-based computing architecture enables system-level design optimizations that reduce energy consumption by 25x and provide a 30x increase in performance per GPU compared to NVIDIA H100 GPUs using competitive architectures for LLMs. These optimizations, which deliver game-changing performance and power savings, are only possible thanks to the unprecedented flexibility for silicon customization that Arm Neoverse enables.

As Arm deployments broaden, these companies could save upwards of 15% the total data center power. Those enormous savings could then be used to drive additional AI capacity within the same power envelope and not add to the energy problem. To put it in perspective, these energy savings could run 2 billion additional ChatGPT queries, power a quarter of all daily web search traffic, light 20 percent of American households, or power a country the size of Costa Rica.

That’s a staggering impact on both energy consumption and environmental sustainability.

At a foundational level, Arm CPUs are powering the AI revolution while benefiting the planet.

The future of AI compute is built on Arm.

By Rene Haas, Chief Executive Officer, Arm

Article Text

Copy Text

Any re-use permitted for informational and non-commercial or personal use only.

Editorial Contact

Brian Fuller & Jack Melling

editorial@arm.com

Subscribe to Blogs and Podcasts

Get the latest blogs & podcasts direct from Arm

Media Information

Latest on X

; Arm @Arm ·

7h 2028443982125035933

Hello, Barcelona 👋

#MWC26 is underway, and Arm is at the center of the conversations shaping the next era of intelligent compute, from devices, data centers and networks.

From cloud to edge to physical devices, AI runs on Arm.

See you on the show floor. 📍

Reply on Twitter 2028443982125035933 Retweet on Twitter 2028443982125035933 6 Like on Twitter 2028443982125035933 20 Twitter 2028443982125035933

; Arm @Arm ·

22h 2028225109266207146

The architectural decisions we make now will shape AI economics for years.

At #MWC26, Chris Bergey will discuss why efficiency — across data, compute and energy — is central to scaling AI from cloud to billions of devices.

Join the conversation on Wednesday in Barcelona:

Reply on Twitter 2028225109266207146 Retweet on Twitter 2028225109266207146 9 Like on Twitter 2028225109266207146 33 Twitter 2028225109266207146

; Arm @Arm ·

28 Feb 2027561421165256777

AI doesn’t run on accelerators alone. Arm CPUs form the foundation of modern AI infrastructure, orchestrating workloads, managing data movement and delivering the scalable performance that makes AI systems work.
https://okt.to/shncLo

Reply on Twitter 2027561421165256777 Retweet on Twitter 2027561421165256777 7 Like on Twitter 2027561421165256777 33 Twitter 2027561421165256777

; Arm @Arm ·

27 Feb 2027373898384183604

Marco built Reachy Phone Home so Reachy Mini can detect when you’re on your phone, using @Ultralytics YOLO26 vision, and respond in real time with voice + motion.

Built on Arm (Apple Mac / Raspberry Pi 5) with @huggingface 🤗 + @pollenrobotics 🦾, it’s now an award-winning

Reply on Twitter 2027373898384183604 Retweet on Twitter 2027373898384183604 26 Like on Twitter 2027373898384183604 136 Twitter 2027373898384183604

; Arm @Arm ·

26 Feb 2027142771748782259

Access to healthcare starts with being seen.

For over a decade, we've worked alongside @Simprints to support digital identity solutions that help connect patients to the care they need.

In collaboration with @gavi, this approach enables clearer visibility across immunization

Reply on Twitter 2027142771748782259 Retweet on Twitter 2027142771748782259 2 Like on Twitter 2027142771748782259 9 Twitter 2027142771748782259

; Arm @Arm ·

26 Feb 2027085540152033331

AI systems don’t just run models. They schedule, retrieve context, coordinate services and operate continuously. 🧠

Those responsibilities fall to the CPU, the engine that makes modern AI systems viable at scale across heterogeneous infrastructure.

https://okt.to/dt618F

Reply on Twitter 2027085540152033331 Retweet on Twitter 2027085540152033331 4 Like on Twitter 2027085540152033331 20 Twitter 2027085540152033331

; Arm @Arm ·

26 Feb 2027079494197039514

AI is converging the worlds of devices, data centers and networks — and it’s all coming together at #MWC26 next week.

At the center of this shift is the Arm compute platform, powering the products and services that define modern digital life. 📍 Join us in Barcelona to see how.

Reply on Twitter 2027079494197039514 Retweet on Twitter 2027079494197039514 5 Like on Twitter 2027079494197039514 28 Twitter 2027079494197039514

Arm’s Mission to Help Tackle AI’s Insatiable Energy Needs

Reimagining the future of AI – a future powered by Arm

Editorial Contact

Media Information

Company Overview & History

Arm Corporate Guidelines

Media Contacts

Latest on X