Blog

June 10, 2026

Supermicro and Arm advance compute for the agentic AI era

By Dilip Ramachandran, Senior Director of Segment Marketing, Cloud AI, Arm

At COMPUTEX, Supermicro announced a new class of servers designed to meet the rapidly growing compute demands of the Agentic AI era. Powered by Arm’s recently introduced AGI CPU, these systems deliver industry-leading compute density and power efficiency for next-generation AI inference and agentic workloads.

AI infrastructure is entering its inference era

Since the launch of ChatGPT in late 2022, AI infrastructure conversations have largely centered around GPUs. Data center expansion over the past several years has been driven by the race to deploy more accelerated compute for large-scale model training. However, the AI landscape is evolving quickly. Unlike first-generation AI deployments that focused primarily on model training, agentic AI workloads are persistent, distributed, and inference-driven. They require systems capable of handling orchestration, retrieval, reasoning, and real-time decision making at scale.

This shift is driving a new wave of infrastructure requirements where efficient CPU compute plays a foundational role in maximizing overall AI system performance. As workloads shift from training to inference — and increasingly toward autonomous, multi-step agentic AI systems — CPUs are becoming a critical component of modern AI infrastructure.

Agentic AI introduces a fundamentally different compute profile. Unlike traditional chatbot-style interactions, agentic systems continuously orchestrate reasoning, memory access, retrieval, planning, and communication across multiple services and models. These workflows generate massive demand for highly efficient general-purpose compute, memory bandwidth, and I/O scalability alongside GPU acceleration.

To address this shift, Arm introduced the AGI CPU in March 2026. Built with up to 136 Arm Neoverse V3 cores, 12 DDR5 memory channels at up to 8800 MT/s, and PCIe Gen6 connectivity within a 300W power envelope, the AGI CPU is designed to deliver exceptional compute density and energy efficiency for AI-first data centers. Arm AGI CPU with leading performance per core combined with high core density, high memory bandwidth per core and industry leading power efficiency enables up to 2x higher performance per rack to comparable x86-based solutions, according to Arm estimates.

Purpose-built infrastructure for next-generation AI workloads

Supermicro’s new server and rack-scale portfolio brings the AGI CPU capabilities to market across cloud, enterprise, and edge deployments.

For hyperscale and neocloud AI infrastructure, Supermicro unveiled the liquid-cooled Open Rack Wide (ORW) platform, the ARS-142TP-QNR-LCC. A fully populated ORW rack can support up to 336 AGI CPUs, enabling massive compute density for cloud-scale agentic AI and inference workloads.

For customers adopting Open Rack V3 (ORV3) environments, Supermicro also introduced the liquid-cooled 2U4N ORV3 ARS-242TP-QNR-LCC server, enabling up to 168 AGI CPUs per rack while maintaining deployment flexibility for modern data centers. Both the ORW and ORV3 systems are targeted for sampling in Q1 2027, with production availability in Q2 2027.

Supermicro is also extending AGI CPU support into air-cooled environments. For edge deployments with constrained power and space requirements, the single-socket ARS-212HE-FNR short depth server provides an optimized platform for distributed AI inference and edge computing applications. The system is targeted to sample in Q4 2026 and reach production in Q1 2027.

For general-purpose compute workloads, the dual-socket 2U ARS-222H-NR server supports up to 8 NVMe drives and additional accelerator expansion in a standard 19-inch form factor. These servers are ideally suited for a wide variety of data center workloads such as web and application serving, databases and analytics, virtualization and cloud infrastructure, and media and content processing applications.

Meanwhile, the 5U ARS-522GP-NR platform targets high-performance AI inference deployments with support for up to eight accelerator cards alongside dual AGI CPUs and high-density NVMe storage. These platforms are targeted to sample during Q3 ’26 and released to production in Q1 ’27.

Together, these platforms highlight an important industry transition: the future of AI infrastructure will not be defined by GPU performance alone. As agentic AI scales across enterprises and cloud providers, balanced architectures that combine high-performance CPUs, accelerators, memory bandwidth, and efficient system design will become essential.

At the same time, power efficiency and data center scalability are becoming increasingly critical. As enterprises look to deploy AI broadly across cloud, enterprise, and edge environments, infrastructure must deliver higher compute density without unsustainable increases in power and cooling requirements. This is where platforms built around the AGI CPU can provide a significant advantage by enabling scalable AI compute with improved performance-per-watt.

With this portfolio based on the AGI CPU, Supermicro is helping customers build AI infrastructure optimized for the realities of agentic computing — from hyperscale inference clusters to enterprise and edge deployments. As the industry moves toward AI systems that can autonomously reason, collaborate, and act, the combination of efficient CPU compute and accelerated AI infrastructure will form the backbone of the next generation of data centers.

By Dilip Ramachandran, Senior Director of Segment Marketing, Cloud AI, Arm

Article Text

Copy Text

Any re-use permitted for informational and non-commercial or personal use only.

Editorial Contact

Arm Editorial Team

editorial@arm.com

Stay informed with Arm's top stories, insights, and conversations.

Blog

Mar 24, 2026

Announcing Arm AGI CPU: The silicon foundation for the agentic AI cloud era

Mohamed Awad, Executive Vice President, Cloud AI, Arm

News

Jun 02, 2026

Oracle Cloud Infrastructure joins the Arm AGI CPU ecosystem as agentic AI accelerates

Mohamed Awad, Executive Vice President, Cloud AI, Arm

Blog

Mar 24, 2026

A comprehensive guide to understanding Arm Neoverse

Arm Editorial Team

Podcast

May 19, 2026

Inside Arm’s AGI CPU: The journey from IP to silicon

Media Information

Latest on X

; Arm @Arm ·

4h 2079633202465927308

Congratulations to @NVIDIA on a major Vera Rubin milestone. 👏

Backed by 300 global partners and built around the Arm-based Vera CPU, the platform shows how compute designed for modern AI infrastructure can move beyond the limits of legacy, off-the-shelf CPUs.

NVIDIA @nvidia

🚀 The NVIDIA Vera Rubin platform is here, with 10x better performance per watt.

➡️ The NVIDIA ecosystem, including @CoreWeave, @GoogleCloud, @Microsoft, and @Oracle Cloud, are standing up NVIDIA Vera Rubin NVL72 to deliver the lowest token cost for the agentic era.
➡️ NVIDIA

Reply on Twitter 2079633202465927308 Retweet on Twitter 2079633202465927308 5 Like on Twitter 2079633202465927308 31 Twitter 2079633202465927308

; Arm @Arm ·

5h 2079624221920055544

For Jason Child, joining Arm was a “once-in-a-lifetime opportunity.”

Jason joins the Secrets of Rockstar CFOs podcast by @stratcfo360 to discuss his journey, going public and our extension into silicon.

Listen to the full conversation on Spotify: https://okt.to/e9NbfM

Reply on Twitter 2079624221920055544 Retweet on Twitter 2079624221920055544 2 Like on Twitter 2079624221920055544 9 Twitter 2079624221920055544

; Arm @Arm ·

10h 2079534826357473587

Congratulations to @XPENG_Global on the launch of the all-new #XPENGL03. 👏

The next-generation AI SUV coupe brings new intelligent driving and connected in-car experiences to XPENG’s global portfolio.

We’re excited to be part of the ecosystem enabling the next generation of

XPENG @XPENG_Global

XPENG evolves into a global Physical AI company.
Meet the all-new XPENG L03.
Featuring the European debut of XPENG NGP.
In Europe, For Europe.
$XPEV

Reply on Twitter 2079534826357473587 Retweet on Twitter 2079534826357473587 7 Like on Twitter 2079534826357473587 26 Twitter 2079534826357473587

; Arm @Arm ·

20 Jul 2079327518742442187

In an ecosystem as broad as ours, there's no single path to innovation.

We understand that different workloads, business goals, and stages of development call for different approaches - which is why we're continuing to expand the ways our partners can build on Arm. 💡

Reply on Twitter 2079327518742442187 Retweet on Twitter 2079327518742442187 48 Like on Twitter 2079327518742442187 81 Twitter 2079327518742442187

; Arm @Arm ·

20 Jul 2079131490869194806

Congratulations are in order for Arm CEO Rene Haas on being named to the Observer's 2026 AI Power Index.

As AI demand continues to grow, so does the need for compute platforms and infrastructure that enable AI at scale.

This recognition highlights the leaders shaping the

Reply on Twitter 2079131490869194806 Retweet on Twitter 2079131490869194806 5 Like on Twitter 2079131490869194806 43 Twitter 2079131490869194806

; Arm @Arm ·

17 Jul 2078253437255336430

Organizations need the flexibility to build, deploy, and scale AI in the way that works best for them. That means supporting different business goals, different deployment models, and different levels of integration.

A strong compute ecosystem should enable organizations to:

Reply on Twitter 2078253437255336430 Retweet on Twitter 2078253437255336430 3 Like on Twitter 2078253437255336430 27 Twitter 2078253437255336430

; Arm @Arm ·

17 Jul 2078194550456140028

Autonomous robots are beginning to reshape how construction work gets done. 🧱🤖

As physical AI takes on more complex tasks, the compute beneath these systems becomes critical, delivering the performance, efficiency and software ecosystem needed to scale.

Reply on Twitter 2078194550456140028 Retweet on Twitter 2078194550456140028 2 Like on Twitter 2078194550456140028 18 Twitter 2078194550456140028

Supermicro and Arm advance compute for the agentic AI era

AI infrastructure is entering its inference era

Purpose-built infrastructure for next-generation AI workloads

Editorial Contact

Stay informed with Arm's top stories, insights, and conversations.

Related

Announcing Arm AGI CPU: The silicon foundation for the agentic AI cloud era

Oracle Cloud Infrastructure joins the Arm AGI CPU ecosystem as agentic AI accelerates

A comprehensive guide to understanding Arm Neoverse

Inside Arm’s AGI CPU: The journey from IP to silicon

Media Information

Company Overview & History

Arm Corporate Guidelines

Media Contacts

Latest on X