Accelerating Cloud Innovation with AWS Graviton4 Processors, Powered by Arm Neoverse
The cloud computing landscape is undergoing a dramatic transformation, driven by the explosive growth of AI. As AI applications become increasingly sophisticated and demanding, the need for powerful, efficient, and cost-effective computing solutions has never been greater. Customers deploying their workloads in the cloud are rethinking what infrastructure they need to meet the requirements of these modern workloads. Their requirements range from achieving better performance and reduced costs to achieving new benchmarks in energy efficiency for regulatory or sustainability goals.
Arm and AWS have a long-standing collaboration aimed at providing specialized silicon and compute, paving the way for a more efficient, sustainable, and powerful cloud. This week at AWS re:Invent 2024, you’ll see more evidence for how Graviton4 marks a significant leap forward, empowering developers and businesses to unlock the full potential of their cloud workloads.
Exceptional Performance Benefits
The latest Arm Neoverse V2 based AWS Graviton4 processors provide up to 30% better compute performance, 50% more cores, and 75% more memory bandwidth than previous generation Graviton3 processors. Thanks to these advantages, we are now seeing a significant adoption of AWS Graviton processors in the ecosystem and by customers.
The Arm Neoverse V2 platform includes new capabilities of the Armv9 architecture, such as high-performance floating-point and vector instruction support, with features like SVE/SVE2, Bfloat16, and Int8 MatMul delivering strong performance for AI/ML and HPC workloads.
AI/ML Workloads
To further drive adoption of AI workloads Arm launched Arm Kleidi earlier this year, collaborating with leading AI frameworks and the software ecosystem to ensure the full ML stack can benefit from out-of-the-box inference performance optimizations on Arm, allowing developers to build their workloads without needing extra Arm-specific expertise. We’ve showcased how these optimizations in Pytorch enable running LLMs such as Llama 3 80B and Llama 3.1 8B on AWS Graviton4 with significantly improved tokens/sec and time-to-first-token metrics.
The performance metrics have been documented in details in these blogs for LLM Inferencing with PyTorch and LLM3 on Graviton4.
HPC and EDA Workloads
For HPC workloads, Graviton4 marks a significant leap forward in capability compared to Graviton3E providing 16% more main-memory bandwidth per core, and a doubling of L2 cache per vCPU. These are significant for HPC application performance which is often memory-bandwidth bound, and AWS has managed to achieve benefits across these areas as shown below.
For EDA workloads, Graviton4 delivers up to 20% better performance over Graviton3 for RTL simulation workloads as measured by production runs conducted by Arm’s engineering teams.
Ecosystem Adoption
Over the last few years, we have seen a continual ramp in adoption across the software ecosystem with end customers deploying a wide range of cloud workloads on AWS Graviton processors. Customers are saving money, seeing better performance, and improving their carbon and sustainability footprints. Here are a few examples:
Upcoming AWS re:Invent 2024
If you are visiting AWS re:Invent 2024, you can check out the following key sessions on a wide range of topics related to AWS Graviton processors. For a full list of more than 60+ sessions on AWS Graviton, check out the event’s official agenda.
Get ready to harness the power of Graviton
We believe the future of cloud computing is undoubtedly Arm-powered, and are proud to support AWS in placing Graviton at the forefront of this revolution. Arm continues to invest in further strengthening our software ecosystem and removing any friction for developers to build on Arm – and to access all the performance and efficiency benefits the Arm compute platform delivers.
Developer resources
Here are some key resources and avenues to engage directly with us and AWS Graviton teams:
- Learn.arm.com: Explore in-depth technical resources on Arm architecture.
- Arm Software Ecosystem Dashboard: Discover applications and tools optimized for Arm.
- AWS Graviton Dashboard: Access information and support for Graviton instances.
- AWS Getting Started GitHub: Explore Graviton-related code samples and tools.
- AWS Partner Program: Connect with industry leaders building solutions for Graviton.
To arrange a meet-up with an Arm representative at AWS re:Invent 2024, please contact sw-ecosystem@arm.com
Any re-use permitted for informational and non-commercial or personal use only.