Blog

July 18, 2024

KleidiAI Integration Brings AI Performance Uplifts to Google AI Edge’s MediaPipe

Arm working with Google AI Edge on the integration of KleidiAI into the MediaPipe framework through XNNPACK, which supports numerous LLMs.

By Ronan Naughton, Director, Product Management, Client Line of Business, Arm

At the end of May 2024, we launched Arm Kleidi, which are broad software deliverables and community engagements for accelerating AI across the developer ecosystem. The first part are the Arm Kleidi Libraries for popular AI frameworks, which feature Arm KleidiAI for unleashing CPU performance across AI workloads.

Arm is working directly with leading AI frameworks on KleidiAI integrations, with these already proving to be successful through bringing significant performance improvements to today’s generative AI workloads, including leading large language models (LLMs). For developers, the KleidiAI integrations are completely seamless and transparent, so there is no need to learn additional tools and skills. This allows developers to move faster and extract maximum AI performance for their AI-based applications.

The work has been well-received throughout the industry, notably from the world’s largest technology companies. As part of his COMPUTEX 2024 keynote, Arm CEO Rene Haas shared testimonial videos of executives from Google, Meta and Samsung Mobile talking about how KleidiAI will enable them to accelerate AI innovation across multiple markets.

KleidiAI integration with Google AI Edge

The impact of KleidiAI is becoming a reality, as we have worked with Google AI Edge on its integration into the MediaPipe framework accelerated on the Arm CPU through XNNPACK, which provides support for numerous LLMs including the Gemma 2B LLM. Thanks to the KleidiAI integration, we have seen 30 percent performance improvements in time-to-first token when running our chatbot summarization demo on the Gemma 2B LLM on Samsung’s Galaxy S24 smartphone (Exynos 2400), which is built on Arm CPU technologies.

The performance improvements relate to how many tokens are being processed per second, with the KleidiAI integration enabling around 250 tokens to be processed in one second, making the demo far more responsive. These are exciting results that have positive implications for AI and ML developers, so we are inviting them to test KleidiAI powering Google AI Edge’s MediaPipe using our new Learning Path.

Previously we showed how LLMs can run on the CPU, with Arm being one of the first to demonstrate this. We expanded this work to Google with the chatbot demo shown above, which runs an application that utilizes the MediaPipe APIs and the XNNPACK CPU backend, which is then accelerated by the KleidiAI integration. The great performance shows what is possible for LLMs on the CPU, and how this can enable many real-world AI use cases, including chatbot, smart reply and message summarization.

A series of firsts

This demo is part of a series of firsts for Arm and the wider AI developer ecosystem.

It is the first demonstration of Arm’s partnership with Google AI Edge on MediaPipe and XNNPACK to accelerate AI workloads for developers on the CPU. This is just the start of our work to bring best-in-class performance to XNNPACK, which is an open-source library of highly optimized neural network operators. As XNNPACK has over 7 billion third-party installs, the KleidiAI integration brings the best in AI performance on the CPU to the widest possible market.

“We are excited to support KleidiAI in Google AI Edge’s XNNPACK to accelerate AI workloads on current and future Arm CPUs. This allows AI developers to access existing and new Arm architecture features to deliver outstanding performance that will only improve over time.” Matthias Grundmann, Google AI Edge Lead.

It is also the first in a series of Kleidi integrations that are happening over the coming months, where Arm will be enabling many more LLMs to run as effectively and efficiently as possible on devices on the Arm CPU. By focusing on Kleidi integrations across the software ecosystem, Arm is making AI performance available across the broadest range of hardware and accessible to the broadest community of software developers that are building AI-based applications on Arm, for Arm.

Easy access to AI performance

For developers, the KleidiAI integrations accelerate the development process and unlock AI performance on the pervasive Arm CPU to deliver the very best AI-based experiences on devices. KleidiAI also works across all tiers of CPU that utilize our industry-leading architecture features, like Neon, SVE2 and Scalable Matrix Extension, enabling the development of portable software solutions for application developers.

As Rene Haas said in his COMPUTEX 2024 keynote: “If you don’t have something developers can get access to, the hardware is not going to do you much good.” Stay tuned for more Kleidi integrations and optimizations as we continue to build the future of AI on Arm.

Learn more about the new KleidiAI technical demo here.

By Ronan Naughton, Director, Product Management, Client Line of Business, Arm

Article Text

Copy Text

Any re-use permitted for informational and non-commercial or personal use only.

Editorial Contact

Arm Editorial Team

editorial@arm.com

Subscribe to Blogs and Podcasts

Get the latest blogs & podcasts direct from Arm

Blog

May 29, 2024

Accelerating AI Developer Innovation Everywhere with New Arm Kleidi

Geraint North, Fellow, AI and Developer Platforms, Arm

Blog

May 15, 2024

Generative AI is on Mobile and it’s Powered by Arm

James McNiven, Vice President of Product Management, Client Line of Business, Arm

Blog

Jan 08, 2024

Arm: The Technology Foundation for AI Everywhere

Arm Editorial Team

Blog

Jun 03, 2024

Arm at Computex 2024: The path to 100+ Billion Arm Devices Ready for AI by 2025

Arm Editorial Team

Media Information

Latest on X

; Arm @Arm ·

14 Nov 1989463925105078706

Arm is powering the future of cloud computing for the AI and enterprise era.⚡

Whether you’re at #MSIgnite in person or online, don't miss our our on-demand session to learn more about how we're enabling performance, efficiency and innovation.
https://okt.to/Jy5ERg

Reply on Twitter 1989463925105078706 Retweet on Twitter 1989463925105078706 1 Like on Twitter 1989463925105078706 8 Twitter 1989463925105078706

; Arm @Arm ·

14 Nov 1989424614103990705

AI, cloud-native, and multi-architecture design are transforming how workloads are deployed and scaled. The momentum seen at KubeCon + CloudNativeCon 2025 reflects an industry building for flexibility, performance, and efficiency - powered by Arm.

https://okt.to/QuhlMj

Reply on Twitter 1989424614103990705 Retweet on Twitter 1989424614103990705 1 Like on Twitter 1989424614103990705 17 Twitter 1989424614103990705

; Arm @Arm ·

14 Nov 1989133471315345661

Counting down the days until #SC25!

From Fugaku to Jupiter, discover why the world's most advanced supercomputers and AI systems run on the Arm compute platform.

👇 Here's where you'll find us.

Reply on Twitter 1989133471315345661 Retweet on Twitter 1989133471315345661 6 Like on Twitter 1989133471315345661 13 Twitter 1989133471315345661

; Arm @Arm ·

13 Nov 1989050943455863273

AI is changing what’s possible within robotics innovation. 🤖🧠

Recently Anders Beck, VP of Technology at @Universal_Robot, joined the Arm Viewpoints podcast and shared his thoughts on how AI is driving a more flexible, collaborative era of automation.

https://okt.to/IXWvYn

Reply on Twitter 1989050943455863273 Retweet on Twitter 1989050943455863273 3 Like on Twitter 1989050943455863273 17 Twitter 1989050943455863273

; Arm @Arm ·

13 Nov 1988772310220779568

Some inventions don’t just break boundaries, they redefine what’s possible.

The Arm-based Meta Ray-Ban Display AI glasses and EMG wristband are changing how we interact with technology — no touchscreens, no buttons, just movement.

Congrats to the team at @Meta behind the…

Reply on Twitter 1988772310220779568 Retweet on Twitter 1988772310220779568 9 Like on Twitter 1988772310220779568 22 Twitter 1988772310220779568

; Arm @Arm ·

11 Nov 1988378686341702065

KubeCon + CloudNativeCon highlights just how quickly the cloud-native ecosystem is advancing. Developers everywhere are rethinking performance, scalability, and efficiency - across architectures - built on Arm.

Arm Software Developers @ArmSoftwareDev

KubeCon + CloudNativeCon 2025 shows the evolution of cloud-native systems and multi-architecture innovation. We're accelerating this shift by enabling scalable, efficient performance for AI and next-generation workloads across diverse architectures!
https://okt.to/KYaX5H

Reply on Twitter 1988378686341702065 Retweet on Twitter 1988378686341702065 1 Like on Twitter 1988378686341702065 9 Twitter 1988378686341702065

; Arm @Arm ·

11 Nov 1988377725451587876

📅 Tomorrow at #WebSummit, Ami Badani joins global leaders shaping the future of AI.

She’ll share how Intelligence per Watt is redefining progress — and why scaling AI responsibly means designing compute that’s as efficient as it is powerful.

Reply on Twitter 1988377725451587876 Retweet on Twitter 1988377725451587876 1 Like on Twitter 1988377725451587876 6 Twitter 1988377725451587876

; Arm @Arm ·

10 Nov 1988017107838398857

Hello KubeCon + CloudNativeCon USA 👋

We're so excited to see you all in Atlanta this week. We're bring community programs, booth demos, and so much more.

Be sure to swing by the Arm booth to see what we're up to!

Arm Software Developers @ArmSoftwareDev

Collaboration, learning, and innovation for the future of cloud native computing? Sign us up!

We can't wait to see you at KubeCon + CloudNativeCon USA where we'll be bringing the Arm developer experience to life with demos, community and more. 🥳
https://okt.to/nGB5Y3

Reply on Twitter 1988017107838398857 Retweet on Twitter 1988017107838398857 1 Like on Twitter 1988017107838398857 11 Twitter 1988017107838398857

; Arm @Arm ·

6 Nov 1986497586925031935

Today's announcement is cause for celebration! 🎉

@googlecloud's new N4A VMs and C4A metal, powered by Arm Neoverse, deliver unmatched performance-per-watt and scalability - showing what’s possible when one platform powers innovation from cloud to car.

https://okt.to/k2f7HJ

Reply on Twitter 1986497586925031935 Retweet on Twitter 1986497586925031935 10 Like on Twitter 1986497586925031935 35 Twitter 1986497586925031935

; Arm @Arm ·

5 Nov 1986188215766950282

Celebrating a strong Q2 FYE26, with revenue surpassing $1B for the third consecutive quarter.

As the only unified compute platform combining unmatched breadth with the performance, efficiency & security the AI era demands, Arm is delivering AI everywhere. https://newsroom.arm.com/news/arm-q2-fye26-results?utm_source=twitter&utm_medium=social-organic&utm_content=blog&utm_campaign=mk29_exec-comms_na

Reply on Twitter 1986188215766950282 Retweet on Twitter 1986188215766950282 9 Like on Twitter 1986188215766950282 44 Twitter 1986188215766950282

; Arm @Arm ·

4 Nov 1985521798000304164

OneTrust’s deployment on Azure Kubernetes Service using the Arm-based Azure Cobalt 100 processor shows what’s possible with efficient, scalable cloud compute. Together, we’re driving secure, high-performance cloud-native innovation with Azure.🤝https://okt.to/lQkmMy

Reply on Twitter 1985521798000304164 Retweet on Twitter 1985521798000304164 2 Like on Twitter 1985521798000304164 14 Twitter 1985521798000304164

; Arm @Arm ·

3 Nov 1985465776954904725

Last week, Richard Grisenthwaite joined theTSF-AI Conference to explore how Arm is powering the AI revolution. Our architecture enables trusted innovation, helping businesses build and run securely as AI scales globally. 💪

Reply on Twitter 1985465776954904725 Retweet on Twitter 1985465776954904725 6 Like on Twitter 1985465776954904725 18 Twitter 1985465776954904725

; Arm @Arm ·

3 Nov 1985425254479429978

Physical AI needs more than hardware - it needs a collaborative ecosystem built on silicon, software, and safety.

Paul Williamson, SVP and GM of IoT, notes how flexibility across platforms drives innovation efficiently and at scale.🧠💡

Physical AI Needs An Ecosystem - EE Times

Robotics is entering the era of physical AI, where smarter, safer machines work alongside humans—driven by advances ...

okt.to

Reply on Twitter 1985425254479429978 Retweet on Twitter 1985425254479429978 1 Like on Twitter 1985425254479429978 10 Twitter 1985425254479429978

; Arm @Arm ·

3 Nov 1985317893009985918

AI is reshaping the world ⏩ but can laws keep up?

In the latest episode of Arm Tech Unheard, Rene Haas and Minister @AshwiniVaishnaw unpack how innovation and policy must work together to govern AI responsibly.

Catch the full episode: https://okt.to/nXTvkb

Reply on Twitter 1985317893009985918 Retweet on Twitter 1985317893009985918 3 Like on Twitter 1985317893009985918 17 Twitter 1985317893009985918

; Arm @Arm ·

2 Nov 1985095589432881499

AI innovation isn’t just about hardware, it’s the software that connects it all. Complexity in AI toolchains still blocks real-world deployment.

See how we're simplifying the AI stack to help developers build faster, deploy anywhere in this article by @VentureBeat.…

Reply on Twitter 1985095589432881499 Retweet on Twitter 1985095589432881499 5 Like on Twitter 1985095589432881499 25 Twitter 1985095589432881499

; Arm @Arm ·

31 Oct 1984374835686895914

We're powering a major shift in AI. 💪

With Arm-based cloud instances organizations can implement AI efficiently and at scale - gaining higher performance-per-watt, lower total cost, and the flexibility to move from pilot projects to full AI platforms.

From pilot to platform: How Arm is powering AI in the cloud

Why now is the right time to evaluate Arm-based cloud instances

okt.to

Reply on Twitter 1984374835686895914 Retweet on Twitter 1984374835686895914 9 Like on Twitter 1984374835686895914 27 Twitter 1984374835686895914

; Arm @Arm ·

30 Oct 1984026047466045622

By migrating to Arm-based AWS Graviton processors and GitHub’s native Arm64 runners, @ThePSF cut compute costs by 25%, reduced carbon emissions by 40%, and achieved zero downtime - keeping Python’s ecosystem running stronger. ⚡💪
https://okt.to/atludF

Reply on Twitter 1984026047466045622 Retweet on Twitter 1984026047466045622 9 Like on Twitter 1984026047466045622 27 Twitter 1984026047466045622

; Arm @Arm ·

30 Oct 1983995601793519960

Personalized AI is reshaping our daily lives and it all starts with power-efficient compute.

From your morning latte to life-changing medical care, Arm is powering the future of AI everywhere.

Learn more in this @nytimes feature.
https://okt.to/5AQvsK

Reply on Twitter 1983995601793519960 Retweet on Twitter 1983995601793519960 3 Like on Twitter 1983995601793519960 14 Twitter 1983995601793519960

; Arm @Arm ·

29 Oct 1983673134977802747

A new era for robotaxis and intelligent mobility is here. 🚗

Auto leaders like @LucidMotors, @MercedesBenz, and @Stellantis are driving innovation with the @NVIDIA DRIVE AV platform and DRIVE AGX Hyperion 10 architecture — powered by NVIDIA DRIVE Thor featuring Arm Neoverse…

Reply on Twitter 1983673134977802747 Retweet on Twitter 1983673134977802747 2 Like on Twitter 1983673134977802747 29 Twitter 1983673134977802747

; Arm @Arm ·

29 Oct 1983536074681995696

Say hello to the #OPPOFindX9Series, built on our latest Arm v9.3 C1 CPU cluster and G1-Ultra GPU, delivering up to 32% higher performance, 42% better power efficiency, plus new AI-powered features with ColorOS 16.

Congrats, @Oppo! We're excited to continue collaborating on the…

Reply on Twitter 1983536074681995696 Retweet on Twitter 1983536074681995696 5 Like on Twitter 1983536074681995696 25 Twitter 1983536074681995696

; Arm @Arm ·

29 Oct 1983485237414789559

The development of the world’s first blockchain-on-chip for drones is being made possible through Arm Flexible Access, as @Minima_Global and @unisouthampton use Arm compute platforms to explore new approaches to secure, autonomous system design 👏: https://okt.to/UL1Or0

Reply on Twitter 1983485237414789559 Retweet on Twitter 1983485237414789559 91 Like on Twitter 1983485237414789559 176 Twitter 1983485237414789559

; Arm @Arm ·

29 Oct 1983461333719724516

Migrating cloud workloads doesn’t need to be complex.

The new Arm Cloud Migration Assistant Custom Agent, integrated with @GitHub Copilot, accelerates deployment so you can analyze code for readiness and build optimized multi-arch containers faster: https://okt.to/Sou3gr

Reply on Twitter 1983461333719724516 Retweet on Twitter 1983461333719724516 7 Like on Twitter 1983461333719724516 21 Twitter 1983461333719724516

; Arm @Arm ·

28 Oct 1983284153815622070

Reply on Twitter 1983284153815622070 Retweet on Twitter 1983284153815622070 1 Like on Twitter 1983284153815622070 16 Twitter 1983284153815622070

; Arm @Arm ·

28 Oct 1983245931672707416

Last week, 300+ Arm graduates from 12 countries came together in London for the Global Graduate Conference.

Co-designed by grads, GGC helps our future innovators accelerate their impact and shape the future of AI. 💡

Reply on Twitter 1983245931672707416 Retweet on Twitter 1983245931672707416 5 Like on Twitter 1983245931672707416 22 Twitter 1983245931672707416

KleidiAI Integration Brings AI Performance Uplifts to Google AI Edge’s MediaPipe

KleidiAI integration with Google AI Edge

A series of firsts

Easy access to AI performance

Editorial Contact

Related

Accelerating AI Developer Innovation Everywhere with New Arm Kleidi

Generative AI is on Mobile and it’s Powered by Arm

Arm: The Technology Foundation for AI Everywhere

Arm at Computex 2024: The path to 100+ Billion Arm Devices Ready for AI by 2025

Media Information

Company Overview & History

Arm Corporate Guidelines

Media Contacts

Latest on X