Blog

May 29, 2024

Accelerating AI Developer Innovation Everywhere with New Arm Kleidi

Arm Kleidi unlocks AI capabilities and performance of the Arm CPU on any software platform with no developer integration required.

By Geraint North, Fellow, AI and Developer Platforms, Arm, Arm

In the ever-evolving, fast paced age of AI, we are steadfast in our support for the millions of developers worldwide and ensuring that they have access to the performance, tools and software libraries needed to seamlessly create the next wave of stunning AI-enabled experiences.

This is why we are launching Arm Kl eidi, a broad program of software and software community engagements for accelerating AI. The first of which is our Arm Kleidi libraries for popular AI frameworks. This enables developers to transparently access the outstanding AI capabilities of the pervasive Arm CPU, where most of the world’s AI inference workloads from cloud to edge already run today. Developers can leverage over 20 years of Arm architectural innovation that consistently improves AI capabilities and performance, from the Armv7 architecture that first introduced the Advanced Single Instruction Multiple Data (SIMD) Extension for machine learning (ML) workloads to today’s Armv9 architecture that incorporates features that accelerate and protect advanced generative AI workloads on the Arm CPU.

Featuring KleidiAI for all AI workloads and KleidiCV for best-in-class computer vision (CV) workloads on Arm CPUs across all tiers, Kleidi Libraries will be embedded directly into popular AI frameworks, with no action needed by developers. This allows developers to frictionlessly enable the AI capabilities of the Arm CPU to build their AI-based applications quickly, at the highest possible performance and across the broadest range of devices.

Accelerating AI

KleidiAI is our solution to the explosion in device types, neural networks and inference engines. It is a collection of highly optimized AI kernels that deliver high performance in use cases such as generative AI. The beauty of KleidiAI is that rather than giving developers extra work to do, we are working directly with leading AI frameworks, including MediaPipe (via XNNPACK), LLAMA.cpp, PyTorch (via ExecuTorch) and TensorFlow Lite (via XNNPACK), to integrate KleidiAI. This accelerates the development process and unlocks AI performance, giving developers performance by default, so they can seamlessly create the best possible AI experiences. KleidiAI also provides forward-looking compatibility to ensure developers can take full advantage of future AI acceleration opportunities as we bring additional technologies to market.

The integration of KleidiAI is already translating into significant performance improvements for generative AI workloads. It accelerates the time-to-first token for Meta’s Llama 3 and Microsoft’s Phi-3 LLMs using llama.cpp by 190 percent on the new Arm Cortex-X925 CPU compared with the reference implementation (which is based on llama.cpp without our software Kleidi optimizations). KleidiAI is so easy to integrate that it took Arm’s engineering teams less than 24 hours to measure this optimized performance for Llama 3. Also, through the KleidiAI integration with MediaPipe through XNNPACK, which provides support for the Gemma open LLM running on mobile, there is a 25 percent improvement for the time-to-first token for Gemma 2B on the Google Pixel 8 Pro smartphone.

Finally, we are working with Unity on Sentis, its on-device AI inference engine that empowers game developers to create innovative, AI-driven gameplay experiences on all devices that support the Unity Game Engine. After integrating KleidAI, Unity Sentis managed to enable the int4 quantization to reduce the model memory utilization by 72.5 percent and improve performance by 660 percent when running the Phi-2 LLM.

For more information about KleidiAI read this blog.

Accelerating CV

KleidiCV accelerates CV pipelines that are used for many camera use cases. OpenCV, the world’s largest CV library containing over 2500 algorithms and supporting hundreds of thousands of developers, has already identified a typical performance uplift of 75 percent for a variety of image processing tasks based on KleidiCV integrations. As part of our strategic software partnership with OpenCV, we are also bringing Android builds to Maven Central, a repository of open-source software components and libraries for Java development, for the very first time.

For more information about KleidiCV read this blog.

The benefits of AI on the CPU

Arm Kleidi focuses on accelerating the AI capabilities on the CPU, as in most cases all AI workloads will start by running on the CPU. This makes it the easiest path for developers when targeting their AI workloads. Therefore, the more performant that we can make this path for developers, the more likely that they will be able to keep using and targeting the CPU during the development process. Also, as LLMs become smaller and more efficient, there will be an increasing number of AI-based workloads that will make sense to be processed on the CPU. The end result is a smoother, more seamless development process that optimizes the performance of developers’ AI workloads.

Building the future of AI on Arm

The introduction of Arm Kleidi re-emphasizes Arm’s role as the leading compute platform for on-device generative AI. It enables developers to access the exceptional AI performance of the Arm CPU across the widest array of hardware possible without the need to learn additional tools and skills. As we continuously innovate our leading-edge architecture for the next generation of AI, developers will have access to even greater, more advanced AI capabilities in the future. For the end-user, this means exceptional AI experiences that are faster, more intelligent, more interactive, more immersive, and more secure.

What’s happening with Arm Kleidi is just the start, with more libraries, compute kernels and engine integrations planned for the future. Our message is to watch this space for further updates as we continue to build the future of AI on Arm.

By Geraint North, Fellow, AI and Developer Platforms, Arm, Arm

Article Text

Copy Text

Any re-use permitted for informational and non-commercial or personal use only.

Editorial Contact

Brian Fuller & Jack Melling

editorial@arm.com

Subscribe to Blogs and Podcasts

Get the latest blogs & podcasts direct from Arm

Blog

Jan 08, 2024

Arm: The Technology Foundation for AI Everywhere

Arm Editorial Team

Blog

May 15, 2024

Generative AI is on Mobile and it’s Powered by Arm

James McNiven, Vice President of Product Management, Client Line of Business, Arm

Blog

May 29, 2024

New Armv9 CPUs for Accelerating AI on Mobile and Beyond

Saurabh Pradhan, Director, CPU Product Management, Client Line of Business, Arm

Blog

May 29, 2024

Arm CSS for Client: The Compute Platform for AI-powered Consumer Experiences

Kinjal Dave, Senior Director, Product Management, Client Line of Business, Arm

Media Information

Latest on X

; Arm @Arm ·

20h 1950006138499441009

Edge AI is rewriting the playbook for IoT and embedded development as it shifts towards collaborative ecosystems and heterogeneous compute.

@VDC_Research partnered with us to explore the next era of embedded technology - led by AI and built on Arm. ⚡⬇️

https://okt.to/nIkNe6

Reply on Twitter 1950006138499441009 Retweet on Twitter 1950006138499441009 2 Like on Twitter 1950006138499441009 18 Twitter 1950006138499441009

; Arm @Arm ·

28 Jul 1949917544954892736

➡️50% faster vector indexing
➡️20% performance boost
➡️10% cost reduction

@zilliz_universe achieved all this and more by transitioning from x86 to Arm CPUs for compute intensive workloads, reducing operational costs and delivering scale across the organization:…

Reply on Twitter 1949917544954892736 Retweet on Twitter 1949917544954892736 4 Like on Twitter 1949917544954892736 14 Twitter 1949917544954892736

; Arm @Arm ·

28 Jul 1949868215485845764

Ready to push genAI performance to the next level?

Our new course gives you hands-on experience in optimizing AI models from cloud to edge using Arm-based platforms like SIMD (SVE, Neon), low-bit quantization, and the KleidiAI library.

Reply on Twitter 1949868215485845764 Retweet on Twitter 1949868215485845764 3 Like on Twitter 1949868215485845764 9 Twitter 1949868215485845764

; Arm @Arm ·

25 Jul 1948827821310161337

We're building a future for real people.

We caught up with @1JessicaHawkins from our partners over at @AstonMartinF1 during our latest brand film shoot where she gave us a look into her own career journey and the importance of empowerment, growth & pushing the limits.

The…

Reply on Twitter 1948827821310161337 Retweet on Twitter 1948827821310161337 1 Like on Twitter 1948827821310161337 10 Twitter 1948827821310161337

; Arm @Arm ·

25 Jul 1948785102562992381

http://x.com/i/article/1948015818245079041

Reply on Twitter 1948785102562992381 Retweet on Twitter 1948785102562992381 4 Like on Twitter 1948785102562992381 14 Twitter 1948785102562992381

; Arm @Arm ·

25 Jul 1948767747002814600

GenAI is reshaping compute and we’re seeing the shift firsthand.

Since 2021, we’ve seen a 14x increase in our data center customer base. With more AI startups than ever choosing Arm platforms for high-performance, power-efficient compute across workloads, it’s clear that the…

Reply on Twitter 1948767747002814600 Retweet on Twitter 1948767747002814600 4 Like on Twitter 1948767747002814600 11 Twitter 1948767747002814600

; Arm @Arm ·

25 Jul 1948737013575774707

🚗 How do you scale safe, efficient compute for increasingly intelligent vehicles?

Meet Arm Zena CSS , our scalable compute platform that will help OEMs accelerate deployment of L2+ to L4 automated driving, beating analyst predictions: https://okt.to/5Nn0rB

Reply on Twitter 1948737013575774707 Retweet on Twitter 1948737013575774707 7 Like on Twitter 1948737013575774707 25 Twitter 1948737013575774707

; Arm @Arm ·

24 Jul 1948407313770975710

🔋Power efficiency is key to scaling AI.

At #FortuneAISingapore, Will Abbey shared why the time to rethink how we build is now ⏭️and how the Arm compute platform is driving that shift.

FORTUNE @FortuneMagazine

“Power efficiency is going to be the key word that the whole industry needs to focus on.”

@Arm EVP and CCO Will Abbey told #FortuneAISingapore that the global supply chain for semiconductor chips needs to find effective solutions to keep up with demand. https://trib.al/DOI1tyi

Reply on Twitter 1948407313770975710 Retweet on Twitter 1948407313770975710 6 Like on Twitter 1948407313770975710 17 Twitter 1948407313770975710

; Arm @Arm ·

23 Jul 1948057694998302815

In this spotlight by @themoment_media, Ami Badani, Chief Marketing Officer shares how our AI tech is helping shape the next era of productivity, creativity, and purpose.

Big thanks to the team at ATM for featuring this moment.🙌

ATM - At The Moment Media @themoment_media

Everyone's thinking about AI - Ami Badani, CMO @Arm is thinking about AI for GOOD 🫶🏽

🌎 Making society more productive while empowering everyone to use technology for positive
change 🙌🏽

#ATM #advertising #technology #media #experiences #storytellers #influencers #stories…

Reply on Twitter 1948057694998302815 Retweet on Twitter 1948057694998302815 2 Like on Twitter 1948057694998302815 10 Twitter 1948057694998302815

; Arm @Arm ·

22 Jul 1947726864522293518

In an interview with @automotiveworld, Dipti Vachani shares how we're helping automakers move faster by making software development simpler, scalable, and AI-ready thanks to SOAFEE and Arm Zena CSS. 🚗

Download the full story: https://okt.to/2RE9nT

Reply on Twitter 1947726864522293518 Retweet on Twitter 1947726864522293518 2 Like on Twitter 1947726864522293518 9 Twitter 1947726864522293518

; Arm @Arm ·

22 Jul 1947704622866411702

Proud to collaborate with @unitygames on their new e-book: “The Ultimate Guide to Profiling Unity Games” 🎮

We helped integrate hardware tools like Arm Performance Studio and Streamline Performance Analyzer to help developers better understand runtime behavior on Arm-based…

Unity for Games @unitygames

🚀 New e-book alert!

“The ultimate guide to profiling Unity games (Unity 6 edition)” is ready to download. Learn how to Get almost 100 pages of tips on profiling, memory management, and power consumption optimization.

🕵️ Learn how to pinpoint performance issues with the Unity…

Reply on Twitter 1947704622866411702 Retweet on Twitter 1947704622866411702 4 Like on Twitter 1947704622866411702 16 Twitter 1947704622866411702

; Arm @Arm ·

21 Jul 1947431357376041069

Will Abbey joins Graphcore’s Nigel Toon at #FortuneAISingapore to unpack how chipmakers can scale AI sustainably in an era where global strategy meets silicon and intelligence.

https://okt.to/rcwGLb

📍Main Stage | 2:40 PM

Reply on Twitter 1947431357376041069 Retweet on Twitter 1947431357376041069 4 Like on Twitter 1947431357376041069 8 Twitter 1947431357376041069

; Arm @Arm ·

21 Jul 1947238915921920265

We joined @AstonMartinF1’s #MakeAMark initiative to help young people explore the future of tech.

From training AI with micro:bit devices to discussing the human side of innovation, it was all about real-world skills, hands-on learning, and big inspiration.

Reply on Twitter 1947238915921920265 Retweet on Twitter 1947238915921920265 4 Like on Twitter 1947238915921920265 20 Twitter 1947238915921920265

; Arm @Arm ·

17 Jul 1945924983990981072

Edge AI is triggering the Great Embedded Awakening 🌍

💡Modern workloads = modern tools
💡Rich operating systems are displacing RTOSs
💡Heterogeneous compute is becoming the norm

Our report with @VDC_Research explores how much the landscape is changing.
https://okt.to/9PxFJM

Reply on Twitter 1945924983990981072 Retweet on Twitter 1945924983990981072 3 Like on Twitter 1945924983990981072 18 Twitter 1945924983990981072

; Arm @Arm ·

17 Jul 1945920169433313612

Congrats, @nuro on the launch of its next-gen global robotaxi program! 🥳

The Nuro Driver, built on Arm, will soon enable safe, AI-first autonomy across Uber’s fleet.

We're proud to support our partners to the AI-driven future of mobility.

Nuro @nuro

.@LucidMotors’ premium EVs. @Nuro’s proven L4 autonomy. @Uber’s global ride-hailing network.

Together, we're launching a next-gen robotaxi fleet—20K+ vehicles, starting in 2026.

Details here: https://www.nuro.ai/nuro-lucid-uber-robotaxi-announcement

#autonomousvehicles #technology #innovation #partnership…

Reply on Twitter 1945920169433313612 Retweet on Twitter 1945920169433313612 7 Like on Twitter 1945920169433313612 24 Twitter 1945920169433313612

; Arm @Arm ·

17 Jul 1945802180981637406

🚗 Arm Zena CSS brings a world class ecosystem of software partners like @awscloud, DENSO, @Mapbox, @RedHat and more together to collaborate and drive the AI-defined future.

Together, we’re transforming vehicles into intelligent, safer, updatable platforms…

Reply on Twitter 1945802180981637406 Retweet on Twitter 1945802180981637406 4 Like on Twitter 1945802180981637406 22 Twitter 1945802180981637406

; Arm @Arm ·

16 Jul 1945553545870078258

“The winners in any technological race are defined by the partners they work with.”

Rene Haas joined the PA Energy & Innovation Summit, emphasizing the importance of collaboration in the AI era.

We’re proud to be part of the efforts shaping that future.

Reply on Twitter 1945553545870078258 Retweet on Twitter 1945553545870078258 3 Like on Twitter 1945553545870078258 14 Twitter 1945553545870078258

; Arm @Arm ·

14 Jul 1944835817605627950

Don't hit the brakes! 🏁

At #62DAC, Suraj Gajendra, joined AMD, Siemens & Collins Aerospace to explore how software-defined infrastructure & system-level modeling are cutting automotive development cycles and ushering in an era of AI-defined vehicles. ⚡…

Reply on Twitter 1944835817605627950 Retweet on Twitter 1944835817605627950 4 Like on Twitter 1944835817605627950 14 Twitter 1944835817605627950

; Arm @Arm ·

10 Jul 1943349846733119934

Congrats to the @SamsungMobile team on a fantastic #GalaxyUnpacked! 👏

The new Galaxy Z Flip7 and Watch8, built on Arm CPU, showcase what’s possible with leading performance and efficiency for smarter, AI-first experiences.

Samsung Galaxy Z Flip7: A Pocket-Sized AI Powerhouse With a New Edge-To-Edge FlexWindow

Compact in size, bold in capability — Galaxy Z Flip7 redefines the flip phone experience

okt.to

Reply on Twitter 1943349846733119934 Retweet on Twitter 1943349846733119934 1 Like on Twitter 1943349846733119934 9 Twitter 1943349846733119934

; Arm @Arm ·

10 Jul 1943295391228637685

SME2🤝KleidiAI= The perfect match for matrix-heavy AI workloads on mobile

With 6x faster AI responses on models like Google's Gemma 3 & real-time text summarization in under a second, SME2 is built to scale next-gen AI features across devices - starting with your apps from today

Arm Software Developers @ArmSoftwareDev

📢 Mobile devs, get ready for a performance boost on matrix-heavy AI workloads with SME2.

Built into @Google’s XNNPACK and AI frameworks via Arm KleidiAI, now’s the time to make sure your apps use a supported stack to benefit - no code changes required: https://newsroom.arm.com/blog/arm-sme2-android-mobile-apps?utm_source=twitter&utm_medium=social-organic&utm_content=newsroom&utm_campaign=mk24_developer_na

Reply on Twitter 1943295391228637685 Retweet on Twitter 1943295391228637685 4 Like on Twitter 1943295391228637685 14 Twitter 1943295391228637685

; Arm @Arm ·

8 Jul 1942588028058239400

“You can’t load up a car with huge servers to run the model.” – Suraj Gajendra, VP Products and Solutions, Automotive

In a recent Arm Viewpoints podcast episode, Suraj and @silviusrus, VP of Software at @Wayve_AI, explore what today tells us about the future of autonomous…

Reply on Twitter 1942588028058239400 Retweet on Twitter 1942588028058239400 2 Like on Twitter 1942588028058239400 15 Twitter 1942588028058239400

; Arm @Arm ·

7 Jul 1942327794261770408

Moore’s Law is slowing. AI demand isn’t.

Will Abbey joins @RAISESummit tomorrow to explore how the industry is meeting this compute collision, with smarter architectures, efficient design, and AI-ready infrastructure: https://okt.to/cLSZ86

Reply on Twitter 1942327794261770408 Retweet on Twitter 1942327794261770408 4 Like on Twitter 1942327794261770408 9 Twitter 1942327794261770408

; Arm @Arm ·

3 Jul 1940901701617131688

As AI models become more efficient, Rene Haas and @OpenAI’s @markchen90 reflect on what’s next in the evolution of intelligence.

🎧 They explore the promise of AGI and how it could empower a new wave of entrepreneurship by making creation more accessible: https://okt.to/2UcJYm

Reply on Twitter 1940901701617131688 Retweet on Twitter 1940901701617131688 6 Like on Twitter 1940901701617131688 22 Twitter 1940901701617131688

; Arm @Arm ·

1 Jul 1940141221319717058

Congrats to @RenesasGlobal on the RA8P1 MCU group, powered by Arm Cortex-M85, M33, and Ethos-U55.

Designed for on-device AI and ML, it brings advanced performance to next-gen voice and vision applications, alongside real-time analytics.👏

https://www.renesas.com/en/about/newsroom/renesas-sets-new-mcu-performance-bar-1-ghz-ra8p1-devices-ai-acceleration

Reply on Twitter 1940141221319717058 Retweet on Twitter 1940141221319717058 1 Like on Twitter 1940141221319717058 9 Twitter 1940141221319717058

Accelerating AI Developer Innovation Everywhere with New Arm Kleidi

Accelerating AI

Accelerating CV

The benefits of AI on the CPU

Building the future of AI on Arm

Editorial Contact

Related

Arm: The Technology Foundation for AI Everywhere

Generative AI is on Mobile and it’s Powered by Arm

New Armv9 CPUs for Accelerating AI on Mobile and Beyond

Arm CSS for Client: The Compute Platform for AI-powered Consumer Experiences

Media Information

Company Overview & History

Arm Corporate Guidelines

Media Contacts

Latest on X