Blog

September 10, 2024

Unlocking New Real-world Generative AI Use Cases on the Mobile CPU

Armv9 CPU technologies are unleashing innovative generative AI use cases that run entirely on today’s AI-enabled flagship smartphone devices.

By Ronan Naughton, Director, Product Management, Client Line of Business, Arm

In 2022, the first example of generative AI emerged through text to image generation in the cloud. The text prompt was “a photograph of an astronaut riding a horse”, with the generative AI workload creating an image of just that. While there were some issues with the image, it showcased the awe-inspiring power and potential of generative AI workloads.

Rather than running this use case in the cloud, I remember thinking to myself at the time “this is great, but could it ever be processed entirely on a mobile device?”

Generative AI is (already) part of today’s smartphone experience

Fast-forward to today and you can. In fact, many generative AI workloads, like image generation and text summarization, that are now a common part of the modern smartphone experience are being processed at the edge – on the device. This is largely thanks to the computing capabilities of today’s AI-enabled flagship smartphones and the large language models (LLMs) behind generative AI becoming smaller and more efficient. And these trends will continue to evolve, with generative AI set to be a part of every single mobile application in the near future.

AI workloads start on the CPU

As we’ve talked about previously, AI on mobile starts on the CPU. It offers software flexibility and programmability for the world’s developers. On top of this, the ubiquity of the CPU, which features in every single digital consumer device on the planet, means developers can “write once, deploy everywhere” when creating their applications, ensuring they reach the widest number of users.

Earlier this year, we demonstrated a chatbot demo deployed to be a virtual teaching assistant for science and coding that runs on mobile on the CPU. The success of the demo meant we started exploring other practical generative AI mobile use cases that run on the Arm CPU and could be used every day by the average smartphone user. This led to the creation of two new demos – group chat summarization and voice note summarization. Like the chatbot demo, these process and run generative AI workloads entirely on the device, which provides privacy, latency and cost benefits compared to sending the data to the cloud to be processed.

The new generative AI demos

For me personally, group chat and voice note summarization are brilliant life hacks. Like most smartphone users, I can get inundated with various messages and voice notes from friends and family, so being able to use generative AI to summarize what’s been said is invaluable.

The group chat summarization demo quickly distills group chat messages with multiple participants down to the key points in an easily digestible format. Even though the demo itself is showcasing group chat messages, it can be used for other applications like summarizing emails. This use case could also be multimodal and even include pictures as part of the summarization.

The voice note summarization demo shows how an LLM and speech-to-text model can work together in a pipeline to summarize and transcribe voice notes sent to users, with the model converting the voice note to text and the LLM then summarizing the text. For me personally, this demo is a real time-saver!

For both of the demos, we use AI-enabled flagship smartphones that adopt Armv9 CPU technologies in their chipsets, including Google Pixel 8 and Pixel 8 Pro (Google Tensor G3 chipset), Xiaomi Redmi K60 Ultra and vivo X100 (MediaTek Dimensity 9300 chipset). The Armv9 CPU technologies integrate the latest architecture features for enhanced AI performance, including SVE2.

In the future, AI-enabled flagship smartphones built on Arm CPUs will utilize the Scalable Matrix Extension (SME) architectural feature, which accelerates AI workloads and enables improved performance, power efficiency and flexibility for AI-based applications running on the Arm CPU.

Looking to the future

While present day possibilities from generative AI are incredible, the future is likely to be even more exciting. In fact, I believe that we are scratching the surface of generative AI on mobile, particularly with image and video generation.

Recently, OpenAI showcased text to video generation, and Luna Labs demoed image to video generation. While both generative AI workloads are being processed in the cloud, if we follow the current trajectory, then there is no reason why they couldn’t be processed on mobile on the CPU in two years’ time – just like the astronaut riding the horse!

Generative AI on mobile runs on Arm

With so many different use cases and workloads that are possible, generative AI is consolidating the smartphone as the center of personal and professional compute. This makes it a hugely exciting time for generative AI in the mobile space.

Through our ubiquitous CPU technologies that feature in 99 percent of the world’s smartphones and industry-leading mobile ecosystem, Arm is the company which is enabling these amazing possibilities.

As we continue to add more capabilities and architecture features to the Arm CPU, alongside unlocking yet more AI performance for developers through Arm Kleidi, Arm is the mobile platform for the future of AI.

AI runs on Arm

Learn about how Arm is accelerating AI everywhere, from cloud to edge.

Read Now

By Ronan Naughton, Director, Product Management, Client Line of Business, Arm

Article Text

Copy Text

Any re-use permitted for informational and non-commercial or personal use only.

Editorial Contact

Arm Editorial

editorial@arm.com

Subscribe to Blogs and Podcasts

Get the latest blogs & podcasts direct from Arm

Blog

Jul 18, 2024

KleidiAI Integration Brings AI Performance Uplifts to Google AI Edge’s MediaPipe

Ronan Naughton, Director, Product Management, Client Line of Business, Arm

Blog

May 15, 2024

Generative AI is on Mobile and it’s Powered by Arm

James McNiven, Vice President of Product Management, Client Line of Business, Arm

Blog

May 29, 2024

Accelerating AI Developer Innovation Everywhere with New Arm Kleidi

Geraint North, Fellow, AI and Developer Platforms, Arm

Blog

May 23, 2024

Scalable Matrix Extension (SME) for Armv9 Architecture Enables AI Innovation on the Arm CPU

Arm Editorial Team

Blog

May 29, 2024

New Armv9 CPUs for Accelerating AI on Mobile and Beyond

Saurabh Pradhan, Director, CPU Product Management, Client Line of Business, Arm

Blog

Jun 03, 2024

Arm at Computex 2024: The path to 100+ Billion Arm Devices Ready for AI by 2025

Arm Editorial Team

Media Information

Latest on X

; Arm @Arm ·

6h 2014009922023809108

We're proud to be ranked #5 on @Glassdoor’s Best Places to Work UK 2026! 🎉

It’s down to our teams and the 10x Mindset, focusing on thinking bigger, simpler solutions, and smarter ways to deliver impact.

Thank you to everyone at Arm who keeps pushing boundaries and building the

Reply on Twitter 2014009922023809108 Retweet on Twitter 2014009922023809108 0 Like on Twitter 2014009922023809108 2 Twitter 2014009922023809108

; Arm @Arm ·

20h 2013793778016927767

At #wef26 in Davos, Rene Haas joined global leaders from @nscale_cloud, @SchneiderElec, @Bloom_Energy, Sweden’s Ministry of Enterprise (@BuschEbba), and @VVVijayEconBiz to discuss how AI’s rapid growth is reshaping the race for compute and clean energy.

Rene shared why the

Reply on Twitter 2013793778016927767 Retweet on Twitter 2013793778016927767 2 Like on Twitter 2013793778016927767 13 Twitter 2013793778016927767

; Arm @Arm ·

22h 2013761975998722287

In the rapidly evolving future of AI one thing is clear: performance-per-watt efficiency, architectural flexibility, and scalable AI infrastructure across Converged AI data centers are the driving force for continued innovation. And that's exactly why hyperscalers are shifting to

Reply on Twitter 2013761975998722287 Retweet on Twitter 2013761975998722287 2 Like on Twitter 2013761975998722287 9 Twitter 2013761975998722287

; Arm @Arm ·

23h 2013750421236392181

“It’s not an if question. It’s a when question.”

Speaking with @CNBC at #wef26 in Davos, Rene Haas shared his view on AI’s trajectory.

As AI accelerates, the challenge is enabling it efficiently at global scale. At Arm, we’re building the compute platform designed to make that

Reply on Twitter 2013750421236392181 Retweet on Twitter 2013750421236392181 4 Like on Twitter 2013750421236392181 15 Twitter 2013750421236392181

; Arm @Arm ·

20 Jan 2013711023505637552

Modular silicon ✅
Open standards ✅
System-level thinking ✅

It's the future of compute built on Arm.

Austin Lyons @theaustinlyons

I joined @Arm's podcast to talk chiplets, from early motivation to what comes next. Listen here:

https://newsroom.arm.com/podcasts/chiplets-trends-austin-lyons

Reply on Twitter 2013711023505637552 Retweet on Twitter 2013711023505637552 4 Like on Twitter 2013711023505637552 14 Twitter 2013711023505637552

; Arm @Arm ·

20 Jan 2013616528327938256

We’re in Davos for #wef26! ⛰️

AI is driving real decisions around infrastructure and energy, and those topics are front and center here.

Today, Rene Haas joins two live conversations:
🔋 Racing for Compute and its Endgame: https://okt.to/LG9xsV - 4:15pm CET
📰 FT Live:

Reply on Twitter 2013616528327938256 Retweet on Twitter 2013616528327938256 2 Like on Twitter 2013616528327938256 10 Twitter 2013616528327938256

; Arm @Arm ·

16 Jan 2012285999175352631

Secrets out! 🤭

The Arm compute platform is the technology behind many of your favorite devices - from gaming to wearables and beyond. We're enabling the future of edge AI by delivering high performance AND efficiency.

👀 @jennaezarik 👀:

The Hidden Tech Behind CES 2026’s Biggest Devices! (Arm Tech Tour)

#Ad I’m at CES with @arm showing you the tech quietly powering some of the smartest devices here. From gaming ...

okt.to

Reply on Twitter 2012285999175352631 Retweet on Twitter 2012285999175352631 10 Like on Twitter 2012285999175352631 21 Twitter 2012285999175352631

Unlocking New Real-world Generative AI Use Cases on the Mobile CPU

Generative AI is (already) part of today’s smartphone experience

AI workloads start on the CPU

The new generative AI demos

Looking to the future

Generative AI on mobile runs on Arm

AI runs on Arm

Editorial Contact

Related

KleidiAI Integration Brings AI Performance Uplifts to Google AI Edge’s MediaPipe

Generative AI is on Mobile and it’s Powered by Arm

Accelerating AI Developer Innovation Everywhere with New Arm Kleidi

Scalable Matrix Extension (SME) for Armv9 Architecture Enables AI Innovation on the Arm CPU

New Armv9 CPUs for Accelerating AI on Mobile and Beyond

Arm at Computex 2024: The path to 100+ Billion Arm Devices Ready for AI by 2025

Media Information

Company Overview & History

Arm Corporate Guidelines

Media Contacts

Latest on X