Blog

December 2, 2025

7 innovations from Arm that you shouldn’t miss from November 2025

A roundup of innovations redefining how partners and developers build, train, and can deploy intelligent experiences on Arm, spanning on-device AI, automotive intelligence, and hybrid compute.

By Arm Editorial Team

Innovations in artificial intelligence (AI) and advanced compute continued to accelerate in November 2025, and Arm remained at the centre of that momentum. From new ways to run large language models (LLMs) on everyday devices to deeper architectural insights that shape next-generation performance, the month highlighted how Arm and our ecosystem are driving progress across mobile, graphics, automotive, and cloud.

Rethinking the CPU across AI workflows

Retrieval-Augmented Generation (RAG) combines two key parts of an AI workflow – retrieving relevant information from a knowledge base and generating responses using a LLM. In modern systems, this workflow relies on both the CPU – which handles data retrieval, filtering, and orchestration – as well as the GPU which manages the model inference.

Odin Shen, Principal Solutions Architect, explored this hybrid workflow on the NVIDIA DGX Spark platform, which pairs the Arm-based Grace GPU with the Blackwell CPU. Odin breaks down how Grace manages the retrieval and pipeline control, while Blackwell handles model execution, all supported by a unified memory architecture that reduces unnecessary data movement.

On-device audio generation with ExecuTorch and Stability AI

At October’s PyTorch Conference 2025, Gian Marco Iodice, Principal Software Engineer, demonstrated how users can generate audio, entirely on-device, using Stability AI’s Stable Audio Open Small model, which is powered by ExecuTorch.

Running entirely on an Arm-powered Android device, this demo showcases how ExecuTorch enables efficient, offline generative AI workloads directly on-device, with no cloud connectivity required. The application converts text prompts into high-quality 44 kHz audio, highlighting the power and flexibility of Arm CPUs for edge AI and creativity at scale.

Exploring LLMs on Android and ChromeOS with AI Chat

LLMs are increasingly running directly on devices rather than relying on cloud connectivity, offering faster responses, stronger privacy, and more predictable performance since all processing stays local. The challenge is often finding a simple way to evaluate different models on real hardware without setting up complex environments or dependencies.

Arm introduced AI Chat, a lightweight app that lets users explore and evaluate multiple LLMs directly on Android and ChromeOS devices. As explained by Han Yin, Staff AI Technology Engineer, AI Chat makes on-device testing accessible to a broader audience and removes friction from evaluating LLM performance by offering a clear, consistent environment that works across diverse Arm-powered devices. The app can also detect the device’s hardware, recommends suitable models, and shows real-time performance metrics such as tokens-per-second and time-to-first-token.

Introducing Arm Virtual FAE

When teams evaluate IP for a new chip or product, they often need clear explanations, comparisons, and guidance. This normally involves speaking with Field Application Engineers (FAEs) who help interpret specifications, answer technical questions, and explain trade-offs.

Matt Rowley, Senior Product Manager, introduces the Virtual FAE, an AI-powered assistant built directly into Arm IP Explorer. This means chip architects, engineers, and product teams now get a faster and more intuitive way to explore Arm IP, and users can get immediate clarity on the IP that fits their goals, whether they are designing edge devices, automotive platforms, or high-performance compute systems.

The next step in Arm CPU AI acceleration

Modern AI workloads rely heavily on matrix operations – the core mathematical building blocks behind neural networks, image processing, and signal analysis. Traditionally, these operations are handled by GPUs or dedicate accelerators, but CPUs also play a critical role. To keep pace with growing on-device AI demand, CPUs need built-in capabilities that make matrix math faster, more efficient, and easier for software to use.

Arm Scalable Matrix Extension 2 (SME2) is Arm’s advanced matrix-processing extensions designed to accelerate AI and high-performance compute directly on the CPU. Zenon (Zhilong) Xiu, Senior Principal Applications Engineer, has now released a new technical breakdown that explains how SME2 works, why its design choices matter, and where it provides the biggest performance advantages. Xiu also covers LUT (lookup-table) enhancements, expanded vector handling, and the practical differences developers should expect in real workloads.

A clearer look at the future of automotive compute

Software-defined vehicles (SDVs) shift core vehicle functions into software that can be updated, improved, and expanded over time. As AI becomes central to perception, decision-making and in-cabin intelligence, the industry is now moving toward AI-defined Vehicles (AIDVs) in which real-time AI inference, sensor fusion and predictive behaviour shape how the vehicle responds to the world.

Prakash Mohapatra, Senior Product Manager, offers a deeper, more structured look at what AIDVs require from compute platform. The breakdown explains why AIDVs need scalable heterogeneous compute, how domain and zonal architectures change workload distribution, and which safety and real-time constraints designs must consider.

Understanding Vulkan Subpasses

Vulkan is a low-level graphics API designed to give developers more control over how rendering workloads are scheduled and executed. One of its features called subpasses, allows multiple rendering operations to be combined within a single render pass. Under optimal use, subpasses can reduce memory bandwidth, improve tiling efficiency, and boost performance on mobile GPUs. But when used in the wrong situations, they can introduce unnecessary complexity, stall pipelines or reduce clarity in the rendering flow.

Peter Harris, Distinguished Engineer, offers a practical, real-world explanation of subpasses — not just how they work, but when developers should or should not use them. This guidance goes further than standard API documentation by unpacking actual performance trade-offs, showing examples of optimal usage, and highlighting scenarios where subpasses deliver no benefit or even degrade performance.

By Arm Editorial Team

Article Text

Copy Text

Any re-use permitted for informational and non-commercial or personal use only.

Editorial Contact

Arm Editorial Team

editorial@arm.com

Subscribe to Blogs and Podcasts

Get the latest blogs & podcasts direct from Arm

Blog

Nov 04, 2025

Top 12 innovations from Arm in October 2025

Arm Editorial Team

Blog

Oct 01, 2025

What Tech Innovations Did Arm Deliver in September 2025?

Arm Editorial Team

Blog

Sep 02, 2025

What Tech Innovations Did Arm Deliver in August 2025?

Arm Editorial Team

Blog

Aug 01, 2025

10 Tech Innovations From Arm in July 2025 That Are Shaping the Future of AI and Cloud Development

Arm Editorial Team

Media Information

Latest on X

; Arm @Arm ·

17h 2017283757238104307

AI innovation doesn’t slow down and neither can product delivery.

Rene Haas and @amazon SVP of Devices and Services, @panos_panay discuss how cloud-first approaches enable rapid iteration, frequent updates, and better customer outcomes in an AI-driven world.

🎧 Listen now

Reply on Twitter 2017283757238104307 Retweet on Twitter 2017283757238104307 2 Like on Twitter 2017283757238104307 20 Twitter 2017283757238104307

; Arm @Arm ·

28 Jan 2016576787300417809

Cloud developers are facing pressures to build AI-ready infrastructure that is not just powerful, but scalable, cost-efficient, and energy aware.

By enabling Arm-based compute, developers can run end-to-end AI workloads with strong performance-per-watt and software compatibility

Reply on Twitter 2016576787300417809 Retweet on Twitter 2016576787300417809 10 Like on Twitter 2016576787300417809 22 Twitter 2016576787300417809

; Arm @Arm ·

27 Jan 2016244832201552103

🎉 N4A is now GA on @GoogleCloud!

Built on Arm Neoverse N3, Google Axion–powered N4A delivers up to 2x better price-performance - with support from the broad Arm ecosystem to help customers scale efficiently and optimize TCO for cloud-native workloads.

Axion-based N4A VMs now in preview | Google Cloud Blog

Lower the TCO of scale-out workloads like GKE and web servers. Google Cloud’s new N4A VMs deliver breakthrough p...

okt.to

Reply on Twitter 2016244832201552103 Retweet on Twitter 2016244832201552103 5 Like on Twitter 2016244832201552103 26 Twitter 2016244832201552103

; Arm @Arm ·

27 Jan 2016213390276899251

🚀 Big moment for compute.

In a recent Bloomberg interview, @NVIDIA’s Jensen Huang confirmed Vera CPUs will be available as a standalone offering - a powerful validation of Arm technology.

Learn more about Vera and the new generation of purpose-built silicon for large-scale

Reply on Twitter 2016213390276899251 Retweet on Twitter 2016213390276899251 8 Like on Twitter 2016213390276899251 51 Twitter 2016213390276899251

; Arm @Arm ·

23 Jan 2014839116295242081

Did someone say chiplets?

On the most recent episode of Arm Viewpoints @theaustinlyons breaks down the chiplet ecosystem and why modular silicon, open standards and system-level thinking are reshaping the future of compute.
https://okt.to/tA4Ovn

Reply on Twitter 2014839116295242081 Retweet on Twitter 2014839116295242081 19 Like on Twitter 2014839116295242081 132 Twitter 2014839116295242081

; Arm @Arm ·

23 Jan 2014763508311728255

📸👇 A week in Davos with Arm

At #wef26, Arm CEO Rene Haas joined global leaders to tackle a defining question: can the world sustainably scale AI?

Across panels and interviews, the focus was on smarter, more distributed compute across cloud, edge, and physical AI systems —

Reply on Twitter 2014763508311728255 Retweet on Twitter 2014763508311728255 3 Like on Twitter 2014763508311728255 26 Twitter 2014763508311728255

; Arm @Arm ·

23 Jan 2014717773658697927

The @Xbox app is now available on Arm-based @Windows 11 PCs, unlocking access to a growing PC game catalog, including 85% of Game Pass titles already compatible.

Built in collaboration with @Microsoft and ecosystem partners, this update delivers more choice for PC gamers on Arm.

Reply on Twitter 2014717773658697927 Retweet on Twitter 2014717773658697927 14 Like on Twitter 2014717773658697927 50 Twitter 2014717773658697927

7 innovations from Arm that you shouldn’t miss from November 2025

Rethinking the CPU across AI workflows

On-device audio generation with ExecuTorch and Stability AI

Exploring LLMs on Android and ChromeOS with AI Chat

Introducing Arm Virtual FAE

The next step in Arm CPU AI acceleration

A clearer look at the future of automotive compute

Understanding Vulkan Subpasses

Editorial Contact

Related

Top 12 innovations from Arm in October 2025

What Tech Innovations Did Arm Deliver in September 2025?

What Tech Innovations Did Arm Deliver in August 2025?

10 Tech Innovations From Arm in July 2025 That Are Shaping the Future of AI and Cloud Development

Media Information

Company Overview & History

Arm Corporate Guidelines

Media Contacts

Latest on X