Blog

February 10, 2021

Cambridge Consultants: How We’re Pushing the Endpoint AI Envelope

One year since Arm launched the Cortex-M55 CPU and Arm Ethos-U55 microNPU, we've achieved an incredible 7x power reduction, 1,000x speed increase in endpoint AI

By Michal Gabrielczyk, Head of Edge AI, Cambridge Consultants

Artificial intelligence (AI) may have grown up in the cloud but delivering transformational products and services means taking AI out of the data center and into the real world.

As one of the world’s leading product development and technology consultancy firms, our technologies can be found in homes and hospitals, in satellite networks and even inside the human body. Many of these applications now use endpoint AI, enabling us to turn raw sensor data into context and meaning on the device itself without sending it to the cloud.

The results of a year of socially-distanced, determined experimentation with the Cortex-M55 and Ethos-U55 aren’t just impressive—they’re game-changing.

But enabling this level of intelligence in endpoint devices with stringent size, cost, power and connectivity constraints is no small task. It requires two key things: robust silicon and a deep understanding of the design trade-offs in power and performance to maximize the latter while maintaining or even reducing the former.

When Arm announced the Cortex-M55 processor and Arm Ethos-U55 micro neural processing unit (NPU) exactly one year ago today, we jumped at the chance to see just how far we could push the power-performance envelope. The results of a year of socially-distanced, determined experimentation with the Cortex-M55 and Ethos-U55 aren’t just impressive—they’re game-changing.

Cortex-M55 + Ethos-U55: A step-change in what’s possible with endpoint AI

As an Arm Approved Design Partner, it wasn’t long after the launch last February that we were able to put this new AI duo through its paces.

Our initial research involved migrating our ultra-low power Voice Activity Detection (VAD) reference design from the Cortex-M3 to the Cortex-M55 and Ethos-U55. We wanted to draw comparison with earlier platforms that we were familiar with to explore the capabilities.

We quickly achieved a remarkable 7 times reduction in average power, yet 1,000 times increase in core speed. It was clear to us then that this wasn’t just the next generation of Cortex-M microcontroller: this was a step-change in what’s possible in endpoint AI.

On-device voice detection that doesn’t need to send all its data to the cloud has major benefits in latency, privacy and power, and this kind of voice detection is going to become increasingly important to the consumer market in the coming years.

The incredible uplift in performance we experienced in porting our VAD reference design to the Cortex-M55 and Ethos-U55 opened up a number of new previously impossible avenues, such as including vision alongside voice detection.

But it also gave us the confidence to really see how far we could stretch the capabilities of these chips.

Pushing the limits of AI medical applications at the endpoint

Putting scepticism firmly to one side, we began to wonder if we could port something as large and complex as a cloud-based deep learning application to this microprocessor duo, and in doing so prove that with the right optimization and silicon IP, even complex neural networks can be deployed on very low power edge devices.

The application we chose centered on a concept system developed by Cambridge Consultants to improve treatment monitoring of tuberculosis (TB) in resource-limited countries by combining AI with a smartphone to capture images from a laboratory microscope. Stained sputum sample images were originally analyzed using a deep learning algorithm in the cloud to identify, count and classify infected cells to determine the disease state of the patient.

To give you an idea of scale, this treatment monitoring application is 350 times more computationally complex than a typical object detection application using the MobileNet V2 neural network, which is commonly used in industry. MobileNet V2 requires a single inference per image of around 0.8 billion multiplier-accumulators (MACs), whereas this research required 70 inferences of around 4 billion MACs each per image.

The port was not only successful: we achieved similar run times and accuracy levels to the application’s former cloud deployment yet drew just a few Watts in the process. These power reductions were achieved through understanding and optimizing the network implementation during the translation and quantization stages, which had a dramatic effect on the run-time, power consumption and accuracy during the cloud to endpoint migration.

Wide-ranging applications for endpoint AI

The applications for this research are huge: real-time medical AI can be deployed in low power endpoint devices and used in settings where Internet connectivity is unavailable, or bulky and power-hungry computing equipment would be impractical.

It also opens the door to combine further benefits of processing AI data on endpoints, including lower latency and lower power. All whilst leaving room to improve privacy and security, since data does not leave the users device.

This research is directly comparable to many other applications and markets, enabling device manufacturers to move complex AI workloads into everyday consumer devices, factories, and even smart cities. This is a topic discussed further in Cambridge Consultants’ recent whitepaper.

From signal processing in billions of mobile phones, to AI in smart inhalers, Cambridge Consultants has generated billions of dollars of value for our clients, by creating and optimizing world-leading silicon platforms. As an Approved Design Partner and Functional Safety Partner, we consider it our duty to see how far we can push the latest Arm IP in order to demonstrate to Arm, our customers and the world just how powerful endpoint AI can be.

Unlock the Benefits of Artificial Intelligence for IoT Devices

Arm offers new compute technologies coupled with software and tools to help companies streamline the design, development, and support for AI-based IoT applications.

Discover More

By Michal Gabrielczyk, Head of Edge AI, Cambridge Consultants

Article Text

Copy Text

Any re-use permitted for informational and non-commercial or personal use only.

Editorial Contact

Brian Fuller and Jack Melling

editorial@arm.com

Subscribe to Blogs and Podcasts

Get the latest blogs & podcasts direct from Arm

Media Information

Latest on X

; Arm @Arm ·

23h 1943349846733119934

Congrats to the @SamsungMobile team on a fantastic #GalaxyUnpacked! 👏

The new Galaxy Z Flip7 and Watch8, built on Arm CPU, showcase what’s possible with leading performance and efficiency for smarter, AI-first experiences.

Samsung Galaxy Z Flip7: A Pocket-Sized AI Powerhouse With a New Edge-To-Edge FlexWindow

Compact in size, bold in capability – Galaxy Z Flip7 redefines the flip phone experience

okt.to

Reply on Twitter 1943349846733119934 Retweet on Twitter 1943349846733119934 1 Like on Twitter 1943349846733119934 8 Twitter 1943349846733119934

; Arm @Arm ·

10 Jul 1943295391228637685

SME2🤝KleidiAI= The perfect match for matrix-heavy AI workloads on mobile

With 6x faster AI responses on models like Google's Gemma 3 & real-time text summarization in under a second, SME2 is built to scale next-gen AI features across devices - starting with your apps from today

Arm Software Developers @ArmSoftwareDev

📢 Mobile devs, get ready for a performance boost on matrix-heavy AI workloads with SME2.

Built into @Google’s XNNPACK and AI frameworks via Arm KleidiAI, now’s the time to make sure your apps use a supported stack to benefit - no code changes required: https://newsroom.arm.com/blog/arm-sme2-android-mobile-apps?utm_source=twitter&utm_medium=social-organic&utm_content=newsroom&utm_campaign=mk24_developer_na

Reply on Twitter 1943295391228637685 Retweet on Twitter 1943295391228637685 3 Like on Twitter 1943295391228637685 14 Twitter 1943295391228637685

; Arm @Arm ·

8 Jul 1942588028058239400

“You can’t load up a car with huge servers to run the model.” – Suraj Gajendra, VP Products and Solutions, Automotive

In a recent Arm Viewpoints podcast episode, Suraj and @silviusrus, VP of Software at @Wayve_AI, explore what today tells us about the future of autonomous…

Reply on Twitter 1942588028058239400 Retweet on Twitter 1942588028058239400 2 Like on Twitter 1942588028058239400 15 Twitter 1942588028058239400

; Arm @Arm ·

7 Jul 1942327794261770408

Moore’s Law is slowing. AI demand isn’t.

Will Abbey joins @RAISESummit tomorrow to explore how the industry is meeting this compute collision, with smarter architectures, efficient design, and AI-ready infrastructure: https://okt.to/cLSZ86

Reply on Twitter 1942327794261770408 Retweet on Twitter 1942327794261770408 4 Like on Twitter 1942327794261770408 8 Twitter 1942327794261770408

; Arm @Arm ·

3 Jul 1940901701617131688

As AI models become more efficient, Rene Haas and @OpenAI’s @markchen90 reflect on what’s next in the evolution of intelligence.

🎧 They explore the promise of AGI and how it could empower a new wave of entrepreneurship by making creation more accessible: https://okt.to/2UcJYm

Reply on Twitter 1940901701617131688 Retweet on Twitter 1940901701617131688 6 Like on Twitter 1940901701617131688 21 Twitter 1940901701617131688

; Arm @Arm ·

1 Jul 1940141221319717058

Congrats to @RenesasGlobal on the RA8P1 MCU group, powered by Arm Cortex-M85, M33, and Ethos-U55.

Designed for on-device AI and ML, it brings advanced performance to next-gen voice and vision applications, alongside real-time analytics.👏

https://www.renesas.com/en/about/newsroom/renesas-sets-new-mcu-performance-bar-1-ghz-ra8p1-devices-ai-acceleration

Reply on Twitter 1940141221319717058 Retweet on Twitter 1940141221319717058 1 Like on Twitter 1940141221319717058 9 Twitter 1940141221319717058

; Arm @Arm ·

30 Jun 1939717874630996142

AI is getting smaller, smarter, and moving to the edge.

As physical and agentic AI converge, scaling means finding the right mix of CPUs and specialized AI Accelerators to drive what's next.

Dive into the insights of our new Exec Insights report:

Reply on Twitter 1939717874630996142 Retweet on Twitter 1939717874630996142 14 Like on Twitter 1939717874630996142 49 Twitter 1939717874630996142

; Arm @Arm ·

27 Jun 1938677414399549557

As Rene Haas shared with Bloomberg @technology Europe, meeting AI’s growing demands will require more energy and an infrastructure evolution.

That’s why we’re committed to delivering efficient compute solutions for AI - from cloud to edge, and at scale.🧠

Reply on Twitter 1938677414399549557 Retweet on Twitter 1938677414399549557 3 Like on Twitter 1938677414399549557 17 Twitter 1938677414399549557

; Arm @Arm ·

27 Jun 1938650552440586252

From AI toys to robot dogs, we are powering the next wave of intelligent, energy-efficient robotics at the edge with partners like R2C2, DEEP Robotics and more!

Discover Arm's role in the robotics revolution🤖
https://okt.to/iGj5d3

Reply on Twitter 1938650552440586252 Retweet on Twitter 1938650552440586252 16 Like on Twitter 1938650552440586252 36 Twitter 1938650552440586252

; Arm @Arm ·

26 Jun 1938352703555469422

Today we welcomed Lord Peter Mandelson, UK Ambassador to the USA, to our HQ in Cambridge.

Our presence across the UK and US drives innovation, enabling AI at scale, and supporting the industries shaping tomorrow.

Highlights from the visit below 📷

Reply on Twitter 1938352703555469422 Retweet on Twitter 1938352703555469422 4 Like on Twitter 1938352703555469422 23 Twitter 1938352703555469422

; Arm @Arm ·

26 Jun 1938275246026744106

We’re proud to be named as one of the 2025 @TIME 100 Most Influential Companies!

With 310B+ chips shipped and Arm everywhere from the cloud to the car, this recognition reflects our foundational role in shaping the future of AI.

#TIME100Companies

➡️

Reply on Twitter 1938275246026744106 Retweet on Twitter 1938275246026744106 7 Like on Twitter 1938275246026744106 19 Twitter 1938275246026744106

; Arm @Arm ·

25 Jun 1938020145873461284

ICYMI Mohamed Awad, SVP and GM of Arm’s Infrastructure Line of Business, took the stage at #62DAC to cover the infrastructure transformations needed to usher in the next era of AI including:

Tech Leadership
Ground Up Systems
A Collaborative Ecosystem

🔗https://okt.to/KZQ6Mo

Reply on Twitter 1938020145873461284 Retweet on Twitter 1938020145873461284 8 Like on Twitter 1938020145873461284 20 Twitter 1938020145873461284

; Arm @Arm ·

25 Jun 1937877996330791359

You can forecast performance, but not breakthroughs.

@OpenAI’s @markchen90 joins Rene Haas on Tech Unheard to talk AI’s rapid rise, surprising capabilities, and what it takes to lead frontier research in a field evolving faster than anyone imagined. 🎧 https://okt.to/byq3g1

Reply on Twitter 1937877996330791359 Retweet on Twitter 1937877996330791359 1 Like on Twitter 1937877996330791359 16 Twitter 1937877996330791359

; Arm @Arm ·

24 Jun 1937579736038846918

Edge AI will make manufacturing more intelligent, autonomous, and resilient than ever before.

Paul Williamson explains how edge AI is transforming industrial operations - from the production line to predictive analytics and beyond for @TheManufacturer.

Op-ed: Smarter factories, safer systems — how edge AI is rewiring industrial manufacturing

Paul Williamson, SVP & GM, IoT Line of Business, Arm, looks at how the convergence of IoT and edge AI is rev...

okt.to

Reply on Twitter 1937579736038846918 Retweet on Twitter 1937579736038846918 12 Like on Twitter 1937579736038846918 35 Twitter 1937579736038846918

; Arm @Arm ·

24 Jun 1937539571949924553

Achieve faster time to market and unlock greater performance and efficiency with Arm Compute Subsystems.

Speaking at The Six Five Summit: AI Unleashed, Rene Haas breaks down what this means for developers and businesses alike. 👇

Six Five Media @TheSixFiveMedia

Building the future of fast and powerful AI computing depends heavily on the platform + ecosystem approach.

@Arm CEO, Rene Haas (@renehaas237), took the stage at The Six Five Summit: AI Unleashed, revealing how Arm is optimizing performance and accelerating time-to-market for…

Reply on Twitter 1937539571949924553 Retweet on Twitter 1937539571949924553 5 Like on Twitter 1937539571949924553 28 Twitter 1937539571949924553

; Arm @Arm ·

23 Jun 1937228127861608561

This #INWED25, we’re celebrating the engineers reimagining what’s possible.

Together with our partners at @AstonMartinF1, we believe inclusion and innovation go hand in hand, because the future of STEM should be built for everyone, by everyone: https://okt.to/Oud7bl

Reply on Twitter 1937228127861608561 Retweet on Twitter 1937228127861608561 4 Like on Twitter 1937228127861608561 16 Twitter 1937228127861608561

; Arm @Arm ·

23 Jun 1937141866459271519

The new Arm-based @Lenovo Chromebook Plus 14” is here, powered by @MediaTek’s Kompanio Ultra SoC and built on Armv9.

With AI features only on Arm like Iterative ImageGen and Smart Grouping, it’s a new chapter for accessible, on-device AI: https://okt.to/w9rvhd

Reply on Twitter 1937141866459271519 Retweet on Twitter 1937141866459271519 4 Like on Twitter 1937141866459271519 30 Twitter 1937141866459271519

; Arm @Arm ·

22 Jun 1936892319283724590

Scaling isn’t just about growth, it’s about embracing change together.

On Tech Unheard, Rene Haas and @Wayve_AI CEO @alexgkendall talk about Arm’s evolution into a platform company, and the mindset shift needed to scale with purpose and unity.

🎧 https://okt.to/udvCHk

Reply on Twitter 1936892319283724590 Retweet on Twitter 1936892319283724590 1 Like on Twitter 1936892319283724590 7 Twitter 1936892319283724590

; Arm @Arm ·

20 Jun 1936194829647778292

⚡️Chiplet Strategies
⚡️AI infrastructure
⚡️Ecosystem Development

Mohamed Awad, Eddie Ramirez, Kevork Kechichian and Suraj Gajendra will be at the 2025 DAC Conference June 22-24 to explore all of this and more.

See you there! #62DAC

Reply on Twitter 1936194829647778292 Retweet on Twitter 1936194829647778292 3 Like on Twitter 1936194829647778292 14 Twitter 1936194829647778292

; Arm @Arm ·

18 Jun 1935332236674433376

What's next for compute is being built today, driven by our culture of innovation.

On the No Ordinary Tech, Paul Williamson shares how we’re enabling power-efficient AI across everything from the smallest devices to the infrastructure shaping tomorrow: https://okt.to/5cMKVY

Reply on Twitter 1935332236674433376 Retweet on Twitter 1935332236674433376 5 Like on Twitter 1935332236674433376 15 Twitter 1935332236674433376

; Arm @Arm ·

18 Jun 1935330801752654228

What's next for compute is being built today driven by our culture of innovation.

On the No Ordinary Tech, Paul Williamson shares how we’re enabling power-efficient AI across everything from the smallest devices to the infrastructure shaping tomorrow: https://okt.to/5cMKVY

Reply on Twitter 1935330801752654228 Retweet on Twitter 1935330801752654228 0 Like on Twitter 1935330801752654228 0 Twitter 1935330801752654228

; Arm @Arm ·

17 Jun 1935044059866804321

Last week Vince Jesaitis, Arm's Head of Global Government Affairs, attended the SCSP AI+ Expo and shared his vision for a bigger, bolder, future powered by AI and built on Arm. 💡

The time is now for taking strategic steps and doubling down on efficiency R&D, design talent, and…

Reply on Twitter 1935044059866804321 Retweet on Twitter 1935044059866804321 5 Like on Twitter 1935044059866804321 22 Twitter 1935044059866804321

; Arm @Arm ·

16 Jun 1934696669150421189

Transformative. Accessible. Innovative.

Sophie, the star of our latest brand film, shared a bit about how RelaJet's AI-enabled audio processing devices are transforming the AI experience and creating smart, human-centered technology for everyday life. ✨

https://okt.to/QsXx3n

Reply on Twitter 1934696669150421189 Retweet on Twitter 1934696669150421189 3 Like on Twitter 1934696669150421189 15 Twitter 1934696669150421189

; Arm @Arm ·

16 Jun 1934675365810606148

The world’s leading hyperscalers are building on Arm to power the next era of AI infrastructure.

Take a look at what Mohamed Awad had to say about Neoverse and what Arm uniquely provides to the industry to take on this future of computing.

https://okt.to/lQ7Uio via @dcdnews

Reply on Twitter 1934675365810606148 Retweet on Twitter 1934675365810606148 11 Like on Twitter 1934675365810606148 33 Twitter 1934675365810606148

Cambridge Consultants: How We’re Pushing the Endpoint AI Envelope

Cortex-M55 + Ethos-U55: A step-change in what’s possible with endpoint AI

Pushing the limits of AI medical applications at the endpoint

Wide-ranging applications for endpoint AI

Unlock the Benefits of Artificial Intelligence for IoT Devices

Editorial Contact

Media Information

Company Overview & History

Arm Corporate Guidelines

Media Contacts

Latest on X