Blog

February 10, 2021

Cambridge Consultants: How We’re Pushing the Endpoint AI Envelope

One year since Arm launched the Cortex-M55 CPU and Arm Ethos-U55 microNPU, we've achieved an incredible 7x power reduction, 1,000x speed increase in endpoint AI

By Michal Gabrielczyk, Head of Edge AI, Cambridge Consultants

Artificial intelligence (AI) may have grown up in the cloud but delivering transformational products and services means taking AI out of the data center and into the real world.

As one of the world’s leading product development and technology consultancy firms, our technologies can be found in homes and hospitals, in satellite networks and even inside the human body. Many of these applications now use endpoint AI, enabling us to turn raw sensor data into context and meaning on the device itself without sending it to the cloud.

The results of a year of socially-distanced, determined experimentation with the Cortex-M55 and Ethos-U55 aren’t just impressive—they’re game-changing.

But enabling this level of intelligence in endpoint devices with stringent size, cost, power and connectivity constraints is no small task. It requires two key things: robust silicon and a deep understanding of the design trade-offs in power and performance to maximize the latter while maintaining or even reducing the former.

When Arm announced the Cortex-M55 processor and Arm Ethos-U55 micro neural processing unit (NPU) exactly one year ago today, we jumped at the chance to see just how far we could push the power-performance envelope. The results of a year of socially-distanced, determined experimentation with the Cortex-M55 and Ethos-U55 aren’t just impressive—they’re game-changing.

Cortex-M55 + Ethos-U55: A step-change in what’s possible with endpoint AI

As an Arm Approved Design Partner, it wasn’t long after the launch last February that we were able to put this new AI duo through its paces.

Our initial research involved migrating our ultra-low power Voice Activity Detection (VAD) reference design from the Cortex-M3 to the Cortex-M55 and Ethos-U55. We wanted to draw comparison with earlier platforms that we were familiar with to explore the capabilities.

We quickly achieved a remarkable 7 times reduction in average power, yet 1,000 times increase in core speed. It was clear to us then that this wasn’t just the next generation of Cortex-M microcontroller: this was a step-change in what’s possible in endpoint AI.

On-device voice detection that doesn’t need to send all its data to the cloud has major benefits in latency, privacy and power, and this kind of voice detection is going to become increasingly important to the consumer market in the coming years.

The incredible uplift in performance we experienced in porting our VAD reference design to the Cortex-M55 and Ethos-U55 opened up a number of new previously impossible avenues, such as including vision alongside voice detection.

But it also gave us the confidence to really see how far we could stretch the capabilities of these chips.

Pushing the limits of AI medical applications at the endpoint

Putting scepticism firmly to one side, we began to wonder if we could port something as large and complex as a cloud-based deep learning application to this microprocessor duo, and in doing so prove that with the right optimization and silicon IP, even complex neural networks can be deployed on very low power edge devices.

The application we chose centered on a concept system developed by Cambridge Consultants to improve treatment monitoring of tuberculosis (TB) in resource-limited countries by combining AI with a smartphone to capture images from a laboratory microscope. Stained sputum sample images were originally analyzed using a deep learning algorithm in the cloud to identify, count and classify infected cells to determine the disease state of the patient.

To give you an idea of scale, this treatment monitoring application is 350 times more computationally complex than a typical object detection application using the MobileNet V2 neural network, which is commonly used in industry. MobileNet V2 requires a single inference per image of around 0.8 billion multiplier-accumulators (MACs), whereas this research required 70 inferences of around 4 billion MACs each per image.

The port was not only successful: we achieved similar run times and accuracy levels to the application’s former cloud deployment yet drew just a few Watts in the process. These power reductions were achieved through understanding and optimizing the network implementation during the translation and quantization stages, which had a dramatic effect on the run-time, power consumption and accuracy during the cloud to endpoint migration.

Wide-ranging applications for endpoint AI

The applications for this research are huge: real-time medical AI can be deployed in low power endpoint devices and used in settings where Internet connectivity is unavailable, or bulky and power-hungry computing equipment would be impractical.

It also opens the door to combine further benefits of processing AI data on endpoints, including lower latency and lower power. All whilst leaving room to improve privacy and security, since data does not leave the users device.

This research is directly comparable to many other applications and markets, enabling device manufacturers to move complex AI workloads into everyday consumer devices, factories, and even smart cities. This is a topic discussed further in Cambridge Consultants’ recent whitepaper.

From signal processing in billions of mobile phones, to AI in smart inhalers, Cambridge Consultants has generated billions of dollars of value for our clients, by creating and optimizing world-leading silicon platforms. As an Approved Design Partner and Functional Safety Partner, we consider it our duty to see how far we can push the latest Arm IP in order to demonstrate to Arm, our customers and the world just how powerful endpoint AI can be.

Unlock the Benefits of Artificial Intelligence for IoT Devices

Arm offers new compute technologies coupled with software and tools to help companies streamline the design, development, and support for AI-based IoT applications.

Discover More

By Michal Gabrielczyk, Head of Edge AI, Cambridge Consultants

Article Text

Copy Text

Any re-use permitted for informational and non-commercial or personal use only.

Editorial Contact

Brian Fuller and Jack Melling

editorial@arm.com

Subscribe to Blogs and Podcasts

Get the latest blogs & podcasts direct from Arm

Media Information

Latest on X

; Arm @Arm ·

14h 1977892633335726574

We've been appointed to the @OpenComputePrj Board of Directors, marking a key step toward the next phase of AI infrastructure. 🎉

We're excited to play our part in advancing open, interoperable designs across the computing ecosystem, represented by Mohamed Awad:…

Reply on Twitter 1977892633335726574 Retweet on Twitter 1977892633335726574 2 Like on Twitter 1977892633335726574 15 Twitter 1977892633335726574

; Arm @Arm ·

15h 1977865453239181639

🗓️ Happening tomorrow at #OCPSummit25!

Mohamed Awad shares how energy-efficient compute, chiplet-based design, and open collaboration are transforming AI and cloud infrastructure — and shaping the next era of the data center.

📍 Tuesday | 09:05am https://okt.to/HBPtCK

Reply on Twitter 1977865453239181639 Retweet on Twitter 1977865453239181639 5 Like on Twitter 1977865453239181639 12 Twitter 1977865453239181639

; Arm @Arm ·

10 Oct 1976725756567789712

Confidence. Curiosity. Belonging.

That’s what fuels careers at Arm. Mamta Thangaraj, started in engineering, stepped into leadership, and now drives our data strategy. Her journey reminds us: when people feel supported, they thrive and innovation follows. https://okt.to/dUsbK4

Reply on Twitter 1976725756567789712 Retweet on Twitter 1976725756567789712 1 Like on Twitter 1976725756567789712 5 Twitter 1976725756567789712

; Arm @Arm ·

10 Oct 1976607298123596092

#OCPSummit25 is almost here!

See how we’re building open, efficient AI data centers with Arm Neoverse, open-chiplet design and production-ready software stacks - all happening at the show.

👉 Catch our speakers
📍 Visit us at booth B11

Reply on Twitter 1976607298123596092 Retweet on Twitter 1976607298123596092 1 Like on Twitter 1976607298123596092 11 Twitter 1976607298123596092

; Arm @Arm ·

9 Oct 1976347804252696812

As the industry evolves from software-defined to AI-defined vehicles, Arm Zena CSS provides the scalable, safety-certified foundation needed to accelerate development of next-generation automotive experiences. 🚗

Discover more in this Automotive Industries feature.…

Reply on Twitter 1976347804252696812 Retweet on Twitter 1976347804252696812 1 Like on Twitter 1976347804252696812 8 Twitter 1976347804252696812

; Arm @Arm ·

7 Oct 1975624690119926027

As AI demands grow, silicon architecture must evolve alongside it.

At #OCPSummit25, we’ll share how Arm Compute Subsystems and the Chiplet System Architecture make building custom silicon for AI faster, modular, and lower risk.

Sneak preview in our blog: https://okt.to/V8axsH

Reply on Twitter 1975624690119926027 Retweet on Twitter 1975624690119926027 4 Like on Twitter 1975624690119926027 14 Twitter 1975624690119926027

; Arm @Arm ·

6 Oct 1975248957899956317

We shared insights from the Tech Unheard podcast, hosted by our CEO Rene Haas, in this @HarvardBiz article.

Conversations with @Scale_AI, @Zoox, @nvidia & Chris Miller (Author of Chip War) reveal 4 signals shaping leadership in the AI era.

Four Strategic Signals Technology Leaders Are Tuning In To - SPONSOR CONTENT FROM ARM

Sponsor content from ARM.

okt.to

Reply on Twitter 1975248957899956317 Retweet on Twitter 1975248957899956317 2 Like on Twitter 1975248957899956317 10 Twitter 1975248957899956317

; Arm @Arm ·

4 Oct 1974267515258761379

Edge AI isn't just a vision, it's already changing industries with real-world impact. Here are 7 use cases demonstrating how Arm is bringing intelligence to devices everywhere.

https://okt.to/N4eLRv

Reply on Twitter 1974267515258761379 Retweet on Twitter 1974267515258761379 3 Like on Twitter 1974267515258761379 10 Twitter 1974267515258761379

; Arm @Arm ·

3 Oct 1974255926912635300

⚡Performance-per-watt leadership
⚡Scalable compute
⚡A vibrant, AI-ready software ecosystem

The biggest leaders in tech are building AI on Arm and our software ecosystem isn’t just keeping up, it’s meeting developers across the cloud-to-edge continuum.
https://okt.to/t0LVnl

Reply on Twitter 1974255926912635300 Retweet on Twitter 1974255926912635300 2 Like on Twitter 1974255926912635300 11 Twitter 1974255926912635300

; Arm @Arm ·

3 Oct 1974207395938324957

AI is forcing a new wave of silicon innovation.

On stage at @theallinpod Summit, Rene Haas shared how rapid advances in AI are accelerating silicon innovation, redefining manufacturing culture, and opening the door to physical AI at massive scale.

🎧:

Reply on Twitter 1974207395938324957 Retweet on Twitter 1974207395938324957 1 Like on Twitter 1974207395938324957 6 Twitter 1974207395938324957

; Arm @Arm ·

2 Oct 1973866600076820964

The AI era 🤝 Infrastructure that prioritizes performance and efficiency

We’re committed to delivering the foundation for AI everywhere - from hyperscale data centers to the edge - enabling innovation without compromise.

Arm’s AI Infrastructure Advantage: Efficiency Meets Innovation

AWS now ships 50% Arm-based compute, and other major cloud providers are following, as efficiency in the gigawatt ...

okt.to

Reply on Twitter 1973866600076820964 Retweet on Twitter 1973866600076820964 2 Like on Twitter 1973866600076820964 15 Twitter 1973866600076820964

; Arm @Arm ·

2 Oct 1973794875574272149

Unboxing the future of mobility 🚗

The @nvidia DRIVE AGX Thor Developer Kit, built on the NVIDIA Blackwell GPUs, Arm Neoverse V3AE CPUs and the NVIDIA DriveOS 7 software stack to deliver scalable, energy-efficient compute for real-time AI workloads.

Unboxing the NVIDIA DRIVE AGX Thor Developer Kit

The developer kit is built on the NVIDIA Blackwell architecture, next-generation Arm Neoverse V3AE CPUs and the NVID...

okt.to

Reply on Twitter 1973794875574272149 Retweet on Twitter 1973794875574272149 4 Like on Twitter 1973794875574272149 8 Twitter 1973794875574272149

; Arm @Arm ·

2 Oct 1973754944747421812

🗣️ “When you get an Arm SME2-enabled phone, it’s going to start feeling like a quality upgrade from what you’ve had before.” - Oliver Gaymond, @Google Android

Hear how SME2 and KleidiAI are powering everyday GenAI experiences across billions of devices:

https://okt.to/YfJZDx

Reply on Twitter 1973754944747421812 Retweet on Twitter 1973754944747421812 2 Like on Twitter 1973754944747421812 12 Twitter 1973754944747421812

; Arm @Arm ·

2 Oct 1973727186004689376

🏆 We’re proud to be named 2025 OIP Partner of the Year in the Processor IP category by TSMC!

This recognition highlights our shared commitment to partner success and innovation, as we work together to empower the ecosystem to bring new products to market faster and with…

Reply on Twitter 1973727186004689376 Retweet on Twitter 1973727186004689376 4 Like on Twitter 1973727186004689376 16 Twitter 1973727186004689376

; Arm @Arm ·

2 Oct 1973704031617827099

Only 29% of business leaders can scale compute to meet AI demand (Arm AI Readiness Index).

In this @FT article, we explore the structural mismatch between today’s infrastructure and tomorrow’s AI workloads and how smarter silicon design is key.

Why the world needs to rethink computing

As AI accelerates and proliferates, business decision-makers must prepare to rethink their organisations’ compute foundations

okt.to

Reply on Twitter 1973704031617827099 Retweet on Twitter 1973704031617827099 2 Like on Twitter 1973704031617827099 12 Twitter 1973704031617827099

; Arm @Arm ·

1 Oct 1973470731385098553

Running AI on billions of devices isn’t easy.

Compute, memory and energy demands are huge. But our power-efficient tech is built for it.

Rene Haas shares how we’re enabling AI to run locally, not just in the cloud. 🎙️

Full Tech Unheard episode: https://okt.to/TqmYZe

Reply on Twitter 1973470731385098553 Retweet on Twitter 1973470731385098553 2 Like on Twitter 1973470731385098553 12 Twitter 1973470731385098553

; Arm @Arm ·

1 Oct 1973468728021884936

What shapes a global tech leader?

Arm EVP and Chief Commercial Officer, Will Abbey recently joined David Savage on the Tech Talks podcast to explore exactly this and more.

If you're looking for a little motivation for your Wednesday give it a listen!

ARM's Will Abbey: From Ghana to Silicon Valley, a Global Leader's View on Tech

Tech Talks · Episode

okt.to

Reply on Twitter 1973468728021884936 Retweet on Twitter 1973468728021884936 2 Like on Twitter 1973468728021884936 9 Twitter 1973468728021884936

; Arm @Arm ·

1 Oct 1973455382568747309

Big congrats to @amazon on the next generation of Echo devices, built from the ground up for Alexa+.

With Alexa running on Arm, we’re proud to power the next generation of smart home experiences.

Amazon unveils the next generation of AI-powered Echo devices, purpose-built for Alexa+

Introducing our most advanced Echo devices yet, featuring premium audio, next-generation AI processing and Omnisense sen...

okt.to

Reply on Twitter 1973455382568747309 Retweet on Twitter 1973455382568747309 2 Like on Twitter 1973455382568747309 15 Twitter 1973455382568747309

; Arm @Arm ·

29 Sep 1972678003873452166

🌍 At Arm, we see sustainability as both a responsibility and a catalyst for progress.

From our power-efficient technology to empowering local communities, we’re laying the foundation for a more sustainable, intelligent world.

More in our report 👉

Reply on Twitter 1972678003873452166 Retweet on Twitter 1972678003873452166 2 Like on Twitter 1972678003873452166 10 Twitter 1972678003873452166

; Arm @Arm ·

24 Sep 1970963265837789390

📈“We’re on this AI demand curve that’s still climbing.”

Rene Haas joins @jimcramer on @SquawkStreet to explain why AI’s potential is far from realized, and how Stargate, with Arm as a technology partner, is opening new opportunities to unlock it📽️🔽

Arm CEO: We're still on this AI demand curve that's still climbing

Arm CEO Rene Haas joins CNBC’s Squawk on the Street to discuss the “Stargate” AI infrastructure project, next-...

www.cnbc.com

Reply on Twitter 1970963265837789390 Retweet on Twitter 1970963265837789390 3 Like on Twitter 1970963265837789390 14 Twitter 1970963265837789390

; Arm @Arm ·

24 Sep 1970868281667121396

"We need to talk. The chatbot is eating our runway."

Arm Software Developers @ArmSoftwareDev

Every. Dollar. Matters.

Vociply AI cut LLM inference costs by 35% migrating to Arm-based @awscloud Graviton. Allowing them to invest in additional AI features, enable more aggressive pricing strategies, and secure higher customer acquisition rates. https://okt.to/wQtbe0

Reply on Twitter 1970868281667121396 Retweet on Twitter 1970868281667121396 5 Like on Twitter 1970868281667121396 17 Twitter 1970868281667121396

; Arm @Arm ·

24 Sep 1970820540257518041

🎉 Congratulations to Richard Grisenthwaite, EVP & Chief Architect, on being elected as a Fellow at the @RAEngNews!

As one of the highest honors in UK engineering, this recognizes Richard's leadership and impact on the Arm compute platform 👏: https://okt.to/zte4Ch…

Reply on Twitter 1970820540257518041 Retweet on Twitter 1970820540257518041 3 Like on Twitter 1970820540257518041 18 Twitter 1970820540257518041

; Arm @Arm ·

23 Sep 1970602850355986841

As a Stargate partner, we’re celebrating 5 new U.S. data center sites.

Developed with @OpenAI, @Oracle & @SoftBank, the new sites will contribute to nearly 7 GW capacity in 3 years, building the infrastructure for the next generation of AI breakthroughs:

https://okt.to/Ahuor4

Reply on Twitter 1970602850355986841 Retweet on Twitter 1970602850355986841 3 Like on Twitter 1970602850355986841 15 Twitter 1970602850355986841

; Arm @Arm ·

23 Sep 1970557800192381008

Vehicles are becoming more intelligent, connected, and AI-defined. 🚗

In this exclusive @justauto op-ed, Dipti Vachani outlines the transformative implications of the rise of the AI-defined vehicle.

https://okt.to/2FZQaY

Reply on Twitter 1970557800192381008 Retweet on Twitter 1970557800192381008 2 Like on Twitter 1970557800192381008 12 Twitter 1970557800192381008

Cambridge Consultants: How We’re Pushing the Endpoint AI Envelope

Cortex-M55 + Ethos-U55: A step-change in what’s possible with endpoint AI

Pushing the limits of AI medical applications at the endpoint

Wide-ranging applications for endpoint AI

Unlock the Benefits of Artificial Intelligence for IoT Devices

Editorial Contact

Media Information

Company Overview & History

Arm Corporate Guidelines

Media Contacts

Latest on X