Blog

December 18, 2020

Cloud AI, Edge AI, Endpoint AI. What’s the Difference?

Arm technology enables AI throughout the compute spectrum. Here, we explain the benefits and limitations of AI compute in cloud, edge and endpoint devices.

By Arm Editorial Team

Today, the majority of what we call artificial intelligence (AI) is machine learning (ML), a subset of AI that involves machines learning from sets of data. In general, the greater the amount of data to learn from, the more the AI is able to infer meaning and the more useful it becomes.

Hundreds of thousands of gigabytes of data are generated every day in AI applications ranging from consumer devices to healthcare, logistics to smart manufacturing. With this much data generated, the key consideration is where that data should be processed.

Arm defines three categories within the compute spectrum: cloud, edge and endpoint. We can employ ML to process data within each of these but choosing the most suitable category isn’t as simple as going where the most compute performance is—as performance is only one governing factor in ensuring the learnings inferred from data remain useful.

What is Cloud AI?

Cloud AI refers to AI processing within powerful cloud data centers. For a long time, cloud AI was the obvious choice of compute platform to crunch enormous amounts of data. Were it not for the concept of shunting data from the edge and endpoint into cloud servers for hyper-efficient processing, AI would not be at the stage of maturity it enjoys today.

It’s likely the majority of AI heavy lifting will always be performed in the cloud due to its reliability, cost-effectiveness and concentration of compute—especially when it comes to training machine learning (ML) algorithms on historic data that doesn’t require an urgent response. Many consumer smart devices rely on the cloud for their ‘intelligence’: for example, today’s smart speakers give the illusion of on-device intelligence yet the only on-device AI they are capable of is to listen out for the trigger word (‘keyword spotting’).

Cloud AI is undisputed in its ability to solve complex problems using ML. Yet as ML’s use cases grow to include many mission-critical, real-time applications, these systems will live or die on how quickly decisions can be made. And when data has to travel thousands of miles from device to data center, there’s no guarantee that by the time it has been received, computed and responded to it will still be useful.

Applications such as safety-critical automation, vehicle autonomy, medical imaging and manufacturing all demand a near-instant response to data that’s mere milliseconds old. The latency introduced in asking the cloud to process that weight of data would in many cases reduce its value to zero.

It’s for this reason that many companies are now looking past the cloud to processing AI elsewhere in the compute infrastructure, moving compute nearer the data.

What is Edge AI?

In a world where data’s time to value or irrelevancy may be measured in milliseconds, the latency introduced in transferring data to the cloud threatens to undermine many of the Internet of Things (IoT’s) most compelling use cases.

Edge AI moves AI and ML processing from the cloud to powerful servers at the edge of the network such as offices, 5G base stations and other physical locations very near to their connected endpoint devices. By moving AI compute closer to the data, we eliminate latency and ensure that all of that data’s value is retained.

Basic devices such as network bridges and switches have given way to powerful edge servers that add data center-level hardware into the gateway between endpoint and cloud. These powerful new AI-enabled edge servers, driven by new platforms such as Arm Neoverse, are designed to increase compute while decreasing power consumption, creating massive opportunities to instrument our cities, factories, farms, and environment to improve efficiency, safety, and productivity.

Edge AI has the potential to benefit both the data and the network infrastructure itself. At a network level, it could be used to analyze the flow of data for network prediction and network function management, while enabling edge AI to make decisions over the data itself offers significantly reduced backhaul to the cloud, negligible latency and improved security, reliability and efficiency across the board.

Another key function of edge AI is sensor fusion: combining the data from multiple sensors to create complex pictures of a process, environment or situation. Consider an edge AI device in an industrial application, tasked with combining data from multiple sensors within a factory to predict when mechanical failure might occur. This edge AI device must learn the interplay between each sensor and how one might affect the other and apply this learning in real-time.

There’s also a key security and resilience benefit in moving sensitive data no further than the edge: The more data we move to a centralized location, the more opportunities arise for that data’s integrity to be compromised. As the nature of compute changes, the edge is playing an increasingly crucial role in supporting diverse systems with a range of power and performance requirements. To deliver on service level agreements at scale for enterprises, the edge must embrace cloud-native software principles.

Arm is enabling this through Project Cassini, an open, collaborative, standards-based initiative to deliver a cloud-native software experience across a secure Arm edge ecosystem.

What is Endpoint AI?

Arm defines endpoint devices as physical devices connected to the network edge, from sensors to smartphones and beyond. As so much data is generated at the endpoint, we can maximise the insight we gain from that data by empowering endpoint devices to think for themselves and process what they collect without moving that data anywhere.

Due to their powerful internal hardware, smartphones have long been a fertile test-bed for endpoint AI. A smartphone camera is a prime example: it’s gone from something that takes grainy selfies to being secure enough for biometric authentication and powerful enough for computational photography – adding background blur (or a pair of bunny ears) to selfies in real-time.

This technology is now finding its way into smaller IoT devices. You may hear it referred to as the ‘AIoT’. In February 2020, Arm announced its solution for adding AI into even the smallest Arm-powered IoT devices. The Arm Cortex-M55 CPU and Arm Ethos-U55 micro neural processing unit (microNPU) combine to boost the performance of Arm-based Internet of Things (IoT) solutions by nearly 500 times—while retaining the trademark energy-efficient, cost-effective benefits our technology is known for. This technology will help to bring the benefits of Arm-powered compute to the IoT’s most challenging environments.

TinyML is an emerging sub-field of Endpoint AI, or AIoT, that enables ML processing in some of the very smallest endpoint devices containing microcontrollers no bigger than a grain of rice and consuming mere milliwatts of power.

Of course, endpoint AI also has its limitations: these devices are far more constrained in terms of performance, power and storage than edge AI and cloud AI devices. Data collected by one endpoint AI sensor can also have limited value on its own, as without the ‘top-down’ view of other data streams that sensor fusion at the edge enables, it is harder to see the full picture.

A combined, secure approach

Cloud AI, Edge AI and Endpoint AI each have their strengths and limitations. Arm’s range of heterogeneous compute IP scales the complete compute spectrum, ensuring that whatever your AI workload, Arm has a solution to enable it to be processed efficiently by putting intelligent compute power where it makes the most sense.

Most importantly, Arm technology ensures that data used in AI processing remains secure, from cloud to edge to endpoint. The Arm Platform Security Architecture (PSA) provides a platform, based on industry best-practice, that enables security to be consistently designed in at both a hardware and firmware level, while PSA Certified assures device manufacturers that their IoT devices are built secure. Within Arm processors, Arm TrustZone security technology simplifies IoT security and offers the ideal platform on which to build a device that adheres to PSA principles.

Powering innovation through AI

AI is empowering change, driving innovation, and creating exciting new possibilities. Arm is forging a path to the future with solutions designed to support the rapid development of AI. Discover how Arm combines the hardware, software, tools, and strategic partners you need to accelerate development.

By Arm Editorial Team

Article Text

Copy Text

Any re-use permitted for informational and non-commercial or personal use only.

Editorial Contact

Brian Fuller and Jack Melling

editorial@arm.com

Subscribe to Blogs and Podcasts

Get the latest blogs & podcasts direct from Arm

Media Information

Latest on X

; Arm @Arm ·

8h 2022158315183120561

Arm is expanding in Austin with support from the Texas Semiconductor Innovation Fund grant!

This move will strengthen Texas’ role in global semiconductor innovation, creating 320+ jobs, new advanced lab capabilities, and building momentum for future compute and AI.👏

Reply on Twitter 2022158315183120561 Retweet on Twitter 2022158315183120561 4 Like on Twitter 2022158315183120561 13 Twitter 2022158315183120561

; Arm @Arm ·

15h 2022058409843900480

Arm’s share among top hyperscalers is expected to reach nearly 50% ⚡

That shift reflects a bigger reality: AI isn’t just about accelerators — CPUs power the data pipeline and system orchestration behind AI, while maximizing performance per watt at scale.

@TheFuturumGroup's

Reply on Twitter 2022058409843900480 Retweet on Twitter 2022058409843900480 3 Like on Twitter 2022058409843900480 26 Twitter 2022058409843900480

; Arm @Arm ·

11 Feb 2021615315126211052

This National Apprenticeship Week, we’re proud to highlight Ayo Giwa, who completed a apprenticeship with us and has since moved into a graduate role, a brilliant example of where this pathway can lead. 👏 https://okt.to/p7em6M

Reply on Twitter 2021615315126211052 Retweet on Twitter 2021615315126211052 3 Like on Twitter 2021615315126211052 25 Twitter 2021615315126211052

; Arm @Arm ·

11 Feb 2021386234736624104

Advancing on-device AI starts with extending the CPU architecture to better support machine learning workloads.

With Exynos 2600, @SamsungDSGlobal adopts Arm Scalable Matrix Extension 2 (SME2), accelerating matrix operations on CPUs and expanding the role of CPU-based AI for

Reply on Twitter 2021386234736624104 Retweet on Twitter 2021386234736624104 25 Like on Twitter 2021386234736624104 187 Twitter 2021386234736624104

; Arm @Arm ·

10 Feb 2021325825786757570

Commodity infrastructure was built for a different era. 🦖

As AI training and inference scale, efficiency and system level optimization are becoming critical. Purpose-built platforms designed end-to-end are emerging as the new foundation for AI infrastructure.

Reply on Twitter 2021325825786757570 Retweet on Twitter 2021325825786757570 3 Like on Twitter 2021325825786757570 23 Twitter 2021325825786757570

; Arm @Arm ·

6 Feb 2019884565108511226

AI hype or real transformation? 🫧

On the latest episode of The Master Investor podcast with @WilfredFrost - Arm CEO Rene Haas breaks down where Arm sits in the AI value chain, how data centers and edge computing are evolving, and what today’s AI surge gets right and wrong.

Reply on Twitter 2019884565108511226 Retweet on Twitter 2019884565108511226 8 Like on Twitter 2019884565108511226 22 Twitter 2019884565108511226

; Arm @Arm ·

6 Feb 2019837004851065241

Where is on-device AI delivering real value today?

At a @GSMAi panel, Steve Raphael, Senior Director, Smartphone at Arm joined industry leaders to explore how on-device AI is shaping smartphones, wearables, PCs and enterprise devices.

Watch the full conversation here

Reply on Twitter 2019837004851065241 Retweet on Twitter 2019837004851065241 3 Like on Twitter 2019837004851065241 16 Twitter 2019837004851065241

Cloud AI, Edge AI, Endpoint AI. What’s the Difference?

What is Cloud AI?

What is Edge AI?

What is Endpoint AI?

A combined, secure approach

Powering innovation through AI

Editorial Contact

Media Information

Company Overview & History

Arm Corporate Guidelines

Media Contacts

Latest on X