Blog

November 18, 2020

SiMa.ai Sets Sights on High Performance, Low Power Endpoint AI

SiMa.ai’s Kavitha Prasad explains the driving force behind the company’s endpoint AI SoC and the wide-ranging applications it enables

By Kavitha Prasad, VP Business Development and System Applications, SiMa.ai

The lifecycle of an endpoint AI device may span years—even decades. Those that go the distance will be capable of processing the machine learning (ML) algorithms of the future.

But while we may not know what those algorithms might look like yet, we can be sure that they will be more complex and more demanding than the workloads we currently task endpoint AI devices with today.

Most endpoint AI devices today are capable of around 4 or 5 Tera Operation Per Second (TOP) per watt. That’s enough for basic ML routines yet incomparable to a datacenter offering of AI compute.

Reducing the power profile of endpoint AI

SiMa.ai began as an ambition to shrink this performance divide: to redefine the performance associated with endpoint AI today. Yet achieving anything close to cloud-like performance in an endpoint AI device would require a marked reduction in power consumption—or rather, a significant increase in TOPs per watt.

With this goal in mind, we developed the MLSoC™ (Machine Learning System on Chip) platform, targeting a peak of 10 TOPs per watt. For an embedded power profile of 5 watts, we can achieve up to 50 TOPs for our ML accelerator. That’s enough to enable AI workloads that would traditionally require cloud performance in a passively cooled endpoint AI device.

We designed our heterogeneous MLSoC to be capable of processing the workloads our customers had created some time back but also one future-proofed for upcoming workloads none of us have identified yet. Unlike a data center, which can be upgraded as new iterations of components come to market, the hardware embedded within an endpoint AI device is set the day it’s baked into silicon.

Our solution to this challenge combines traditional compute IP from Arm with our own machine learning accelerator and dedicated vision accelerator. As the market leader in low power compute, Arm IP was the obvious choice as a secure platform upon which to build our MLSoC. We chose the Arm Cortex-A65 CPU after working closely with our customers to define the compute requirements for their applications: it was a decision very much based on customer needs, from performance down to software toolchain.

While it’s capable of a wide range of ML workloads such as natural language processing (NLP), SiMa.ai’s MLSoC is initially optimized for computer vision applications. Computer vision is already central to many endpoint AI use cases, from traffic cameras to manipulating selfies—and we believe its use will only increase in future applications such as high-end surveillance, crowd control and thermal scanning.

Computer vision unlocks future complex use cases for endpoint AI

Combining the vision accelerator with the ML accelerator also ensures MLSoC can handle complex workloads such as sensor fusion from multiple sensors—this enables it to play a role in autonomous systems from consumer autonomous vehicles to autonomous robots in industrial IoT settings. We also foresee a role for MLSoC in aerospace and defense.

Of course, these complex autonomous workloads require more than 50 TOPs. That’s why we’ve designed MLSoC to be modular: by combining multiple machine learning accelerator mosaics via a proprietary interconnect, we can scale from 50 TOPs at 5 watts up to 400 TOPs at 40 watts.

Consider that today’s level 5 autonomous vehicle prototypes draw around 4 kilowatts, that’s potentially a 100x reduction in power consumption and a greatly reduced physical hardware footprint, alongside reduced need for active cooling.

There’s another good reason for reducing power consumption in devices that will soon be filling our world in the millions. A lot of the OEMs and customers we talk to are very conscious about how to bring down the power profile so they can become carbon neutral by 2030 or earlier. That’s reason enough for us to want to design something low power.

Giving developers the tools they need

I believe that MLSoCs will play a key role in enabling low-power AI in edge and endpoint devices. But I also know that it’s not enough to simply provide a license to a solution benchmarked to achieve a certain number of TOPs.

Many of the solutions that exist on the market today advertise their performance based on benchmarks such as ResNet-50. But quoting frames per second or TOPs per watt only matters if it is achievable under real-world conditions—i.e., our customers’ workloads.

Our customers want one thing: development velocity. How quickly they can go to market. They don’t want to spend months in development cycles trying to achieve the performance they’ve been promised, they want to be able to license your solution and then add their own secret sauce using simple and comprehensive tools.

We’re planning to tape out our MLSoC early next year, with a view to delivering engineering samples and potentially customer samples towards the end of next year. However, we’re already working very closely with customers to define and build their applications and map them to our hardware, and the software development kit (SDK) will be available to customers in advance.

This means they’ll be able to work through the flows, develop their applications and run simulations so that when the silicon becomes available it’s simply a case of compile-and-go.

And because MLSoC is grounded in Arm technology, our customers can be sure that they will have the software, tools and ongoing support they need to build not only the next generation but many subsequent generations of highly capable, low power AI devices.

Transform Your Business with Arm Technology

Arm technology scales from the tiniest of sensors to the largest of data centers, providing power-efficient intelligence upon which transformative applications and business models are built. Discover more about Arm solutions and get started on your journey to market.

By Kavitha Prasad, VP Business Development and System Applications, SiMa.ai

Article Text

Copy Text

Any re-use permitted for informational and non-commercial or personal use only.

Editorial Contact

Brian Fuller and Jack Melling

editorial@arm.com

Subscribe to Blogs and Podcasts

Get the latest blogs & podcasts direct from Arm

Media Information

Latest on X

; Arm @Arm ·

9h 1980383796160532981

For Rene Haas, leadership means moving fast, embracing change, and knowing when to pivot.

Speaking with future leaders at @CarnegieMellon, he highlighted the importance of experimentation and learning from fast failure.

🎧 Hear more on Tech Unheard: https://okt.to/Jbgn2E

Reply on Twitter 1980383796160532981 Retweet on Twitter 1980383796160532981 0 Like on Twitter 1980383796160532981 3 Twitter 1980383796160532981

; Arm @Arm ·

17h 1980258226344952005

🆕 We’re expanding Arm Flexible Access to include our first Armv9 edge AI platform, giving innovators low-cost access to the performance, efficiency, & security they need to bring intelligence to every edge device, and drive the next wave of AI innovation: https://okt.to/GyJDV2

Reply on Twitter 1980258226344952005 Retweet on Twitter 1980258226344952005 5 Like on Twitter 1980258226344952005 25 Twitter 1980258226344952005

; Arm @Arm ·

19 Oct 1979880253595037712

“Stay useful, stay flexible and hold the door open for the person behind you.” – Tamika Curry Smith, Arm

Ahead of the #USGP, Arm and the @AstonMartinF1 Team joined forces in Austin to inspire high school and university students through hands-on learning, mentoring, and…

Reply on Twitter 1979880253595037712 Retweet on Twitter 1979880253595037712 4 Like on Twitter 1979880253595037712 14 Twitter 1979880253595037712

; Arm @Arm ·

17 Oct 1979140387022270924

SME2’s integration into @OPPO’s AI framework is a huge step for on-device AI!

Built into the Arm Lumex compute platform, SME2 delivers faster, more efficient AI performance - with a 1.2x performance improvement and 63% reduction in quantization precision loss in OPPO AI’s…

Reply on Twitter 1979140387022270924 Retweet on Twitter 1979140387022270924 5 Like on Twitter 1979140387022270924 24 Twitter 1979140387022270924

; Arm @Arm ·

16 Oct 1978955390197916037

The @nvidia DGX Spark is now available, with leading OEMs launching new AI workstations.

Built on the Arm-based NVIDIA GB10 Grace Blackwell Superchip, these desktop computing systems deliver petaflop-scale AI performance and support for models up to 200B parameters, all…

Reply on Twitter 1978955390197916037 Retweet on Twitter 1978955390197916037 7 Like on Twitter 1978955390197916037 28 Twitter 1978955390197916037

; Arm @Arm ·

16 Oct 1978953133196767368

A calm moment before the doors opened at #OCPSummit25. Since then, the Arm booth has been buzzing with conversations, meetings, and insights on building the Converged AI Datacenter, where performance meets efficiency and collaboration drives innovation.

Reply on Twitter 1978953133196767368 Retweet on Twitter 1978953133196767368 2 Like on Twitter 1978953133196767368 9 Twitter 1978953133196767368

; Arm @Arm ·

16 Oct 1978896049767866684

Designed for AI, Apple's new M5 chip, built on the Arm architecture, delivers major gains in performance and efficiency, bringing next-generation AI experiences to the new Macbook Pro, iPad Pro and Apple Vision Pro - with Arm innovation at the foundation.

Apple unleashes M5, the next big leap in AI performance for Apple silicon

Apple today announced M5, delivering advances to every aspect of the chip and the next big leap in AI.

okt.to

Reply on Twitter 1978896049767866684 Retweet on Twitter 1978896049767866684 4 Like on Twitter 1978896049767866684 37 Twitter 1978896049767866684

; Arm @Arm ·

16 Oct 1978881404151767303

Building on Arm’s appointment to the @OpenComputePrj board, Tech Arena spoke with Eddie Ramirez on how the Foundation Chiplet System Architecture drives openness, interoperability and efficiency across AI infrastructure:

#OCPSummit25

Arm Joins OCP Board, Contributes Chiplet Architecture Spec

Appointment to Open Compute Project Foundation board of directors, contribution of Foundation Chiplet System Architecture ...

okt.to

Reply on Twitter 1978881404151767303 Retweet on Twitter 1978881404151767303 3 Like on Twitter 1978881404151767303 20 Twitter 1978881404151767303

; Arm @Arm ·

16 Oct 1978820536663728292

Edge AI is accelerating faster than ever. ⚡

In the latest Arm Viewpoints podcast, Arm’s SVP and GM of IoT, Paul Williamson and @VDC_Research’s Chris Rommel discuss what’s driving innovation at the edge — from software complexity to smarter system design.…

Reply on Twitter 1978820536663728292 Retweet on Twitter 1978820536663728292 4 Like on Twitter 1978820536663728292 10 Twitter 1978820536663728292

; Arm @Arm ·

16 Oct 1978758591898149058

As vehicles become AI-defined, compute must evolve. 🚘

This week, we contributed the Foundation Chiplet System Architecture (FCSA) to @OpenComputePrj. This lays the groundwork for open, interoperable chiplet design, helping the whole ecosystem move faster.…

Reply on Twitter 1978758591898149058 Retweet on Twitter 1978758591898149058 8 Like on Twitter 1978758591898149058 30 Twitter 1978758591898149058

; Arm @Arm ·

16 Oct 1978664718416879941

Great conversations at #OCPSummit25! 💡

Eddie Ramirez joined partners from Meta, Rebellions, and Novatek to explore how Arm-based chiplets and ecosystem collaboration are shaping a new generation of composable, AI-ready infrastructure.

Reply on Twitter 1978664718416879941 Retweet on Twitter 1978664718416879941 2 Like on Twitter 1978664718416879941 15 Twitter 1978664718416879941

; Arm @Arm ·

15 Oct 1978609650007203986

It’s not #OCPSummit25 without the iconic Arm-based hardware and DPU wall. 💪
We've got a seriously impressive line up, featuring:
🔹 @NVIDIA Grace Blackwell GB300
🔹 NeuReality NR1
🔹 Novatek’s Arm Neoverse CSS N2-based SoC
🔹 …and plenty more from Marvell, xSight, and others

Reply on Twitter 1978609650007203986 Retweet on Twitter 1978609650007203986 1 Like on Twitter 1978609650007203986 8 Twitter 1978609650007203986

; Arm @Arm ·

15 Oct 1978494349966025044

Announcing a deepened, strategic partnership with @Meta to drive the next era of AI.

From software to the data center, we’re accelerating our collaboration, combining our power-efficient leadership with Meta’s AI innovation to scale AI everywhere: https://okt.to/v6mhgw

Reply on Twitter 1978494349966025044 Retweet on Twitter 1978494349966025044 16 Like on Twitter 1978494349966025044 76 Twitter 1978494349966025044

; Arm @Arm ·

15 Oct 1978477155295395883

“AI is powering the shift from on-device intelligence to cloud-to-car integration.”

In this #AIToyToTools podcast series, Suraj Gajendra shares how end-to-end AI and standardization are defining the future of automotive innovation.

🎧 Listen now

Reply on Twitter 1978477155295395883 Retweet on Twitter 1978477155295395883 3 Like on Twitter 1978477155295395883 13 Twitter 1978477155295395883

; Arm @Arm ·

14 Oct 1978226762904408272

We’re catching you up on the latest from #OCPSummit25 👇

1️⃣ Mohamed Awad kicked things off this morning with a keynote on how the AI era is transforming the way we build and power the world’s infrastructure.

He highlighted how Arm Total Design, advances in chiplet innovation,…

Reply on Twitter 1978226762904408272 Retweet on Twitter 1978226762904408272 4 Like on Twitter 1978226762904408272 21 Twitter 1978226762904408272

; Arm @Arm ·

14 Oct 1978148310532702296

We’re accelerating the next phase of open, interoperable AI infrastructure.

Announced today at #OCPSummit25 - we've contributed the vendor-neutral FCSA specification to @OpenComputePrj & expanded our Arm Total Design ecosystem to include new capabilities: https://okt.to/I85ZDv

Reply on Twitter 1978148310532702296 Retweet on Twitter 1978148310532702296 3 Like on Twitter 1978148310532702296 20 Twitter 1978148310532702296

; Arm @Arm ·

14 Oct 1978114597887283370

vivo’s new X300 Series launched in the Chinese Market, powered by our V9.3 CPU Cluster and the first to use SME2 for faster on-device AI.

⬆️ Up to 20% faster translation
⬆️ Up to 30% faster gallery search

This marks a leap forward for mobile AI performance.

Congrats…

Reply on Twitter 1978114597887283370 Retweet on Twitter 1978114597887283370 6 Like on Twitter 1978114597887283370 20 Twitter 1978114597887283370

; Arm @Arm ·

14 Oct 1977892633335726574

We've been appointed to the @OpenComputePrj Board of Directors, marking a key step toward the next phase of AI infrastructure. 🎉

We're excited to play our part in advancing open, interoperable designs across the computing ecosystem, represented by Mohamed Awad:…

Reply on Twitter 1977892633335726574 Retweet on Twitter 1977892633335726574 1 Like on Twitter 1977892633335726574 18 Twitter 1977892633335726574

; Arm @Arm ·

13 Oct 1977865453239181639

🗓️ Happening tomorrow at #OCPSummit25!

Mohamed Awad shares how energy-efficient compute, chiplet-based design, and open collaboration are transforming AI and cloud infrastructure — and shaping the next era of the data center.

📍 Tuesday | 09:05am https://okt.to/HBPtCK

Reply on Twitter 1977865453239181639 Retweet on Twitter 1977865453239181639 5 Like on Twitter 1977865453239181639 15 Twitter 1977865453239181639

; Arm @Arm ·

10 Oct 1976725756567789712

Confidence. Curiosity. Belonging.

That’s what fuels careers at Arm. Mamta Thangaraj, started in engineering, stepped into leadership, and now drives our data strategy. Her journey reminds us: when people feel supported, they thrive and innovation follows. https://okt.to/dUsbK4

Reply on Twitter 1976725756567789712 Retweet on Twitter 1976725756567789712 1 Like on Twitter 1976725756567789712 7 Twitter 1976725756567789712

; Arm @Arm ·

10 Oct 1976607298123596092

#OCPSummit25 is almost here!

See how we’re building open, efficient AI data centers with Arm Neoverse, open-chiplet design and production-ready software stacks - all happening at the show.

👉 Catch our speakers
📍 Visit us at booth B11

Reply on Twitter 1976607298123596092 Retweet on Twitter 1976607298123596092 1 Like on Twitter 1976607298123596092 12 Twitter 1976607298123596092

; Arm @Arm ·

9 Oct 1976347804252696812

As the industry evolves from software-defined to AI-defined vehicles, Arm Zena CSS provides the scalable, safety-certified foundation needed to accelerate development of next-generation automotive experiences. 🚗

Discover more in this Automotive Industries feature.…

Reply on Twitter 1976347804252696812 Retweet on Twitter 1976347804252696812 1 Like on Twitter 1976347804252696812 9 Twitter 1976347804252696812

; Arm @Arm ·

7 Oct 1975624690119926027

As AI demands grow, silicon architecture must evolve alongside it.

At #OCPSummit25, we’ll share how Arm Compute Subsystems and the Chiplet System Architecture make building custom silicon for AI faster, modular, and lower risk.

Sneak preview in our blog: https://okt.to/V8axsH

Reply on Twitter 1975624690119926027 Retweet on Twitter 1975624690119926027 4 Like on Twitter 1975624690119926027 15 Twitter 1975624690119926027

; Arm @Arm ·

6 Oct 1975248957899956317

We shared insights from the Tech Unheard podcast, hosted by our CEO Rene Haas, in this @HarvardBiz article.

Conversations with @Scale_AI, @Zoox, @nvidia & Chris Miller (Author of Chip War) reveal 4 signals shaping leadership in the AI era.

Four Strategic Signals Technology Leaders Are Tuning In To - SPONSOR CONTENT FROM ARM

Sponsor content from ARM.

okt.to

Reply on Twitter 1975248957899956317 Retweet on Twitter 1975248957899956317 2 Like on Twitter 1975248957899956317 12 Twitter 1975248957899956317

SiMa.ai Sets Sights on High Performance, Low Power Endpoint AI

Reducing the power profile of endpoint AI

Computer vision unlocks future complex use cases for endpoint AI

Giving developers the tools they need

Transform Your Business with Arm Technology

Editorial Contact

Media Information

Company Overview & History

Arm Corporate Guidelines

Media Contacts

Latest on X