feature article
Subscribe Now

Ambiq’s Low-Power AI Cancels Speech Noise Like Magic

The nice thing about magic is that you need not know how the magic works to apply it. For example, in Harry Potter’s world, Hogwarts students learned to use spells and incantations that invoked magic without knowing the underlying physics (metaphysics?) of that magic. Too science-fictiony for you? Then grok this. You don’t need to understand immersion or EUV lithography to design with the integrated circuits produced by these magical applications of real-world physics. For 99.99% of us, digital logic abstracts away almost everything happening in the real world and leaves us in the near-pristine Boolean universe without atoms, capacitance, inductance, and resistance. Although these real, physical quantities of matter continue to exist, in the worlds of digital ICs, we are simply able to ignore these physical quantities and still get on with our work because someone else abstracted them out of our immediate engineering universe.

Ambiq, the low-power microcontroller company, recently announced new capabilities by combining two forms of engineering magic. The first bit of magic, long mastered by the company, is FET-based logic circuits running at sub-threshold supply voltages. If you squint your eyes to remember your first classes in logic circuits, you’ll remember that we like to use transistors as switches. We like them fully turned on or fully turned off because those are the two states where the transistor normally dissipates the least amount of power when operating from power supplies of a few volts. However, there’s another way to achieve low-power operation: use a power supply below the threshold voltage of the FET.

In a simplistic view of FETs, they ought not to work when operating below threshold voltage. But they do. Or at least, they can. The FETs switch, but more slowly. Much more slowly. That’s OK, because there are many applications for digital ICs that do not require multi-gigahertz operation but do need to draw minimal power, and Ambiq specializes in using sub-threshold circuitry for one such application: low-power microcontrollers.

Microcontrollers have always been slow creatures. Back in the 1970s, they ran at one or a few megahertz – five to ten times slower than the fastest microprocessors of the day. Even today, with all the nanometer lithographic legerdemain available to us, it’s unusual to see microcontrollers running faster than a few hundred megahertz, because they don’t need to be faster.

Ambiq’s latest subthreshold microcontrollers, the Apollo 3 and 4, made with TSMC’s 40ULP and 22ULL semiconductor processes respectively, come close to but can’t quite attain 100 and 200 MHz operation, respectively. Yet that’s plenty fast for these flea-power microcontrollers, which consume microwatts per megahertz, thanks to their subthreshold circuit design. Even at 100 to 200 MHz, you can still do very useful things with these microcontrollers.

Which brings us to Ambiq’s second bit of magic. The company has noticed that AI is all the rage these days, so the company has been packing some AI application magic into the software it offers for its Apollo 4 low-power microcontroller, hoping to make this device even more attractive to the makers of battery-powered devices. What kind of devices? Tech Insights recently extracted an Ambiq Apollo 4 microcontroller with 2 Mbytes of non-volatile, on-chip MRAM from a Fitbit Luxe fitness band and wrote a teardown report. The Fitbit is just the sort of end product that Ambiq has in mind for its low-power microcontrollers, and the company is expecting to attract people developing more battery-powered devices of that type that are worn or carried on an everyday basis with its latest magic trick: AI-powered noise cancellation.

Now this isn’t generative AI like ChatGPT that’s been snagging all the headlines. It’s functional AI, of the machine-learning (ML) sort. Ambiq has implemented a noise-cancelling neural network (NN) model for speech enhancement as part of its growing library of NN models that run on the company’s own TinyML implementation. TinyML aims to bring real-time ML applications to systems at the extreme edge where you’ll find battery-powered, microcontroller-based devices that are possibly not connected to the Internet. TinyML applications stand in stark contrast to other sorts of ML applications running on GPUs in data centers, where power consumption is measured in kilowatts instead of milliwatts.

Ambiq has developed an ML model zoo that works in conjunction with the company’s neuralSPOT SDK and TinyML inference engine for its Apollo 4 microcontroller. Currently, the Ambiq model zoo contains three ML models:

  •         NN Speech: A collection of three speech-focused models for voice activity detection, keyword spotting, and speech-to-intent inference
  •         Arrhythmia Classification: Detects several types of heart conditions based on single-lead ECG sensors
  •         Speech Enhancement: A TinyLSTM-based audio model that removes noise from speech

The speech-enhancement model is the latest Ambiq model zoo addition. It’s designed to remove noise from speech. That’s remarkably useful for many speech applications, including video conferencing and speech recording. In fact, BabbleLabs developed a remarkably effective ML-based speech-enhancement application and demonstrated it back in 2019. That demonstration ran on Nvidia V100 Tensor Core GPUs, which consume far more than one milliwatt, but the demonstration was impressive enough that Cisco acquired BabbleLabs in 2020 to add speech enhancement to its Webex offering. Ambiq can now perform the same ML-based magic trick with one of its micropower microcontrollers running in a battery-powered edge device just a few years later, which is truly amazing.

Part of the trick is Ambiq’s home-grown version of the TinyML inference engine: TinyEngine. The ML models in the Ambiq zoo run on TinyEngine. Because of the diversity of resources available across the microcontroller spectrum, TinyEngine was originally conceived by the TinyML community as a resource-lite application. Consequently, Ambiq’s Tiny Engine implementation runs on just the Apollo 4 microcontroller’s CPU and leverages Arm Cortex-M4F processor core’s vector math acceleration features. Although the Ambiq Apollo 4 has an on-chip GPU, Ambiq does not use it for the TinyEngine implementation, saying that “embedded GPUs tend to be purpose-built for popular features such as displaying graphics for IoT devices… Embedded GPUs don’t generally support the type of general-purpose compute that you see in data center and smartphone GPUs. Most, if not all, of our customers are using those GPUs to drive better user interfaces such as animated smartwatch displays.” Which is fine, considering that the CPU alone seems perfectly capable of de-noising speech without the need for additional hardware acceleration.

One device that I think that could really benefit from this sort of ML-based speech enhancement is my Zoom H1 Portable Digital Recorder. I’ve used this sub-$100 product for more than ten years to record excellent audio for video blogs. The small, handheld recorder runs for hours on one AA battery and can record many hours of sound on a microSD card, captured by a pair of superb electret microphones integrated into the unit. However, one of the things that really mars the sound recorded by the Zoom H1 is wind noise, which you currently fight by fitting a little fuzzy cap – colloquially called a “dead cat” – over the microphone end of the recorder. It’s a pain to carry the dead cat around in a little plastic bag, and it’s not always effective.

I can easily envision a future version of the Zoom H1 recorder with a built-in, ML-based denoiser. I think this is exactly the sort of product that can benefit from the latest member of Ambiq’s model zoo. Ambiq has posted its neuralSPOT SDK, the TinyEngine inference engine, and the model zoo on Github as an aid to development teams using the company’s Apollo 4 microcontroller. If you’ve got to deal with noisy speech, this could be the solution to your problem.

One thought on “Ambiq’s Low-Power AI Cancels Speech Noise Like Magic”

Leave a Reply

featured blogs
Nov 22, 2024
We're providing every session and keynote from Works With 2024 on-demand. It's the only place wireless IoT developers can access hands-on training for free....
Nov 22, 2024
I just saw a video on YouTube'”it's a few very funny minutes from a show by an engineer who transitioned into being a comedian...

featured video

Introducing FPGAi – Innovations Unlocked by AI-enabled FPGAs

Sponsored by Intel

Altera Innovators Day presentation by Ilya Ganusov showing the advantages of FPGAs for implementing AI-based Systems. See additional videos on AI and other Altera Innovators Day in Altera’s YouTube channel playlists.

Learn more about FPGAs for Artificial Intelligence here

featured paper

Quantized Neural Networks for FPGA Inference

Sponsored by Intel

Implementing a low precision network in FPGA hardware for efficient inferencing provides numerous advantages when it comes to meeting demanding specifications. The increased flexibility allows optimization of throughput, overall power consumption, resource usage, device size, TOPs/watt, and deterministic latency. These are important benefits where scaling and efficiency are inherent requirements of the application.

Click to read more

featured chalk talk

Introducing the TCKE9 eFuse: Advanced Circuit Protection for Modern Electronics
Sponsored by Mouser Electronics and Toshiba
eFuse ICs provide better protection performance than conventional mechanical fuses. In this episode of Chalk Talk, Amelia Dalton and Talayeh Saderi from Toshiba chat about the what, where, and how of eFuse technology. They also investigate the benefits that Toshiba’s TCKE9 eFuses bring to server power management and how you can get started using a TCKE9 eFuse in your next design. 
Oct 29, 2024
29,277 views