Improving Vision-Inspired Keyword Detection Using a Streaming Conformal Encoder with Input-Dependent Dynamic Depth

Using a vision-inspired keyword detection framework, we propose an architecture with input-dependent dynamic depth capable of processing audio streaming. Specifically, we extend a Conformer encoder with trainable binary gates that enable dynamically bypassing network modules based on input audio. Our approach improves detection and localization accuracy in continuous speech using the top 1000 most frequent words from Librispeech while maintaining a small memory footprint. The inclusion of gates also allows the average amount of processing to be reduced without affecting overall performance. These benefits have been shown to be even more pronounced using Google voice commands placed above background noise, where up to 97% of processing is skipped on non-speech inputs, making our method particularly interesting. for an always-on keyword watcher.

Improving Vision-Inspired Keyword Detection Using a Streaming Conformal Encoder with Input-Dependent Dynamic Depth

Technical Terrence Team

Solana fell below $80.00 yesterday

Leave a Reply Cancel reply

Recommended.

Justin Sun cashes in $209 million in ETH from Lido Finance

Researchers from Meta AI and UT Austin explored scaling in autoencoders and introduced ViTok: a ViT-style autoencoder for performing exploration

Treat Bitcoin as a Tool, Not a Cult

The Trump administration looks at Bitcoin's massive accumulation, says the executive director

The best PS5 games in 2024

Categories

Important Links

Improving Vision-Inspired Keyword Detection Using a Streaming Conformal Encoder with Input-Dependent Dynamic Depth

Related

Technical Terrence Team

Solana fell below $80.00 yesterday

Leave a Reply Cancel reply

Recommended.

Justin Sun cashes in $209 million in ETH from Lido Finance

Researchers from Meta AI and UT Austin explored scaling in autoencoders and introduced ViTok: a ViT-style autoencoder for performing exploration

Treat Bitcoin as a Tool, Not a Cult

The Trump administration looks at Bitcoin's massive accumulation, says the executive director

The best PS5 games in 2024

Categories

Important Links

Get daily news updates to your inbox!