Improving Vision-Inspired Keyword Detection Using a Streaming Conformal Encoder with Input-Dependent Dynamic Depth
Using a vision-inspired keyword detection framework, we propose an architecture with input-dependent dynamic depth capable of processing audio streaming. Specifically, ...