Corpus Synthesis for Zero-Shot ASR Domain Adaptation Using Large Language Models

While automatic speech recognition (ASR) systems are widely used in many real-world applications, they often do not generalize well to new domains and need to be fine-tuned with data from these domains. However, data from the target domain is generally not available in many scenarios. In this paper, we propose a new strategy to adapt ASR models to new target domains without text or speech from those domains. To achieve this, we propose a novel data synthesis process that uses a large language model (LLM) to generate a target-domain text corpus and a state-of-the-art controllable speech synthesis model to generate the corresponding speech. We propose a simple but effective instruction-in-context adjustment strategy to increase the effectiveness of LLM in generating text corpora for new domains. Experiments on the SLURP dataset show that the proposed method achieves an average relative word error rate improvement of 28% on unseen target domains without any performance drop on source domains.

Corpus Synthesis for Zero-Shot ASR Domain Adaptation Using Large Language Models

Technical Terrence Team

Universal Studios theme parks to add more Harry Potter

Leave a Reply Cancel reply

Recommended.

MixMob Co-Founder Talks Web3 Gaming Revolution

Master the basics of finance with Mercury VP of Finance Dan Kang at TechCrunch Early Stage

Fable Studio Launches SHOW-1: An Artificial Intelligence Platform That Can Write, Produce, Direct Animation And Even Voice All-New Episodes Of TV Shows

Royal Caribbean Group creates impressive scene

MCB Real Estate's bid for Whitestone REIT could kick off bidding process, investor says

Categories

Important Links

Corpus Synthesis for Zero-Shot ASR Domain Adaptation Using Large Language Models

Related

Technical Terrence Team

Universal Studios theme parks to add more Harry Potter

Leave a Reply Cancel reply

Recommended.

MixMob Co-Founder Talks Web3 Gaming Revolution

Master the basics of finance with Mercury VP of Finance Dan Kang at TechCrunch Early Stage

Fable Studio Launches SHOW-1: An Artificial Intelligence Platform That Can Write, Produce, Direct Animation And Even Voice All-New Episodes Of TV Shows

Royal Caribbean Group creates impressive scene

MCB Real Estate's bid for Whitestone REIT could kick off bidding process, investor says

Categories

Important Links

Get daily news updates to your inbox!