Generating synthetic queries using large language models for virtual assistants

This article was accepted into the Industry Track at SIGIR 2024.

Virtual assistants (VAs) are important information retrieval platforms that help users perform various tasks using spoken commands. The speech recognition system (speech to text) uses previous queries, trained solely on text, to distinguish between phonetically confusing alternatives. Therefore, generating synthetic queries that are similar to existing VA usage can greatly improve VA capabilities, especially for use cases that do not (yet) occur in paired audio/text data.

In this article, we provide a preliminary exploration of using large language models (LLMs) to generate synthetic queries that are complementary to template-based methods. We investigate whether the methods (a) generate queries similar to queries from users of a popular VA and (b) whether the generated queries are specific. We found that LLMs generate more detailed queries, compared to template-based methods, and reference specific aspects of the entity. The generated queries are similar to VA user queries and are specific enough to retrieve the relevant entity. We conclude that queries generated by LLM and templates are complementary.

Generating synthetic queries using large language models for virtual assistants

Technical Terrence Team

RTX company Collins Aerospace awarded $265 million Department of Defense contract (NYSE:RTX)

Leave a Reply Cancel reply

Recommended.

Guide to file management in Python (explained with examples)

Finnish stock markets close lower; OMX Helsinki 25 falls 0.56% By Investing.com

Ronin introduces Ronin zkEVM for enhanced Web3 gaming

NFL-style game 'Draftables' sell all NFTs in less than 10 minutes

Internet Archive is under attack, with a pop-up claiming a “catastrophic” breach

Categories

Important Links

Generating synthetic queries using large language models for virtual assistants

Related

Technical Terrence Team

RTX company Collins Aerospace awarded $265 million Department of Defense contract (NYSE:RTX)

Leave a Reply Cancel reply

Recommended.

Guide to file management in Python (explained with examples)

Finnish stock markets close lower; OMX Helsinki 25 falls 0.56% By Investing.com

Ronin introduces Ronin zkEVM for enhanced Web3 gaming

NFL-style game 'Draftables' sell all NFTs in less than 10 minutes

Internet Archive is under attack, with a pop-up claiming a “catastrophic” breach

Categories

Important Links

Get daily news updates to your inbox!