Evaluation of IWSLT2023 speech translation tasks: human annotations, automatic metrics and segmentation

Human evaluation is a critical component in the development of machine translation systems and has received much attention in text translation research. However, there is little prior work on the topic of human evaluation for speech translation, which adds additional challenges such as noisy data and segmentation discrepancies. We take the first steps to fill this gap by conducting a comprehensive human evaluation of the results of several shared tasks from the latest International Workshop on Spoken Language Translation (IWSLT 2023). We propose an effective evaluation strategy based on automatic resegmentation and direct evaluation with segment context. Our analysis revealed that: 1) the proposed evaluation strategy is robust and correlates well with other types of human judgments; 2) automated metrics are often, but not always, well correlated with direct assessment scores; and 3) COMET as a slightly stronger automatic metric than chrF, despite the segmentation noise introduced by resegmentation step systems. We publish the collected human-annotated data to encourage further research.

Evaluation of IWSLT2023 speech translation tasks: human annotations, automatic metrics and segmentation

Technical Terrence Team

I like dividends but avoid National Grid stock. Because?

Leave a Reply Cancel reply

Recommended.

RenQ Finance Raises Nearly $1 Million In 24 Hours As Stage 4 Completes – Press Release Bitcoin News

It’s time to put oceans to the test in the climate fight, scientists say

Understanding Abstractions in Neural Networks | by 林育任 (Yu-Jen Lin) | May, 2024

Move Over Bitcoin, Robert Kiyosaki Advocates Ethereum Investment

2 Stocks to Buy in February for Healthy Dividends

Categories

Important Links

Evaluation of IWSLT2023 speech translation tasks: human annotations, automatic metrics and segmentation

Related

Technical Terrence Team

I like dividends but avoid National Grid stock. Because?

Leave a Reply Cancel reply

Recommended.

RenQ Finance Raises Nearly $1 Million In 24 Hours As Stage 4 Completes – Press Release Bitcoin News

It’s time to put oceans to the test in the climate fight, scientists say

Understanding Abstractions in Neural Networks | by 林育任 (Yu-Jen Lin) | May, 2024

Move Over Bitcoin, Robert Kiyosaki Advocates Ethereum Investment

2 Stocks to Buy in February for Healthy Dividends

Categories

Important Links

Get daily news updates to your inbox!