StreamSpeech: A Simul-S2ST direct speech-to-speech translation model that jointly learns simultaneous translation and policies in a unified multi-task learning framework
Large language models (LLMs) have gained significant attention in the field of simultaneous speech-to-speech translation (SimulS2ST). This technology has become ...