Using Phonemes in cascaded S2S translation pipeline

2025-04-22Code Available0· sign in to hype

Rene Pilz, Johannes Schneider

Code Available — Be the first to reproduce this paper.

Code

github.com/fungus75/phonemes_s2s_pipeline
OfficialIn papernone★ 0

Abstract

This paper explores the idea of using phonemes as a textual representation within a conventional multilingual simultaneous speech-to-speech translation pipeline, as opposed to the traditional reliance on text-based language representations. To investigate this, we trained an open-source sequence-to-sequence model on the WMT17 dataset in two formats: one using standard textual representation and the other employing phonemic representation. The performance of both approaches was assessed using the BLEU metric. Our findings shows that the phonemic approach provides comparable quality but offers several advantages, including lower resource requirements or better suitability for low-resource languages.

Tasks

Simultaneous Speech-to-Speech Translation Speech-to-Speech Translation Translation

Using Phonemes in cascaded S2S translation pipeline

Code

Abstract

Tasks

Reproductions