| Using Phonemes in cascaded S2S translation pipeline | Apr 22, 2025 | Simultaneous Speech-to-Speech TranslationSpeech-to-Speech Translation | CodeCode Available | 0 |
| SimulS2S-LLM: Unlocking Simultaneous Inference of Speech LLMs for Speech-to-Speech Translation | Apr 22, 2025 | Simultaneous Speech-to-Speech TranslationSpeech-to-Speech Translation | —Unverified | 0 |
| High-Fidelity Simultaneous Speech-To-Speech Translation | Feb 5, 2025 | DecoderSimultaneous Speech-to-Speech Translation | CodeCode Available | 5 |
| What does it take to get state of the art in simultaneous speech-to-speech translation? | Sep 2, 2024 | HallucinationManagement | —Unverified | 0 |
| A Non-autoregressive Generation Framework for End-to-End Simultaneous Speech-to-Speech Translation | Jun 11, 2024 | DecoderSimultaneous Speech-to-Speech Translation | CodeCode Available | 2 |
| StreamSpeech: Simultaneous Speech-to-Speech Translation with Multi-task Learning | Jun 5, 2024 | Automatic Speech Recognition (ASR)de-en | CodeCode Available | 5 |
| SimulTron: On-Device Simultaneous Speech to Speech Translation | Jun 4, 2024 | Simultaneous Speech-to-Speech TranslationSpeech-to-Speech Translation | —Unverified | 0 |
| Learning When to Speak: Latency and Quality Trade-offs for Simultaneous Speech-to-Speech Translation with Offline Models | Jun 1, 2023 | Simultaneous Speech-to-Speech TranslationSpeech-to-Speech Translation | CodeCode Available | 1 |
| From Start to Finish: Latency Reduction Strategies for Incremental Speech Synthesis in Simultaneous Speech-to-Speech Translation | Oct 15, 2021 | Data AugmentationSimultaneous Speech-to-Speech Translation | —Unverified | 0 |
| Direct Simultaneous Speech-to-Speech Translation with Variational Monotonic Multihead Attention | Oct 15, 2021 | Simultaneous Speech-to-Speech TranslationSpeech Synthesis | —Unverified | 0 |