NAIST Simultaneous Speech Translation System for IWSLT 2024 Jun 30, 2024 Speech-to-Speech Translation Speech-to-Text
— Unverified 0Augmentation Invariant Discrete Representation for Generative Spoken Language Modeling Sep 30, 2022 Language Modeling Language Modelling
— Unverified 0Phi-Omni-ST: A multimodal language model for direct speech-to-speech translation Jun 4, 2025 Language Modeling Language Modelling
— Unverified 0PolySinger: Singing-Voice to Singing-Voice Translation from English to Japanese Jul 19, 2024 Singing Voice Synthesis Speech-to-Speech Translation
— Unverified 0PolyVoice: Language Models for Speech to Speech Translation Jun 5, 2023 Language Modeling Language Modelling
— Unverified 0Portable Speech-to-Speech Translation on an Android Smartphone: The MFLTS System Mar 1, 2018 Speech Recognition Speech-to-Speech Translation
— Unverified 0Preset-Voice Matching for Privacy Regulated Speech-to-Speech Translation Systems Jul 18, 2024 Speech-to-Speech Translation Voice Cloning
— Unverified 0What does it take to get state of the art in simultaneous speech-to-speech translation? Sep 2, 2024 Hallucination Management
— Unverified 0A Case Study on Filtering for End-to-End Speech Translation Feb 2, 2024 Speech-to-Speech Translation Speech-to-Text
— Unverified 0A Holistic Cascade System, benchmark, and Human Evaluation Protocol for Expressive Speech-to-Speech Translation Jan 25, 2023 Speech-to-Speech Translation Translation
— Unverified 0Analyzing Speech Unit Selection for Textless Speech-to-Speech Translation Jul 8, 2024 Automatic Speech Recognition Emotion Recognition
— Unverified 0Assessing Evaluation Metrics for Speech-to-Speech Translation Oct 26, 2021 Machine Translation Open-Ended Question Answering
— Unverified 0AudioPaLM: A Large Language Model That Can Speak and Listen Jun 22, 2023 Language Modeling Language Modelling
— Unverified 0A Unit-based System and Dataset for Expressive Direct Speech-to-Speech Translation Feb 1, 2025 Speech-to-Speech Translation Translation
— Unverified 0Automatic Extraction of Parallel Speech Corpora from Dubbed Movies Aug 1, 2017 Speech-to-Speech Translation Translation
— Unverified 0AV-TranSpeech: Audio-Visual Robust Speech-to-Speech Translation May 24, 2023 Speech-to-Speech Translation Translation
— Unverified 0Balancing Speech Understanding and Generation Using Continual Pre-training for Codec-based Speech LLM Feb 24, 2025 Automatic Speech Recognition Language Modeling
— Unverified 0Can We Achieve High-quality Direct Speech-to-Speech Translation without Parallel Speech Data? Jun 11, 2024 Contrastive Learning Speech Synthesis
— Unverified 0Connecting Voices: LoReSpeech as a Low-Resource Speech Parallel Corpus Feb 25, 2025 Speech-to-Speech Translation Translation
— Unverified 0Cross-Lingual Machine Speech Chain for Javanese, Sundanese, Balinese, and Bataks Speech Recognition and Synthesis Nov 4, 2020 Machine Translation speech-recognition
— Unverified 0CrossVoice: Crosslingual Prosody Preserving Cascade-S2ST using Transfer Learning May 23, 2024 es-en fr-en
— Unverified 0DiffS2UT: A Semantic Preserving Diffusion Model for Textless Direct Speech-to-Speech Translation Oct 26, 2023 Image Generation Speech-to-Speech Translation
— Unverified 0Diffusion Synthesizer for Efficient Multilingual Speech to Speech Translation Jun 14, 2024 Speech-to-Speech Translation Translation
— Unverified 0Direct Punjabi to English speech translation using discrete units Feb 25, 2024 Speech-to-Speech Translation Speech-to-Text
— Unverified 0Direct Simultaneous Speech-to-Speech Translation with Variational Monotonic Multihead Attention Oct 15, 2021 Simultaneous Speech-to-Speech Translation Speech Synthesis
— Unverified 0Direct Speech-to-Speech Neural Machine Translation: A Survey Nov 13, 2024 Machine Translation Speech-to-Speech Translation
— Unverified 0Direct Speech to Speech Translation: A Review Mar 3, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Direct Speech-to-speech Translation without Textual Annotation using Bottleneck Features Dec 12, 2022 Speech-to-Speech Translation Translation
— Unverified 0Direct Text to Speech Translation System using Acoustic Units Sep 14, 2023 Decoder Speech-to-Speech Translation
— Unverified 0Discrete Multimodal Transformers with a Pretrained Large Language Model for Mixed-Supervision Speech Processing Jun 4, 2024 Decoder Language Modeling
— Unverified 0Textless Speech-to-Speech Translation on Real Data Dec 15, 2021 Speech-to-Speech Translation Translation
— Unverified 0Textless Streaming Speech-to-Speech Translation using Semantic Speech Tokens Oct 4, 2024 Language Modeling Language Modelling
— Unverified 0The HW-TSC’s Speech to Speech Translation System for IWSLT 2022 Evaluation May 1, 2022 Machine Translation Reranking
— Unverified 0Towards Multilingual Conversations in the Medical Domain: Development of Multilingual Medical Data and A Network-based ASR System May 1, 2014 Machine Translation speech-recognition
— Unverified 0TranSentence: Speech-to-speech Translation via Language-agnostic Sentence-level Speech Encoding without Language-parallel Data Jan 17, 2024 Sentence Speech-to-Speech Translation
— Unverified 0TransFace: Unit-Based Audio-Visual Speech Synthesizer for Talking Head Translation Dec 23, 2023 es-en fr-en
— Unverified 0Translatotron 2: High-quality direct speech-to-speech translation with voice preservation Jul 19, 2021 Data Augmentation Decoder
— Unverified 0Translatotron 3: Speech to Speech Translation with Monolingual Data May 27, 2023 Speech-to-Speech Translation Translation
— Unverified 0UnitY: Two-pass Direct Speech-to-speech Translation with Discrete Units Dec 15, 2022 Decoder Denoising
— Unverified 0UWSpeech: Speech to Speech Translation for Unwritten Languages Jun 14, 2020 speech-recognition Speech Recognition
— Unverified 0Scheduled Interleaved Speech-Text Training for Speech-to-Speech Translation with LLMs Jun 12, 2025 Speech-to-Speech Translation text-to-speech
— Unverified 0Prosodic Alignment for off-screen automatic dubbing Apr 6, 2022 Speech-to-Speech Translation Translation
— Unverified 0Real-time Incremental Speech-to-Speech Translation of Dialogs Jun 1, 2012 Machine Translation Speech Recognition
— Unverified 0S2ST-Omni: An Efficient and Scalable Multilingual Speech-to-Speech Translation Framework via Seamless Speech-Text Alignment and Streaming Speech Generation Jun 11, 2025 Reading Comprehension Speech Synthesis
— Unverified 0SAMU-XLSR: Semantically-Aligned Multimodal Utterance-level Cross-Lingual Speech Representation May 17, 2022 Representation Learning Retrieval
— Unverified 0SeamlessExpressiveLM: Speech Language Model for Expressive Speech-to-Speech Translation with Chain-of-Thought May 30, 2024 Language Modeling Language Modelling
— Unverified 0SimulS2S-LLM: Unlocking Simultaneous Inference of Speech LLMs for Speech-to-Speech Translation Apr 22, 2025 Simultaneous Speech-to-Speech Translation Speech-to-Speech Translation
— Unverified 0Simultaneous Speech-to-Speech Translation System with Neural Incremental ASR, MT, and TTS Nov 10, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0SimulTron: On-Device Simultaneous Speech to Speech Translation Jun 4, 2024 Simultaneous Speech-to-Speech Translation Speech-to-Speech Translation
— Unverified 0SpeechMatrix: A Large-Scale Mined Corpus of Multilingual Speech-to-Speech Translations Nov 8, 2022 Mixture-of-Experts Speech-to-Speech Translation
— Unverified 0