AV-TranSpeech: Audio-Visual Robust Speech-to-Speech Translation May 24, 2023 Speech-to-Speech Translation Translation
— Unverified 0Balancing Speech Understanding and Generation Using Continual Pre-training for Codec-based Speech LLM Feb 24, 2025 Automatic Speech Recognition Language Modeling
— Unverified 0Can We Achieve High-quality Direct Speech-to-Speech Translation without Parallel Speech Data? Jun 11, 2024 Contrastive Learning Speech Synthesis
— Unverified 0Connecting Voices: LoReSpeech as a Low-Resource Speech Parallel Corpus Feb 25, 2025 Speech-to-Speech Translation Translation
— Unverified 0Cross-Lingual Machine Speech Chain for Javanese, Sundanese, Balinese, and Bataks Speech Recognition and Synthesis Nov 4, 2020 Machine Translation speech-recognition
— Unverified 0CrossVoice: Crosslingual Prosody Preserving Cascade-S2ST using Transfer Learning May 23, 2024 es-en fr-en
— Unverified 0DiffS2UT: A Semantic Preserving Diffusion Model for Textless Direct Speech-to-Speech Translation Oct 26, 2023 Image Generation Speech-to-Speech Translation
— Unverified 0Diffusion Synthesizer for Efficient Multilingual Speech to Speech Translation Jun 14, 2024 Speech-to-Speech Translation Translation
— Unverified 0Direct Punjabi to English speech translation using discrete units Feb 25, 2024 Speech-to-Speech Translation Speech-to-Text
— Unverified 0Direct Simultaneous Speech-to-Speech Translation with Variational Monotonic Multihead Attention Oct 15, 2021 Simultaneous Speech-to-Speech Translation Speech Synthesis
— Unverified 0Direct Speech-to-Speech Neural Machine Translation: A Survey Nov 13, 2024 Machine Translation Speech-to-Speech Translation
— Unverified 0Direct Speech to Speech Translation: A Review Mar 3, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Direct Speech-to-speech Translation without Textual Annotation using Bottleneck Features Dec 12, 2022 Speech-to-Speech Translation Translation
— Unverified 0Direct Text to Speech Translation System using Acoustic Units Sep 14, 2023 Decoder Speech-to-Speech Translation
— Unverified 0Discrete Multimodal Transformers with a Pretrained Large Language Model for Mixed-Supervision Speech Processing Jun 4, 2024 Decoder Language Modeling
— Unverified 0Speech-to-speech Translation between Untranscribed Unknown Languages Oct 2, 2019 Speech-to-Speech Translation Translation
— Unverified 0Speech-to-Speech Translation For A Real-world Unwritten Language Nov 11, 2022 Speech-to-Speech Translation Translation
— Unverified 0Speech to Speech Translation with Translatotron: A State of the Art Review Feb 9, 2025 speech-recognition Speech Recognition
— Unverified 0Speech-to-Speech Translation with Discrete-Unit-Based Style Transfer Sep 14, 2023 In-Context Learning Language Modeling
— Unverified 0Textless Acoustic Model with Self-Supervised Distillation for Noise-Robust Expressive Speech-to-Speech Translation Jun 4, 2024 Speech-to-Speech Translation Translation
— Unverified 0Textless Direct Speech-to-Speech Translation with Discrete Speech Representation Oct 31, 2022 Speech-to-Speech Translation Translation
— Unverified 0Textless Speech-to-Speech Translation on Real Data Dec 15, 2021 Speech-to-Speech Translation Translation
— Unverified 0Textless Streaming Speech-to-Speech Translation using Semantic Speech Tokens Oct 4, 2024 Language Modeling Language Modelling
— Unverified 0The HW-TSC’s Speech to Speech Translation System for IWSLT 2022 Evaluation May 1, 2022 Machine Translation Reranking
— Unverified 0Towards Multilingual Conversations in the Medical Domain: Development of Multilingual Medical Data and A Network-based ASR System May 1, 2014 Machine Translation speech-recognition
— Unverified 0TranSentence: Speech-to-speech Translation via Language-agnostic Sentence-level Speech Encoding without Language-parallel Data Jan 17, 2024 Sentence Speech-to-Speech Translation
— Unverified 0TransFace: Unit-Based Audio-Visual Speech Synthesizer for Talking Head Translation Dec 23, 2023 es-en fr-en
— Unverified 0Translatotron 2: High-quality direct speech-to-speech translation with voice preservation Jul 19, 2021 Data Augmentation Decoder
— Unverified 0Translatotron 3: Speech to Speech Translation with Monolingual Data May 27, 2023 Speech-to-Speech Translation Translation
— Unverified 0UWSpeech: Speech to Speech Translation for Unwritten Languages Jun 14, 2020 speech-recognition Speech Recognition
— Unverified 0Scheduled Interleaved Speech-Text Training for Speech-to-Speech Translation with LLMs Jun 12, 2025 Speech-to-Speech Translation text-to-speech
— Unverified 0Leveraging unsupervised and weakly-supervised data to improve direct speech-to-speech translation Mar 24, 2022 Representation Learning Speech Representation Learning
— Unverified 0MLLP-VRAIN UPV systems for the IWSLT 2022 Simultaneous Speech Translation and Speech-to-Speech Translation tasks May 1, 2022 Simultaneous Speech-to-Text Translation Speech-to-Speech Translation
— Unverified 0MSLM-S2ST: A Multitask Speech Language Model for Textless Speech-to-Speech Translation with Speaker Style Preservation Mar 19, 2024 Decoder Language Modeling
— Unverified 0Multilingual Speech-to-Speech Translation into Multiple Target Languages Jul 17, 2023 Language Identification Speech-to-Speech Translation
— Unverified 0NAIST Simultaneous Speech Translation System for IWSLT 2024 Jun 30, 2024 Speech-to-Speech Translation Speech-to-Text
— Unverified 0Augmentation Invariant Discrete Representation for Generative Spoken Language Modeling Sep 30, 2022 Language Modeling Language Modelling
— Unverified 0Phi-Omni-ST: A multimodal language model for direct speech-to-speech translation Jun 4, 2025 Language Modeling Language Modelling
— Unverified 0PolySinger: Singing-Voice to Singing-Voice Translation from English to Japanese Jul 19, 2024 Singing Voice Synthesis Speech-to-Speech Translation
— Unverified 0PolyVoice: Language Models for Speech to Speech Translation Jun 5, 2023 Language Modeling Language Modelling
— Unverified 0Portable Speech-to-Speech Translation on an Android Smartphone: The MFLTS System Mar 1, 2018 Speech Recognition Speech-to-Speech Translation
— Unverified 0Preset-Voice Matching for Privacy Regulated Speech-to-Speech Translation Systems Jul 18, 2024 Speech-to-Speech Translation Voice Cloning
— Unverified 0Prosodic Alignment for off-screen automatic dubbing Apr 6, 2022 Speech-to-Speech Translation Translation
— Unverified 0Real-time Incremental Speech-to-Speech Translation of Dialogs Jun 1, 2012 Machine Translation Speech Recognition
— Unverified 0S2ST-Omni: An Efficient and Scalable Multilingual Speech-to-Speech Translation Framework via Seamless Speech-Text Alignment and Streaming Speech Generation Jun 11, 2025 Reading Comprehension Speech Synthesis
— Unverified 0SAMU-XLSR: Semantically-Aligned Multimodal Utterance-level Cross-Lingual Speech Representation May 17, 2022 Representation Learning Retrieval
— Unverified 0SeamlessExpressiveLM: Speech Language Model for Expressive Speech-to-Speech Translation with Chain-of-Thought May 30, 2024 Language Modeling Language Modelling
— Unverified 0SimulS2S-LLM: Unlocking Simultaneous Inference of Speech LLMs for Speech-to-Speech Translation Apr 22, 2025 Simultaneous Speech-to-Speech Translation Speech-to-Speech Translation
— Unverified 0Simultaneous Speech-to-Speech Translation System with Neural Incremental ASR, MT, and TTS Nov 10, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0SimulTron: On-Device Simultaneous Speech to Speech Translation Jun 4, 2024 Simultaneous Speech-to-Speech Translation Speech-to-Speech Translation
— Unverified 0