SimulTron: On-Device Simultaneous Speech to Speech Translation Jun 4, 2024 Simultaneous Speech-to-Speech Translation Speech-to-Speech Translation
— Unverified 0Discrete Multimodal Transformers with a Pretrained Large Language Model for Mixed-Supervision Speech Processing Jun 4, 2024 Decoder Language Modeling
— Unverified 0SeamlessExpressiveLM: Speech Language Model for Expressive Speech-to-Speech Translation with Chain-of-Thought May 30, 2024 Language Modeling Language Modelling
— Unverified 0CrossVoice: Crosslingual Prosody Preserving Cascade-S2ST using Transfer Learning May 23, 2024 es-en fr-en
— Unverified 0DiffNorm: Self-Supervised Normalization for Non-autoregressive Speech-to-speech Translation May 22, 2024 Denoising Noise Estimation
Code Code Available 0MSLM-S2ST: A Multitask Speech Language Model for Textless Speech-to-Speech Translation with Speaker Style Preservation Mar 19, 2024 Decoder Language Modeling
— Unverified 0Direct Punjabi to English speech translation using discrete units Feb 25, 2024 Speech-to-Speech Translation Speech-to-Text
— Unverified 0A Case Study on Filtering for End-to-End Speech Translation Feb 2, 2024 Speech-to-Speech Translation Speech-to-Text
— Unverified 0TranSentence: Speech-to-speech Translation via Language-agnostic Sentence-level Speech Encoding without Language-parallel Data Jan 17, 2024 Sentence Speech-to-Speech Translation
— Unverified 0TransFace: Unit-Based Audio-Visual Speech Synthesizer for Talking Head Translation Dec 23, 2023 es-en fr-en
— Unverified 0DiffS2UT: A Semantic Preserving Diffusion Model for Textless Direct Speech-to-Speech Translation Oct 26, 2023 Image Generation Speech-to-Speech Translation
— Unverified 0Enhancing expressivity transfer in textless speech-to-speech translation Oct 11, 2023 Self-Supervised Learning Speech-to-Speech Translation
— Unverified 0Direct Text to Speech Translation System using Acoustic Units Sep 14, 2023 Decoder Speech-to-Speech Translation
— Unverified 0Speech-to-Speech Translation with Discrete-Unit-Based Style Transfer Sep 14, 2023 In-Context Learning Language Modeling
— Unverified 0Multilingual Speech-to-Speech Translation into Multiple Target Languages Jul 17, 2023 Language Identification Speech-to-Speech Translation
— Unverified 0Towards cross-language prosody transfer for dialog Jul 9, 2023 Speech-to-Speech Translation Translation
Code Code Available 0AudioPaLM: A Large Language Model That Can Speak and Listen Jun 22, 2023 Language Modeling Language Modelling
— Unverified 0PolyVoice: Language Models for Speech to Speech Translation Jun 5, 2023 Language Modeling Language Modelling
— Unverified 0Translatotron 3: Speech to Speech Translation with Monolingual Data May 27, 2023 Speech-to-Speech Translation Translation
— Unverified 0Textless Speech-to-Speech Translation With Limited Parallel Data May 24, 2023 Automatic Speech Recognition Denoising
Code Code Available 0AV-TranSpeech: Audio-Visual Robust Speech-to-Speech Translation May 24, 2023 Speech-to-Speech Translation Translation
— Unverified 0i-Code Studio: A Configurable and Composable Framework for Integrative AI May 23, 2023 Question Answering Retrieval
— Unverified 0Duplex Diffusion Models Improve Speech-to-Speech Translation May 22, 2023 Speech-to-Speech Translation Translation
— Unverified 0ESPnet-ST-v2: Multipurpose Spoken Language Translation Toolkit Apr 10, 2023 Benchmarking Simultaneous Speech-to-Text Translation
Code Code Available 0Enhancing Speech-to-Speech Translation with Multiple TTS Targets Apr 10, 2023 Speech-to-Speech Translation Speech-to-Text
— Unverified 0A Holistic Cascade System, benchmark, and Human Evaluation Protocol for Expressive Speech-to-Speech Translation Jan 25, 2023 Speech-to-Speech Translation Translation
— Unverified 0UnitY: Two-pass Direct Speech-to-speech Translation with Discrete Units Dec 15, 2022 Decoder Denoising
Code Code Available 0Direct Speech-to-speech Translation without Textual Annotation using Bottleneck Features Dec 12, 2022 Speech-to-Speech Translation Translation
— Unverified 0Dialogs Re-enacted Across Languages Nov 18, 2022 Speech-to-Speech Translation Translation
Code Code Available 0Speech-to-Speech Translation For A Real-world Unwritten Language Nov 11, 2022 Speech-to-Speech Translation Translation
— Unverified 0SpeechMatrix: A Large-Scale Mined Corpus of Multilingual Speech-to-Speech Translations Nov 8, 2022 Mixture-of-Experts Speech-to-Speech Translation
— Unverified 0Joint Pre-Training with Speech and Bilingual Text for Direct Speech to Speech Translation Oct 31, 2022 Speech-to-Speech Translation Translation
Code Code Available 0Textless Direct Speech-to-Speech Translation with Discrete Speech Representation Oct 31, 2022 Speech-to-Speech Translation Translation
— Unverified 0Improving Speech-to-Speech Translation Through Unlabeled Text Oct 26, 2022 Machine Translation speech-recognition
— Unverified 0A Textless Metric for Speech-to-Speech Comparison Oct 21, 2022 Sentence Speech-to-Speech Translation
Code Code Available 0Augmentation Invariant Discrete Representation for Generative Spoken Language Modeling Sep 30, 2022 Language Modeling Language Modelling
— Unverified 0SAMU-XLSR: Semantically-Aligned Multimodal Utterance-level Cross-Lingual Speech Representation May 17, 2022 Representation Learning Retrieval
— Unverified 0Findings of the IWSLT 2022 Evaluation Campaign May 1, 2022 Speech-to-Speech Translation Translation
— Unverified 0Pretrained Speech Encoders and Efficient Fine-tuning Methods for Speech Translation: UPC at IWSLT 2022 May 1, 2022 Decoder Knowledge Distillation
Code Code Available 0The HW-TSC’s Speech to Speech Translation System for IWSLT 2022 Evaluation May 1, 2022 Machine Translation Reranking
— Unverified 0MLLP-VRAIN UPV systems for the IWSLT 2022 Simultaneous Speech Translation and Speech-to-Speech Translation tasks May 1, 2022 Simultaneous Speech-to-Text Translation Speech-to-Speech Translation
— Unverified 0LibriS2S: A German-English Speech-to-Speech Translation Corpus Apr 22, 2022 Speech-to-Speech Translation Speech-to-Text
Code Code Available 0Enhanced Direct Speech-to-Speech Translation Using Self-supervised Pre-training and Data Augmentation Apr 6, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Prosodic Alignment for off-screen automatic dubbing Apr 6, 2022 Speech-to-Speech Translation Translation
— Unverified 0Leveraging unsupervised and weakly-supervised data to improve direct speech-to-speech translation Mar 24, 2022 Representation Learning Speech Representation Learning
— Unverified 0Evaluating MT Systems: A Theoretical Framework Feb 11, 2022 Machine Translation Speech-to-Speech Translation
— Unverified 0Textless Speech-to-Speech Translation on Real Data Dec 15, 2021 Speech-to-Speech Translation Translation
— Unverified 0Multimodal and Multilingual Embeddings for Large-Scale Speech Mining Dec 1, 2021 Speech-to-Speech Translation Translation
Code Code Available 0Assessing Evaluation Metrics for Speech-to-Speech Translation Oct 26, 2021 Machine Translation Open-Ended Question Answering
— Unverified 0From Start to Finish: Latency Reduction Strategies for Incremental Speech Synthesis in Simultaneous Speech-to-Speech Translation Oct 15, 2021 Data Augmentation Simultaneous Speech-to-Speech Translation
— Unverified 0