| MuAViC: A Multilingual Audio-Visual Corpus for Robust Speech Recognition and Robust Speech-to-Text Translation | Mar 1, 2023 | Audio-Visual Speech RecognitionRobust Speech Recognition | CodeCode Available | 2 |
| Improving Medical Speech-to-Text Accuracy with Vision-Language Pre-training Model | Feb 27, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| PATCorrect: Non-autoregressive Phoneme-augmented Transformer for ASR Error Correction | Feb 10, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Characterizing Financial Market Coverage using Artificial Intelligence | Feb 7, 2023 | Speech-to-Text | —Unverified | 0 |
| PSST! Prosodic Speech Segmentation with Transformers | Feb 3, 2023 | SegmentationSpeech-to-Text | CodeCode Available | 1 |
| Pre-training for Speech Translation: CTC Meets Optimal Transport | Jan 27, 2023 | Multi-Task LearningSpeech-to-Text | CodeCode Available | 1 |
| Using External Off-Policy Speech-To-Text Mappings in Contextual End-To-End Automated Speech Recognition | Jan 6, 2023 | Domain AdaptationGPU | —Unverified | 0 |
| Pushing the performances of ASR models on English and Spanish accents | Dec 22, 2022 | Speech-to-Text | —Unverified | 0 |
| WACO: Word-Aligned Contrastive Learning for Speech Translation | Dec 19, 2022 | Contrastive LearningSpeech-to-Text | CodeCode Available | 0 |
| M3ST: Mix at Three Levels for Speech Translation | Dec 7, 2022 | Data AugmentationDiversity | —Unverified | 0 |
| MMSpeech: Multi-modal Multi-task Encoder-Decoder Pre-training for Speech Recognition | Nov 29, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Handling and extracting key entities from customer conversations using Speech recognition and Named Entity recognition | Nov 28, 2022 | named-entity-recognitionNamed Entity Recognition | —Unverified | 0 |
| Multilingual Speech Emotion Recognition With Multi-Gating Mechanism and Neural Architecture Search | Oct 31, 2022 | Emotion RecognitionNeural Architecture Search | —Unverified | 0 |
| Phonemic Representation and Transcription for Speech to Text Applications for Under-resourced Indigenous African Languages: The Case of Kiswahili | Oct 29, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Efficient Speech Translation with Dynamic Latent Perceivers | Oct 28, 2022 | Speech-to-TextSpeech-to-Text Translation | CodeCode Available | 0 |
| Don't Discard Fixed-Window Audio Segmentation in Speech-to-Text Translation | Oct 24, 2022 | SegmentationSpeech-to-Text | CodeCode Available | 0 |
| Information-Transport-based Policy for Simultaneous Translation | Oct 22, 2022 | Machine TranslationSpeech-to-Text | CodeCode Available | 1 |
| Named Entity Detection and Injection for Direct Speech Translation | Oct 21, 2022 | SentenceSpeech-to-Text | —Unverified | 0 |
| Improving Semi-supervised End-to-end Automatic Speech Recognition using CycleGAN and Inter-domain Losses | Oct 20, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Simple and Effective Unsupervised Speech Translation | Oct 18, 2022 | Domain AdaptationMachine Translation | —Unverified | 0 |
| Anonymizing Speech with Generative Adversarial Networks to Preserve Speaker Privacy | Oct 13, 2022 | Generative Adversarial NetworkSpeaker anonymization | CodeCode Available | 0 |
| CTC Alignments Improve Autoregressive Translation | Oct 11, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| SpeechUT: Bridging Speech and Text with Hidden-Unit for Encoder-Decoder Based Speech-Text Pre-training | Oct 7, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| JoeyS2T: Minimalistic Speech-to-Text Modeling with JoeyNMT | Oct 5, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Speech-to-Text and Evaluation of Multiple Machine Translation Systems | Sep 1, 2022 | Machine TranslationSpeech-to-Text | —Unverified | 0 |