| Enhancing Speech-to-Speech Translation with Multiple TTS Targets | Apr 10, 2023 | Speech-to-Speech TranslationSpeech-to-Text | —Unverified | 0 |
| Natural Language Robot Programming: NLP integrated with autonomous robotic grasping | Apr 6, 2023 | Robotic GraspingSpeech-to-Text | —Unverified | 0 |
| Improving the previous state-of-the-art Frisian ASR by fine-tuning XLS-R | Mar 31, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| wav2vec and its current potential to Automatic Speech Recognition in German for the usage in Digital History: A comparative assessment of available ASR-technologies for the use in cultural heritage contexts | Mar 6, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages | Mar 2, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| MuAViC: A Multilingual Audio-Visual Corpus for Robust Speech Recognition and Robust Speech-to-Text Translation | Mar 1, 2023 | Audio-Visual Speech RecognitionRobust Speech Recognition | CodeCode Available | 2 |
| Improving Medical Speech-to-Text Accuracy with Vision-Language Pre-training Model | Feb 27, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| PATCorrect: Non-autoregressive Phoneme-augmented Transformer for ASR Error Correction | Feb 10, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Characterizing Financial Market Coverage using Artificial Intelligence | Feb 7, 2023 | Speech-to-Text | —Unverified | 0 |
| PSST! Prosodic Speech Segmentation with Transformers | Feb 3, 2023 | SegmentationSpeech-to-Text | CodeCode Available | 1 |