| Efficient Speech Translation with Dynamic Latent Perceivers | Oct 28, 2022 | Speech-to-TextSpeech-to-Text Translation | CodeCode Available | 0 |
| Careless Whisper: Speech-to-Text Hallucination Harms | Feb 12, 2024 | HallucinationLanguage Modeling | CodeCode Available | 0 |
| Re-Translation Strategies For Long Form, Simultaneous, Spoken Language Translation | Dec 6, 2019 | FormMachine Translation | CodeCode Available | 0 |
| Towards End-to-end Speech-to-text Summarization | Jun 6, 2023 | Abstractive Text SummarizationSpeech-to-Text | CodeCode Available | 0 |
| Don't Discard Fixed-Window Audio Segmentation in Speech-to-Text Translation | Oct 24, 2022 | SegmentationSpeech-to-Text | CodeCode Available | 0 |
| Towards End-to-End Training of Automatic Speech Recognition for Nigerian Pidgin | Oct 21, 2020 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Synchronous Speech Recognition and Speech-to-Text Translation with Interactive Decoding | Dec 16, 2019 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| A Dataset for Speech Emotion Recognition in Greek Theatrical Plays | Mar 27, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| WACO: Word-Aligned Contrastive Learning for Speech Translation | Dec 19, 2022 | Contrastive LearningSpeech-to-Text | CodeCode Available | 0 |
| Transformer-Based Named Entity Recognition for Automated Server Provisioning | Apr 1, 2025 | named-entity-recognitionNamed Entity Recognition | CodeCode Available | 0 |