| Federating Dynamic Models using Early-Exit Architectures for Automatic Speech Recognition on Heterogeneous Clients | May 27, 2024 | Automatic Speech RecognitionFederated Learning | CodeCode Available | 0 | 5 |
| Multi-Stage Speaker Diarization for Noisy Classrooms | May 16, 2025 | Action DetectionActivity Detection | CodeCode Available | 0 | 5 |
| Fine-Grained Grounding for Multimodal Speech Recognition | Oct 5, 2020 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 | 5 |
| Fleurs-SLU: A Massively Multilingual Benchmark for Spoken Language Understanding | Jan 10, 2025 | Automatic Speech RecognitionClassification | CodeCode Available | 0 | 5 |
| Guided Source Separation Meets a Strong ASR Backend: Hitachi/Paderborn University Joint Investigation for Dinner Party ASR | May 29, 2019 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 | 5 |
| Exploring Generative Error Correction for Dysarthric Speech Recognition | May 26, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 | 5 |
| Exploiting Attention-based Sequence-to-Sequence Architectures for Sound Event Localization | Feb 28, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 | 5 |
| Explainability of Speech Recognition Transformers via Gradient-based Attention Visualization | Jun 2, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 | 5 |
| ESPnet-TTS: Unified, Reproducible, and Integratable Open Source End-to-End Text-to-Speech Toolkit | Oct 24, 2019 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 | 5 |
| Evaluating Variants of wav2vec 2.0 on Affective Vocal Burst Tasks | May 5, 2023 | Automatic Speech RecognitionCultural Vocal Bursts Intensity Prediction | CodeCode Available | 0 | 5 |