| Deep Learning for Audio Signal Processing | Apr 30, 2019 | Audio Signal ProcessingAutomatic Speech Recognition | CodeCode Available | 0 | 5 |
| Data Fusion for Audiovisual Speaker Localization: Extending Dynamic Stream Weights to the Spatial Domain | Feb 23, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 | 5 |
| Data augmentation using prosody and false starts to recognize non-native children's speech | Aug 29, 2020 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 | 5 |
| Deep Spiking Neural Networks for Large Vocabulary Automatic Speech Recognition | Nov 19, 2019 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 | 5 |
| DiaCorrect: End-to-end error correction for speaker diarization | Oct 31, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 | 5 |
| ELITR-Bench: A Meeting Assistant Benchmark for Long-Context Language Models | Mar 29, 2024 | Automatic Speech Recognitionspeech-recognition | CodeCode Available | 0 | 5 |
| A Comprehensive Evaluation of Incremental Speech Recognition and Diarization for Conversational AI | Dec 1, 2020 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 | 5 |
| Beyond Levenshtein: Leveraging Multiple Algorithms for Robust Word Error Rate Computations And Granular Error Classifications | Aug 28, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 | 5 |
| EESEN: End-to-End Speech Recognition using Deep RNN Models and WFST-based Decoding | Jul 29, 2015 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 | 5 |
| Beyond Instructional Videos: Probing for More Diverse Visual-Textual Grounding on YouTube | Apr 29, 2020 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 | 5 |