| Comparing Self-Supervised Learning Models Pre-Trained on Human Speech and Animal Vocalizations for Bioacoustics Processing | Jan 10, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Contextual ASR Error Handling with LLMs Augmentation for Goal-Oriented Conversational AI | Jan 10, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Fleurs-SLU: A Massively Multilingual Benchmark for Spoken Language Understanding | Jan 10, 2025 | Automatic Speech RecognitionClassification | CodeCode Available | 0 |
| Universal-2-TF: Robust All-Neural Text Formatting for ASR | Jan 10, 2025 | AllAutomatic Speech Recognition | —Unverified | 0 |
| Benchmarking Rotary Position Embeddings for Automatic Speech Recognition | Jan 10, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Deep Learning for Pathological Speech: A Survey | Jan 7, 2025 | Automatic Speech RecognitionData Augmentation | —Unverified | 0 |
| Universal Speaker Embedding Free Target Speaker Extraction and Personal Voice Activity Detection | Jan 7, 2025 | Action DetectionActivity Detection | —Unverified | 0 |
| Samba-ASR: State-Of-The-Art Speech Recognition Leveraging Structured State-Space Models | Jan 6, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Listening and Seeing Again: Generative Error Correction for Audio-Visual Speech Recognition | Jan 3, 2025 | Audio-Visual Speech RecognitionAutomatic Speech Recognition | CodeCode Available | 0 |
| Improving Transducer-Based Spoken Language Understanding with Self-Conditioned CTC and Knowledge Transfer | Jan 3, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |