| Contextualized Automatic Speech Recognition with Attention-Based Bias Phrase Boosted Beam Search | Jan 19, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| SlideAVSR: A Dataset of Paper Explanation Videos for Audio-Visual Speech Recognition | Jan 18, 2024 | Audio-Visual Speech RecognitionAutomatic Speech Recognition | —Unverified | 0 |
| AGADIR: Towards Array-Geometry Agnostic Directional Speech Recognition | Jan 18, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Communication-Efficient Personalized Federated Learning for Speech-to-Text Tasks | Jan 18, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| On Speech Pre-emphasis as a Simple and Inexpensive Method to Boost Speech Enhancement | Jan 17, 2024 | Automatic Speech RecognitionSpeech Enhancement | —Unverified | 0 |
| Improving ASR Contextual Biasing with Guided Attention | Jan 16, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Multi-Input Multi-Output Target-Speaker Voice Activity Detection For Unified, Flexible, and Robust Audio-Visual Speaker Diarization | Jan 16, 2024 | Action DetectionActivity Detection | —Unverified | 0 |
| NOTSOFAR-1 Challenge: New Datasets, Baseline, and Tasks for Distant Meeting Transcription | Jan 16, 2024 | Automatic Speech RecognitionBenchmarking | —Unverified | 0 |
| SeMaScore : a new evaluation metric for automatic speech recognition tasks | Jan 15, 2024 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 |
| Cascaded Cross-Modal Transformer for Audio-Textual Classification | Jan 15, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |