| persoDA: Personalized Data Augmentation for Personalized ASR | Jan 15, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Adapting Whisper for Regional Dialects: Enhancing Public Services for Vulnerable Populations in the United Kingdom | Jan 15, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| A Non-autoregressive Model for Joint STT and TTS | Jan 15, 2025 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 |
| Selective Attention Merging for low resource tasks: A case study of Child ASR | Jan 14, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Loudspeaker Beamforming to Enhance Speech Recognition Performance of Voice Driven Applications | Jan 14, 2025 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 |
| Joint Automatic Speech Recognition And Structure Learning For Better Speech Understanding | Jan 13, 2025 | Automatic Speech Recognitionintent-classification | CodeCode Available | 0 |
| AdaCS: Adaptive Normalization for Enhanced Code-Switching ASR | Jan 13, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Speech Recognition for Automatically Assessing Afrikaans and isiXhosa Preschool Oral Narratives | Jan 11, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| A Survey on Spoken Italian Datasets and Corpora | Jan 11, 2025 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 |
| Discrete Speech Unit Extraction via Independent Component Analysis | Jan 11, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Comparing Self-Supervised Learning Models Pre-Trained on Human Speech and Animal Vocalizations for Bioacoustics Processing | Jan 10, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Fleurs-SLU: A Massively Multilingual Benchmark for Spoken Language Understanding | Jan 10, 2025 | Automatic Speech RecognitionClassification | CodeCode Available | 0 |
| Universal-2-TF: Robust All-Neural Text Formatting for ASR | Jan 10, 2025 | AllAutomatic Speech Recognition | —Unverified | 0 |
| Contextual ASR Error Handling with LLMs Augmentation for Goal-Oriented Conversational AI | Jan 10, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Benchmarking Rotary Position Embeddings for Automatic Speech Recognition | Jan 10, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Deep Learning for Pathological Speech: A Survey | Jan 7, 2025 | Automatic Speech RecognitionData Augmentation | —Unverified | 0 |
| Universal Speaker Embedding Free Target Speaker Extraction and Personal Voice Activity Detection | Jan 7, 2025 | Action DetectionActivity Detection | —Unverified | 0 |
| Samba-ASR: State-Of-The-Art Speech Recognition Leveraging Structured State-Space Models | Jan 6, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Listening and Seeing Again: Generative Error Correction for Audio-Visual Speech Recognition | Jan 3, 2025 | Audio-Visual Speech RecognitionAutomatic Speech Recognition | CodeCode Available | 0 |
| Improving Transducer-Based Spoken Language Understanding with Self-Conditioned CTC and Knowledge Transfer | Jan 3, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Advancing Singlish Understanding: Bridging the Gap with Datasets and Multimodal Models | Jan 2, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| LiveCC: Learning Video LLM with Streaming Speech Transcription at Scale | Jan 1, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Breaking Through the Spike: Spike Window Decoding for Accelerated and Precise Automatic Speech Recognition | Jan 1, 2025 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 |
| Automatic Text Pronunciation Correlation Generation and Application for Contextual Biasing | Jan 1, 2025 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 |
| Large Language Models Are Read/Write Policy-Makers for Simultaneous Generation | Jan 1, 2025 | Automatic Speech RecognitionDecoder | CodeCode Available | 1 |