| NonverbalTTS: A Public English Corpus of Text-Aligned Nonverbal Vocalizations with Emotion Annotations for Text-to-Speech | Jul 17, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Task-Specific Audio Coding for Machines: Machine-Learned Latent Features Are Codes for That Machine | Jul 17, 2025 | Audio ClassificationAutomatic Speech Recognition | —Unverified | 0 |
| WhisperKit: On-device Real-time ASR with Billion-Scale Transformers | Jul 14, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| VisualSpeaker: Visually-Guided 3D Avatar Lip Synthesis | Jul 8, 2025 | Automatic Speech RecognitionLip Reading | —Unverified | 0 |
| MambAttention: Mamba with Multi-Head Attention for Generalizable Single-Channel Speech Enhancement | Jul 1, 2025 | Automatic Speech RecognitionMamba | CodeCode Available | 2 |
| Lightweight Target-Speaker-Based Overlap Transcription for Practical Streaming ASR | Jun 25, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| End-to-End Spoken Grammatical Error Correction | Jun 23, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| AI-Generated Song Detection via Lyrics Transcripts | Jun 23, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Splitformer: An improved early-exit architecture for automatic speech recognition on edge devices | Jun 22, 2025 | Automatic Speech Recognitionspeech-recognition | CodeCode Available | 0 |
| LM-SPT: LM-Aligned Semantic Distillation for Speech Tokenization | Jun 20, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Breaking the Transcription Bottleneck: Fine-tuning ASR Models for Extremely Low-Resource Fieldwork Languages | Jun 20, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| State-Space Models in Efficient Whispered and Multi-dialect Speech Recognition | Jun 20, 2025 | Automatic Speech RecognitionDiversity | —Unverified | 0 |
| Automatic Speech Recognition Biases in Newcastle English: an Error Analysis | Jun 19, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Unifying Streaming and Non-streaming Zipformer-based ASR | Jun 17, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Improving Practical Aspects of End-to-End Multi-Talker Speech Recognition for Online and Offline Scenarios | Jun 17, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| NTU Speechlab LLM-Based Multilingual ASR System for Interspeech MLC-SLM Challenge 2025 | Jun 16, 2025 | Automatic Speech RecognitionLanguage Modeling | —Unverified | 0 |
| Bi-directional Context-Enhanced Speech Large Language Models for Multilingual Conversational ASR | Jun 16, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| BUT System for the MLC-SLM Challenge | Jun 16, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Seewo's Submission to MLC-SLM: Lessons learned from Speech Reasoning Language Models | Jun 16, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Lightweight and Robust Multi-Channel End-to-End Speech Recognition with Spherical Harmonic Transform | Jun 13, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Enabling automatic transcription of child-centered audio recordings from real-world environments | Jun 13, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| (SimPhon Speech Test): A Data-Driven Method for In Silico Design and Validation of a Phonetically Balanced Speech Test | Jun 13, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Advances in Small-Footprint Keyword Spotting: A Comprehensive Review of Efficient Models and Algorithms | Jun 12, 2025 | Automatic Speech RecognitionKeyword Spotting | CodeCode Available | 0 |
| Joint ASR and Speaker Role Tagging with Serialized Output Training | Jun 12, 2025 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 |
| FairASR: Fair Audio Contrastive Learning for Automatic Speech Recognition | Jun 12, 2025 | Automatic Speech RecognitionContrastive Learning | —Unverified | 0 |