| NonverbalTTS: A Public English Corpus of Text-Aligned Nonverbal Vocalizations with Emotion Annotations for Text-to-Speech | Jul 17, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Task-Specific Audio Coding for Machines: Machine-Learned Latent Features Are Codes for That Machine | Jul 17, 2025 | Audio ClassificationAutomatic Speech Recognition | —Unverified | 0 |
| WhisperKit: On-device Real-time ASR with Billion-Scale Transformers | Jul 14, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| VisualSpeaker: Visually-Guided 3D Avatar Lip Synthesis | Jul 8, 2025 | Automatic Speech RecognitionLip Reading | —Unverified | 0 |
| MambAttention: Mamba with Multi-Head Attention for Generalizable Single-Channel Speech Enhancement | Jul 1, 2025 | Automatic Speech RecognitionMamba | CodeCode Available | 2 |
| Lightweight Target-Speaker-Based Overlap Transcription for Practical Streaming ASR | Jun 25, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| AI-Generated Song Detection via Lyrics Transcripts | Jun 23, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| End-to-End Spoken Grammatical Error Correction | Jun 23, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Splitformer: An improved early-exit architecture for automatic speech recognition on edge devices | Jun 22, 2025 | Automatic Speech Recognitionspeech-recognition | CodeCode Available | 0 |
| State-Space Models in Efficient Whispered and Multi-dialect Speech Recognition | Jun 20, 2025 | Automatic Speech RecognitionDiversity | —Unverified | 0 |
| LM-SPT: LM-Aligned Semantic Distillation for Speech Tokenization | Jun 20, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Breaking the Transcription Bottleneck: Fine-tuning ASR Models for Extremely Low-Resource Fieldwork Languages | Jun 20, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Automatic Speech Recognition Biases in Newcastle English: an Error Analysis | Jun 19, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Unifying Streaming and Non-streaming Zipformer-based ASR | Jun 17, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Improving Practical Aspects of End-to-End Multi-Talker Speech Recognition for Online and Offline Scenarios | Jun 17, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| NTU Speechlab LLM-Based Multilingual ASR System for Interspeech MLC-SLM Challenge 2025 | Jun 16, 2025 | Automatic Speech RecognitionLanguage Modeling | —Unverified | 0 |
| BUT System for the MLC-SLM Challenge | Jun 16, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Bi-directional Context-Enhanced Speech Large Language Models for Multilingual Conversational ASR | Jun 16, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Seewo's Submission to MLC-SLM: Lessons learned from Speech Reasoning Language Models | Jun 16, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Lightweight and Robust Multi-Channel End-to-End Speech Recognition with Spherical Harmonic Transform | Jun 13, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| (SimPhon Speech Test): A Data-Driven Method for In Silico Design and Validation of a Phonetically Balanced Speech Test | Jun 13, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Enabling automatic transcription of child-centered audio recordings from real-world environments | Jun 13, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Advances in Small-Footprint Keyword Spotting: A Comprehensive Review of Efficient Models and Algorithms | Jun 12, 2025 | Automatic Speech RecognitionKeyword Spotting | CodeCode Available | 0 |
| Joint ASR and Speaker Role Tagging with Serialized Output Training | Jun 12, 2025 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 |
| FairASR: Fair Audio Contrastive Learning for Automatic Speech Recognition | Jun 12, 2025 | Automatic Speech RecognitionContrastive Learning | —Unverified | 0 |
| Improving Named Entity Transcription with Contextual LLM-based Revision | Jun 12, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| OWSM-Biasing: Contextualizing Open Whisper-Style Speech Models for Automatic Speech Recognition with Dynamic Vocabulary | Jun 11, 2025 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 |
| Regularizing Learnable Feature Extraction for Automatic Speech Recognition | Jun 11, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Addressing Pitfalls in Auditing Practices of Automatic Speech Recognition Technologies: A Case Study of People with Aphasia | Jun 10, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| SimClass: A Classroom Speech Dataset Generated via Game Engine Simulation For Automatic Speech Recognition Research | Jun 10, 2025 | Automatic Speech RecognitionData Augmentation | —Unverified | 0 |
| Benchmarking Foundation Speech and Language Models for Alzheimer's Disease and Related Dementia Detection from Spontaneous Speech | Jun 9, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Speaker-Distinguishable CTC: Learning Speaker Distinction Using CTC for Multi-Talker Speech Recognition | Jun 9, 2025 | Automatic Speech RecognitionMulti-Task Learning | —Unverified | 0 |
| Transcript-Prompted Whisper with Dictionary-Enhanced Decoding for Japanese Speech Annotation | Jun 9, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Unified Semi-Supervised Pipeline for Automatic Speech Recognition | Jun 9, 2025 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 |
| Speech Recognition on TV Series with Video-guided Post-Correction | Jun 8, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Automatic Speech Recognition of African American English: Lexical and Contextual Effects | Jun 7, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Bridging the Modality Gap: Softly Discretizing Audio Representation for LLM-based Automatic Speech Recognition | Jun 6, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Diarization-Aware Multi-Speaker Automatic Speech Recognition via Large Language Models | Jun 6, 2025 | Automatic Speech Recognitionspeaker-diarization | —Unverified | 0 |
| AS-ASR: A Lightweight Framework for Aphasia-Specific Automatic Speech Recognition | Jun 6, 2025 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 |
| Low-Resource Domain Adaptation for Speech LLMs via Text-Only Fine-Tuning | Jun 6, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Lightweight Prompt Biasing for Contextualized End-to-End ASR Systems | Jun 6, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Customizing Speech Recognition Model with Large Language Model Feedback | Jun 5, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Better Pseudo-labeling with Multi-ASR Fusion and Error Correction by SpeechLLM | Jun 5, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| LLM-based phoneme-to-grapheme for phoneme-based speech recognition | Jun 5, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| LESS: Large Language Model Enhanced Semi-Supervised Learning for Speech Foundational Models | Jun 5, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Effects of Speaker Count, Duration, and Accent Diversity on Zero-Shot Accent Robustness in Low-Resource ASR | Jun 4, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| A Multi-Dialectal Dataset for German Dialect ASR and Dialect-to-Standard Speech Translation | Jun 3, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Overcoming Data Scarcity in Multi-Dialectal Arabic ASR via Whisper Fine-Tuning | Jun 3, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Enhancing Lyrics Transcription on Music Mixtures with Consistency Loss | Jun 3, 2025 | Automatic Lyrics TranscriptionAutomatic Speech Recognition | —Unverified | 0 |
| DNCASR: End-to-End Training for Speaker-Attributed ASR | Jun 2, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |