| NonverbalTTS: A Public English Corpus of Text-Aligned Nonverbal Vocalizations with Emotion Annotations for Text-to-Speech | Jul 17, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Task-Specific Audio Coding for Machines: Machine-Learned Latent Features Are Codes for That Machine | Jul 17, 2025 | Audio ClassificationAutomatic Speech Recognition | —Unverified | 0 |
| WhisperKit: On-device Real-time ASR with Billion-Scale Transformers | Jul 14, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| VisualSpeaker: Visually-Guided 3D Avatar Lip Synthesis | Jul 8, 2025 | Automatic Speech RecognitionLip Reading | —Unverified | 0 |
| MambAttention: Mamba with Multi-Head Attention for Generalizable Single-Channel Speech Enhancement | Jul 1, 2025 | Automatic Speech RecognitionMamba | CodeCode Available | 2 |
| Lightweight Target-Speaker-Based Overlap Transcription for Practical Streaming ASR | Jun 25, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| AI-Generated Song Detection via Lyrics Transcripts | Jun 23, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| End-to-End Spoken Grammatical Error Correction | Jun 23, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Splitformer: An improved early-exit architecture for automatic speech recognition on edge devices | Jun 22, 2025 | Automatic Speech Recognitionspeech-recognition | CodeCode Available | 0 |
| Breaking the Transcription Bottleneck: Fine-tuning ASR Models for Extremely Low-Resource Fieldwork Languages | Jun 20, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |