| Espresso: A Fast End-to-end Neural Speech Recognition Toolkit | Sep 18, 2019 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| RWTH ASR Systems for LibriSpeech: Hybrid vs Attention -- w/o Data Augmentation | May 8, 2019 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition | Apr 18, 2019 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Mitigating the Impact of Speech Recognition Errors on Spoken Question Answering by Adversarial Domain Adaptation | Apr 16, 2019 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| How2: A Large-scale Dataset for Multimodal Language Understanding | Nov 1, 2018 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Attention-based Audio-Visual Fusion for Robust Automatic Speech Recognition | Sep 5, 2018 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Open Source Automatic Speech Recognition for German | Jul 26, 2018 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Zero-shot keyword spotting for visual speech recognition in-the-wild | Jul 23, 2018 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Word Error Rate Estimation for Speech Recognition: e-WER | Jul 1, 2018 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Snips Voice Platform: an embedded Spoken Language Understanding system for private-by-design voice interfaces | May 25, 2018 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Speech Commands: A Dataset for Limited-Vocabulary Speech Recognition | Apr 9, 2018 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Attentive Sequence-to-Sequence Learning for Diacritic Restoration of Yorùbá Language Text | Apr 3, 2018 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Minimum Word Error Rate Training for Attention-based Sequence-to-Sequence Models | Dec 5, 2017 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| State-of-the-art Speech Recognition With Sequence-to-Sequence Models | Dec 5, 2017 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| A Baseline for Detecting Misclassified and Out-of-Distribution Examples in Neural Networks | Oct 7, 2016 | Anomaly DetectionAutomatic Speech Recognition | CodeCode Available | 1 |
| Single-Channel Multi-Speaker Separation using Deep Clustering | Jul 7, 2016 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Task-Specific Audio Coding for Machines: Machine-Learned Latent Features Are Codes for That Machine | Jul 17, 2025 | Audio ClassificationAutomatic Speech Recognition | —Unverified | 0 |
| NonverbalTTS: A Public English Corpus of Text-Aligned Nonverbal Vocalizations with Emotion Annotations for Text-to-Speech | Jul 17, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| WhisperKit: On-device Real-time ASR with Billion-Scale Transformers | Jul 14, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| VisualSpeaker: Visually-Guided 3D Avatar Lip Synthesis | Jul 8, 2025 | Automatic Speech RecognitionLip Reading | —Unverified | 0 |
| Lightweight Target-Speaker-Based Overlap Transcription for Practical Streaming ASR | Jun 25, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| End-to-End Spoken Grammatical Error Correction | Jun 23, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| AI-Generated Song Detection via Lyrics Transcripts | Jun 23, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Splitformer: An improved early-exit architecture for automatic speech recognition on edge devices | Jun 22, 2025 | Automatic Speech Recognitionspeech-recognition | CodeCode Available | 0 |
| State-Space Models in Efficient Whispered and Multi-dialect Speech Recognition | Jun 20, 2025 | Automatic Speech RecognitionDiversity | —Unverified | 0 |
| Breaking the Transcription Bottleneck: Fine-tuning ASR Models for Extremely Low-Resource Fieldwork Languages | Jun 20, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| LM-SPT: LM-Aligned Semantic Distillation for Speech Tokenization | Jun 20, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Automatic Speech Recognition Biases in Newcastle English: an Error Analysis | Jun 19, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Unifying Streaming and Non-streaming Zipformer-based ASR | Jun 17, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Improving Practical Aspects of End-to-End Multi-Talker Speech Recognition for Online and Offline Scenarios | Jun 17, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Seewo's Submission to MLC-SLM: Lessons learned from Speech Reasoning Language Models | Jun 16, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Bi-directional Context-Enhanced Speech Large Language Models for Multilingual Conversational ASR | Jun 16, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| NTU Speechlab LLM-Based Multilingual ASR System for Interspeech MLC-SLM Challenge 2025 | Jun 16, 2025 | Automatic Speech RecognitionLanguage Modeling | —Unverified | 0 |
| BUT System for the MLC-SLM Challenge | Jun 16, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Enabling automatic transcription of child-centered audio recordings from real-world environments | Jun 13, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Lightweight and Robust Multi-Channel End-to-End Speech Recognition with Spherical Harmonic Transform | Jun 13, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| (SimPhon Speech Test): A Data-Driven Method for In Silico Design and Validation of a Phonetically Balanced Speech Test | Jun 13, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Joint ASR and Speaker Role Tagging with Serialized Output Training | Jun 12, 2025 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 |
| Improving Named Entity Transcription with Contextual LLM-based Revision | Jun 12, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Advances in Small-Footprint Keyword Spotting: A Comprehensive Review of Efficient Models and Algorithms | Jun 12, 2025 | Automatic Speech RecognitionKeyword Spotting | CodeCode Available | 0 |
| FairASR: Fair Audio Contrastive Learning for Automatic Speech Recognition | Jun 12, 2025 | Automatic Speech RecognitionContrastive Learning | —Unverified | 0 |
| OWSM-Biasing: Contextualizing Open Whisper-Style Speech Models for Automatic Speech Recognition with Dynamic Vocabulary | Jun 11, 2025 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 |
| Regularizing Learnable Feature Extraction for Automatic Speech Recognition | Jun 11, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Addressing Pitfalls in Auditing Practices of Automatic Speech Recognition Technologies: A Case Study of People with Aphasia | Jun 10, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| SimClass: A Classroom Speech Dataset Generated via Game Engine Simulation For Automatic Speech Recognition Research | Jun 10, 2025 | Automatic Speech RecognitionData Augmentation | —Unverified | 0 |
| Transcript-Prompted Whisper with Dictionary-Enhanced Decoding for Japanese Speech Annotation | Jun 9, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Benchmarking Foundation Speech and Language Models for Alzheimer's Disease and Related Dementia Detection from Spontaneous Speech | Jun 9, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Speaker-Distinguishable CTC: Learning Speaker Distinction Using CTC for Multi-Talker Speech Recognition | Jun 9, 2025 | Automatic Speech RecognitionMulti-Task Learning | —Unverified | 0 |
| Unified Semi-Supervised Pipeline for Automatic Speech Recognition | Jun 9, 2025 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 |
| Speech Recognition on TV Series with Video-guided Post-Correction | Jun 8, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |