| BERTraffic: BERT-based Joint Speaker Role and Speaker Change Detection for Air Traffic Control Communications | Oct 12, 2021 | Action DetectionActivity Detection | CodeCode Available | 1 | 5 |
| Evolutionary Prompt Design for LLM-Based Post-ASR Error Correction | Jul 23, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 | 5 |
| Automatic Speech Recognition for Speech Assessment of Persian Preschool Children | Mar 24, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 | 5 |
| Automatic speech recognition for the Nepali language using CNN, bidirectional LSTM and ResNet | Jun 25, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 | 5 |
| A Variance-Preserving Interpolation Approach for Diffusion Models with Applications to Single Channel Speech Enhancement and Recognition | May 27, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 | 5 |
| Adaptation of Whisper models to child speech recognition | Jul 24, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 | 5 |
| EnCodecMAE: Leveraging neural codecs for universal audio representation learning | Sep 14, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 | 5 |
| Automatic Severity Classification of Dysarthric speech by using Self-supervised Model with Multi-task Learning | Oct 27, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 | 5 |
| Adapting End-to-End Speech Recognition for Readable Subtitles | May 25, 2020 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 | 5 |
| Brouhaha: multi-task training for voice activity detection, speech-to-noise ratio, and C50 room acoustics estimation | Oct 24, 2022 | Action DetectionActivity Detection | CodeCode Available | 1 | 5 |
| Automatic Disfluency Detection from Untranscribed Speech | Nov 1, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 | 5 |
| Efficient conformer: Progressive downsampling and grouped attention for automatic speech recognition | Aug 31, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 | 5 |
| Automatic Speech Recognition Benchmark for Air-Traffic Communications | Jun 18, 2020 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 | 5 |
| CB-Conformer: Contextual biasing Conformer for biased word recognition | Apr 19, 2023 | Automatic Speech RecognitionLanguage Modeling | CodeCode Available | 1 | 5 |
| AVATAR: Unconstrained Audiovisual Speech Recognition | Jun 15, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 | 5 |
| Efficient Neural Architecture Search for End-to-end Speech Recognition via Straight-Through Gradients | Nov 11, 2020 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 | 5 |
| End-to-End Automatic Speech Recognition for Gujarati | Dec 1, 2020 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 | 5 |
| ClovaCall: Korean Goal-Oriented Dialog Speech Corpus for Automatic Speech Recognition of Contact Centers | Apr 20, 2020 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 | 5 |
| Framework for Curating Speech Datasets and Evaluating ASR Systems: A Case Study for Polish | Jul 18, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 | 5 |
| Combining Frame-Synchronous and Label-Synchronous Systems for Speech Recognition | Jul 1, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 | 5 |
| Common Voice: A Massively-Multilingual Speech Corpus | Dec 13, 2019 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 | 5 |
| HypR: A comprehensive study for ASR hypothesis revising with a reference corpus | Sep 18, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 | 5 |
| Complex Dynamic Neurons Improved Spiking Transformer Network for Efficient Automatic Speech Recognition | Feb 2, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 | 5 |
| Improved Noisy Student Training for Automatic Speech Recognition | May 19, 2020 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 | 5 |
| Improving Mandarin End-to-End Speech Recognition with Word N-gram Language Model | Jan 6, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 | 5 |
| Improving Mandarin Speech Recogntion with Block-augmented Transformer | Jul 24, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 | 5 |
| Audio-Visual Efficient Conformer for Robust Speech Recognition | Jan 4, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 | 5 |
| Audio-Visual Representation Learning via Knowledge Distillation from Speech Foundation Models | Feb 9, 2025 | Audio-Visual Speech RecognitionAutomatic Speech Recognition | CodeCode Available | 1 | 5 |
| DuplexMamba: Enhancing Real-time Speech Conversations with Duplex and Streaming Capabilities | Feb 16, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 | 5 |
| Attentive Sequence-to-Sequence Learning for Diacritic Restoration of Yorùbá Language Text | Apr 3, 2018 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 | 5 |
| Dual-Path Style Learning for End-to-End Noise-Robust Speech Recognition | Mar 28, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 | 5 |
| DUAL: Discrete Spoken Unit Adaptive Learning for Textless Spoken Question Answering | Mar 9, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 | 5 |
| Earnings-22: A Practical Benchmark for Accents in the Wild | Mar 29, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 | 5 |
| Attention-based Contextual Language Model Adaptation for Speech Recognition | Jun 2, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 | 5 |
| Accented Speech Recognition With Accent-specific Codebooks | Oct 24, 2023 | Accented Speech RecognitionAutomatic Speech Recognition | CodeCode Available | 1 | 5 |
| Dompteur: Taming Audio Adversarial Examples | Feb 10, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 | 5 |
| Attention-based Audio-Visual Fusion for Robust Automatic Speech Recognition | Sep 5, 2018 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 | 5 |
| AVLnet: Learning Audio-Visual Language Representations from Instructional Videos | Jun 16, 2020 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 | 5 |
| Dual-decoder Transformer for Joint Automatic Speech Recognition and Multilingual Speech Translation | Nov 2, 2020 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 | 5 |
| Efficient Adapter Transfer of Self-Supervised Speech Models for Automatic Speech Recognition | Feb 7, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 | 5 |
| End-to-end Named Entity Recognition from English Speech | May 22, 2020 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 | 5 |
| A convolutional neural-network model of human cochlear mechanics and filter tuning for real-time applications | Apr 30, 2020 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 | 5 |
| A context-aware knowledge transferring strategy for CTC-based ASR | Oct 12, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 | 5 |
| A Systematic Comparison of Phonetic Aware Techniques for Speech Enhancement | Jun 22, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 | 5 |
| A Study of Multilingual End-to-End Speech Recognition for Kazakh, Russian, and English | Aug 3, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 | 5 |
| A Survey on Non-Autoregressive Generation for Neural Machine Translation and Beyond | Apr 20, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 | 5 |
| ATCO2 corpus: A Large-Scale Dataset for Research on Automatic Speech Recognition and Natural Language Understanding of Air Traffic Control Communications | Nov 8, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 | 5 |
| Distilling the Knowledge of BERT for Sequence-to-Sequence ASR | Aug 9, 2020 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 | 5 |
| ASR data augmentation in low-resource settings using cross-lingual multi-speaker TTS and cross-lingual voice conversion | Mar 29, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 | 5 |
| A Sidecar Separator Can Convert a Single-Talker Speech Recognition System to a Multi-Talker One | Feb 20, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 | 5 |