| A Theory of Unsupervised Speech Recognition | Jun 9, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 | 5 |
| Integrating Emotion Recognition with Speech Recognition and Speaker Diarisation for Conversations | Aug 14, 2023 | Action DetectionActivity Detection | CodeCode Available | 0 | 5 |
| Kurdish (Sorani) Speech to Text: Presenting an Experimental Dataset | Nov 29, 2019 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 | 5 |
| Multi-Sentence Resampling: A Simple Approach to Alleviate Dataset Length Bias and Beam-Search Degradation | Sep 13, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 | 5 |
| Improving LSTM-CTC based ASR performance in domains with limited training data | Jul 3, 2017 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 | 5 |
| Improving CTC-based speech recognition via knowledge transferring from pre-trained language models | Feb 22, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 | 5 |
| A Change of Heart: Improving Speech Emotion Recognition through Speech-to-Text Modality Conversion | Jul 21, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 | 5 |
| Improving Automatic Speech Recognition for Non-Native English with Transfer Learning and Language Model Decoding | Feb 10, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 | 5 |
| Improving RNN Transducer Modeling for End-to-End Speech Recognition | Sep 26, 2019 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 | 5 |
| HydraFormer: One Encoder For All Subsampling Rates | Aug 8, 2024 | AllAutomatic Speech Recognition | CodeCode Available | 0 | 5 |
| Hybrid phonetic-neural model for correction in speech recognition systems | Feb 12, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 | 5 |
| Hybrid ASR for Resource-Constrained Robots: HMM - Deep Learning Fusion | Sep 11, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 | 5 |
| Learning from Past Mistakes: Improving Automatic Speech Recognition Output via Noisy-Clean Phrase Context Modeling | Feb 7, 2018 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 | 5 |
| HYBRIDFORMER: improving SqueezeFormer with hybrid attention and NSR mechanism | Mar 15, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 | 5 |
| A comparative analysis between Conformer-Transducer, Whisper, and wav2vec2 for improving the child speech recognition | Nov 7, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 | 5 |
| How You Say It Matters: Measuring the Impact of Verbal Disfluency Tags on Automated Dementia Detection | May 1, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 | 5 |
| Audiovisual Speaker Tracking using Nonlinear Dynamical Systems with Dynamic Stream Weights | Mar 14, 2019 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 | 5 |
| A Model for Every User and Budget: Label-Free and Personalized Mixed-Precision Quantization | Jul 24, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 | 5 |
| HuBERT-EE: Early Exiting HuBERT for Efficient Speech Recognition | Apr 13, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 | 5 |
| How Phonotactics Affect Multilingual and Zero-shot ASR Performance | Oct 22, 2020 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 | 5 |
| Human Transcription Quality Improvement | Sep 24, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 | 5 |
| Imperceptible, Robust, and Targeted Adversarial Examples for Automatic Speech Recognition | Mar 22, 2019 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 | 5 |
| Graph Neural Networks for Contextual ASR with the Tree-Constrained Pointer Generator | May 30, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 | 5 |
| Greek2MathTex: A Greek Speech-to-Text Framework for LaTeX Equations Generation | Dec 11, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 | 5 |
| Growing Trees on Sounds: Assessing Strategies for End-to-End Dependency Parsing of Speech | Jun 18, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 | 5 |
| Generative Adversarial Training Data Adaptation for Very Low-resource Automatic Speech Recognition | May 19, 2020 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 | 5 |
| A Simplified Fully Quantized Transformer for End-to-end Speech Recognition | Nov 9, 2019 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 | 5 |
| AI-Generated Song Detection via Lyrics Transcripts | Jun 23, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 | 5 |
| Guided Source Separation Meets a Strong ASR Backend: Hitachi/Paderborn University Joint Investigation for Dinner Party ASR | May 29, 2019 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 | 5 |
| A Unified Speaker Adaptation Approach for ASR | Oct 16, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 | 5 |
| FLEURS: Few-shot Learning Evaluation of Universal Representations of Speech | May 25, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 | 5 |
| Assessing the Use of Prosody in Constituency Parsing of Imperfect Transcripts | Jun 14, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 | 5 |
| Measuring the Accuracy of Automatic Speech Recognition Solutions | Aug 29, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 | 5 |
| Finnish Parliament ASR corpus - Analysis, benchmarks and statistics | Mar 28, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 | 5 |
| Fleurs-SLU: A Massively Multilingual Benchmark for Spoken Language Understanding | Jan 10, 2025 | Automatic Speech RecognitionClassification | CodeCode Available | 0 | 5 |
| Federating Dynamic Models using Early-Exit Architectures for Automatic Speech Recognition on Heterogeneous Clients | May 27, 2024 | Automatic Speech RecognitionFederated Learning | CodeCode Available | 0 | 5 |
| Fine-Grained Grounding for Multimodal Speech Recognition | Oct 5, 2020 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 | 5 |
| Multi-Stage Speaker Diarization for Noisy Classrooms | May 16, 2025 | Action DetectionActivity Detection | CodeCode Available | 0 | 5 |
| Fine-tuning Strategies for Faster Inference using Speech Self-Supervised Models: A Comparative Study | Mar 12, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 | 5 |
| Guiding Frame-Level CTC Alignments Using Self-knowledge Distillation | Jun 12, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 | 5 |
| Improving Voice Separation by Incorporating End-to-end Speech Recognition | Nov 29, 2019 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 | 5 |
| Exploring Generative Error Correction for Dysarthric Speech Recognition | May 26, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 | 5 |
| Explainability of Speech Recognition Transformers via Gradient-based Attention Visualization | Jun 2, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 | 5 |
| Exploiting Attention-based Sequence-to-Sequence Architectures for Sound Event Localization | Feb 28, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 | 5 |
| FASA: a Flexible and Automatic Speech Aligner for Extracting High-quality Aligned Children Speech Data | Jun 25, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 | 5 |
| Multichannel AV-wav2vec2: A Framework for Learning Multichannel Multi-Modal Speech Representation | Jan 7, 2024 | Audio-Visual Speech RecognitionAutomatic Speech Recognition | CodeCode Available | 0 | 5 |
| Evaluating Variants of wav2vec 2.0 on Affective Vocal Burst Tasks | May 5, 2023 | Automatic Speech RecognitionCultural Vocal Bursts Intensity Prediction | CodeCode Available | 0 | 5 |
| Evaluation of Neural Architectures Trained with Square Loss vs Cross-Entropy in Classification Tasks | Jun 12, 2020 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 | 5 |
| ESPnet-TTS: Unified, Reproducible, and Integratable Open Source End-to-End Text-to-Speech Toolkit | Oct 24, 2019 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 | 5 |
| Enhanced ASR Robustness to Packet Loss with a Front-End Adaptation Network | Jun 27, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 | 5 |