| Revise, Reason, and Recognize: LLM-Based Emotion Recognition via Emotion-Specific Prompts and ASR Error Correction | Sep 23, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| FastEmit: Low-latency Streaming ASR with Sequence-level Emission Regularization | Oct 21, 2020 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Mixed-Precision Training for NLP and Speech Recognition with OpenSeq2Seq | May 25, 2018 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Optimized Speculative Sampling for GPU Hardware Accelerators | Jun 16, 2024 | Automatic Speech RecognitionGPU | CodeCode Available | 0 |
| Boosting Cross-Domain Speech Recognition with Self-Supervision | Jun 20, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Towards Contextual Spelling Correction for Customization of End-to-end Speech Recognition Systems | Mar 2, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| SoK: A Modularized Approach to Study the Security of Automatic Speech Recognition Systems | Mar 19, 2021 | Adversarial AttackAutomatic Speech Recognition | CodeCode Available | 0 |
| Analyzing Hidden Representations in End-to-End Automatic Speech Recognition Systems | Sep 13, 2017 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Augmenting Librispeech with French Translations: A Multimodal Corpus for Direct Speech Translation Evaluation | Feb 9, 2018 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Blank Collapse: Compressing CTC emission for the faster decoding | Oct 31, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Coupled Training of Sequence-to-Sequence Models for Accented Speech Recognition | May 14, 2020 | Accented Speech RecognitionAutomatic Speech Recognition | CodeCode Available | 0 |
| A Theory of Unsupervised Speech Recognition | Jun 9, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Effectiveness of Text, Acoustic, and Lattice-based representations in Spoken Language Understanding tasks | Dec 16, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Audiovisual Speaker Tracking using Nonlinear Dynamical Systems with Dynamic Stream Weights | Mar 14, 2019 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Mlphon: A Multifunctional Grapheme-Phoneme Conversion Tool Using Finite State Transducers | Sep 5, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| MLS: A Large-Scale Multilingual Dataset for Speech Research | Dec 7, 2020 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Pansori: ASR Corpus Generation from Open Online Video Contents | Dec 23, 2018 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| When Is TTS Augmentation Through a Pivot Language Useful? | Jul 20, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| FASA: a Flexible and Automatic Speech Aligner for Extracting High-quality Aligned Children Speech Data | Jun 25, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Analysis of EEG frequency bands for Envisioned Speech Recognition | Mar 29, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| AfriHuBERT: A self-supervised speech representation model for African languages | Sep 30, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Contrastive and Consistency Learning for Neural Noisy-Channel Model in Spoken Language Understanding | May 23, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Streaming Sequence Transduction through Dynamic Compression | Feb 2, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Textless Speech-to-Speech Translation With Limited Parallel Data | May 24, 2023 | Automatic Speech RecognitionDenoising | CodeCode Available | 0 |
| Audio Segmentation for Robust Real-Time Speech Recognition Based on Neural Networks | Dec 1, 2016 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Towards End-to-End Speech Recognition with Deep Convolutional Neural Networks | Jan 10, 2017 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Towards End-to-End Training of Automatic Speech Recognition for Nigerian Pidgin | Oct 21, 2020 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Advancing Singlish Understanding: Bridging the Gap with Datasets and Multimodal Models | Jan 2, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Perceptual and Task-Oriented Assessment of a Semantic Metric for ASR Evaluation | Sep 7, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| A Dataset for Speech Emotion Recognition in Greek Theatrical Plays | Mar 27, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| A Comparative Study on Transformer vs RNN in Speech Applications | Sep 13, 2019 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Using Adapters to Overcome Catastrophic Forgetting in End-to-End Automatic Speech Recognition | Mar 30, 2022 | AllAutomatic Speech Recognition | CodeCode Available | 0 |
| Momentum Pseudo-Labeling for Semi-Supervised Speech Recognition | Jun 16, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Continual Learning for Monolingual End-to-End Automatic Speech Recognition | Dec 17, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| EESEN: End-to-End Speech Recognition using Deep RNN Models and WFST-based Decoding | Jul 29, 2015 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Audio Adversarial Examples: Targeted Attacks on Speech-to-Text | Jan 5, 2018 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| A Change of Heart: Improving Speech Emotion Recognition through Speech-to-Text Modality Conversion | Jul 21, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Exploring Generative Error Correction for Dysarthric Speech Recognition | May 26, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Are Neural Open-Domain Dialog Systems Robust to Speech Recognition Errors in the Dialog History? An Empirical Study | Aug 18, 2020 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Exploiting Attention-based Sequence-to-Sequence Architectures for Sound Event Localization | Feb 28, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Robust Unstructured Knowledge Access in Conversational Dialogue with ASR Errors | Nov 8, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Explainability of Speech Recognition Transformers via Gradient-based Attention Visualization | Jun 2, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Towards Inclusive ASR: Investigating Voice Conversion for Dysarthric Speech Recognition in Low-Resource Languages | May 20, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| An Automatic Speech Recognition System for Bengali Language based on Wav2Vec2 and Transfer Learning | Sep 16, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Whose Emotion Matters? Speaking Activity Localisation without Prior Knowledge | Nov 23, 2022 | Active Speaker DetectionAutomatic Speech Recognition | CodeCode Available | 0 |
| Big model only for hard audios: Sample dependent Whisper model selection for efficient inferences | Sep 22, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| ROSE: A Recognition-Oriented Speech Enhancement Framework in Air Traffic Control Using Multi-Objective Learning | Dec 11, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Analyzing the impact of speaker localization errors on speech separation for automatic speech recognition | Oct 24, 2019 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Evaluation of Neural Architectures Trained with Square Loss vs Cross-Entropy in Classification Tasks | Jun 12, 2020 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Evaluating Variants of wav2vec 2.0 on Affective Vocal Burst Tasks | May 5, 2023 | Automatic Speech RecognitionCultural Vocal Bursts Intensity Prediction | CodeCode Available | 0 |