| Revise, Reason, and Recognize: LLM-Based Emotion Recognition via Emotion-Specific Prompts and ASR Error Correction | Sep 23, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| FastEmit: Low-latency Streaming ASR with Sequence-level Emission Regularization | Oct 21, 2020 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Mixed-Precision Training for NLP and Speech Recognition with OpenSeq2Seq | May 25, 2018 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Optimized Speculative Sampling for GPU Hardware Accelerators | Jun 16, 2024 | Automatic Speech RecognitionGPU | CodeCode Available | 0 |
| Boosting Cross-Domain Speech Recognition with Self-Supervision | Jun 20, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Towards Contextual Spelling Correction for Customization of End-to-end Speech Recognition Systems | Mar 2, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| SoK: A Modularized Approach to Study the Security of Automatic Speech Recognition Systems | Mar 19, 2021 | Adversarial AttackAutomatic Speech Recognition | CodeCode Available | 0 |
| Analyzing Hidden Representations in End-to-End Automatic Speech Recognition Systems | Sep 13, 2017 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Augmenting Librispeech with French Translations: A Multimodal Corpus for Direct Speech Translation Evaluation | Feb 9, 2018 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Blank Collapse: Compressing CTC emission for the faster decoding | Oct 31, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Coupled Training of Sequence-to-Sequence Models for Accented Speech Recognition | May 14, 2020 | Accented Speech RecognitionAutomatic Speech Recognition | CodeCode Available | 0 |
| A Theory of Unsupervised Speech Recognition | Jun 9, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Effectiveness of Text, Acoustic, and Lattice-based representations in Spoken Language Understanding tasks | Dec 16, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Audiovisual Speaker Tracking using Nonlinear Dynamical Systems with Dynamic Stream Weights | Mar 14, 2019 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Mlphon: A Multifunctional Grapheme-Phoneme Conversion Tool Using Finite State Transducers | Sep 5, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| MLS: A Large-Scale Multilingual Dataset for Speech Research | Dec 7, 2020 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Pansori: ASR Corpus Generation from Open Online Video Contents | Dec 23, 2018 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| When Is TTS Augmentation Through a Pivot Language Useful? | Jul 20, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| FASA: a Flexible and Automatic Speech Aligner for Extracting High-quality Aligned Children Speech Data | Jun 25, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Analysis of EEG frequency bands for Envisioned Speech Recognition | Mar 29, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| AfriHuBERT: A self-supervised speech representation model for African languages | Sep 30, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Contrastive and Consistency Learning for Neural Noisy-Channel Model in Spoken Language Understanding | May 23, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Streaming Sequence Transduction through Dynamic Compression | Feb 2, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Textless Speech-to-Speech Translation With Limited Parallel Data | May 24, 2023 | Automatic Speech RecognitionDenoising | CodeCode Available | 0 |
| Audio Segmentation for Robust Real-Time Speech Recognition Based on Neural Networks | Dec 1, 2016 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |