| Zero Shot Text to Speech Augmentation for Automatic Speech Recognition on Low-Resource Accented Speech Corpora | Sep 17, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| Zipformer: A faster and better encoder for automatic speech recognition | Oct 17, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| Zipper: A Multi-Tower Decoder Architecture for Fusing Modalities | May 29, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| 100,000 Podcasts: A Spoken English Document Corpus | Dec 1, 2020 | 3D Facial Landmark LocalizationAutomatic Speech Recognition | —Unverified | 0 | 0 |
| ZJU’s IWSLT 2021 Speech Translation System | Aug 1, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| Analysis of Deep Clustering as Preprocessing for Automatic Speech Recognition of Sparsely Overlapping Speech | May 9, 2019 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| Listening while Speaking and Visualizing: Improving ASR through Multimodal Chain | Jun 3, 2019 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| Regularization Advantages of Multilingual Neural Language Models for Low Resource Domains | May 29, 2019 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| Towards Better Understanding of Spontaneous Conversations: Overcoming Automatic Speech Recognition Errors With Intent Recognition | Aug 21, 2019 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| Improving OOV Detection and Resolution with External Language Models in Acoustic-to-Word ASR | Sep 22, 2019 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| Transformer-based Cascaded Multimodal Speech Translation | Oct 29, 2019 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| Improving sequence-to-sequence speech recognition training with on-the-fly data augmentation | Oct 29, 2019 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| 1SPU: 1-step Speech Processing Unit | Nov 8, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| Improving noisy student training for low-resource languages in End-to-End ASR using CycleGAN and inter-domain losses | Jul 26, 2024 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 | 0 |
| Towards interfacing large language models with ASR systems using confidence measures and prompting | Jul 31, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| On the Problem of Text-To-Speech Model Selection for Synthetic Data Generation in Automatic Speech Recognition | Jul 31, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| Handling Numeric Expressions in Automatic Speech Recognition | Jul 18, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| Sentence-wise Speech Summarization: Task, Datasets, and End-to-End Modeling with LM Knowledge Distillation | Aug 1, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| SynesLM: A Unified Approach for Audio-visual Speech Recognition and Translation via Language Model and Synthetic Data | Aug 1, 2024 | Audio-Visual Speech RecognitionAutomatic Speech Recognition | —Unverified | 0 | 0 |
| Self-Supervised Learning for Multi-Channel Neural Transducer | Aug 6, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| ASR-enhanced Multimodal Representation Learning for Cross-Domain Product Retrieval | Aug 6, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| MultiMed: Multilingual Medical Speech Recognition via Attention Encoder Decoder | Sep 21, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| Survey of End-to-End Multi-Speaker Automatic Speech Recognition for Monaural Audio | May 16, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| LegoSLM: Connecting LLM with Speech Encoder using CTC Posteriors | May 16, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| 2-bit Conformer quantization for automatic speech recognition | May 26, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| 3-D Feature and Acoustic Modeling for Far-Field Speech Recognition | Nov 13, 2019 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| 4-bit Quantization of LSTM-based Speech Recognition Models | Aug 27, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| Joint Beam Search Integrating CTC, Attention, and Transducer Decoders | Jun 5, 2024 | Automatic Speech RecognitionDecoder | —Unverified | 0 | 0 |
| 4D ASR: Joint modeling of CTC, Attention, Transducer, and Mask-Predict decoders | Dec 21, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| A bandit approach to curriculum generation for automatic speech recognition | Feb 6, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| A baseline model for computationally inexpensive speech recognition for Kazakh using the Coqui STT framework | Jul 19, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| A Bayesian Network View on Acoustic Model-Based Techniques for Robust Speech Recognition | Oct 11, 2013 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| A Benchmark of French ASR Systems Based on Error Severity | Jan 18, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| A Case Study on Combining ASR and Visual Features for Generating Instructional Video Captions | Oct 7, 2019 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| Accelerating Transducers through Adjacent Token Merging | Jun 28, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| AccentDB: A Database of Non-Native English Accents to Assist Neural Speech Recognition | May 16, 2020 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| Accented Speech Recognition: A Survey | Apr 21, 2021 | Accented Speech RecognitionAutomatic Speech Recognition | —Unverified | 0 | 0 |
| Accented Speech Recognition Inspired by Human Perception | Apr 9, 2021 | Accented Speech RecognitionAutomatic Speech Recognition | —Unverified | 0 | 0 |
| AccentFold: A Journey through African Accents for Zero-Shot ASR Adaptation to Target Accents | Feb 2, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| Accent Recognition with Hybrid Phonetic Features | May 5, 2021 | Audio ClassificationAutomatic Speech Recognition | —Unverified | 0 | 0 |
| Accent-Robust Automatic Speech Recognition Using Supervised and Unsupervised Wav2vec Embeddings | Oct 7, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| Accurate and Structured Pruning for Efficient Automatic Speech Recognition | May 31, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| Accurate synthesis of Dysarthric Speech for ASR data augmentation | Aug 16, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| A CLARIN Transcription Portal for Interview Data | May 1, 2020 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| A Closer Look at Audio-Visual Multi-Person Speech Recognition and Active Speaker Selection | May 11, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| A Closer Look at Wav2Vec2 Embeddings for On-Device Single-Channel Speech Enhancement | Mar 3, 2024 | Automatic Speech RecognitionKeyword Spotting | —Unverified | 0 | 0 |
| AC-Mix: Self-Supervised Adaptation for Low-Resource Automatic Speech Recognition using Agnostic Contrastive Mixup | Oct 18, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| A Comparative Analysis of Bilingual and Trilingual Wav2Vec Models for Automatic Speech Recognition in Multilingual Oral History Archives | Jul 24, 2024 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 | 0 |
| A Comparative Analysis of Crowdsourced Natural Language Corpora for Spoken Dialog Systems | May 1, 2016 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| A Comparative Study of LLM-based ASR and Whisper in Low Resource and Code Switching Scenario | Dec 1, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |