SOTAVerified

Automatic Speech Recognition

Papers

Showing 25512575 of 3174 papers

TitleStatusHype
Zero Shot Text to Speech Augmentation for Automatic Speech Recognition on Low-Resource Accented Speech Corpora0
Zipformer: A faster and better encoder for automatic speech recognition0
Zipper: A Multi-Tower Decoder Architecture for Fusing Modalities0
100,000 Podcasts: A Spoken English Document Corpus0
ZJU’s IWSLT 2021 Speech Translation System0
Analysis of Deep Clustering as Preprocessing for Automatic Speech Recognition of Sparsely Overlapping Speech0
Listening while Speaking and Visualizing: Improving ASR through Multimodal Chain0
Regularization Advantages of Multilingual Neural Language Models for Low Resource Domains0
Towards Better Understanding of Spontaneous Conversations: Overcoming Automatic Speech Recognition Errors With Intent Recognition0
Improving OOV Detection and Resolution with External Language Models in Acoustic-to-Word ASR0
Transformer-based Cascaded Multimodal Speech Translation0
Improving sequence-to-sequence speech recognition training with on-the-fly data augmentation0
1SPU: 1-step Speech Processing Unit0
Improving noisy student training for low-resource languages in End-to-End ASR using CycleGAN and inter-domain losses0
Towards interfacing large language models with ASR systems using confidence measures and prompting0
On the Problem of Text-To-Speech Model Selection for Synthetic Data Generation in Automatic Speech Recognition0
Handling Numeric Expressions in Automatic Speech Recognition0
Sentence-wise Speech Summarization: Task, Datasets, and End-to-End Modeling with LM Knowledge Distillation0
SynesLM: A Unified Approach for Audio-visual Speech Recognition and Translation via Language Model and Synthetic Data0
Self-Supervised Learning for Multi-Channel Neural Transducer0
ASR-enhanced Multimodal Representation Learning for Cross-Domain Product Retrieval0
MultiMed: Multilingual Medical Speech Recognition via Attention Encoder Decoder0
Survey of End-to-End Multi-Speaker Automatic Speech Recognition for Monaural Audio0
LegoSLM: Connecting LLM with Speech Encoder using CTC Posteriors0
2-bit Conformer quantization for automatic speech recognition0
Show:102550
← PrevPage 103 of 127Next →

No leaderboard results yet.