SOTAVerified

Automatic Speech Recognition

Papers

Showing 226250 of 3174 papers

TitleStatusHype
Efficient Neural Architecture Search for End-to-end Speech Recognition via Straight-Through GradientsCode1
A Study of Multilingual End-to-End Speech Recognition for Kazakh, Russian, and EnglishCode1
End-to-end Named Entity Recognition from English SpeechCode1
How Does Pre-trained Wav2Vec 2.0 Perform on Domain Shifted ASR? An Extensive Benchmark on Air Traffic Control CommunicationsCode1
Performance-Efficiency Trade-offs in Unsupervised Pre-training for Speech RecognitionCode1
HyPoradise: An Open Baseline for Generative Speech Recognition with Large Language ModelsCode1
Espresso: A Fast End-to-end Neural Speech Recognition ToolkitCode1
End-to-End Speech Recognition and Disfluency RemovalCode1
End-to-End Speech Recognition from Federated Acoustic ModelsCode1
Google Crowdsourced Speech Corpora and Related Open-Source Resources for Low-Resource Languages and Dialects: An OverviewCode1
Enhancing Monotonic Multihead Attention for Streaming ASRCode1
ESB: A Benchmark For Multi-Domain End-to-End Speech RecognitionCode1
Punctuation Restoration using Transformer Models for High-and Low-Resource LanguagesCode1
Gradient Remedy for Multi-Task Learning in End-to-End Noise-Robust Speech RecognitionCode1
ASR data augmentation in low-resource settings using cross-lingual multi-speaker TTS and cross-lingual voice conversionCode1
ExKaldi-RT: A Real-Time Automatic Speech Recognition Extension Toolkit of KaldiCode1
Quilt-1M: One Million Image-Text Pairs for HistopathologyCode1
A Sidecar Separator Can Convert a Single-Talker Speech Recognition System to a Multi-Talker OneCode1
Factorized Neural Transducer for Efficient Language Model AdaptationCode1
Fast Development of ASR in African Languages using Self Supervised Speech Representation LearningCode1
Regularizing End-to-End Speech Translation with Triangular Decomposition AgreementCode1
ASR Error Correction with Constrained Decoding on Operation PredictionCode1
How2: A Large-scale Dataset for Multimodal Language UnderstandingCode1
HypR: A comprehensive study for ASR hypothesis revising with a reference corpusCode1
ArTST: Arabic Text and Speech TransformerCode1
Show:102550
← PrevPage 10 of 127Next →

No leaderboard results yet.