SOTAVerified

Automatic Speech Recognition

Papers

Showing 176200 of 3174 papers

TitleStatusHype
Efficient conformer: Progressive downsampling and grouped attention for automatic speech recognitionCode1
ALIF: Low-Cost Adversarial Audio Attacks on Black-Box Speech Platforms using Linguistic FeaturesCode1
A Hybrid Continuity Loss to Reduce Over-Suppression for Time-domain Target Speaker ExtractionCode1
End-to-end Named Entity Recognition from English SpeechCode1
End-to-End Single-Channel Speaker-Turn Aware Conversational Speech TranslationCode1
AV Taris: Online Audio-Visual Speech RecognitionCode1
AISHELL-NER: Named Entity Recognition from Chinese SpeechCode1
ESB: A Benchmark For Multi-Domain End-to-End Speech RecognitionCode1
Espresso: A Fast End-to-end Neural Speech Recognition ToolkitCode1
ExKaldi-RT: A Real-Time Automatic Speech Recognition Extension Toolkit of KaldiCode1
Extending Whisper with prompt tuning to target-speaker ASRCode1
BENDR: using transformers and a contrastive self-supervised learning task to learn from massive amounts of EEG dataCode1
CB-Conformer: Contextual biasing Conformer for biased word recognitionCode1
Continuous speech separation: dataset and analysisCode1
Radically Old Way of Computing Spectra: Applications in End-to-End ASRCode1
Automatic Speech Recognition for Speech Assessment of Persian Preschool ChildrenCode1
Gradient Remedy for Multi-Task Learning in End-to-End Noise-Robust Speech RecognitionCode1
How Does Pre-trained Wav2Vec 2.0 Perform on Domain Shifted ASR? An Extensive Benchmark on Air Traffic Control CommunicationsCode1
HowToCaption: Prompting LLMs to Transform Video Annotations at ScaleCode1
Improved Child Text-to-Speech Synthesis through Fastpitch-based Transfer LearningCode1
Improved DeepFake Detection Using Whisper FeaturesCode1
Improving Audio-Visual Speech Recognition by Lip-Subword Correlation Based Visual Pre-training and Cross-Modal Fusion EncoderCode1
Improving Mandarin End-to-End Speech Recognition with Word N-gram Language ModelCode1
Improving Whispered Speech Recognition Performance using Pseudo-whispered based Data AugmentationCode1
Automatic Speech Recognition Benchmark for Air-Traffic CommunicationsCode1
Show:102550
← PrevPage 8 of 127Next →

No leaderboard results yet.