SOTAVerified

Automatic Speech Recognition

Papers

Showing 201225 of 3174 papers

TitleStatusHype
Attentive Sequence-to-Sequence Learning for Diacritic Restoration of Yorùbá Language TextCode1
Leveraging pre-trained representations to improve access to untranscribed speech from endangered languagesCode1
Deep Contextualized Acoustic Representations For Semi-Supervised Speech RecognitionCode1
Deep Sparse Conformer for Speech RecognitionCode1
A Survey on Non-Autoregressive Generation for Neural Machine Translation and BeyondCode1
Lightweight Adapter Tuning for Multilingual Speech TranslationCode1
ALIF: Low-Cost Adversarial Audio Attacks on Black-Box Speech Platforms using Linguistic FeaturesCode1
DuplexMamba: Enhancing Real-time Speech Conversations with Duplex and Streaming CapabilitiesCode1
A Systematic Comparison of Phonetic Aware Techniques for Speech EnhancementCode1
Automatic Speech Recognition for Speech Assessment of Persian Preschool ChildrenCode1
Mamba for Streaming ASR Combined with Unimodal AggregationCode1
HowToCaption: Prompting LLMs to Transform Video Annotations at ScaleCode1
Adaptation of Whisper models to child speech recognitionCode1
MelHuBERT: A simplified HuBERT on Mel spectrogramsCode1
Improving Mandarin Speech Recogntion with Block-augmented TransformerCode1
FlanEC: Exploring Flan-T5 for Post-ASR Error CorrectionCode1
Distilling the Knowledge of BERT for Sequence-to-Sequence ASRCode1
Multilingual DistilWhisper: Efficient Distillation of Multi-task Speech Models via Language-Specific ExpertsCode1
Adapting End-to-End Speech Recognition for Readable SubtitlesCode1
From Tens of Hours to Tens of Thousands: Scaling Back-Translation for Speech RecognitionCode1
Dual-decoder Transformer for Joint Automatic Speech Recognition and Multilingual Speech TranslationCode1
Dompteur: Taming Audio Adversarial ExamplesCode1
A Sidecar Separator Can Convert a Single-Talker Speech Recognition System to a Multi-Talker OneCode1
ASR data augmentation in low-resource settings using cross-lingual multi-speaker TTS and cross-lingual voice conversionCode1
ASR Error Correction with Constrained Decoding on Operation PredictionCode1
Show:102550
← PrevPage 9 of 127Next →

No leaderboard results yet.