SOTAVerified

Automatic Speech Recognition

Papers

Showing 276300 of 3174 papers

TitleStatusHype
A Sidecar Separator Can Convert a Single-Talker Speech Recognition System to a Multi-Talker OneCode1
ASR data augmentation in low-resource settings using cross-lingual multi-speaker TTS and cross-lingual voice conversionCode1
Automatic Severity Classification of Dysarthric speech by using Self-supervised Model with Multi-task LearningCode1
JoeyS2T: Minimalistic Speech-to-Text Modeling with JoeyNMTCode1
Automatic speech recognition for the Nepali language using CNN, bidirectional LSTM and ResNetCode1
A Variance-Preserving Interpolation Approach for Diffusion Models with Applications to Single Channel Speech Enhancement and RecognitionCode1
AVATAR: Unconstrained Audiovisual Speech RecognitionCode1
ArzEn-LLM: Code-Switched Egyptian Arabic-English Translation and Speech Recognition Using LLMsCode1
Back Translation for Speech-to-text Translation Without TranscriptsCode1
BembaSpeech: A Speech Recognition Corpus for the Bemba LanguageCode1
Layer-wise Analysis of a Self-supervised Speech Representation ModelCode1
ArTST: Arabic Text and Speech TransformerCode1
Brouhaha: multi-task training for voice activity detection, speech-to-noise ratio, and C50 room acoustics estimationCode1
It's Never Too Late: Fusing Acoustic Information into Large Language Models for Automatic Speech RecognitionCode1
Joint Masked CPC and CTC Training for ASRCode1
Investigating the Reordering Capability in CTC-based Non-Autoregressive End-to-End Speech TranslationCode1
A Persian ASR-based SER: Modification of Sharif Emotional Speech Database and Investigation of Persian Text CorporaCode1
A Reference-less Quality Metric for Automatic Speech Recognition via Contrastive-Learning of a Multi-Language Model with Self-SupervisionCode1
Brazilian Portuguese Speech Recognition Using Wav2vec 2.0Code1
Advancing Test-Time Adaptation in Wild Acoustic Test SettingsCode1
Can Contextual Biasing Remain Effective with Whisper and GPT-2?Code1
Investigation of End-To-End Speaker-Attributed ASR for Continuous Multi-Talker RecordingsCode1
A Baseline for Detecting Misclassified and Out-of-Distribution Examples in Neural NetworksCode1
Dompteur: Taming Audio Adversarial ExamplesCode1
ÌròyìnSpeech: A multi-purpose Yorùbá Speech CorpusCode1
Show:102550
← PrevPage 12 of 127Next →

No leaderboard results yet.