SOTAVerified

Automatic Speech Recognition

Papers

Showing 11761200 of 3174 papers

TitleStatusHype
AGADIR: Towards Array-Geometry Agnostic Directional Speech Recognition0
Coarse-To-Fine And Cross-Lingual ASR Transfer0
CNN-based MultiChannel End-to-End Speech Recognition for everyday home environments0
ASR-Aware End-to-end Neural Diarization0
ASR and Emotional Speech: A Word-Level Investigation of the Mutual Impact of Speech and Emotion Recognition0
Afrispeech-Dialog: A Benchmark Dataset for Spontaneous English Conversations in Healthcare and Beyond0
Acoustic and Textual Data Augmentation for Improved ASR of Code-Switching Speech0
AccentFold: A Journey through African Accents for Zero-Shot ASR Adaptation to Target Accents0
MultiMed: Multilingual Medical Speech Recognition via Attention Encoder Decoder0
Cloud-based Automatic Speech Recognition Systems for Southeast Asian Languages0
Automatic Speech Recognition Advancements for Indigenous Languages of the Americas0
Clinical Dialogue Transcription Error Correction using Seq2Seq Models0
ASR Adaptation for E-commerce Chatbots using Cross-Utterance Context and Multi-Task Language Modeling0
AfriSpeech-200: Pan-African Accented Speech Dataset for Clinical and General Domain ASR0
Clinical BERTScore: An Improved Measure of Automatic Speech Recognition Performance in Clinical Settings0
A Speech Test Set of Practice Business Presentations with Additional Relevant Texts0
Cleanformer: A multichannel array configuration-invariant neural enhancement frontend for ASR in smart speakers0
Classist Tools: Social Class Correlates with Performance in NLP0
AfriNames: Most ASR models "butcher" African Names0
Acoustically Grounded Word Embeddings for Improved Acoustics-to-Word Speech Recognition0
Classification Error Bound for Low Bayes Error Conditions in Machine Learning0
Citrinet: Closing the Gap between Non-Autoregressive and Autoregressive End-to-End Models for Automatic Speech Recognition0
Chinese Medical Speech Recognition with Punctuated Hypothesis0
Ask2Mask: Guided Data Selection for Masked Speech Modeling0
Chinese-LiPS: A Chinese audio-visual speech recognition dataset with Lip-reading and Presentation Slides0
Show:102550
← PrevPage 48 of 127Next →

No leaderboard results yet.