SOTAVerified

Automatic Speech Recognition

Papers

Showing 11511200 of 3174 papers

TitleStatusHype
A Genetic Programming Approach To Zero-Shot Neural Architecture Ranking0
CommonAccent: Exploring Large Acoustic Pretrained Models for Accent Classification Based on Common Voice0
CommanderSong: A Systematic Approach for Practical Adversarial Voice Recognition0
Combining X-Vectors and Bayesian Batch Active Learning: Two-Stage Active Learning Pipeline for Speech Recognition0
ASR Error Correction using Large Language Models0
A Generative Model of a Pronunciation Lexicon for Hindi0
Acoustic Model Fusion for End-to-end Speech Recognition0
Accent Recognition with Hybrid Phonetic Features0
Combining Spectral and Self-Supervised Features for Low Resource Speech Recognition and Translation0
Combining Language Models For Specialized Domains: A Colorful Approach0
ASR Error Correction and Domain Adaptation Using Machine Translation0
ASR-EC Benchmark: Evaluating Large Language Models on Chinese ASR Error Correction0
A General Multi-Task Learning Framework to Leverage Text Data for Speech to Text Tasks0
Collaborative Data Relabeling for Robust and Diverse Voice Apps Recommendation in Intelligent Personal Assistants0
Code-Switching Text Generation and Injection in Mandarin-English ASR0
ASR Bundestag: A Large-Scale political debate dataset in German0
Code-Switching Detection with Data-Augmented Acoustic and Language Models0
Code-Switching Detection Using ASR-Generated Language Posteriors0
A GEN AI Framework for Medical Note Generation0
Acoustic Data-Driven Subword Modeling for End-to-End Speech Recognition0
Code-Switched Language Models Using Neural Based Synthetic Data from Parallel Sentences0
ASR-based Features for Emotion Recognition: A Transfer Learning Approach0
Code Switched and Code Mixed Speech Recognition for Indic languages0
Codec-ASR: Training Performant Automatic Speech Recognition Systems with Discrete Speech Representations0
ASR-based CALL systems and learner speech data: new resources and opportunities for research and development in second language learning0
AGADIR: Towards Array-Geometry Agnostic Directional Speech Recognition0
Coarse-To-Fine And Cross-Lingual ASR Transfer0
CNN-based MultiChannel End-to-End Speech Recognition for everyday home environments0
ASR-Aware End-to-end Neural Diarization0
ASR and Emotional Speech: A Word-Level Investigation of the Mutual Impact of Speech and Emotion Recognition0
Afrispeech-Dialog: A Benchmark Dataset for Spontaneous English Conversations in Healthcare and Beyond0
Acoustic and Textual Data Augmentation for Improved ASR of Code-Switching Speech0
AccentFold: A Journey through African Accents for Zero-Shot ASR Adaptation to Target Accents0
MultiMed: Multilingual Medical Speech Recognition via Attention Encoder Decoder0
Cloud-based Automatic Speech Recognition Systems for Southeast Asian Languages0
Automatic Speech Recognition Advancements for Indigenous Languages of the Americas0
Clinical Dialogue Transcription Error Correction using Seq2Seq Models0
ASR Adaptation for E-commerce Chatbots using Cross-Utterance Context and Multi-Task Language Modeling0
AfriSpeech-200: Pan-African Accented Speech Dataset for Clinical and General Domain ASR0
Clinical BERTScore: An Improved Measure of Automatic Speech Recognition Performance in Clinical Settings0
A Speech Test Set of Practice Business Presentations with Additional Relevant Texts0
Cleanformer: A multichannel array configuration-invariant neural enhancement frontend for ASR in smart speakers0
Classist Tools: Social Class Correlates with Performance in NLP0
AfriNames: Most ASR models "butcher" African Names0
Acoustically Grounded Word Embeddings for Improved Acoustics-to-Word Speech Recognition0
Classification Error Bound for Low Bayes Error Conditions in Machine Learning0
Citrinet: Closing the Gap between Non-Autoregressive and Autoregressive End-to-End Models for Automatic Speech Recognition0
Chinese Medical Speech Recognition with Punctuated Hypothesis0
Ask2Mask: Guided Data Selection for Masked Speech Modeling0
Chinese-LiPS: A Chinese audio-visual speech recognition dataset with Lip-reading and Presentation Slides0
Show:102550
← PrevPage 24 of 64Next →

No leaderboard results yet.