SOTAVerified

Automatic Speech Recognition

Papers

Showing 451475 of 3174 papers

TitleStatusHype
Direct Speech to Speech Translation: A Review0
Unveiling Biases while Embracing Sustainability: Assessing the Dual Challenges of Automatic Speech Recognition Systems0
Adapting Automatic Speech Recognition for Accented Air Traffic Control Communications0
CS-Dialogue: A 104-Hour Dataset of Spontaneous Mandarin-English Code-Switching Dialogues for Speech Recognition0
Nexus: An Omni-Perceptive And -Interactive Model for Language, Audio, And Vision0
Exploring Gender Disparities in Automatic Speech Recognition Technology0
Low-Rank and Sparse Model Merging for Multi-Lingual Speech Recognition and Translation0
Balancing Speech Understanding and Generation Using Continual Pre-training for Codec-based Speech LLM0
Understanding Zero-shot Rare Word Recognition Improvements Through LLM Integration0
The Esethu Framework: Reimagining Sustainable Dataset Governance and Curation for Low-Resource Languages0
Enhancing Speech Large Language Models with Prompt-Aware Mixture of Audio Encoders0
WavRAG: Audio-Integrated Retrieval Augmented Generation for Spoken Dialogue Models0
Measuring the Effect of Transcription Noise on Downstream Language Understanding TasksCode0
Adopting Whisper for Confidence Estimation0
Benchmarking Automatic Speech Recognition coupled LLM Modules for Medical Diagnostics0
Speech-FT: Merging Pre-trained And Fine-Tuned Speech Representation Models For Cross-Task Generalization0
Gesture-Aware Zero-Shot Speech Recognition for Patients with Language Disorders0
Lost in Transcription, Found in Distribution Shift: Demystifying Hallucination in Speech Foundation Models0
Microphone Array Geometry Independent Multi-Talker Distant ASR: NTT System for the DASR Task of the CHiME-8 Challenge0
MTLM: Incorporating Bidirectional Text Information to Enhance Language Model Training in Speech Recognition Systems0
Causal Analysis of ASR Errors for Children: Quantifying the Impact of Physiological, Cognitive, and Extrinsic Factors0
Evaluating Standard and Dialectal Frisian ASR: Multilingual Fine-tuning and Language Identification for Improved Low-resource Performance0
Koel-TTS: Enhancing LLM based Speech Generation with Preference Alignment and Classifier Free Guidance0
Aligner-Encoders: Self-Attention Transformers Can Be Self-Transducers0
Afrispeech-Dialog: A Benchmark Dataset for Spontaneous English Conversations in Healthcare and Beyond0
Show:102550
← PrevPage 19 of 127Next →

No leaderboard results yet.