SOTAVerified

Automatic Speech Recognition

Papers

Showing 351400 of 3174 papers

TitleStatusHype
Automatic Speech Recognition of African American English: Lexical and Contextual Effects0
Low-Resource Domain Adaptation for Speech LLMs via Text-Only Fine-Tuning0
AS-ASR: A Lightweight Framework for Aphasia-Specific Automatic Speech Recognition0
Lightweight Prompt Biasing for Contextualized End-to-End ASR Systems0
Bridging the Modality Gap: Softly Discretizing Audio Representation for LLM-based Automatic Speech Recognition0
Diarization-Aware Multi-Speaker Automatic Speech Recognition via Large Language Models0
LLM-based phoneme-to-grapheme for phoneme-based speech recognition0
Better Pseudo-labeling with Multi-ASR Fusion and Error Correction by SpeechLLM0
LESS: Large Language Model Enhanced Semi-Supervised Learning for Speech Foundational Models0
Customizing Speech Recognition Model with Large Language Model Feedback0
Effects of Speaker Count, Duration, and Accent Diversity on Zero-Shot Accent Robustness in Low-Resource ASR0
A Multi-Dialectal Dataset for German Dialect ASR and Dialect-to-Standard Speech Translation0
Enhancing Lyrics Transcription on Music Mixtures with Consistency Loss0
Overcoming Data Scarcity in Multi-Dialectal Arabic ASR via Whisper Fine-Tuning0
Analyzing the Importance of Blank for CTC-Based Knowledge Distillation0
Unsupervised Rhythm and Voice Conversion to Improve ASR on Dysarthric SpeechCode0
HENT-SRT: Hierarchical Efficient Neural Transducer with Self-Distillation for Joint Speech Recognition and Translation0
DNCASR: End-to-End Training for Speaker-Attributed ASR0
Self-Supervised Speech Quality Assessment (S3QA): Leveraging Speech Foundation Models for a Scalable Speech Quality Metric0
What do self-supervised speech models know about Dutch? Analyzing advantages of language-specific pre-trainingCode0
Towards Temporally Explainable Dysarthric Speech Clarity AssessmentCode0
DYNAC: Dynamic Vocabulary based Non-Autoregressive Contextualization for Speech Recognition0
Causal Structure Discovery for Error Diagnostics of Children's ASR0
Running Conventional Automatic Speech Recognition on Memristor Hardware: A Simulated Approach0
MSDA: Combining Pseudo-labeling and Self-Supervision for Unsupervised Domain Adaptation in ASR0
Improving Multilingual Speech Models on ML-SUPERB 2.0: Fine-tuning with Data Augmentation and LID-Aware CTC0
Fewer Hallucinations, More Verification: A Three-Stage LLM-Based Framework for ASR Error Correction0
Dynamic Context-Aware Streaming Pretrained Language Model For Inverse Text Normalization0
Vedavani: A Benchmark Corpus for ASR on Vedic Sanskrit PoetryCode0
Contextualized Automatic Speech Recognition with Dynamic Vocabulary Prediction and Activation0
BeaverTalk: Oregon State University's IWSLT 2025 Simultaneous Speech Translation SystemCode0
Prompting Whisper for Improved Verbatim Transcription and End-to-end Miscue Detection0
NGPU-LM: GPU-Accelerated N-Gram Language Model for Context-Biasing in Greedy ASR Decoding0
Advancing Hearing Assessment: An ASR-Based Frequency-Specific Speech Test for Diagnosing Presbycusis0
PSRB: A Comprehensive Benchmark for Evaluating Persian ASR Systems0
Loquacious Set: 25,000 Hours of Transcribed and Diverse English Speech Recognition Data for Research and Commercial Use0
Towards Pretraining Robust ASR Foundation Model with Acoustic-Aware Data Augmentation0
Beyond Manual Transcripts: The Potential of Automated Speech Recognition Errors in Improving Alzheimer's Disease Detection0
KIT's Low-resource Speech Translation Systems for IWSLT2025: System Enhancement with Synthetic Data and Model Regularization0
Continuous Learning for Children's ASR: Overcoming Catastrophic Forgetting with Elastic Weight Consolidation and Synaptic Intelligence0
Mixture of LoRA Experts for Low-Resourced Multi-Accent Automatic Speech Recognition0
Robust fine-tuning of speech recognition models via model merging: application to disordered speech0
In-context Language Learning for Endangered Languages in Speech Recognition0
The NaijaVoices Dataset: Cultivating Large-Scale, High-Quality, Culturally-Rich Speech Data for African Languages0
Exploring Generative Error Correction for Dysarthric Speech RecognitionCode0
CHSER: A Dataset and Case Study on Generative Speech Error Correction for Child ASRCode0
LLM-based Generative Error Correction for Rare Words with Synthetic Data and Phonetic ContextCode0
VietASR: Achieving Industry-level Vietnamese ASR with 50-hour labeled data and Large-Scale Speech Pretraining0
An Effective Training Framework for Light-Weight Automatic Speech Recognition Models0
SoccerChat: Integrating Multimodal Data for Enhanced Soccer Game UnderstandingCode0
Show:102550
← PrevPage 8 of 64Next →

No leaderboard results yet.