SOTAVerified

Automatic Speech Recognition

Papers

Showing 5175 of 3174 papers

TitleStatusHype
HENT-SRT: Hierarchical Efficient Neural Transducer with Self-Distillation for Joint Speech Recognition and Translation0
Analyzing the Importance of Blank for CTC-Based Knowledge Distillation0
Unsupervised Rhythm and Voice Conversion to Improve ASR on Dysarthric SpeechCode0
Self-Supervised Speech Quality Assessment (S3QA): Leveraging Speech Foundation Models for a Scalable Speech Quality Metric0
What do self-supervised speech models know about Dutch? Analyzing advantages of language-specific pre-trainingCode0
GigaAM: Efficient Self-Supervised Learner for Speech RecognitionCode4
Towards Temporally Explainable Dysarthric Speech Clarity AssessmentCode0
DYNAC: Dynamic Vocabulary based Non-Autoregressive Contextualization for Speech Recognition0
Causal Structure Discovery for Error Diagnostics of Children's ASR0
Vedavani: A Benchmark Corpus for ASR on Vedic Sanskrit PoetryCode0
MSDA: Combining Pseudo-labeling and Self-Supervision for Unsupervised Domain Adaptation in ASR0
Dynamic Context-Aware Streaming Pretrained Language Model For Inverse Text Normalization0
Improving Multilingual Speech Models on ML-SUPERB 2.0: Fine-tuning with Data Augmentation and LID-Aware CTC0
Fewer Hallucinations, More Verification: A Three-Stage LLM-Based Framework for ASR Error Correction0
Running Conventional Automatic Speech Recognition on Memristor Hardware: A Simulated Approach0
BeaverTalk: Oregon State University's IWSLT 2025 Simultaneous Speech Translation SystemCode0
Prompting Whisper for Improved Verbatim Transcription and End-to-end Miscue Detection0
Contextualized Automatic Speech Recognition with Dynamic Vocabulary Prediction and Activation0
NGPU-LM: GPU-Accelerated N-Gram Language Model for Context-Biasing in Greedy ASR Decoding0
Advancing Hearing Assessment: An ASR-Based Frequency-Specific Speech Test for Diagnosing Presbycusis0
Towards Pretraining Robust ASR Foundation Model with Acoustic-Aware Data Augmentation0
Loquacious Set: 25,000 Hours of Transcribed and Diverse English Speech Recognition Data for Research and Commercial Use0
PSRB: A Comprehensive Benchmark for Evaluating Persian ASR Systems0
In-context Language Learning for Endangered Languages in Speech Recognition0
The NaijaVoices Dataset: Cultivating Large-Scale, High-Quality, Culturally-Rich Speech Data for African Languages0
Show:102550
← PrevPage 3 of 127Next →

No leaderboard results yet.