SOTAVerified

Automatic Speech Recognition

Papers

Showing 526550 of 3174 papers

TitleStatusHype
LASER: Learning by Aligning Self-supervised Representations of Speech for Improving Content-related TasksCode0
Transcription-Free Fine-Tuning of Speech Separation Models for Noisy and Reverberant Multi-Speaker Automatic Speech Recognition0
Language Complexity and Speech Recognition Accuracy: Orthographic Complexity Hurts, Phonological Complexity Doesn'tCode0
ML-SUPERB 2.0: Benchmarking Multilingual Speech Models Across Modeling Constraints, Languages, and Datasets0
Audio-conditioned phonemic and prosodic annotation for building text-to-speech models from unlabeled speech data0
DualVC 3: Leveraging Language Model Generated Pseudo Context for End-to-end Low Latency Streaming Voice Conversion0
Refining Self-Supervised Learnt Speech Representation using Brain Activations0
Speech Emotion Recognition with ASR Transcripts: A Comprehensive Study on Word Error Rate and Fusion TechniquesCode0
Transformer-based Model for ASR N-Best Rescoring and Rewriting0
Dual-Pipeline with Low-Rank Adaptation for New Language Integration in Multilingual ASR0
PRoDeliberation: Parallel Robust Deliberation for End-to-End Spoken Language Understanding0
Guiding Frame-Level CTC Alignments Using Self-knowledge DistillationCode0
Towards Unsupervised Speech Recognition Without Pronunciation ModelsCode0
Tag and correct: high precision post-editing approach to correction of speech recognition errors0
Reading Miscue Detection in Primary School through Automatic Speech Recognition0
Fast Context-Biasing for CTC and Transducer ASR models with CTC-based Word Spotter0
AS-70: A Mandarin stuttered speech dataset for automatic speech recognition and stuttering event detection0
ASTRA: Aligning Speech and Text Representations for Asr without Sampling0
MS-HuBERT: Mitigating Pre-training and Inference Mismatch in Masked Language Modelling methods for learning Speech Representations0
LoRA-Whisper: Parameter-Efficient and Extensible Multilingual ASR0
Pitch-Aware RNN-T for Mandarin Chinese Mispronunciation Detection and Diagnosis0
Flexible Multichannel Speech Enhancement for Noise-Robust Frontend0
To Distill or Not to Distill? On the Robustness of Robust Knowledge DistillationCode0
Hypernetworks for Personalizing ASR to Atypical Speech0
LipGER: Visually-Conditioned Generative Error Correction for Robust Automatic Speech RecognitionCode1
Show:102550
← PrevPage 22 of 127Next →

No leaderboard results yet.