SOTAVerified

Robust Speech Recognition

Papers

Showing 125 of 97 papers

TitleStatusHype
SimClass: A Classroom Speech Dataset Generated via Game Engine Simulation For Automatic Speech Recognition Research0
Robust Speech Recognition with Schrödinger Bridge-Based Speech Enhancement0
Dysarthria Normalization via Local Lie Group Transformations for Robust ASRCode0
MMS-LLaMA: Efficient LLM-based Audio-Visual Speech Recognition with Minimal Multimodal Speech TokensCode1
MoHAVE: Mixture of Hierarchical Audio-Visual Experts for Robust Speech Recognition0
mWhisper-Flamingo for Multilingual Audio-Visual Noise-Robust Speech RecognitionCode3
Data-Driven Mispronunciation Pattern Discovery for Robust Speech Recognition0
Privacy-Preserving Edge Speech Understanding with Tiny Foundation Models0
Paraformer-v2: An improved non-autoregressive transformer for noise-robust speech recognition0
Channel-Aware Domain-Adaptive Generative Adversarial Network for Robust Speech RecognitionCode0
Robust Audiovisual Speech Recognition Models with Mixture-of-Experts0
CPT-Boosted Wav2vec2.0: Towards Noise Robust Speech Recognition for Classroom Environments0
XLAVS-R: Cross-Lingual Audio-Visual Speech Representation Learning for Noise-Robust Speech Perception0
Multilingual Audio-Visual Speech Recognition with Hybrid CTC/RNN-T Fast Conformer0
Speech Robust Bench: A Robustness Benchmark For Speech RecognitionCode1
Large Language Models are Efficient Learners of Noise-Robust Speech RecognitionCode2
KinSPEAK: Improving speech recognition for Kinyarwanda via semi-supervised learning methods0
LyricWhiz: Robust Multilingual Zero-shot Lyrics Transcription by Whispering to ChatGPTCode1
The CHiME-7 DASR Challenge: Distant Meeting Transcription with Multiple Devices in Diverse Scenarios0
Statistical Beamformer Exploiting Non-stationarity and Sparsity with Spatially Constrained ICA for Robust Speech Recognition0
RescueSpeech: A German Corpus for Speech Recognition in Search and Rescue Domain0
Incorporating L2 Phonemes Using Articulatory Features for Robust Speech Recognition0
AVFormer: Injecting Vision into Frozen Speech Models for Zero-Shot AV-ASR0
MuAViC: A Multilingual Audio-Visual Corpus for Robust Speech Recognition and Robust Speech-to-Text TranslationCode2
Gradient Remedy for Multi-Task Learning in End-to-End Noise-Robust Speech RecognitionCode1
Show:102550
← PrevPage 1 of 4Next →

No leaderboard results yet.