SOTAVerified

Robust Speech Recognition

Papers

Showing 150 of 97 papers

TitleStatusHype
SimClass: A Classroom Speech Dataset Generated via Game Engine Simulation For Automatic Speech Recognition Research0
Robust Speech Recognition with Schrödinger Bridge-Based Speech Enhancement0
Dysarthria Normalization via Local Lie Group Transformations for Robust ASRCode0
MMS-LLaMA: Efficient LLM-based Audio-Visual Speech Recognition with Minimal Multimodal Speech TokensCode1
MoHAVE: Mixture of Hierarchical Audio-Visual Experts for Robust Speech Recognition0
mWhisper-Flamingo for Multilingual Audio-Visual Noise-Robust Speech RecognitionCode3
Data-Driven Mispronunciation Pattern Discovery for Robust Speech Recognition0
Privacy-Preserving Edge Speech Understanding with Tiny Foundation Models0
Paraformer-v2: An improved non-autoregressive transformer for noise-robust speech recognition0
Robust Audiovisual Speech Recognition Models with Mixture-of-Experts0
Channel-Aware Domain-Adaptive Generative Adversarial Network for Robust Speech RecognitionCode0
CPT-Boosted Wav2vec2.0: Towards Noise Robust Speech Recognition for Classroom Environments0
XLAVS-R: Cross-Lingual Audio-Visual Speech Representation Learning for Noise-Robust Speech Perception0
Multilingual Audio-Visual Speech Recognition with Hybrid CTC/RNN-T Fast Conformer0
Speech Robust Bench: A Robustness Benchmark For Speech RecognitionCode1
Large Language Models are Efficient Learners of Noise-Robust Speech RecognitionCode2
KinSPEAK: Improving speech recognition for Kinyarwanda via semi-supervised learning methods0
LyricWhiz: Robust Multilingual Zero-shot Lyrics Transcription by Whispering to ChatGPTCode1
The CHiME-7 DASR Challenge: Distant Meeting Transcription with Multiple Devices in Diverse Scenarios0
Statistical Beamformer Exploiting Non-stationarity and Sparsity with Spatially Constrained ICA for Robust Speech Recognition0
RescueSpeech: A German Corpus for Speech Recognition in Search and Rescue Domain0
Incorporating L2 Phonemes Using Articulatory Features for Robust Speech Recognition0
AVFormer: Injecting Vision into Frozen Speech Models for Zero-Shot AV-ASR0
MuAViC: A Multilingual Audio-Visual Corpus for Robust Speech Recognition and Robust Speech-to-Text TranslationCode2
Gradient Remedy for Multi-Task Learning in End-to-End Noise-Robust Speech RecognitionCode1
Audio-Visual Efficient Conformer for Robust Speech RecognitionCode1
Robust Speech Recognition via Large-Scale Weak SupervisionCode8
CCC-wav2vec 2.0: Clustering aided Cross Contrastive Self-supervised learning of speech representationsCode1
DENT-DDSP: Data-efficient noisy speech generator using differentiable digital signal processors for explicit distortion modelling and noise-robust speech recognitionCode1
ESPnet-SE++: Speech Enhancement for Robust Speech Recognition, Translation, and UnderstandingCode0
pMCT: Patched Multi-Condition Training for Robust Speech Recognition0
RUSAVIC Corpus: Russian Audio-Visual Speech in Cars0
On monoaural speech enhancement for automatic recognition of real noisy speech using mixture invariant training0
Multiple Confidence Gates For Joint Training Of SE And ASR0
End-to-End Integration of Speech Recognition, Speech Enhancement, and Self-Supervised Learning Representation0
Noise-robust Speech Recognition with 10 Minutes Unparalleled In-domain Data0
Dual-Path Style Learning for End-to-End Noise-Robust Speech RecognitionCode1
Speech-enhanced and Noise-aware Networks for Robust Speech RecognitionCode0
Chain-based Discriminative Autoencoders for Speech Recognition0
Investigation of Densely Connected Convolutional Networks with Domain Adversarial Learning for Noise Robust Speech Recognition0
Phone Based Keyword Spotting for Transcribing Very Low Resource Languages0
A comparison of streaming models and data augmentation methods for robust speech recognition0
Sequential Randomized Smoothing for Adversarially Robust Speech RecognitionCode0
Wav2vec-Switch: Contrastive Learning from Original-noisy Speech Pairs for Robust Speech Recognition0
Interactive Feature Fusion for End-to-End Noise-Robust Speech RecognitionCode1
Optimized Power Normalized Cepstral Coefficients towards Robust Deep Speaker Verification0
An Investigation of End-to-End Models for Robust Speech RecognitionCode1
HMM-based phoneme speech recognition system for the control and command of industrial robots0
Domain Adaptation Using Class Similarity for Robust Speech RecognitionCode0
Aphasic Speech Recognition using a Mixture of Speech Intelligibility Experts0
Show:102550
← PrevPage 1 of 2Next →

No leaderboard results yet.