SOTAVerified

Robust Speech Recognition

Papers

Showing 150 of 97 papers

TitleStatusHype
Robust Speech Recognition via Large-Scale Weak SupervisionCode8
mWhisper-Flamingo for Multilingual Audio-Visual Noise-Robust Speech RecognitionCode3
Large Language Models are Efficient Learners of Noise-Robust Speech RecognitionCode2
MuAViC: A Multilingual Audio-Visual Corpus for Robust Speech Recognition and Robust Speech-to-Text TranslationCode2
An Investigation of End-to-End Models for Robust Speech RecognitionCode1
Interactive Feature Fusion for End-to-End Noise-Robust Speech RecognitionCode1
Multi-task self-supervised learning for Robust Speech RecognitionCode1
Speech Robust Bench: A Robustness Benchmark For Speech RecognitionCode1
Audio-Visual Efficient Conformer for Robust Speech RecognitionCode1
Gradient Remedy for Multi-Task Learning in End-to-End Noise-Robust Speech RecognitionCode1
LyricWhiz: Robust Multilingual Zero-shot Lyrics Transcription by Whispering to ChatGPTCode1
MMS-LLaMA: Efficient LLM-based Audio-Visual Speech Recognition with Minimal Multimodal Speech TokensCode1
Dual-Path Style Learning for End-to-End Noise-Robust Speech RecognitionCode1
CCC-wav2vec 2.0: Clustering aided Cross Contrastive Self-supervised learning of speech representationsCode1
DENT-DDSP: Data-efficient noisy speech generator using differentiable digital signal processors for explicit distortion modelling and noise-robust speech recognitionCode1
Dysarthria Normalization via Local Lie Group Transformations for Robust ASRCode0
Learning Waveform-Based Acoustic Models using Deep Variational Convolutional Neural NetworksCode0
Investigating Generative Adversarial Networks based Speech Dereverberation for Robust Speech RecognitionCode0
Sequential Randomized Smoothing for Adversarially Robust Speech RecognitionCode0
ESPnet-SE++: Speech Enhancement for Robust Speech Recognition, Translation, and UnderstandingCode0
Channel-Aware Domain-Adaptive Generative Adversarial Network for Robust Speech RecognitionCode0
Speech-enhanced and Noise-aware Networks for Robust Speech RecognitionCode0
Domain Adaptation Using Class Similarity for Robust Speech RecognitionCode0
Scalable Factorized Hierarchical Variational Autoencoder TrainingCode0
Unsupervised Speech Domain Adaptation Based on Disentangled Representation Learning for Robust Speech RecognitionCode0
Very Deep Convolutional Neural Networks for Robust Speech RecognitionCode0
Incorporating L2 Phonemes Using Articulatory Features for Robust Speech Recognition0
調變頻譜分解技術於強健語音辨識之研究 (Investigating Modulation Spectrum Factorization Techniques for Robust Speech Recognition) [In Chinese]0
Investigation of Densely Connected Convolutional Networks with Domain Adversarial Learning for Noise Robust Speech Recognition0
KinSPEAK: Improving speech recognition for Kinyarwanda via semi-supervised learning methods0
Learning Noise-Invariant Representations for Robust Speech Recognition0
Learning Noise-Invariant Representations for Robust Speech Recognition0
Modality Attention for End-to-End Audio-visual Speech Recognition0
Modified SPLICE and its Extension to Non-Stereo Data for Noise Robust Speech Recognition0
MoHAVE: Mixture of Hierarchical Audio-Visual Experts for Robust Speech Recognition0
Multilingual Audio-Visual Speech Recognition with Hybrid CTC/RNN-T Fast Conformer0
Multiple Confidence Gates For Joint Training Of SE And ASR0
Multi-scale Octave Convolutions for Robust Speech Recognition0
Multi-Staged Cross-Lingual Acoustic Model Adaption for Robust Speech Recognition in Real-World Applications - A Case Study on German Oral History Interviews0
Noise-robust Speech Recognition with 10 Minutes Unparalleled In-domain Data0
On combining features for single-channel robust speech recognition in reverberant environments0
On monoaural speech enhancement for automatic recognition of real noisy speech using mixture invariant training0
On the Use of Different Feature Extraction Methods for Linear and Non Linear kernels0
Optimized Power Normalized Cepstral Coefficients towards Robust Deep Speaker Verification0
Paraformer-v2: An improved non-autoregressive transformer for noise-robust speech recognition0
Phone Based Keyword Spotting for Transcribing Very Low Resource Languages0
pMCT: Patched Multi-Condition Training for Robust Speech Recognition0
Privacy-Preserving Edge Speech Understanding with Tiny Foundation Models0
Recurrent Models for Auditory Attention in Multi-Microphone Distance Speech Recognition0
Reinforcement Learning Based Speech Enhancement for Robust Speech Recognition0
Show:102550
← PrevPage 1 of 2Next →

No leaderboard results yet.