SOTAVerified

Robust Speech Recognition

Papers

Showing 125 of 97 papers

TitleStatusHype
Robust Speech Recognition via Large-Scale Weak SupervisionCode8
mWhisper-Flamingo for Multilingual Audio-Visual Noise-Robust Speech RecognitionCode3
MuAViC: A Multilingual Audio-Visual Corpus for Robust Speech Recognition and Robust Speech-to-Text TranslationCode2
Large Language Models are Efficient Learners of Noise-Robust Speech RecognitionCode2
Interactive Feature Fusion for End-to-End Noise-Robust Speech RecognitionCode1
CCC-wav2vec 2.0: Clustering aided Cross Contrastive Self-supervised learning of speech representationsCode1
Audio-Visual Efficient Conformer for Robust Speech RecognitionCode1
LyricWhiz: Robust Multilingual Zero-shot Lyrics Transcription by Whispering to ChatGPTCode1
Multi-task self-supervised learning for Robust Speech RecognitionCode1
Speech Robust Bench: A Robustness Benchmark For Speech RecognitionCode1
Gradient Remedy for Multi-Task Learning in End-to-End Noise-Robust Speech RecognitionCode1
MMS-LLaMA: Efficient LLM-based Audio-Visual Speech Recognition with Minimal Multimodal Speech TokensCode1
An Investigation of End-to-End Models for Robust Speech RecognitionCode1
DENT-DDSP: Data-efficient noisy speech generator using differentiable digital signal processors for explicit distortion modelling and noise-robust speech recognitionCode1
Dual-Path Style Learning for End-to-End Noise-Robust Speech RecognitionCode1
Data-Driven Pronunciation Modeling of Swiss German Dialectal Speech for Automatic Speech Recognition0
Data-Driven Mispronunciation Pattern Discovery for Robust Speech Recognition0
Cumulative Adaptation for BLSTM Acoustic Models0
CPT-Boosted Wav2vec2.0: Towards Noise Robust Speech Recognition for Classroom Environments0
類神經網路訓練結合環境群集及專家混合系統於強健性語音辨識(Automatic Speech Recognition using Neural Network based Acoustic Model with the Environment Clustering and Mixture of Experts Algorithms) [In Chinese]0
Environmental Noise Embeddings for Robust Speech Recognition0
Domain Expansion in DNN-based Acoustic Models for Robust Speech Recognition0
Attentive Fusion Enhanced Audio-Visual Encoding for Transformer Based Robust Speech Recognition0
雜訊環境下應用線性估測編碼於特徵時序列之強健性語音辨識 (Employing Linear Prediction Coding in Feature Time Sequences for Robust Speech Recognition in Noisy Environments) [In Chinese]0
Chain-based Discriminative Autoencoders for Speech Recognition0
Show:102550
← PrevPage 1 of 4Next →

No leaderboard results yet.