SOTAVerified

Robust Speech Recognition

Papers

Showing 150 of 97 papers

TitleStatusHype
Robust Speech Recognition via Large-Scale Weak SupervisionCode8
mWhisper-Flamingo for Multilingual Audio-Visual Noise-Robust Speech RecognitionCode3
MuAViC: A Multilingual Audio-Visual Corpus for Robust Speech Recognition and Robust Speech-to-Text TranslationCode2
Large Language Models are Efficient Learners of Noise-Robust Speech RecognitionCode2
Speech Robust Bench: A Robustness Benchmark For Speech RecognitionCode1
Audio-Visual Efficient Conformer for Robust Speech RecognitionCode1
Gradient Remedy for Multi-Task Learning in End-to-End Noise-Robust Speech RecognitionCode1
LyricWhiz: Robust Multilingual Zero-shot Lyrics Transcription by Whispering to ChatGPTCode1
MMS-LLaMA: Efficient LLM-based Audio-Visual Speech Recognition with Minimal Multimodal Speech TokensCode1
Dual-Path Style Learning for End-to-End Noise-Robust Speech RecognitionCode1
An Investigation of End-to-End Models for Robust Speech RecognitionCode1
CCC-wav2vec 2.0: Clustering aided Cross Contrastive Self-supervised learning of speech representationsCode1
DENT-DDSP: Data-efficient noisy speech generator using differentiable digital signal processors for explicit distortion modelling and noise-robust speech recognitionCode1
Interactive Feature Fusion for End-to-End Noise-Robust Speech RecognitionCode1
Multi-task self-supervised learning for Robust Speech RecognitionCode1
Domain Expansion in DNN-based Acoustic Models for Robust Speech Recognition0
雜訊環境下應用線性估測編碼於特徵時序列之強健性語音辨識 (Employing Linear Prediction Coding in Feature Time Sequences for Robust Speech Recognition in Noisy Environments) [In Chinese]0
End-to-End Integration of Speech Recognition, Speech Enhancement, and Self-Supervised Learning Representation0
Environmental Noise Embeddings for Robust Speech Recognition0
Exploring Speech Enhancement with Generative Adversarial Networks for Robust Speech Recognition0
Feature Learning with Gaussian Restricted Boltzmann Machine for Robust Speech Recognition0
Feature Normalisation for Robust Speech Recognition0
HMM-based phoneme speech recognition system for the control and command of industrial robots0
改良式統計圖等化法強鍵性語音辨識之研究 (Improved Histogram Equalization Methods for Robust Speech Recognition) [In Chinese]0
改良調變頻譜統計圖等化法於強健性語音辨識之研究 (Improved Modulation Spectrum Histogram Equalization for Robust Speech Recognition) [In Chinese]0
Improved Transcription and Indexing of Oral History Interviews for Digital Humanities Research0
Incorporating L2 Phonemes Using Articulatory Features for Robust Speech Recognition0
調變頻譜分解技術於強健語音辨識之研究 (Investigating Modulation Spectrum Factorization Techniques for Robust Speech Recognition) [In Chinese]0
Investigation of Densely Connected Convolutional Networks with Domain Adversarial Learning for Noise Robust Speech Recognition0
KinSPEAK: Improving speech recognition for Kinyarwanda via semi-supervised learning methods0
Learning Noise-Invariant Representations for Robust Speech Recognition0
Learning Noise-Invariant Representations for Robust Speech Recognition0
Modality Attention for End-to-End Audio-visual Speech Recognition0
Modified SPLICE and its Extension to Non-Stereo Data for Noise Robust Speech Recognition0
MoHAVE: Mixture of Hierarchical Audio-Visual Experts for Robust Speech Recognition0
XLAVS-R: Cross-Lingual Audio-Visual Speech Representation Learning for Noise-Robust Speech Perception0
A comparison of streaming models and data augmentation methods for robust speech recognition0
Aphasic Speech Recognition using a Mixture of Speech Intelligibility Experts0
Attentive Fusion Enhanced Audio-Visual Encoding for Transformer Based Robust Speech Recognition0
類神經網路訓練結合環境群集及專家混合系統於強健性語音辨識(Automatic Speech Recognition using Neural Network based Acoustic Model with the Environment Clustering and Mixture of Experts Algorithms) [In Chinese]0
AVFormer: Injecting Vision into Frozen Speech Models for Zero-Shot AV-ASR0
Chain-based Discriminative Autoencoders for Speech Recognition0
CPT-Boosted Wav2vec2.0: Towards Noise Robust Speech Recognition for Classroom Environments0
Cumulative Adaptation for BLSTM Acoustic Models0
Data-Driven Mispronunciation Pattern Discovery for Robust Speech Recognition0
Data-Driven Pronunciation Modeling of Swiss German Dialectal Speech for Automatic Speech Recognition0
Deep Learning Based Dereverberation of Temporal Envelopesfor Robust Speech Recognition0
Deep Learning for Environmentally Robust Speech Recognition: An Overview of Recent Developments0
Deep Long Short-Term Memory Adaptive Beamforming Networks For Multichannel Robust Speech Recognition0
Does Speech enhancement of publicly available data help build robust Speech Recognition Systems?0
Show:102550
← PrevPage 1 of 2Next →

No leaderboard results yet.