SOTAVerified

Automatic Speech Recognition

Papers

Showing 151175 of 3174 papers

TitleStatusHype
MelHuBERT: A simplified HuBERT on Mel spectrogramsCode1
MT4SSL: Boosting Self-Supervised Speech Representation Learning by Integrating Multiple TargetsCode1
ATCO2 corpus: A Large-Scale Dataset for Research on Automatic Speech Recognition and Natural Language Understanding of Air Traffic Control CommunicationsCode1
Towards Improved Room Impulse Response Estimation for Speech RecognitionCode1
Multi-blank Transducers for Speech RecognitionCode1
Losses Can Be Blessings: Routing Self-Supervised Speech Representations Towards Efficient Multilingual and Multitask Speech ProcessingCode1
Robust Data2vec: Noise-robust Speech Representation Learning for ASR by Combining Regression and Improved Contrastive LearningCode1
Automatic Severity Classification of Dysarthric speech by using Self-supervised Model with Multi-task LearningCode1
There is more than one kind of robustness: Fooling Whisper with adversarial examplesCode1
Brouhaha: multi-task training for voice activity detection, speech-to-noise ratio, and C50 room acoustics estimationCode1
ESB: A Benchmark For Multi-Domain End-to-End Speech RecognitionCode1
Towards Relation Extraction From SpeechCode1
Can we use Common Voice to train a Multi-Speaker TTS system?Code1
A context-aware knowledge transferring strategy for CTC-based ASRCode1
JoeyS2T: Minimalistic Speech-to-Text Modeling with JoeyNMTCode1
Non-autoregressive Error Correction for CTC-based ASR with Phone-conditioned Masked LMCode1
Deep Sparse Conformer for Speech RecognitionCode1
IndicSUPERB: A Speech Processing Universal Performance Benchmark for Indian languagesCode1
ASR Error Correction with Constrained Decoding on Operation PredictionCode1
DENT-DDSP: Data-efficient noisy speech generator using differentiable digital signal processors for explicit distortion modelling and noise-robust speech recognitionCode1
Improving Mandarin Speech Recogntion with Block-augmented TransformerCode1
Transfer Learning of wav2vec 2.0 for Automatic Lyric TranscriptionCode1
MM-ALT: A Multimodal Automatic Lyric Transcription SystemCode1
Distilling a Pretrained Language Model to a Multilingual ASR ModelCode1
A Systematic Comparison of Phonetic Aware Techniques for Speech EnhancementCode1
Show:102550
← PrevPage 7 of 127Next →

No leaderboard results yet.