SOTAVerified

Automatic Speech Recognition

Papers

Showing 176200 of 3174 papers

TitleStatusHype
Brouhaha: multi-task training for voice activity detection, speech-to-noise ratio, and C50 room acoustics estimationCode1
Can we use Common Voice to train a Multi-Speaker TTS system?Code1
A Hybrid Continuity Loss to Reduce Over-Suppression for Time-domain Target Speaker ExtractionCode1
CB-Conformer: Contextual biasing Conformer for biased word recognitionCode1
Investigating the Reordering Capability in CTC-based Non-Autoregressive End-to-End Speech TranslationCode1
Radically Old Way of Computing Spectra: Applications in End-to-End ASRCode1
AISHELL-NER: Named Entity Recognition from Chinese SpeechCode1
A Study of Multilingual End-to-End Speech Recognition for Kazakh, Russian, and EnglishCode1
CL-MASR: A Continual Learning Benchmark for Multilingual ASRCode1
A Survey on Non-Autoregressive Generation for Neural Machine Translation and BeyondCode1
Attention-based Contextual Language Model Adaptation for Speech RecognitionCode1
Common Voice: A Massively-Multilingual Speech CorpusCode1
Combining Frame-Synchronous and Label-Synchronous Systems for Speech RecognitionCode1
A Cross-Modal Approach to Silent Speech with LLM-Enhanced RecognitionCode1
Knowledge Distillation from BERT Transformer to Speech Transformer for Intent ClassificationCode1
ASR Error Correction with Constrained Decoding on Operation PredictionCode1
ALIF: Low-Cost Adversarial Audio Attacks on Black-Box Speech Platforms using Linguistic FeaturesCode1
A Systematic Comparison of Phonetic Aware Techniques for Speech EnhancementCode1
ContextNet: Improving Convolutional Neural Networks for Automatic Speech Recognition with Global ContextCode1
Automatic Severity Classification of Dysarthric speech by using Self-supervised Model with Multi-task LearningCode1
Controlling Whisper: Universal Acoustic Adversarial Attacks to Control Speech Foundation ModelsCode1
How Does Pre-trained Wav2Vec 2.0 Perform on Domain Shifted ASR? An Extensive Benchmark on Air Traffic Control CommunicationsCode1
CORAA: a large corpus of spontaneous and prepared speech manually validated for speech recognition in Brazilian PortugueseCode1
ExKaldi-RT: A Real-Time Automatic Speech Recognition Extension Toolkit of KaldiCode1
A Sidecar Separator Can Convert a Single-Talker Speech Recognition System to a Multi-Talker OneCode1
Show:102550
← PrevPage 8 of 127Next →

No leaderboard results yet.