SOTAVerified

Automatic Speech Recognition

Papers

Showing 76100 of 3174 papers

TitleStatusHype
D4AM: A General Denoising Framework for Downstream Acoustic ModelsCode1
Deep Contextualized Acoustic Representations For Semi-Supervised Speech RecognitionCode1
CORAA: a large corpus of spontaneous and prepared speech manually validated for speech recognition in Brazilian PortugueseCode1
Cross Attention Augmented Transducer Networks for Simultaneous TranslationCode1
Continuous speech separation: dataset and analysisCode1
Consistent Training and Decoding For End-to-end Speech Recognition Using Lattice-free MMICode1
Controlling Whisper: Universal Acoustic Adversarial Attacks to Control Speech Foundation ModelsCode1
Cross-Modal Global Interaction and Local Alignment for Audio-Visual Speech RecognitionCode1
Deep Sparse Conformer for Speech RecognitionCode1
Dual-Path Style Learning for End-to-End Noise-Robust Speech RecognitionCode1
A Cross-Modal Approach to Silent Speech with LLM-Enhanced RecognitionCode1
ClovaCall: Korean Goal-Oriented Dialog Speech Corpus for Automatic Speech Recognition of Contact CentersCode1
Combining Frame-Synchronous and Label-Synchronous Systems for Speech RecognitionCode1
Can Contextual Biasing Remain Effective with Whisper and GPT-2?Code1
Brouhaha: multi-task training for voice activity detection, speech-to-noise ratio, and C50 room acoustics estimationCode1
Can we use Common Voice to train a Multi-Speaker TTS system?Code1
CL-MASR: A Continual Learning Benchmark for Multilingual ASRCode1
BERTraffic: BERT-based Joint Speaker Role and Speaker Change Detection for Air Traffic Control CommunicationsCode1
Brazilian Portuguese Speech Recognition Using Wav2vec 2.0Code1
Adapting End-to-End Speech Recognition for Readable SubtitlesCode1
CB-Conformer: Contextual biasing Conformer for biased word recognitionCode1
ContextNet: Improving Convolutional Neural Networks for Automatic Speech Recognition with Global ContextCode1
Continual Test-time Adaptation for End-to-end Speech Recognition on Noisy SpeechCode1
CopyNE: Better Contextual ASR by Copying Named EntitiesCode1
Common Voice: A Massively-Multilingual Speech CorpusCode1
Show:102550
← PrevPage 4 of 127Next →

No leaderboard results yet.