SOTAVerified

Automatic Speech Recognition

Papers

Showing 126150 of 3174 papers

TitleStatusHype
Improving Mandarin Speech Recogntion with Block-augmented TransformerCode1
DiaCorrect: Error Correction Back-end For Speaker DiarizationCode1
Continual Test-time Adaptation for End-to-end Speech Recognition on Noisy SpeechCode1
ContextNet: Improving Convolutional Neural Networks for Automatic Speech Recognition with Global ContextCode1
Continuous speech separation: dataset and analysisCode1
Confidence Estimation for Attention-based Sequence-to-sequence Models for Speech RecognitionCode1
Consistent Training and Decoding For End-to-end Speech Recognition Using Lattice-free MMICode1
Controlling Whisper: Universal Acoustic Adversarial Attacks to Control Speech Foundation ModelsCode1
Combining Frame-Synchronous and Label-Synchronous Systems for Speech RecognitionCode1
Accented Speech Recognition With Accent-specific CodebooksCode1
Common Voice: A Massively-Multilingual Speech CorpusCode1
Complex Dynamic Neurons Improved Spiking Transformer Network for Efficient Automatic Speech RecognitionCode1
CopyNE: Better Contextual ASR by Copying Named EntitiesCode1
A convolutional neural-network model of human cochlear mechanics and filter tuning for real-time applicationsCode1
A context-aware knowledge transferring strategy for CTC-based ASRCode1
CB-Conformer: Contextual biasing Conformer for biased word recognitionCode1
Can Contextual Biasing Remain Effective with Whisper and GPT-2?Code1
Brouhaha: multi-task training for voice activity detection, speech-to-noise ratio, and C50 room acoustics estimationCode1
Can we use Common Voice to train a Multi-Speaker TTS system?Code1
BENDR: using transformers and a contrastive self-supervised learning task to learn from massive amounts of EEG dataCode1
BERTraffic: BERT-based Joint Speaker Role and Speaker Change Detection for Air Traffic Control CommunicationsCode1
BASPRO: a balanced script producer for speech corpus collection based on the genetic algorithmCode1
Advancing Test-Time Adaptation in Wild Acoustic Test SettingsCode1
BembaSpeech: A Speech Recognition Corpus for the Bemba LanguageCode1
AVLnet: Learning Audio-Visual Language Representations from Instructional VideosCode1
Show:102550
← PrevPage 6 of 127Next →

No leaderboard results yet.