SOTAVerified

Automatic Speech Recognition

Papers

Showing 251300 of 3174 papers

TitleStatusHype
Common Voice: A Massively-Multilingual Speech CorpusCode1
Unified Multimodal Punctuation Restoration Framework for Mixed-Modality CorpusCode1
Unimodal Aggregation for CTC-based Speech RecognitionCode1
Unsupervised pretraining transfers well across languagesCode1
Can we use Common Voice to train a Multi-Speaker TTS system?Code1
Brouhaha: multi-task training for voice activity detection, speech-to-noise ratio, and C50 room acoustics estimationCode1
CB-Conformer: Contextual biasing Conformer for biased word recognitionCode1
Complex Dynamic Neurons Improved Spiking Transformer Network for Efficient Automatic Speech RecognitionCode1
Daily-Omni: Towards Audio-Visual Reasoning with Temporal Alignment across ModalitiesCode1
BASPRO: a balanced script producer for speech corpus collection based on the genetic algorithmCode1
A Variance-Preserving Interpolation Approach for Diffusion Models with Applications to Single Channel Speech Enhancement and RecognitionCode1
AVLnet: Learning Audio-Visual Language Representations from Instructional VideosCode1
BembaSpeech: A Speech Recognition Corpus for the Bemba LanguageCode1
Automatic Speech Recognition for Speech Assessment of Persian Preschool ChildrenCode1
Automatic Speech Recognition Benchmark for Air-Traffic CommunicationsCode1
Automatic speech recognition for the Nepali language using CNN, bidirectional LSTM and ResNetCode1
Automatic Disfluency Detection from Untranscribed SpeechCode1
AVATAR: Unconstrained Audiovisual Speech RecognitionCode1
AV Taris: Online Audio-Visual Speech RecognitionCode1
Back Translation for Speech-to-text Translation Without TranscriptsCode1
A Comparison of Methods for OOV-word Recognition on a New Public DatasetCode1
Brazilian Portuguese Speech Recognition Using Wav2vec 2.0Code1
An Investigation of End-to-End Models for Robust Speech RecognitionCode1
Can Contextual Biasing Remain Effective with Whisper and GPT-2?Code1
Automatic Severity Classification of Dysarthric speech by using Self-supervised Model with Multi-task LearningCode1
CL-MASR: A Continual Learning Benchmark for Multilingual ASRCode1
Automatic Speech Recognition in Sanskrit: A New Speech Corpus and Modelling InsightsCode1
BENDR: using transformers and a contrastive self-supervised learning task to learn from massive amounts of EEG dataCode1
Confidence Estimation for Attention-based Sequence-to-sequence Models for Speech RecognitionCode1
Attentive Sequence-to-Sequence Learning for Diacritic Restoration of Yorùbá Language TextCode1
ContextNet: Improving Convolutional Neural Networks for Automatic Speech Recognition with Global ContextCode1
Continual Test-time Adaptation for End-to-end Speech Recognition on Noisy SpeechCode1
Attention-based Contextual Language Model Adaptation for Speech RecognitionCode1
CORAA: a large corpus of spontaneous and prepared speech manually validated for speech recognition in Brazilian PortugueseCode1
Cross Attention Augmented Transducer Networks for Simultaneous TranslationCode1
Cross-Modal Global Interaction and Local Alignment for Audio-Visual Speech RecognitionCode1
ALIF: Low-Cost Adversarial Audio Attacks on Black-Box Speech Platforms using Linguistic FeaturesCode1
Deep Contextualized Acoustic Representations For Semi-Supervised Speech RecognitionCode1
Audio-Visual Efficient Conformer for Robust Speech RecognitionCode1
A transfer learning based approach for pronunciation scoringCode1
ATCO2 corpus: A Large-Scale Dataset for Research on Automatic Speech Recognition and Natural Language Understanding of Air Traffic Control CommunicationsCode1
Attention-based Audio-Visual Fusion for Robust Automatic Speech RecognitionCode1
Audio-Visual Representation Learning via Knowledge Distillation from Speech Foundation ModelsCode1
Distilling a Pretrained Language Model to a Multilingual ASR ModelCode1
A Study of Multilingual End-to-End Speech Recognition for Kazakh, Russian, and EnglishCode1
ASR Error Correction with Constrained Decoding on Operation PredictionCode1
Dual-decoder Transformer for Joint Automatic Speech Recognition and Multilingual Speech TranslationCode1
Dual-Path Style Learning for End-to-End Noise-Robust Speech RecognitionCode1
Earnings-22: A Practical Benchmark for Accents in the WildCode1
A Survey on Non-Autoregressive Generation for Neural Machine Translation and BeyondCode1
Show:102550
← PrevPage 6 of 64Next →

No leaderboard results yet.