SOTAVerified

Automatic Speech Recognition

Papers

Showing 201250 of 3174 papers

TitleStatusHype
Sentiment Word Aware Multimodal Refinement for Multimodal Sentiment Analysis with ASR ErrorsCode1
AISHELL-NER: Named Entity Recognition from Chinese SpeechCode1
Efficient Adapter Transfer of Self-Supervised Speech Models for Automatic Speech RecognitionCode1
Streaming Multi-Talker ASR with Token-Level Serialized Output TrainingCode1
Unified Multimodal Punctuation Restoration Framework for Mixed-Modality CorpusCode1
Improving Mandarin End-to-End Speech Recognition with Word N-gram Language ModelCode1
Regularizing End-to-End Speech Translation with Triangular Decomposition AgreementCode1
X-Vector based voice activity detection for multi-genre broadcast speech-to-textCode1
Consistent Training and Decoding For End-to-end Speech Recognition Using Lattice-free MMICode1
SLUE: New Benchmark Tasks for Spoken Language Understanding Evaluation on Natural SpeechCode1
MT3: Multi-Task Multitrack Music TranscriptionCode1
Cross Attention Augmented Transducer Networks for Simultaneous TranslationCode1
A transfer learning based approach for pronunciation scoringCode1
SpeechT5: Unified-Modal Encoder-Decoder Pre-Training for Spoken Language ProcessingCode1
CORAA: a large corpus of spontaneous and prepared speech manually validated for speech recognition in Brazilian PortugueseCode1
BERTraffic: BERT-based Joint Speaker Role and Speaker Change Detection for Air Traffic Control CommunicationsCode1
Interactive Feature Fusion for End-to-End Noise-Robust Speech RecognitionCode1
K-Wav2vec 2.0: Automatic Speech Recognition based on Joint Decoding of Graphemes and SyllablesCode1
FAST-RIR: Fast neural diffuse room impulse response generatorCode1
Factorized Neural Transducer for Efficient Language Model AdaptationCode1
Performance-Efficiency Trade-offs in Unsupervised Pre-training for Speech RecognitionCode1
Vietnamese end-to-end speech recognition using wav2vec 2.0Code1
Efficient conformer: Progressive downsampling and grouped attention for automatic speech recognitionCode1
Knowledge Distillation from BERT Transformer to Speech Transformer for Intent ClassificationCode1
A Study of Multilingual End-to-End Speech Recognition for Kazakh, Russian, and EnglishCode1
USC: An Open-Source Uzbek Speech Corpus and Initial Speech Recognition ExperimentsCode1
The History of Speech Recognition to the Year 2030Code1
Brazilian Portuguese Speech Recognition Using Wav2vec 2.0Code1
Token-Level Supervised Contrastive Learning for Punctuation RestorationCode1
STRODE: Stochastic Boundary Ordinary Differential EquationCode1
A Comparison of Methods for OOV-word Recognition on a New Public DatasetCode1
Layer-wise Analysis of a Self-supervised Speech Representation ModelCode1
TENET: A Time-reversal Enhancement Network for Noise-robust ASRCode1
Relaxed Attention: A Simple Method to Boost Performance of End-to-End Automatic Speech RecognitionCode1
Combining Frame-Synchronous and Label-Synchronous Systems for Speech RecognitionCode1
Learning Audio-Visual DereverberationCode1
Incorporating External POS Tagger for Punctuation RestorationCode1
Lightweight Adapter Tuning for Multilingual Speech TranslationCode1
Automatic Speech Recognition in Sanskrit: A New Speech Corpus and Modelling InsightsCode1
Attention-based Contextual Language Model Adaptation for Speech RecognitionCode1
Investigating the Reordering Capability in CTC-based Non-Autoregressive End-to-End Speech TranslationCode1
End-to-End Speech Recognition from Federated Acoustic ModelsCode1
LeBenchmark: A Reproducible Framework for Assessing Self-Supervised Representation Learning from SpeechCode1
A Toolbox for Construction and Analysis of Speech DatasetsCode1
RNN Transducer Models For Spoken Language UnderstandingCode1
Speak or Chat with Me: End-to-End Spoken Language Understanding System with Flexible InputsCode1
ExKaldi-RT: A Real-Time Automatic Speech Recognition Extension Toolkit of KaldiCode1
Integer-only Zero-shot Quantization for Efficient Speech RecognitionCode1
Leveraging pre-trained representations to improve access to untranscribed speech from endangered languagesCode1
Radically Old Way of Computing Spectra: Applications in End-to-End ASRCode1
Show:102550
← PrevPage 5 of 64Next →

No leaderboard results yet.