SOTAVerified

Automatic Speech Recognition

Papers

Showing 201225 of 3174 papers

TitleStatusHype
Sentiment Word Aware Multimodal Refinement for Multimodal Sentiment Analysis with ASR ErrorsCode1
AISHELL-NER: Named Entity Recognition from Chinese SpeechCode1
Efficient Adapter Transfer of Self-Supervised Speech Models for Automatic Speech RecognitionCode1
Streaming Multi-Talker ASR with Token-Level Serialized Output TrainingCode1
Unified Multimodal Punctuation Restoration Framework for Mixed-Modality CorpusCode1
Improving Mandarin End-to-End Speech Recognition with Word N-gram Language ModelCode1
Regularizing End-to-End Speech Translation with Triangular Decomposition AgreementCode1
X-Vector based voice activity detection for multi-genre broadcast speech-to-textCode1
Consistent Training and Decoding For End-to-end Speech Recognition Using Lattice-free MMICode1
SLUE: New Benchmark Tasks for Spoken Language Understanding Evaluation on Natural SpeechCode1
MT3: Multi-Task Multitrack Music TranscriptionCode1
A transfer learning based approach for pronunciation scoringCode1
Cross Attention Augmented Transducer Networks for Simultaneous TranslationCode1
CORAA: a large corpus of spontaneous and prepared speech manually validated for speech recognition in Brazilian PortugueseCode1
SpeechT5: Unified-Modal Encoder-Decoder Pre-Training for Spoken Language ProcessingCode1
BERTraffic: BERT-based Joint Speaker Role and Speaker Change Detection for Air Traffic Control CommunicationsCode1
K-Wav2vec 2.0: Automatic Speech Recognition based on Joint Decoding of Graphemes and SyllablesCode1
Interactive Feature Fusion for End-to-End Noise-Robust Speech RecognitionCode1
FAST-RIR: Fast neural diffuse room impulse response generatorCode1
Factorized Neural Transducer for Efficient Language Model AdaptationCode1
Performance-Efficiency Trade-offs in Unsupervised Pre-training for Speech RecognitionCode1
Vietnamese end-to-end speech recognition using wav2vec 2.0Code1
Efficient conformer: Progressive downsampling and grouped attention for automatic speech recognitionCode1
Knowledge Distillation from BERT Transformer to Speech Transformer for Intent ClassificationCode1
A Study of Multilingual End-to-End Speech Recognition for Kazakh, Russian, and EnglishCode1
Show:102550
← PrevPage 9 of 127Next →

No leaderboard results yet.