SOTAVerified

Automatic Speech Recognition

Papers

Showing 29012925 of 3174 papers

TitleStatusHype
Integrating Emotion Recognition with Speech Recognition and Speaker Diarisation for ConversationsCode0
Swiss Parliaments Corpus, an Automatically Aligned Swiss German Speech to Standard German Text CorpusCode0
Multimodal Speech Recognition for Language-Guided Embodied AgentsCode0
Weighted Cross-entropy for Low-Resource Languages in Multilingual Speech RecognitionCode0
Large-Scale End-to-End Multilingual Speech Recognition and Language Identification with Multi-Task LearningCode0
Syllable-Based Sequence-to-Sequence Speech Recognition with the Transformer in Mandarin ChineseCode0
Syllable Subword Tokens for Open Vocabulary Speech Recognition in MalayalamCode0
Segmentation-Free Streaming Machine TranslationCode0
Adapting the adapters for code-switching in multilingual ASRCode0
Instant One-Shot Word-Learning for Context-Specific Neural Sequence-to-Sequence Speech RecognitionCode0
Selective Attention Merging for low resource tasks: A case study of Child ASRCode0
Preserving spoken content in voice anonymisation with character-level vocoder conditioningCode0
Pretext Tasks selection for multitask self-supervised speech representation learningCode0
Synchronous Speech Recognition and Speech-to-Text Translation with Interactive DecodingCode0
Whispering Under the Eaves: Protecting User Privacy Against Commercial and LLM-powered Automatic Speech Recognition SystemsCode0
LASER: Learning by Aligning Self-supervised Representations of Speech for Improving Content-related TasksCode0
Multi-Sentence Resampling: A Simple Approach to Alleviate Dataset Length Bias and Beam-Search DegradationCode0
DoCIA: An Online Document-Level Context Incorporation Agent for Speech TranslationCode0
Voices Unheard: NLP Resources and Models for Yorùbá Regional DialectsCode0
Multi-Speaker ASR Combining Non-Autoregressive Conformer CTC and Conditional Speaker ChainCode0
Unsupervised Data Selection for TTS: Using Arabic Broadcast News as a Case StudyCode0
Pre-training on high-resource speech recognition improves low-resource speech-to-text translationCode0
Discrete Speech Unit Extraction via Independent Component AnalysisCode0
Towards Temporally Explainable Dysarthric Speech Clarity AssessmentCode0
Discrete Cross-Modal Alignment Enables Zero-Shot Speech TranslationCode0
Show:102550
← PrevPage 117 of 127Next →

No leaderboard results yet.