SOTAVerified

Automatic Speech Recognition

Papers

Showing 29012950 of 3174 papers

TitleStatusHype
Integrating Emotion Recognition with Speech Recognition and Speaker Diarisation for ConversationsCode0
Swiss Parliaments Corpus, an Automatically Aligned Swiss German Speech to Standard German Text CorpusCode0
Multimodal Speech Recognition for Language-Guided Embodied AgentsCode0
Weighted Cross-entropy for Low-Resource Languages in Multilingual Speech RecognitionCode0
Large-Scale End-to-End Multilingual Speech Recognition and Language Identification with Multi-Task LearningCode0
Syllable-Based Sequence-to-Sequence Speech Recognition with the Transformer in Mandarin ChineseCode0
Syllable Subword Tokens for Open Vocabulary Speech Recognition in MalayalamCode0
Segmentation-Free Streaming Machine TranslationCode0
Adapting the adapters for code-switching in multilingual ASRCode0
Instant One-Shot Word-Learning for Context-Specific Neural Sequence-to-Sequence Speech RecognitionCode0
Selective Attention Merging for low resource tasks: A case study of Child ASRCode0
Preserving spoken content in voice anonymisation with character-level vocoder conditioningCode0
Pretext Tasks selection for multitask self-supervised speech representation learningCode0
Synchronous Speech Recognition and Speech-to-Text Translation with Interactive DecodingCode0
Whispering Under the Eaves: Protecting User Privacy Against Commercial and LLM-powered Automatic Speech Recognition SystemsCode0
LASER: Learning by Aligning Self-supervised Representations of Speech for Improving Content-related TasksCode0
Multi-Sentence Resampling: A Simple Approach to Alleviate Dataset Length Bias and Beam-Search DegradationCode0
DoCIA: An Online Document-Level Context Incorporation Agent for Speech TranslationCode0
Voices Unheard: NLP Resources and Models for Yorùbá Regional DialectsCode0
Multi-Speaker ASR Combining Non-Autoregressive Conformer CTC and Conditional Speaker ChainCode0
Unsupervised Data Selection for TTS: Using Arabic Broadcast News as a Case StudyCode0
Pre-training on high-resource speech recognition improves low-resource speech-to-text translationCode0
Discrete Speech Unit Extraction via Independent Component AnalysisCode0
Towards Temporally Explainable Dysarthric Speech Clarity AssessmentCode0
Discrete Cross-Modal Alignment Enables Zero-Shot Speech TranslationCode0
Improving Voice Separation by Incorporating End-to-end Speech RecognitionCode0
Self-Powered LLM Modality Expansion for Large Speech-Text ModelsCode0
Improving RNN Transducer Modeling for End-to-End Speech RecognitionCode0
ADIMA: Abuse Detection In Multilingual AudioCode0
WPD++: An Improved Neural Beamformer for Simultaneous Speech Separation and DereverberationCode0
Arabic Dysarthric Speech Recognition Using Adversarial and Signal-Based AugmentationCode0
Advancing African-Accented Speech Recognition: Epistemic Uncertainty-Driven Data Selection for Generalizable ASR ModelsCode0
Leveraging Self-Supervised Models for Automatic Whispered Speech RecognitionCode0
AdaCS: Adaptive Normalization for Enhanced Code-Switching ASRCode0
Improving LSTM-CTC based ASR performance in domains with limited training dataCode0
Improving CTC-based speech recognition via knowledge transferring from pre-trained language modelsCode0
Targeted Adversarial Examples for Black Box Audio SystemsCode0
A Comprehensive Evaluation of Incremental Speech Recognition and Diarization for Conversational AICode0
Learning from Past Mistakes: Improving Automatic Speech Recognition Output via Noisy-Clean Phrase Context ModelingCode0
Towards Unsupervised Speech Recognition Without Pronunciation ModelsCode0
A Quantitative Approach to Understand Self-Supervised Models as Cross-lingual Feature ExtractorsCode0
Improving Automatic Speech Recognition for Non-Native English with Transfer Learning and Language Model DecodingCode0
Task Oriented Dialogue as a Catalyst for Self-Supervised Automatic Speech RecognitionCode0
Self-supervised Speech Representations Still Struggle with African American Vernacular EnglishCode0
Comparing Self-Supervised Learning Models Pre-Trained on Human Speech and Animal Vocalizations for Bioacoustics ProcessingCode0
Discovering Phonetic Inventories with Crosslingual Automatic Speech RecognitionCode0
Learning to adapt: a meta-learning approach for speaker adaptationCode0
Imperceptible, Robust, and Targeted Adversarial Examples for Automatic Speech RecognitionCode0
Writer adaptation for offline text recognition: An exploration of neural network-based methodsCode0
Semantically Corrected Amharic Automatic Speech RecognitionCode0
Show:102550
← PrevPage 59 of 64Next →

No leaderboard results yet.