SOTAVerified

Automatic Speech Recognition

Papers

Showing 19261950 of 3174 papers

TitleStatusHype
Research Advances and New Paradigms for Biology-inspired Spiking Neural Networks0
Research Challenges in Building a Voice-based Artificial Personal Shopper - Position Paper0
Residual Adapters for Parameter-Efficient ASR Adaptation to Atypical and Accented Speech0
Residual Convolutional CTC Networks for Automatic Speech Recognition0
Residual Energy-Based Models for End-to-End Speech Recognition0
Residual Language Model for End-to-end Speech Recognition0
Resilience of Large Language Models for Noisy Instructions0
Resolving Transcription Ambiguity in Spanish: A Hybrid Acoustic-Lexical System for Punctuation Restoration0
Resource aware design of a deep convolutional-recurrent neural network for speech recognition through audio-visual sensor fusion0
Resource-Efficient Adaptation of Speech Foundation Models for Multi-Speaker ASR0
Resource-Efficient Transfer Learning From Speech Foundation Model Using Hierarchical Feature Fusion0
Rethinking End-to-End Evaluation of Decomposable Tasks: A Case Study on Spoken Language Understanding0
Rethinking Processing Distortions: Disentangling the Impact of Speech Enhancement Errors on Speech Recognition Performance0
Retraining-free Customized ASR for Enharmonic Words Based on a Named-Entity-Aware Model and Phoneme Similarity Estimation0
Retrieval Augmented Correction of Named Entity Speech Recognition Errors0
Retrieval-Enhanced Few-Shot Prompting for Speech Event Extraction0
Retrieve and Copy: Scaling ASR Personalization to Large Catalogs0
Revisiting Acoustic Features for Robust ASR0
Revisiting the Boundary between ASR and NLU in the Age of Conversational Dialog Systems0
r-G2P: Evaluating and Enhancing Robustness of Grapheme to Phoneme Conversion by Controlled noise introducing and Contextual information incorporation0
RIR-SF: Room Impulse Response Based Spatial Feature for Target Speech Recognition in Multi-Channel Multi-Speaker Scenarios0
RNN-T For Latency Controlled ASR With Improved Beam Search0
RNN-T Models Fail to Generalize to Out-of-Domain Audio: Causes and Solutions0
Roadmap towards Superhuman Speech Understanding using Large Language Models0
ROAR: Reinforcing Original to Augmented Data Ratio Dynamics for Wav2Vec2.0 Based ASR0
Show:102550
← PrevPage 78 of 127Next →

No leaderboard results yet.