SOTAVerified

Automatic Speech Recognition

Papers

Showing 10261050 of 3174 papers

TitleStatusHype
Retraining-free Customized ASR for Enharmonic Words Based on a Named-Entity-Aware Model and Phoneme Similarity Estimation0
A Hierarchical Context-aware Modeling Approach for Multi-aspect and Multi-granular Pronunciation Assessment0
Can We Trust Explainable AI Methods on ASR? An Evaluation on Phoneme Recognition0
Improving Textless Spoken Language Understanding with Discrete Units as Intermediate Target0
CommonAccent: Exploring Large Acoustic Pretrained Models for Accent Classification Based on Common Voice0
DisfluencyFixer: A tool to enhance Language Learning through Speech To Speech Disfluency Correction0
DistriBlock: Identifying adversarial audio samples by leveraging characteristics of the output distributionCode0
2-bit Conformer quantization for automatic speech recognition0
INTapt: Information-Theoretic Adversarial Prompt Tuning for Enhanced Non-Native Speech Recognition0
Unified Modeling of Multi-Talker Overlapped Speech Recognition and Diarization with a Sidecar Separator0
Improving Scheduled Sampling for Neural Transducer-based ASR0
Mixture-of-Expert Conformer for Streaming Multilingual ASR0
Svarah: Evaluating English ASR Systems on Indian Accents0
ASR and Emotional Speech: A Word-Level Investigation of the Mutual Impact of Speech and Emotion Recognition0
InterFormer: Interactive Local and Global Features Fusion for Automatic Speech Recognition0
Textless Speech-to-Speech Translation With Limited Parallel DataCode0
Iteratively Improving Speech Recognition and Voice Conversion0
Incorporating Ultrasound Tongue Images for Audio-Visual Speech Enhancement through Knowledge Distillation0
Graph Meets LLM: A Novel Approach to Collaborative Filtering for Robust Conversational Understanding0
Evaluating OpenAI's Whisper ASR for Punctuation Prediction and Topic Modeling of life histories of the Museum of the Person0
On the Transferability of Whisper-based Representations for "In-the-Wild" Cross-Task Downstream Speech Applications0
BA-SOT: Boundary-Aware Serialized Output Training for Multi-Talker ASR0
Personalized Predictive ASR for Latency Reduction in Voice Assistants0
Cross-lingual Knowledge Transfer and Iterative Pseudo-labeling for Low-Resource Speech Recognition with Transducers0
TranUSR: Phoneme-to-word Transcoder Based Unified Speech Representation Learning for Cross-lingual Speech Recognition0
Show:102550
← PrevPage 42 of 127Next →

No leaderboard results yet.