SOTAVerified

Automatic Speech Recognition

Papers

Showing 12261250 of 3174 papers

TitleStatusHype
Graph Neural Networks for Contextual ASR with the Tree-Constrained Pointer GeneratorCode0
A Hierarchical Context-aware Modeling Approach for Multi-aspect and Multi-granular Pronunciation Assessment0
CommonAccent: Exploring Large Acoustic Pretrained Models for Accent Classification Based on Common Voice0
Can We Trust Explainable AI Methods on ASR? An Evaluation on Phoneme Recognition0
Building Accurate Low Latency ASR for Streaming Voice Search0
Improving Textless Spoken Language Understanding with Discrete Units as Intermediate Target0
Retraining-free Customized ASR for Enharmonic Words Based on a Named-Entity-Aware Model and Phoneme Similarity Estimation0
DistriBlock: Identifying adversarial audio samples by leveraging characteristics of the output distributionCode0
DisfluencyFixer: A tool to enhance Language Learning through Speech To Speech Disfluency Correction0
2-bit Conformer quantization for automatic speech recognition0
INTapt: Information-Theoretic Adversarial Prompt Tuning for Enhanced Non-Native Speech Recognition0
Mixture-of-Expert Conformer for Streaming Multilingual ASR0
ASR and Emotional Speech: A Word-Level Investigation of the Mutual Impact of Speech and Emotion Recognition0
Improving Scheduled Sampling for Neural Transducer-based ASR0
Unified Modeling of Multi-Talker Overlapped Speech Recognition and Diarization with a Sidecar Separator0
Svarah: Evaluating English ASR Systems on Indian Accents0
InterFormer: Interactive Local and Global Features Fusion for Automatic Speech Recognition0
Textless Speech-to-Speech Translation With Limited Parallel DataCode0
Iteratively Improving Speech Recognition and Voice Conversion0
Incorporating Ultrasound Tongue Images for Audio-Visual Speech Enhancement through Knowledge Distillation0
BA-SOT: Boundary-Aware Serialized Output Training for Multi-Talker ASR0
Cross-lingual Knowledge Transfer and Iterative Pseudo-labeling for Low-Resource Speech Recognition with Transducers0
TranUSR: Phoneme-to-word Transcoder Based Unified Speech Representation Learning for Cross-lingual Speech Recognition0
Evaluating OpenAI's Whisper ASR for Punctuation Prediction and Topic Modeling of life histories of the Museum of the Person0
On the Transferability of Whisper-based Representations for "In-the-Wild" Cross-Task Downstream Speech Applications0
Show:102550
← PrevPage 50 of 127Next →

No leaderboard results yet.