SOTAVerified

Automatic Speech Recognition

Papers

Showing 12011250 of 3174 papers

TitleStatusHype
OTF: Optimal Transport based Fusion of Supervised and Self-Supervised Learning Models for Automatic Speech Recognition0
Incorporating L2 Phonemes Using Articulatory Features for Robust Speech Recognition0
SpellMapper: A non-autoregressive neural spellchecker for ASR customization with candidate retrieval based on n-gram mappings0
End-to-End Joint Target and Non-Target Speakers ASR0
Advancing African-Accented Speech Recognition: Epistemic Uncertainty-Driven Data Selection for Generalizable ASR ModelsCode0
Streaming Speech-to-Confusion Network Speech Recognition0
Audio-Visual Speech Enhancement with Score-Based Generative Models0
Improved Training for End-to-End Streaming Automatic Speech Recognition Model with Punctuation0
Explainability of Speech Recognition Transformers via Gradient-based Attention VisualizationCode0
SlothSpeech: Denial-of-service Attack Against Speech Recognition ModelsCode0
Inspecting Spoken Language Understanding from Kids for Basic Math Learning at Home0
Adaptation and Optimization of Automatic Speech Recognition (ASR) for the Maritime Domain in the Field of VHF Communication0
Towards hate speech detection in low-resource languages: Comparing ASR to acoustic word embeddings on Wolof and Swahili0
Some voices are too common: Building fair speech recognition systems using the Common Voice dataset0
AfriNames: Most ASR models "butcher" African Names0
Bypass Temporal Classification: Weakly Supervised Automatic Speech Recognition with Imperfect Transcripts0
Encoder-decoder multimodal speaker change detection0
Strategies for improving low resource speech to text translation relying on pre-trained ASR models0
Zero-Shot Automatic Pronunciation Assessment0
Accurate and Structured Pruning for Efficient Automatic Speech Recognition0
Simple yet Effective Code-Switching Language Identification with Multitask Pre-Training and Transfer Learning0
VILAS: Exploring the Effects of Vision and Language Context in Automatic Speech Recognition0
Adapting Multi-Lingual ASR Models for Handling Multiple Talkers0
STT4SG-350: A Speech Corpus for All Swiss German Dialect Regions0
Towards Selection of Text-to-speech Data to Augment ASR Training0
Graph Neural Networks for Contextual ASR with the Tree-Constrained Pointer GeneratorCode0
A Hierarchical Context-aware Modeling Approach for Multi-aspect and Multi-granular Pronunciation Assessment0
CommonAccent: Exploring Large Acoustic Pretrained Models for Accent Classification Based on Common Voice0
Can We Trust Explainable AI Methods on ASR? An Evaluation on Phoneme Recognition0
Building Accurate Low Latency ASR for Streaming Voice Search0
Improving Textless Spoken Language Understanding with Discrete Units as Intermediate Target0
Retraining-free Customized ASR for Enharmonic Words Based on a Named-Entity-Aware Model and Phoneme Similarity Estimation0
DistriBlock: Identifying adversarial audio samples by leveraging characteristics of the output distributionCode0
DisfluencyFixer: A tool to enhance Language Learning through Speech To Speech Disfluency Correction0
2-bit Conformer quantization for automatic speech recognition0
INTapt: Information-Theoretic Adversarial Prompt Tuning for Enhanced Non-Native Speech Recognition0
Mixture-of-Expert Conformer for Streaming Multilingual ASR0
ASR and Emotional Speech: A Word-Level Investigation of the Mutual Impact of Speech and Emotion Recognition0
Improving Scheduled Sampling for Neural Transducer-based ASR0
Unified Modeling of Multi-Talker Overlapped Speech Recognition and Diarization with a Sidecar Separator0
Svarah: Evaluating English ASR Systems on Indian Accents0
InterFormer: Interactive Local and Global Features Fusion for Automatic Speech Recognition0
Textless Speech-to-Speech Translation With Limited Parallel DataCode0
Iteratively Improving Speech Recognition and Voice Conversion0
Incorporating Ultrasound Tongue Images for Audio-Visual Speech Enhancement through Knowledge Distillation0
BA-SOT: Boundary-Aware Serialized Output Training for Multi-Talker ASR0
Cross-lingual Knowledge Transfer and Iterative Pseudo-labeling for Low-Resource Speech Recognition with Transducers0
TranUSR: Phoneme-to-word Transcoder Based Unified Speech Representation Learning for Cross-lingual Speech Recognition0
Evaluating OpenAI's Whisper ASR for Punctuation Prediction and Topic Modeling of life histories of the Museum of the Person0
On the Transferability of Whisper-based Representations for "In-the-Wild" Cross-Task Downstream Speech Applications0
Show:102550
← PrevPage 25 of 64Next →

No leaderboard results yet.