SOTAVerified

Automatic Speech Recognition

Papers

Showing 9761000 of 3174 papers

TitleStatusHype
Automatic Speech Recognition with Very Large Conversational Finnish and Estonian Vocabularies0
Are Transformers in Pre-trained LM A Good ASR Encoder? An Empirical Study0
Don't Be So Sure! Boosting ASR Decoding via Confidence Relaxation0
Don't Stop Self-Supervision: Accent Adaptation of Speech Representations via Residual Adapters0
Do We Still Need Automatic Speech Recognition for Spoken Language Understanding?0
Do You Listen with One or Two Microphones? A Unified ASR Model for Single and Multi-Channel Audio0
DPSNN: Spiking Neural Network for Low-Latency Streaming Speech Enhancement0
DRAFT: A Novel Framework to Reduce Domain Shifting in Self-supervised Learning and Its Application to Children's ASR0
Driving ROVER with Segment-based ASR Quality Estimation0
An Investigation of End-to-End Multichannel Speech Recognition for Reverberant and Mismatch Conditions0
Dual Causal/Non-Causal Self-Attention for Streaming End-to-End Speech Recognition0
On the Effectiveness of Pinyin-Character Dual-Decoding for End-to-End Mandarin Chinese ASR0
Can We Trust Explainable AI Methods on ASR? An Evaluation on Phoneme Recognition0
Dual Language Models for Code Switched Speech Recognition0
Can We Train a Language Model Inside an End-to-End ASR Model? - Investigating Effective Implicit Language Modeling0
Adversarial Speaker Disentanglement Using Unannotated External Data for Self-supervised Representation Based Voice Conversion0
Self-Supervised Learning for Multi-Channel Neural Transducer0
Effective Sentence Scoring Method using Bidirectional Language Model for Speech Recognition0
Can Visual Context Improve Automatic Speech Recognition for an Embodied Agent?0
DualVC 3: Leveraging Language Model Generated Pseudo Context for End-to-end Low Latency Streaming Voice Conversion0
DualVoice: Speech Interaction that Discriminates between Normal and Whispered Voice Input0
Cantonese Automatic Speech Recognition Using Transfer Learning from Mandarin0
A Wav2vec2-Based Experimental Study on Self-Supervised Learning Methods to Improve Child Speech Recognition0
DuTongChuan: Context-aware Translation Model for Simultaneous Interpreting0
Are disentangled representations all you need to build speaker anonymization systems?0
Show:102550
← PrevPage 40 of 127Next →

No leaderboard results yet.