SOTAVerified

Automatic Speech Recognition

Papers

Showing 9511000 of 3174 papers

TitleStatusHype
Careful Whisper -- leveraging advances in automatic speech recognition for robust and interpretable aphasia subtype classification0
DisfluencyFixer: A tool to enhance Language Learning through Speech To Speech Disfluency Correction0
Dissecting User-Perceived Latency of On-Device E2E Speech Recognition0
Capturing Multi-Resolution Context by Dilated Self-Attention0
Distilling HuBERT with LSTMs via Decoupled Knowledge Distillation0
A review of on-device fully neural end-to-end automatic speech recognition algorithms0
Distilling the Knowledge of BERT for CTC-based ASR0
Capitalization and Punctuation Restoration: a Survey0
Can You Hear It? Backdoor Attacks via Ultrasonic Triggers0
A Review of Deep Learning Techniques for Speech Processing0
Distributed Deep Learning Strategies For Automatic Speech Recognition0
Distributed Training of Deep Neural Network Acoustic Models for Automatic Speech Recognition0
DNCASR: End-to-End Training for Speaker-Attributed ASR0
DNN-Based Multilingual Automatic Speech Recognition for Wolaytta using Oromo Speech0
DNN-Based Semantic Model for Rescoring N-best Speech Recognition List0
Do as I mean, not as I say: Sequence Loss Training for Spoken Language Understanding0
Automatic Speech Recognition And Limited Vocabulary: A Survey0
An Approach to Improve Robustness of NLP Systems against ASR Errors0
Does Single-channel Speech Enhancement Improve Keyword Spotting Accuracy? A Case Study0
Does Speech enhancement of publicly available data help build robust Speech Recognition Systems?0
Does Visual Self-Supervision Improve Learning of Speech Representations for Emotion Recognition?0
Does Whisper understand Swiss German? An automatic, qualitative, and human evaluation0
Adversarial synthesis based data-augmentation for code-switched spoken language identification0
Domain Adaptation of low-resource Target-Domain models using well-trained ASR Conformer Models0
Can Whisper perform speech-based in-context learning?0
Automatic Speech Recognition with Very Large Conversational Finnish and Estonian Vocabularies0
Are Transformers in Pre-trained LM A Good ASR Encoder? An Empirical Study0
Don't Be So Sure! Boosting ASR Decoding via Confidence Relaxation0
Don't Stop Self-Supervision: Accent Adaptation of Speech Representations via Residual Adapters0
Do We Still Need Automatic Speech Recognition for Spoken Language Understanding?0
Do You Listen with One or Two Microphones? A Unified ASR Model for Single and Multi-Channel Audio0
DPSNN: Spiking Neural Network for Low-Latency Streaming Speech Enhancement0
DRAFT: A Novel Framework to Reduce Domain Shifting in Self-supervised Learning and Its Application to Children's ASR0
Driving ROVER with Segment-based ASR Quality Estimation0
An Investigation of End-to-End Multichannel Speech Recognition for Reverberant and Mismatch Conditions0
Dual Causal/Non-Causal Self-Attention for Streaming End-to-End Speech Recognition0
On the Effectiveness of Pinyin-Character Dual-Decoding for End-to-End Mandarin Chinese ASR0
Can We Trust Explainable AI Methods on ASR? An Evaluation on Phoneme Recognition0
Dual Language Models for Code Switched Speech Recognition0
Can We Train a Language Model Inside an End-to-End ASR Model? - Investigating Effective Implicit Language Modeling0
Adversarial Speaker Disentanglement Using Unannotated External Data for Self-supervised Representation Based Voice Conversion0
Self-Supervised Learning for Multi-Channel Neural Transducer0
Effective Sentence Scoring Method using Bidirectional Language Model for Speech Recognition0
Can Visual Context Improve Automatic Speech Recognition for an Embodied Agent?0
DualVC 3: Leveraging Language Model Generated Pseudo Context for End-to-end Low Latency Streaming Voice Conversion0
DualVoice: Speech Interaction that Discriminates between Normal and Whispered Voice Input0
Cantonese Automatic Speech Recognition Using Transfer Learning from Mandarin0
A Wav2vec2-Based Experimental Study on Self-Supervised Learning Methods to Improve Child Speech Recognition0
DuTongChuan: Context-aware Translation Model for Simultaneous Interpreting0
Are disentangled representations all you need to build speaker anonymization systems?0
Show:102550
← PrevPage 20 of 64Next →

No leaderboard results yet.