SOTAVerified

Automatic Speech Recognition

Papers

Showing 13011325 of 3174 papers

TitleStatusHype
Extracting Domain Invariant Features by Unsupervised Learning for Robust Automatic Speech Recognition0
Can We Train a Language Model Inside an End-to-End ASR Model? - Investigating Effective Implicit Language Modeling0
Are Transformers in Pre-trained LM A Good ASR Encoder? An Empirical Study0
Adversarial Speaker Disentanglement Using Unannotated External Data for Self-supervised Representation Based Voice Conversion0
Accented Speech Recognition: A Survey0
Flexible Multichannel Speech Enhancement for Noise-Robust Frontend0
Flexi-Transducer: Optimizing Latency, Accuracy and Compute forMulti-Domain On-Device Scenarios0
FluentNet: End-to-End Detection of Speech Disfluency with Deep Learning0
Extracting Biomedical Entities from Noisy Audio Transcripts0
FOOCTTS: Generating Arabic Speech with Acoustic Environment for Football Commentator0
Can Visual Context Improve Automatic Speech Recognition for an Embodied Agent?0
Fotheidil: an Automatic Transcription System for the Irish Language0
Four-in-One: A Joint Approach to Inverse Text Normalization, Punctuation, Capitalization, and Disfluency for Automatic Speech Recognition0
Free English and Czech telephone speech corpus shared under the CC-BY-SA 3.0 license0
Extending Recurrent Neural Aligner for Streaming End-to-End Speech Recognition in Mandarin0
Frequency Domain Multi-channel Acoustic Modeling for Distant Speech Recognition0
Extended Graph Temporal Classification for Multi-Speaker End-to-End ASR0
From English to More Languages: Parameter-Efficient Model Reprogramming for Cross-Lingual Speech Recognition0
Cantonese Automatic Speech Recognition Using Transfer Learning from Mandarin0
From Senones to Chenones: Tied Context-Dependent Graphemes for Hybrid Speech Recognition0
Exploring WavLM Back-ends for Speech Spoofing and Deepfake Detection0
Exploring Transfer Learning For End-to-End Spoken Language Understanding0
A Wav2vec2-Based Experimental Study on Self-Supervised Learning Methods to Improve Child Speech Recognition0
From Weak Labels to Strong Results: Utilizing 5,000 Hours of Noisy Classroom Transcripts with Minimal Accurate Data0
Exploring the Role of Audio in Video Captioning0
Show:102550
← PrevPage 53 of 127Next →

No leaderboard results yet.