SOTAVerified

Automatic Speech Recognition

Papers

Showing 23012325 of 3174 papers

TitleStatusHype
Cascaded encoders for unifying streaming and non-streaming ASR0
Emotion recognition by fusing time synchronous and time asynchronous representations0
Improved Mask-CTC for Non-Autoregressive End-to-End ASR0
Decentralizing Feature Extraction with Quantum Convolutional Neural Network for Automatic Speech RecognitionCode1
Large-Scale End-to-End Multilingual Speech Recognition and Language Identification with Multi-Task LearningCode0
Two-stage Textual Knowledge Distillation for End-to-End Spoken Language UnderstandingCode0
Improving Noise Robustness of an End-to-End Neural Model for Automatic Speech Recognition0
Rethinking Evaluation in ASR: Are Our Models Robust Enough?Code0
Improving Streaming Automatic Speech Recognition With Non-Streaming Model Distillation On Unsupervised Data0
Confidence Estimation for Attention-based Sequence-to-sequence Models for Speech RecognitionCode1
MAM: Masked Acoustic Modeling for End-to-End Speech-to-Text Translation0
How Phonotactics Affect Multilingual and Zero-shot ASR PerformanceCode0
SlimIPL: Language-Model-Free Iterative Pseudo-Labeling0
A General Multi-Task Learning Framework to Leverage Text Data for Speech to Text Tasks0
Sentence Boundary Augmentation For Neural Machine Translation Robustness0
VenoMave: Targeted Poisoning Against Speech RecognitionCode0
Cascaded Models With Cyclic Feedback For Direct Speech Translation0
Towards End-to-End Training of Automatic Speech Recognition for Nigerian PidginCode0
Knowledge Distillation for Improved Accuracy in Spoken Question Answering0
FastEmit: Low-latency Streaming ASR with Sequence-level Emission RegularizationCode0
Replacing Human Audio with Synthetic Audio for On-device Unspoken Punctuation Prediction0
Pushing the Limits of Semi-Supervised Learning for Automatic Speech RecognitionCode1
Knowledge Transfer for Efficient On-device False Trigger Mitigation0
Ensemble Chinese End-to-End Spoken Language Understanding for Abnormal Event Detection from audio stream0
Towards Data Distillation for End-to-end Spoken Conversational Question Answering0
Show:102550
← PrevPage 93 of 127Next →

No leaderboard results yet.