SOTAVerified

Automatic Speech Recognition

Papers

Showing 301325 of 3174 papers

TitleStatusHype
Combining Frame-Synchronous and Label-Synchronous Systems for Speech RecognitionCode1
Common Voice: A Massively-Multilingual Speech CorpusCode1
Large Language Models Are Read/Write Policy-Makers for Simultaneous GenerationCode1
LeBenchmark: A Reproducible Framework for Assessing Self-Supervised Representation Learning from SpeechCode1
D4AM: A General Denoising Framework for Downstream Acoustic ModelsCode1
Muting Whisper: A Universal Acoustic Adversarial Attack on Speech Foundation ModelsCode1
Consistent Training and Decoding For End-to-end Speech Recognition Using Lattice-free MMICode1
Continual Test-time Adaptation for End-to-end Speech Recognition on Noisy SpeechCode1
ContextNet: Improving Convolutional Neural Networks for Automatic Speech Recognition with Global ContextCode1
A Reference-less Quality Metric for Automatic Speech Recognition via Contrastive-Learning of a Multi-Language Model with Self-SupervisionCode1
A context-aware knowledge transferring strategy for CTC-based ASRCode1
MelHuBERT: A simplified HuBERT on Mel spectrogramsCode1
Controlling Whisper: Universal Acoustic Adversarial Attacks to Control Speech Foundation ModelsCode1
CopyNE: Better Contextual ASR by Copying Named EntitiesCode1
SLUE: New Benchmark Tasks for Spoken Language Understanding Evaluation on Natural SpeechCode1
Cross Attention Augmented Transducer Networks for Simultaneous TranslationCode1
A Comparison of Adaptation Techniques and Recurrent Neural Network ArchitecturesCode0
A Deep Dive into the Disparity of Word Error Rates Across Thousands of NPTEL MOOC VideosCode0
Kurdish (Sorani) Speech to Text: Presenting an Experimental DatasetCode0
Killkan: The Automatic Speech Recognition Dataset for Kichwa with Morphosyntactic InformationCode0
Addressing Pitfalls in Auditing Practices of Automatic Speech Recognition Technologies: A Case Study of People with AphasiaCode0
A Comparative Study on Transformer vs RNN in Speech ApplicationsCode0
A Dataset for Speech Emotion Recognition in Greek Theatrical PlaysCode0
Key Frame Mechanism For Efficient Conformer Based End-to-end Speech RecognitionCode0
Language Complexity and Speech Recognition Accuracy: Orthographic Complexity Hurts, Phonological Complexity Doesn'tCode0
Show:102550
← PrevPage 13 of 127Next →

No leaderboard results yet.