SOTAVerified

Automatic Speech Recognition

Papers

Showing 551575 of 3174 papers

TitleStatusHype
Improving Zero-Shot Chinese-English Code-Switching ASR with kNN-CTC and Gated Monolingual Datastores0
Joint Beam Search Integrating CTC, Attention, and Transducer Decoders0
Text Injection for Neural Contextual Biasing0
Task Arithmetic can Mitigate Synthetic-to-Real Gap in Automatic Speech Recognition0
Enhancing CTC-based speech recognition with diverse modeling units0
Error-preserving Automatic Speech Recognition of Young English Learners' LanguageCode0
Keyword-Guided Adaptation of Automatic Speech Recognition0
Efficiently Train ASR Models that Memorize Less and Perform Better with Per-core Clipping0
Whistle: Data-Efficient Multilingual and Crosslingual Speech Recognition via Weakly Phonetic Supervision0
Enabling ASR for Low-Resource Languages: A Comprehensive Dataset Creation Approach0
Wav2Prompt: End-to-End Speech Prompt Generation and Tuning For LLM in Zero and Few-shot Learning0
Zipper: A Multi-Tower Decoder Architecture for Fusing Modalities0
Intelligent Clinical Documentation: Harnessing Generative AI for Patient-Centric Clinical Note Generation0
A Variance-Preserving Interpolation Approach for Diffusion Models with Applications to Single Channel Speech Enhancement and RecognitionCode1
Federating Dynamic Models using Early-Exit Architectures for Automatic Speech Recognition on Heterogeneous ClientsCode0
Denoising LM: Pushing the Limits of Error Correction Models for Speech Recognition0
Contrastive and Consistency Learning for Neural Noisy-Channel Model in Spoken Language UnderstandingCode0
Self-Taught Recognizer: Toward Unsupervised Adaptation for Speech Foundation ModelsCode3
Let's Fuse Step by Step: A Generative Fusion Decoding Algorithm with LLMs for Multi-modal Text RecognitionCode2
Contextualized Automatic Speech Recognition with Dynamic Vocabulary0
You don't understand me!: Comparing ASR results for L1 and L2 speakers of Swedish0
Joint Optimization of Streaming and Non-Streaming Automatic Speech Recognition with Multi-Decoder and Knowledge Distillation0
FairLENS: Assessing Fairness in Law Enforcement Speech Recognition0
Listen Again and Choose the Right Answer: A New Paradigm for Automatic Speech Recognition with Large Language Models0
Continued Pretraining for Domain Adaptation of Wav2vec2.0 in Automatic Speech Recognition for Elementary Math Classroom Settings0
Show:102550
← PrevPage 23 of 127Next →

No leaderboard results yet.