SOTAVerified

Automatic Speech Recognition

Papers

Showing 851875 of 3174 papers

TitleStatusHype
Harnessing the Zero-Shot Power of Instruction-Tuned Large Language Model in End-to-End Speech Recognition0
HTEC: Human Transcription Error Correction0
Corpus Synthesis for Zero-shot ASR domain Adaptation using Large Language Models0
HypR: A comprehensive study for ASR hypothesis revising with a reference corpusCode1
Distilling HuBERT with LSTMs via Decoupled Knowledge Distillation0
Investigating End-to-End ASR Architectures for Long Form Audio Transcription0
Training dynamic models using early exits for automatic speech recognition on resource-constrained devicesCode0
Instruction-Following Speech Recognition0
A Multitask Training Approach to Enhance Whisper with Contextual Biasing and Open-Vocabulary Keyword Spotting0
Continuous Modeling of the Denoising Process for Speech Enhancement Based on Deep Learning0
Enhancing Quantised End-to-End ASR Models via PersonalisationCode0
Improving Speech Recognition for African American English With Audio Classification0
Decoder-only Architecture for Speech Recognition with CTC Prompts and Text Data Augmentation0
Boosting End-to-End Multilingual Phoneme Recognition through Exploiting Universal Speech Attributes Constraints0
Transformer Based Punctuation Restoration for TurkishCode0
t-SOT FNT: Streaming Multi-talker ASR with Text-only Domain Adaptation Capability0
Unimodal Aggregation for CTC-based Speech RecognitionCode1
DiaCorrect: Error Correction Back-end For Speaker DiarizationCode1
Combining TF-GridNet and Mixture Encoder for Continuous Speech Separation for Meeting Transcription0
Towards Word-Level End-to-End Neural Speaker Diarization with Auxiliary Network0
Hybrid Attention-based Encoder-decoder Model for Efficient Language Model Adaptation0
CPPF: A contextual and post-processing-free model for automatic speech recognition0
PromptASR for contextualized ASR with controllable styleCode2
Echotune: A Modular Extractor Leveraging the Variable-Length Nature of Speech in ASR Tasks0
EnCodecMAE: Leveraging neural codecs for universal audio representation learningCode1
Show:102550
← PrevPage 35 of 127Next →

No leaderboard results yet.