SOTAVerified

Automatic Speech Recognition

Papers

Showing 101125 of 3174 papers

TitleStatusHype
Extending Whisper with prompt tuning to target-speaker ASRCode1
D4AM: A General Denoising Framework for Downstream Acoustic ModelsCode1
Improving Whispered Speech Recognition Performance using Pseudo-whispered based Data AugmentationCode1
Improved Child Text-to-Speech Synthesis through Fastpitch-based Transfer LearningCode1
Multilingual DistilWhisper: Efficient Distillation of Multi-task Speech Models via Language-Specific ExpertsCode1
End-to-End Single-Channel Speaker-Turn Aware Conversational Speech TranslationCode1
Automatic Disfluency Detection from Untranscribed SpeechCode1
Developing a Multilingual Dataset and Evaluation Metrics for Code-Switching: A Focus on Hong Kong's Polylingual DynamicsCode1
ArTST: Arabic Text and Speech TransformerCode1
CL-MASR: A Continual Learning Benchmark for Multilingual ASRCode1
Accented Speech Recognition With Accent-specific CodebooksCode1
Advancing Test-Time Adaptation in Wild Acoustic Test SettingsCode1
HowToCaption: Prompting LLMs to Transform Video Annotations at ScaleCode1
Speech collage: code-switched audio generation by collaging monolingual corporaCode1
HyPoradise: An Open Baseline for Generative Speech Recognition with Large Language ModelsCode1
Memory-augmented conformer for improved end-to-end long-form ASRCode1
HypR: A comprehensive study for ASR hypothesis revising with a reference corpusCode1
DiaCorrect: Error Correction Back-end For Speaker DiarizationCode1
Unimodal Aggregation for CTC-based Speech RecognitionCode1
EnCodecMAE: Leveraging neural codecs for universal audio representation learningCode1
Improving Audio-Visual Speech Recognition by Lip-Subword Correlation Based Visual Pre-training and Cross-Modal Fusion EncoderCode1
OmniDataComposer: A Unified Data Structure for Multimodal Data Fusion and Infinite Data GenerationCode1
ÌròyìnSpeech: A multi-purpose Yorùbá Speech CorpusCode1
Learning Multi-modal Representations by Watching Hundreds of Surgical Video LecturesCode1
Adaptation of Whisper models to child speech recognitionCode1
Show:102550
← PrevPage 5 of 127Next →

No leaderboard results yet.