SOTAVerified

Automatic Speech Recognition

Papers

Showing 751775 of 3174 papers

TitleStatusHype
Retrieve and Copy: Scaling ASR Personalization to Large Catalogs0
On the Effectiveness of ASR Representations in Real-world Noisy Speech Emotion Recognition0
Improving Whispered Speech Recognition Performance using Pseudo-whispered based Data AugmentationCode1
1SPU: 1-step Speech Processing Unit0
A comparative analysis between Conformer-Transducer, Whisper, and wav2vec2 for improving the child speech recognitionCode0
Improved Child Text-to-Speech Synthesis through Fastpitch-based Transfer LearningCode1
Fine-tuning convergence model in Bengali speech recognition0
Pseudo-Labeling for Domain-Agnostic Bangla Automatic Speech RecognitionCode0
COSMIC: Data Efficient Instruction-tuning For Speech In-Context Learning0
Server-side Rescoring of Spoken Entity-centric Knowledge Queries for Virtual Assistants0
Multilingual DistilWhisper: Efficient Distillation of Multi-task Speech Models via Language-Specific ExpertsCode1
Automatic Disfluency Detection from Untranscribed SpeechCode1
End-to-End Single-Channel Speaker-Turn Aware Conversational Speech TranslationCode1
RIR-SF: Room Impulse Response Based Spatial Feature for Target Speech Recognition in Multi-Channel Multi-Speaker Scenarios0
Combining Language Models For Specialized Domains: A Colorful Approach0
Developing a Multilingual Dataset and Evaluation Metrics for Code-Switching: A Focus on Hong Kong's Polylingual DynamicsCode1
Dialect Adaptation and Data Augmentation for Low-Resource ASR: TalTech Systems for the MADASR 2023 Challenge0
DISCO: A Large Scale Human Annotated Corpus for Disfluency Correction in Indo-European LanguagesCode0
CL-MASR: A Continual Learning Benchmark for Multilingual ASRCode1
ArTST: Arabic Text and Speech TransformerCode1
Accented Speech Recognition With Accent-specific CodebooksCode1
Modality Dropout for Multimodal Device Directed Speech Detection using Verbal and Non-Verbal Features0
Leveraging Timestamp Information for Serialized Joint Streaming Recognition and Translation0
Key Frame Mechanism For Efficient Conformer Based End-to-end Speech RecognitionCode0
Quantifying the Dialect Gap and its Correlates Across Languages0
Show:102550
← PrevPage 31 of 127Next →

No leaderboard results yet.