SOTAVerified

Automatic Speech Recognition

Papers

Showing 951975 of 3174 papers

TitleStatusHype
Online Hybrid CTC/Attention End-to-End Automatic Speech Recognition Architecture0
Transcribing Educational Videos Using Whisper: A preliminary study on using AI for transcribing educational videos0
Align With Purpose: Optimize Desired Properties in CTC Models with a General Plug-and-Play Framework0
Boosting Norwegian Automatic Speech Recognition0
Knowledge-Aware Audio-Grounded Generative Slot Filling for Limited Annotated Data0
Multilingual Contextual Adapters To Improve Custom Word Recognition In Low-resource Languages0
Don't Stop Self-Supervision: Accent Adaptation of Speech Representations via Residual Adapters0
Conformer LLMs -- Convolution Augmented Large Language Models0
Automatic Speech Recognition of Non-Native Child Speech for Language Learning Applications0
Accelerating Transducers through Adjacent Token Merging0
Master-ASR: Achieving Multilingual Scalability and Low-Resource Adaptation in ASR with Modular Learning0
The CHiME-7 DASR Challenge: Distant Meeting Transcription with Multiple Devices in Diverse Scenarios0
Strategies in Transfer Learning for Low-Resource Speech Synthesis: Phone Mapping, Features Input, and Source Language Selection0
Learning When to Trust Which Teacher for Weakly Supervised ASR0
Exploring the Role of Audio in Video Captioning0
A Reference-less Quality Metric for Automatic Speech Recognition via Contrastive-Learning of a Multi-Language Model with Self-SupervisionCode1
NoRefER: a Referenceless Quality Metric for Automatic Speech Recognition via Semi-Supervised Language Model Fine-Tuning with Contrastive LearningCode1
Federated Self-Learning with Weak Supervision for Speech Recognition0
Mixture Encoder for Joint Speech Separation and Recognition0
Quilt-1M: One Million Image-Text Pairs for HistopathologyCode1
Rehearsal-Free Online Continual Learning for Automatic Speech RecognitionCode0
MobileASR: A resource-aware on-device learning framework for user voice personalization applications on mobile phones0
Pushing the Limits of Unsupervised Unit Discovery for SSL Speech RepresentationCode1
Lexical Speaker Error Correction: Leveraging Language Models for Speaker Diarization Error Correction0
Unified model for code-switching speech recognition and language identification based on a concatenated tokenizer0
Show:102550
← PrevPage 39 of 127Next →

No leaderboard results yet.