SOTAVerified

Automatic Speech Recognition

Papers

Showing 901925 of 3174 papers

TitleStatusHype
Convoifilter: A case study of doing cocktail party speech recognition0
SeamlessM4T: Massively Multilingual & Multimodal Machine TranslationCode2
TokenSplit: Using Discrete Speech Representations for Direct, Refined, and Transcript-Conditioned Speech Separation and Recognition0
Indonesian Automatic Speech Recognition with XLSR-530
Bayes Risk Transducer: Transducer with Controllable Alignment Prediction0
Radio2Text: Streaming Speech Recognition Using mmWave Radio Signals0
Accurate synthesis of Dysarthric Speech for ASR data augmentation0
End-to-End Open Vocabulary Keyword Search With Multilingual Neural RepresentationsCode0
Improving CTC-AED model with integrated-CTC and auxiliary loss regularization0
Using Text Injection to Improve Recognition of Personal Identifiers in Speech0
Text Injection for Capitalization and Turn-Taking Prediction in Speech Models0
Improving Audio-Visual Speech Recognition by Lip-Subword Correlation Based Visual Pre-training and Cross-Modal Fusion EncoderCode1
Integrating Emotion Recognition with Speech Recognition and Speaker Diarisation for ConversationsCode0
Alternative Pseudo-Labeling for Semi-Supervised Automatic Speech Recognition0
Bilingual Streaming ASR with Grapheme units and Auxiliary Monolingual Loss0
A Novel Self-training Approach for Low-resource Speech Recognition0
Conformer-based Target-Speaker Automatic Speech Recognition for Single-Channel AudioCode0
Comparative Analysis of the wav2vec 2.0 Feature Extractor0
OmniDataComposer: A Unified Data Structure for Multimodal Data Fusion and Infinite Data GenerationCode1
Boosting Chinese ASR Error Correction with Dynamic Error Scaling Mechanism0
ApproBiVT: Lead ASR Models to Generalize Better Using Approximated Bias-Variance Tradeoff Guided Early Stopping and Checkpoint Averaging0
Federated Representation Learning for Automatic Speech Recognition0
Careful Whisper -- leveraging advances in automatic speech recognition for robust and interpretable aphasia subtype classification0
Pre-training End-to-end ASR Models with Augmented Speech Samples Queried by Text0
ÌròyìnSpeech: A multi-purpose Yorùbá Speech CorpusCode1
Show:102550
← PrevPage 37 of 127Next →

No leaderboard results yet.