SOTAVerified

Automatic Speech Recognition

Papers

Showing 201225 of 3174 papers

TitleStatusHype
persoDA: Personalized Data Augmentation for Personalized ASR0
Adapting Whisper for Regional Dialects: Enhancing Public Services for Vulnerable Populations in the United Kingdom0
A Non-autoregressive Model for Joint STT and TTS0
Selective Attention Merging for low resource tasks: A case study of Child ASRCode0
Loudspeaker Beamforming to Enhance Speech Recognition Performance of Voice Driven Applications0
Joint Automatic Speech Recognition And Structure Learning For Better Speech UnderstandingCode0
AdaCS: Adaptive Normalization for Enhanced Code-Switching ASRCode0
Speech Recognition for Automatically Assessing Afrikaans and isiXhosa Preschool Oral Narratives0
A Survey on Spoken Italian Datasets and Corpora0
Discrete Speech Unit Extraction via Independent Component AnalysisCode0
Comparing Self-Supervised Learning Models Pre-Trained on Human Speech and Animal Vocalizations for Bioacoustics ProcessingCode0
Fleurs-SLU: A Massively Multilingual Benchmark for Spoken Language UnderstandingCode0
Universal-2-TF: Robust All-Neural Text Formatting for ASR0
Contextual ASR Error Handling with LLMs Augmentation for Goal-Oriented Conversational AI0
Benchmarking Rotary Position Embeddings for Automatic Speech Recognition0
Deep Learning for Pathological Speech: A Survey0
Universal Speaker Embedding Free Target Speaker Extraction and Personal Voice Activity Detection0
Samba-ASR: State-Of-The-Art Speech Recognition Leveraging Structured State-Space Models0
Listening and Seeing Again: Generative Error Correction for Audio-Visual Speech RecognitionCode0
Improving Transducer-Based Spoken Language Understanding with Self-Conditioned CTC and Knowledge Transfer0
Advancing Singlish Understanding: Bridging the Gap with Datasets and Multimodal ModelsCode0
LiveCC: Learning Video LLM with Streaming Speech Transcription at Scale0
Breaking Through the Spike: Spike Window Decoding for Accelerated and Precise Automatic Speech Recognition0
Automatic Text Pronunciation Correlation Generation and Application for Contextual Biasing0
Large Language Models Are Read/Write Policy-Makers for Simultaneous GenerationCode1
Show:102550
← PrevPage 9 of 127Next →

No leaderboard results yet.