SOTAVerified

Automatic Speech Recognition

Papers

Showing 76100 of 3174 papers

TitleStatusHype
SER Evals: In-domain and Out-of-domain Benchmarking for Speech Emotion RecognitionCode1
LI-TTA: Language Informed Test-Time Adaptation for Automatic Speech RecognitionCode1
ALIF: Low-Cost Adversarial Audio Attacks on Black-Box Speech Platforms using Linguistic FeaturesCode1
Evolutionary Prompt Design for LLM-Based Post-ASR Error CorrectionCode1
Framework for Curating Speech Datasets and Evaluating ASR Systems: A Case Study for PolishCode1
Controlling Whisper: Universal Acoustic Adversarial Attacks to Control Speech Foundation ModelsCode1
Improving Self-supervised Pre-training using Accent-Specific CodebooksCode1
Pinyin Regularization in Error Correction for Chinese Speech Recognition with Large Language ModelsCode1
ArzEn-LLM: Code-Switched Egyptian Arabic-English Translation and Speech Recognition Using LLMsCode1
Towards Building an End-to-End Multilingual Automatic Lyrics Transcription ModelCode1
Automatic speech recognition for the Nepali language using CNN, bidirectional LSTM and ResNetCode1
Continual Test-time Adaptation for End-to-end Speech Recognition on Noisy SpeechCode1
LipGER: Visually-Conditioned Generative Error Correction for Robust Automatic Speech RecognitionCode1
A Variance-Preserving Interpolation Approach for Diffusion Models with Applications to Single Channel Speech Enhancement and RecognitionCode1
SoccerNet-Echoes: A Soccer Game Audio Commentary DatasetCode1
Muting Whisper: A Universal Acoustic Adversarial Attack on Speech Foundation ModelsCode1
Unveiling the Potential of LLM-Based ASR on Chinese Open-Source DatasetsCode1
Less Peaky and More Accurate CTC Forced Alignment by Label PriorsCode1
Kallaama: A Transcribed Speech Dataset about Agriculture in the Three Most Widely Spoken Languages in SenegalCode1
Speech Robust Bench: A Robustness Benchmark For Speech RecognitionCode1
Language and Speech Technology for Central Kurdish VarietiesCode1
A Cross-Modal Approach to Silent Speech with LLM-Enhanced RecognitionCode1
It's Never Too Late: Fusing Acoustic Information into Large Language Models for Automatic Speech RecognitionCode1
REBORN: Reinforcement-Learned Boundary Segmentation with Iterative Training for Unsupervised ASRCode1
Word-Level ASR Quality Estimation for Efficient Corpus Sampling and Post-Editing through Analyzing Attentions of a Reference-Free MetricCode1
Show:102550
← PrevPage 4 of 127Next →

No leaderboard results yet.