SOTAVerified

Automatic Speech Recognition

Papers

Showing 3140 of 3174 papers

TitleStatusHype
MOSEL: 950,000 Hours of Speech Data for Open-Source Speech Foundation Model Training on EU LanguagesCode2
Recent Advances in Speech Language Models: A SurveyCode2
Large Language Models are Strong Audio-Visual Speech Recognition LearnersCode2
Large Language Model Can Transcribe Speech in Multi-Talker Scenarios with Versatile InstructionsCode2
wav2graph: A Framework for Supervised Learning Knowledge Graph from SpeechCode2
Pretraining End-to-End Keyword Search with Automatically Discovered Acoustic UnitsCode2
Let's Fuse Step by Step: A Generative Fusion Decoding Algorithm with LLMs for Multi-modal Text RecognitionCode2
PixIT: Joint Training of Speaker Diarization and Speech Separation from Real-world Multi-speaker RecordingsCode2
An Embarrassingly Simple Approach for LLM with Strong ASR CapacityCode2
AIR-Bench: Benchmarking Large Audio-Language Models via Generative ComprehensionCode2
Show:102550
← PrevPage 4 of 318Next →

No leaderboard results yet.