SOTAVerified

Automatic Speech Recognition

Papers

Showing 3140 of 3174 papers

TitleStatusHype
LauraGPT: Listen, Attend, Understand, and Regenerate Audio with GPTCode2
Large Language Models are Strong Audio-Visual Speech Recognition LearnersCode2
Learning Audio-Visual Speech Representation by Masked Multimodal Cluster PredictionCode2
LiteASR: Efficient Automatic Speech Recognition with Low-Rank ApproximationCode2
PixIT: Joint Training of Speaker Diarization and Speech Separation from Real-world Multi-speaker RecordingsCode2
FunCodec: A Fundamental, Reproducible and Integrable Open-source Toolkit for Neural Speech CodecCode2
Fast Transformers with Clustered AttentionCode2
Large Language Model Can Transcribe Speech in Multi-Talker Scenarios with Versatile InstructionsCode2
An Embarrassingly Simple Approach for LLM with Strong ASR CapacityCode2
AIR-Bench: Benchmarking Large Audio-Language Models via Generative ComprehensionCode2
Show:102550
← PrevPage 4 of 318Next →

No leaderboard results yet.