Automatic Speech Recognition (ASR)
Automatic Speech Recognition (ASR) involves converting spoken language into written text. It is designed to transcribe spoken words into text in real-time, allowing people to communicate with computers, mobile devices, and other technology using their voice. The goal of Automatic Speech Recognition is to accurately transcribe speech, taking into account variations in accent, pronunciation, and speaking style, as well as background noise and other factors that can affect speech quality.
Papers
Showing 1–10 of 3012 papers
All datasetsLRS2RealMANSagaleeHUI speech corpusLRS3-TEDM-AILabs speech datasetThe Spoken Wikipedia CorporaVoxforge GermanVoxPopuli
Benchmark Results
| # | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| 1 | Conformer Transducer | WER (%) | 8.04 | — | Unverified |