Optical Character Recognition (OCR)
Optical Character Recognition or Optical Character Reader (OCR) is the electronic or mechanical conversion of images of typed, handwritten or printed text into machine-encoded text, whether from a scanned document, a photo of a document, a scene-photo (for example the text on signs and billboards in a landscape photo, license plates in cars...) or from subtitle text superimposed on an image (for example: from a television broadcast)
Papers
Showing 1–10 of 1209 papers
All datasetsBenchmarking Chinese Text Recognition: Datasets, Baselines, and an Empirical StudyVideoDB's OCR Benchmark Public CollectionFSNS - TestI2L-140KSUTim2latex-100k
Benchmark Results
| # | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| 1 | GPT-4o | Average Accuracy | 76.22 | — | Unverified |
| 2 | Gemini-1.5 Pro | Average Accuracy | 76.13 | — | Unverified |
| 3 | Claude-3 Sonnet | Average Accuracy | 67.71 | — | Unverified |
| 4 | RapidOCR | Average Accuracy | 56.98 | — | Unverified |
| 5 | EasyOCR | Average Accuracy | 49.3 | — | Unverified |