SOTAVerified

Handwritten Text Recognition

Handwritten Text Recognition (HTR) is the task of automatically identifying and transcribing handwritten text from images or scanned documents into machine-readable text. The goal is to develop a system capable of accurately interpreting diverse handwriting styles, accounting for variations in alignment, stroke, spacing, and noise. This task involves detecting handwritten regions within an image, extracting the text content, and converting it into a structured digital format, enabling further search, indexing, or data analysis.

Papers

Showing 125 of 139 papers

TitleStatusHype
Unlocking the Archives: Using Large Language Models to Transcribe Handwritten Historical DocumentsCode2
HTR-VT: Handwritten Text Recognition with Vision TransformerCode2
DTrOCR: Decoder-only Transformer for Optical Character RecognitionCode2
Full Page Handwriting Recognition via Image to Sequence ExtractionCode2
Meta-DAN: towards an efficient prediction strategy for page-level handwritten text recognitionCode1
Hespi: A pipeline for automatically detecting information from hebarium specimen sheetsCode1
DANIEL: A fast Document Attention Network for Information Extraction and Labelling of handwritten documentsCode1
Muharaf: Manuscripts of Handwritten Arabic Dataset for Cursive Text RecognitionCode1
Best Practices for a Handwritten Text Recognition SystemCode1
Data Generation for Post-OCR correction of Cyrillic handwritingCode1
Exploring OCR Capabilities of GPT-4V(ision) : A Quantitative and In-depth EvaluationCode1
BN-DRISHTI: Bangla Document Recognition through Instance-level Segmentation of Handwritten Text ImagesCode1
Enhancing Indic Handwritten Text Recognition Using Global Semantic InformationCode1
Towards End-to-end Handwritten Document RecognitionCode1
Easter2.0: Improving convolutional models for handwritten text recognitionCode1
DAN: a Segmentation-free Document Attention Network for Handwritten Document RecognitionCode1
AttentionHTR: Handwritten Text Recognition Based on Attention Encoder-Decoder NetworksCode1
Continuous Offline Handwriting Recognition using Deep Learning ModelsCode1
Many Heads but One Brain: Fusion Brain -- a Competition and a Single Multimodal Multitask ArchitectureCode1
Rescoring Sequence-to-Sequence Models for Text Line Recognition with CTC-PrefixesCode1
KOHTD: Kazakh Offline Handwritten Text DatasetCode1
TrOCR: Transformer-based Optical Character Recognition with Pre-trained ModelsCode1
StackMix and Blot Augmentations for Handwritten Text RecognitionCode1
Few Shots Are All You Need: A Progressive Few Shot Learning Approach for Low Resource Handwritten Text RecognitionCode1
LineCounter: Learning Handwritten Text Line Segmentation by CountingCode1
Show:102550
← PrevPage 1 of 6Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Transformer w/ CNNCER7.62Unverified
2FPHR Paragraph Level (~145 dpi)CER6.7Unverified
3Leaky LP CellCER6.6Unverified
4FPHR+Aug Line Level (~145 dpi)CER6.5Unverified
5Decouple Attention NetworkCER6.4Unverified
6Start, Follow, ReadCER6.4Unverified
7FPHR+Aug Paragraph Level (~145 dpi)CER6.3Unverified
8Easter2.0CER6.21Unverified
9HTR-VT(line-level)CER4.7Unverified
10Transformer w/ CNN (+synth)CER4.67Unverified
#ModelMetricClaimedVerifiedStatus
1GFCNTest CER5.2Unverified
2TrOCRTest CER3.6Unverified
3OrigamiNet-18Test CER3.1Unverified
4OrigamiNet-12Test CER3.1Unverified
5OrigamiNet-24Test CER3Unverified
6HTR-VTTest CER2.8Unverified
#ModelMetricClaimedVerifiedStatus
1GFCNTest CER8Unverified
2OrigamiNet-12Test CER6Unverified
3VANTest CER5Unverified
4HTR-VTTest CER4.7Unverified
5TrOCRTest CER3.4Unverified
#ModelMetricClaimedVerifiedStatus
1CNN + BLSTMTest CER4.7Unverified
2SpanTest CER4.6Unverified
3VANTest CER4.1Unverified
4DANTest CER4.1Unverified
5HTR-VTTest CER3.9Unverified
#ModelMetricClaimedVerifiedStatus
1PyLaia (human transcriptions + random split)CER (%)10.54Unverified
2PyLaia (human transcriptions + agreement-based split)CER (%)5.57Unverified
3PyLaia (rover consensus + agreement-based split)CER (%)4.95Unverified
4PyLaia (all transcriptions + agreement-based split)CER (%)4.34Unverified
#ModelMetricClaimedVerifiedStatus
1HTR-VT(line-level)CER (%)3.9Unverified
2DANCER (%)3.22Unverified
#ModelMetricClaimedVerifiedStatus
1StackMix+BlotsCER1.73Unverified
#ModelMetricClaimedVerifiedStatus
1StackMix+BlotsCER2.5Unverified
#ModelMetricClaimedVerifiedStatus
1StackMix+BlotsCER3.49Unverified
#ModelMetricClaimedVerifiedStatus
1StackMix+BlotsCER3.77Unverified
#ModelMetricClaimedVerifiedStatus
1StackMix+BlotsCER3.01Unverified
#ModelMetricClaimedVerifiedStatus
1StackMix+BlotsCER3.65Unverified
#ModelMetricClaimedVerifiedStatus
1DANCER (%)6.46Unverified