SOTAVerified

Handwritten Text Recognition

Handwritten Text Recognition (HTR) is the task of automatically identifying and transcribing handwritten text from images or scanned documents into machine-readable text. The goal is to develop a system capable of accurately interpreting diverse handwriting styles, accounting for variations in alignment, stroke, spacing, and noise. This task involves detecting handwritten regions within an image, extracting the text content, and converting it into a structured digital format, enabling further search, indexing, or data analysis.

Papers

Showing 125 of 139 papers

TitleStatusHype
Advancing Offline Handwritten Text Recognition: A Systematic Review of Data Augmentation and Generation Techniques0
Learning to Align: Addressing Character Frequency Distribution Shifts in Handwritten Text RecognitionCode0
MetaWriter: Personalized Handwritten Text Recognition Using Meta-Learned Prompt Tuning0
Preserving Privacy Without Compromising Accuracy: Machine Unlearning for Handwritten Text Recognition0
Meta-DAN: towards an efficient prediction strategy for page-level handwritten text recognitionCode1
TRIDIS: A Comprehensive Medieval and Early Modern Corpus for HTR and NER0
Benchmarking Large Language Models for Handwritten Text Recognition0
Judge a Book by its Cover: Investigating Multi-Modal LLMs for Multi-Page Handwritten Document TranscriptionCode0
Handwritten Text Recognition: A Survey0
Col-OLHTR: A Novel Framework for Multimodal Online Handwritten Text Recognition0
HAND: Hierarchical Attention Network for Multi-Scale Handwritten Document Recognition and Layout AnalysisCode0
HTR-JAND: Handwritten Text Recognition with Joint Attention Network and Knowledge DistillationCode0
On the Generalization of Handwritten Text Recognition Models0
Nuremberg Letterbooks: A Multi-Transcriptional Dataset of Early 15th Century Manuscripts for Document Analysis0
Unlocking the Archives: Using Large Language Models to Transcribe Handwritten Historical DocumentsCode2
Integrating Canonical Neural Units and Multi-Scale Training for Handwritten Text Recognition0
Hespi: A pipeline for automatically detecting information from hebarium specimen sheetsCode1
HATFormer: Historic Handwritten Arabic Text Recognition with Transformers0
HTR-VT: Handwritten Text Recognition with Vision TransformerCode2
Platypus: A Generalized Specialist Model for Reading Text in Various FormsCode0
DANIEL: A fast Document Attention Network for Information Extraction and Labelling of handwritten documentsCode1
Muharaf: Manuscripts of Handwritten Arabic Dataset for Cursive Text RecognitionCode1
Arabic Handwritten Text for Person Biometric Identification: A Deep Learning Approach0
Revisiting N-Gram Models: Their Impact in Modern Neural Networks for Handwritten Text Recognition0
End-to-end information extraction in handwritten documents: Understanding Paris marriage records from 1880 to 19400
Show:102550
← PrevPage 1 of 6Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Transformer w/ CNNCER7.62Unverified
2FPHR Paragraph Level (~145 dpi)CER6.7Unverified
3Leaky LP CellCER6.6Unverified
4FPHR+Aug Line Level (~145 dpi)CER6.5Unverified
5Decouple Attention NetworkCER6.4Unverified
6Start, Follow, ReadCER6.4Unverified
7FPHR+Aug Paragraph Level (~145 dpi)CER6.3Unverified
8Easter2.0CER6.21Unverified
9HTR-VT(line-level)CER4.7Unverified
10Transformer w/ CNN (+synth)CER4.67Unverified
#ModelMetricClaimedVerifiedStatus
1GFCNTest CER5.2Unverified
2TrOCRTest CER3.6Unverified
3OrigamiNet-18Test CER3.1Unverified
4OrigamiNet-12Test CER3.1Unverified
5OrigamiNet-24Test CER3Unverified
6HTR-VTTest CER2.8Unverified
#ModelMetricClaimedVerifiedStatus
1GFCNTest CER8Unverified
2OrigamiNet-12Test CER6Unverified
3VANTest CER5Unverified
4HTR-VTTest CER4.7Unverified
5TrOCRTest CER3.4Unverified
#ModelMetricClaimedVerifiedStatus
1CNN + BLSTMTest CER4.7Unverified
2SpanTest CER4.6Unverified
3VANTest CER4.1Unverified
4DANTest CER4.1Unverified
5HTR-VTTest CER3.9Unverified
#ModelMetricClaimedVerifiedStatus
1PyLaia (human transcriptions + random split)CER (%)10.54Unverified
2PyLaia (human transcriptions + agreement-based split)CER (%)5.57Unverified
3PyLaia (rover consensus + agreement-based split)CER (%)4.95Unverified
4PyLaia (all transcriptions + agreement-based split)CER (%)4.34Unverified
#ModelMetricClaimedVerifiedStatus
1HTR-VT(line-level)CER (%)3.9Unverified
2DANCER (%)3.22Unverified
#ModelMetricClaimedVerifiedStatus
1StackMix+BlotsCER1.73Unverified
#ModelMetricClaimedVerifiedStatus
1StackMix+BlotsCER2.5Unverified
#ModelMetricClaimedVerifiedStatus
1StackMix+BlotsCER3.49Unverified
#ModelMetricClaimedVerifiedStatus
1StackMix+BlotsCER3.77Unverified
#ModelMetricClaimedVerifiedStatus
1StackMix+BlotsCER3.01Unverified
#ModelMetricClaimedVerifiedStatus
1StackMix+BlotsCER3.65Unverified
#ModelMetricClaimedVerifiedStatus
1DANCER (%)6.46Unverified