SOTAVerified

Optical Character Recognition

Papers

Showing 151200 of 526 papers

TitleStatusHype
Segmentation-free Connectionist Temporal Classification loss based OCR Model for Text Captcha Classification0
ExTTNet: A Deep Learning Algorithm for Extracting Table Texts from Invoice Images0
From Training-Free to Adaptive: Empirical Insights into MLLMs' Understanding of Detection Information0
Efficient Multi-domain Text Recognition Deep Neural Network Parameterization with Residual AdaptersCode0
An Empirical Study of Scaling Law for OCRCode1
Chaurah: A Smart Raspberry Pi based Parking System0
Segmenting Messy Text: Detecting Boundaries in Text Derived from Historical Newspaper Images0
Advancements and Challenges in Arabic Optical Character Recognition: A Comprehensive Survey0
memorAIs: an Optical Character Recognition and Rule-Based Medication Intake Reminder-Generating SolutionCode0
UPOCR: Towards Unified Pixel-Level OCR Interface0
IDPL-PFOD2: A New Large-Scale Dataset for Printed Farsi Optical Character RecognitionCode0
Pipeline Enabling Zero-shot Classification for Bangla Handwritten Grapheme0
Vulnerability Analysis of Transformer-based Optical Character Recognition to Adversarial Attacks0
Optimization of Image Processing Algorithms for Character Recognition in Cultural Typewritten DocumentsCode0
Data Generation for Post-OCR correction of Cyrillic handwritingCode1
ChemScraper: Leveraging PDF Graphics Instructions for Molecular Diagram ParsingCode0
Efficient End-to-End Visual Document Understanding with Rationale Distillation0
DECDM: Document Enhancement using Cycle-Consistent Diffusion Models0
Reading Between the Mud: A Challenging Motorcycle Racer Number DatasetCode0
DONUT-hole: DONUT Sparsification by Harnessing Knowledge and Optimizing Learning Efficiency0
On Manipulating Scene Text in the Wild with Diffusion ModelsCode0
Exploring OCR Capabilities of GPT-4V(ision) : A Quantitative and In-depth EvaluationCode1
GenKIE: Robust Generative Multimodal Document Key Information ExtractionCode1
Towards reducing hallucination in extracting information from financial reports using Large Language Models0
EfficientOCR: An Extensible, Open-Source Package for Efficiently Digitizing World Knowledge0
Exploring Sparse Spatial Relation in Graph Inference for Text-Based VQA0
Invisible Threats: Backdoor Attack in OCR Systems0
Persis: A Persian Font Recognition Pipeline Using Convolutional Neural NetworksCode1
Comprehensive Overview of Named Entity Recognition: Models, Domain-Specific Applications and Challenges0
DTrOCR: Decoder-only Transformer for Optical Character RecognitionCode2
Handwritten image augmentation0
Bengali Document Layout Analysis with Detectron20
Nougat: Neural Optical Understanding for Academic DocumentsCode5
DISGO: Automatic End-to-End Evaluation for Scene Text OCR0
bbOCR: An Open-source Multi-domain OCR Pipeline for Bengali DocumentsCode1
Extraction of Text from Optic Nerve Optical Coherence Tomography Reports0
OCR Language Models with Custom Vocabularies0
Multimodal Analysis Of Google Bard And GPT-Vision: Experiments In Visual Reasoning0
OmniDataComposer: A Unified Data Structure for Multimodal Data Fusion and Infinite Data GenerationCode1
Universal Defensive Underpainting Patch: Making Your Text Invisible to Optical Character RecognitionCode1
CTP-Net: Character Texture Perception Network for Document Image Forgery Localization0
Toward Zero-shot Character Recognition: A Gold Standard Dataset with Radical-level Annotations0
Optimizing the Neural Network Training for OCR Error Correction of Historical Hebrew Texts0
Toward a Period-Specific Optimized Neural Network for OCR Error Correction of Historical Hebrew Texts0
Augmented Math: Authoring AR-Based Explorable Explanations by Augmenting Static Math TextbooksCode0
Validation of a Zero-Shot Learning Natural Language Processing Tool for Data Abstraction from Unstructured Healthcare DataCode1
Handwritten and Printed Text Segmentation: A Signature Case Study0
Handwritten Text Recognition Using Convolutional Neural Network0
A Novel Pipeline for Improving Optical Character Recognition through Post-processing Using Natural Language Processing0
Artificial Eye for the Blind0
Show:102550
← PrevPage 4 of 11Next →

No leaderboard results yet.