Augmented Math: Authoring AR-Based Explorable Explanations by Augmenting Static Math Textbooks Jul 30, 2023 Math Optical Character Recognition
Code Code Available 0Optimizing the Neural Network Training for OCR Error Correction of Historical Hebrew Texts Jul 30, 2023 Optical Character Recognition Optical Character Recognition (OCR)
— Unverified 0Multi-Granularity Prediction with Learnable Fusion for Scene Text Recognition Jul 25, 2023 Language Modelling Optical Character Recognition (OCR)
— Unverified 0MataDoc: Margin and Text Aware Document Dewarping for Arbitrary Boundary Jul 24, 2023 document understanding Optical Character Recognition (OCR)
— Unverified 0A comparative analysis of SRGAN models Jul 18, 2023 Generative Adversarial Network Image Super-Resolution
— Unverified 0Handwritten and Printed Text Segmentation: A Signature Case Study Jul 15, 2023 Binary Classification Optical Character Recognition
— Unverified 0Handwritten Text Recognition Using Convolutional Neural Network Jul 11, 2023 Handwritten Text Recognition Optical Character Recognition
— Unverified 0A Novel Pipeline for Improving Optical Character Recognition through Post-processing Using Natural Language Processing Jul 9, 2023 Optical Character Recognition Optical Character Recognition (OCR)
— Unverified 0Artificial Eye for the Blind Jul 7, 2023 Object object-detection
— Unverified 0mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding Jul 4, 2023 document understanding Language Modeling
— Unverified 0Estimating Post-OCR Denoising Complexity on Numerical Texts Jul 3, 2023 Denoising Optical Character Recognition (OCR)
— Unverified 0Fraunhofer SIT at CheckThat! 2023: Mixing Single-Modal Classifiers to Estimate the Check-Worthiness of Multi-Modal Tweets Jul 2, 2023 Fact Checking Optical Character Recognition (OCR)
— Unverified 0Resume Information Extraction via Post-OCR Text Processing Jun 23, 2023 Object Recognition Optical Character Recognition
— Unverified 0A Survey on Multimodal Large Language Models Jun 23, 2023 Hallucination In-Context Learning
— Unverified 0Document Image Cleaning using Budget-Aware Black-Box Approximation Jun 22, 2023 Optical Character Recognition (OCR)
Code Code Available 0When Vision Fails: Text Attacks Against ViT and OCR Jun 12, 2023 Optical Character Recognition (OCR)
Code Code Available 0Weakly supervised information extraction from inscrutable handwritten document images Jun 12, 2023 Language Modeling Language Modelling
— Unverified 0SciCap+: A Knowledge Augmented Dataset to Study the Challenges of Scientific Figure Captioning Jun 6, 2023 Caption Generation Image Captioning
Code Code Available 0Transformer-Based UNet with Multi-Headed Cross-Attention Skip Connections to Eliminate Artifacts in Scanned Documents Jun 5, 2023 Denoising Document Classification
— Unverified 0Improving Handwritten OCR with Training Samples Generated by Glyph Conditional Denoising Diffusion Probabilistic Model May 31, 2023 Denoising Optical Character Recognition (OCR)
— Unverified 0DuoSearch: A Novel Search Engine for Bulgarian Historical Documents May 30, 2023 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 0A template-independent approach for information extraction in real estate documents May 30, 2023 Information Retrieval Natural Language Understanding
Code Code Available 0People and Places of Historical Europe: Bootstrapping Annotation Pipeline and a New Corpus of Named Entities in Late Medieval Texts May 26, 2023 Information Retrieval named-entity-recognition
— Unverified 0Quantifying Character Similarity with Vision Transformers May 24, 2023 Optical Character Recognition (OCR)
Code Code Available 0DUBLIN -- Document Understanding By Language-Image Network May 23, 2023 Document Classification document understanding
— Unverified 0Measuring Intersectional Biases in Historical Documents May 21, 2023 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 0TextDiffuser: Diffusion Models as Text Painters May 18, 2023 Optical Character Recognition (OCR)
— Unverified 0Mobile User Interface Element Detection Via Adaptively Prompt Tuning May 16, 2023 object-detection Object Detection
Code Code Available 0Sequence-to-Sequence Pre-training with Unified Modality Masking for Visual Document Understanding May 16, 2023 Decoder document understanding
— Unverified 0Combining OCR Models for Reading Early Modern Printed Books May 11, 2023 Font Recognition Optical Character Recognition (OCR)
Code Code Available 0E2TIMT: Efficient and Effective Modal Adapter for Text Image Machine Translation May 9, 2023 Decoder Machine Translation
Code Code Available 0Text Reading Order in Uncontrolled Conditions by Sparse Graph Segmentation May 4, 2023 Optical Character Recognition (OCR)
— Unverified 0Evaluating BERT-based Scientific Relation Classifiers for Scholarly Knowledge Graph Construction on Digital Library Collections May 3, 2023 graph construction Optical Character Recognition
— Unverified 0ICDAR 2023 Competition on Reading the Seal Title Apr 24, 2023 Optical Character Recognition (OCR) Task 2
— Unverified 0Multimodal Short Video Rumor Detection System Based on Contrastive Learning Apr 17, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0TransDocs: Optical Character Recognition with word to word translation Apr 15, 2023 Deep Learning Document Translation
Code Code Available 0Cleansing Jewel: A Neural Spelling Correction Model Built On Google OCR-ed Tibetan Manuscripts Apr 7, 2023 Optical Character Recognition Optical Character Recognition (OCR)
— Unverified 0Linking Representations with Multimodal Contrastive Learning Apr 7, 2023 Contrastive Learning Optical Character Recognition
— Unverified 0A semi-automatic method for document classification in the shipping industry Mar 29, 2023 Classification Document Classification
— Unverified 0OVeNet: Offset Vector Network for Semantic Segmentation Mar 25, 2023 Optical Character Recognition (OCR) Scene Understanding
Code Code Available 0CLIP-ReIdent: Contrastive Training for Player Re-Identification Mar 21, 2023 Optical Character Recognition (OCR) Sports Analytics
— Unverified 0Optical Character Recognition and Transcription of Berber Signs from Images in a Low-Resource Language Amazigh Mar 21, 2023 Optical Character Recognition Optical Character Recognition (OCR)
— Unverified 0The System Description of dun_oscar team for The ICPR MSR Challenge Mar 13, 2023 Optical Character Recognition (OCR)
— Unverified 0BaDLAD: A Large Multi-Domain Bengali Document Layout Analysis Dataset Mar 9, 2023 Benchmarking Deep Learning
Code Code Available 0Meme Sentiment Analysis Enhanced with Multimodal Spatial Encoding and Facial Embedding Mar 3, 2023 Optical Character Recognition (OCR) Position
— Unverified 0StrucTexTv2: Masked Visual-Textual Prediction for Document Image Pre-training Mar 1, 2023 Document Image Classification image-classification
Code Code Available 0Language Is Not All You Need: Aligning Perception with Language Models Feb 27, 2023 All Image Captioning
— Unverified 0User-Centric Evaluation of OCR Systems for Kwak'wala Feb 26, 2023 Optical Character Recognition Optical Character Recognition (OCR)
— Unverified 0An Investigation into Pre-Training Object-Centric Representations for Reinforcement Learning Feb 9, 2023 Object Optical Character Recognition (OCR)
— Unverified 0SPARLING: Learning Latent Representations with Extremely Sparse Activations Feb 3, 2023 Optical Character Recognition (OCR)
— Unverified 0